各位先進好: 請問如何使用特定分隔符號(ex: , | ; .)來分隔一個字串, 如StringTokenizer itr = new StringTokenizer(value.toString()); 此程式碼預設是使用空白符號分割一個字串, 如果想改用其他分隔符號該如何做呢? 謝謝回覆~
public static class TokenizerMapper extends Mapper<Object, Text, Text, IntWritable>{ private final static IntWritable one = new IntWritable(1); private Text word = new Text(); public void map(Object key, Text value, Context context) throws IOException, InterruptedException { StringTokenizer itr = new StringTokenizer(value.toString()); while (itr.hasMoreTokens()) { word.set(itr.nextToken()); context.write(word, one); } } }
|