@Bindable(prefix="Tokenizer") public final class Tokenizer extends Object
This class saves the following results to the PreprocessingContext:
| Modifier and Type | Field and Description |
|---|---|
Collection<String> |
documentFields
Textual fields of documents that should be tokenized and parsed for clustering.
|
| Constructor and Description |
|---|
Tokenizer() |
| Modifier and Type | Method and Description |
|---|---|
void |
tokenize(PreprocessingContext context)
Performs tokenization and saves the results to the
context. |
@Input @Attribute @Label(value="Document fields") @Level(value=ADVANCED) @Group(value="Preprocessing") public Collection<String> documentFields
public void tokenize(PreprocessingContext context)
context.