Modifier and Type | Method and Description |
---|---|
void |
NGramTokenizer.setTokenPreProcessor(TokenPreProcess tokenPreProcessor) |
void |
DefaultTokenizer.setTokenPreProcessor(TokenPreProcess tokenPreProcessor) |
void |
Tokenizer.setTokenPreProcessor(TokenPreProcess tokenPreProcessor)
Set the token pre process
|
void |
DefaultStreamTokenizer.setTokenPreProcessor(TokenPreProcess tokenPreProcessor) |
Modifier and Type | Class and Description |
---|---|
class |
CommonPreprocessor |
class |
EndingPreProcessor
Gets rid of endings:
ed,ing, ly, s, .
|
class |
LowCasePreProcessor |
Modifier and Type | Method and Description |
---|---|
TokenPreProcess |
NGramTokenizerFactory.getTokenPreProcessor()
Returns TokenPreProcessor set for this TokenizerFactory instance
|
TokenPreProcess |
DefaultTokenizerFactory.getTokenPreProcessor()
Returns TokenPreProcessor set for this TokenizerFactory instance
|
TokenPreProcess |
TokenizerFactory.getTokenPreProcessor()
Returns TokenPreProcessor set for this TokenizerFactory instance
|
Modifier and Type | Method and Description |
---|---|
void |
NGramTokenizerFactory.setTokenPreProcessor(TokenPreProcess preProcessor) |
void |
DefaultTokenizerFactory.setTokenPreProcessor(TokenPreProcess preProcessor) |
void |
TokenizerFactory.setTokenPreProcessor(TokenPreProcess preProcessor)
Sets a token pre processor to be used
with every tokenizer
|
Copyright © 2017. All rights reserved.