Modifier and Type | Method and Description |
---|---|
void |
DefaultStreamTokenizer.setTokenPreProcessor(TokenPreProcess tokenPreProcessor) |
void |
DefaultTokenizer.setTokenPreProcessor(TokenPreProcess tokenPreProcessor) |
void |
BertWordPieceTokenizer.setTokenPreProcessor(TokenPreProcess tokenPreProcessor) |
void |
NGramTokenizer.setTokenPreProcessor(TokenPreProcess tokenPreProcessor) |
void |
Tokenizer.setTokenPreProcessor(TokenPreProcess tokenPreProcessor)
Set the token pre process
|
Constructor and Description |
---|
BertWordPieceStreamTokenizer(InputStream tokens,
Charset encoding,
NavigableMap<String,Integer> vocab,
TokenPreProcess preTokenizePreProcessor,
TokenPreProcess tokenPreProcess) |
BertWordPieceTokenizer(String tokens,
NavigableMap<String,Integer> vocab,
TokenPreProcess preTokenizePreProcessor,
TokenPreProcess tokenPreProcess) |
Modifier and Type | Class and Description |
---|---|
class |
BertWordPiecePreProcessor |
class |
CommonPreprocessor |
class |
CompositePreProcessor |
class |
EndingPreProcessor
Gets rid of endings:
ed,ing, ly, s, .
|
class |
LowCasePreProcessor |
Constructor and Description |
---|
CompositePreProcessor(TokenPreProcess... preProcessors) |
Modifier and Type | Method and Description |
---|---|
TokenPreProcess |
NGramTokenizerFactory.getTokenPreProcessor()
Returns TokenPreProcessor set for this TokenizerFactory instance
|
TokenPreProcess |
DefaultTokenizerFactory.getTokenPreProcessor()
Returns TokenPreProcessor set for this TokenizerFactory instance
|
TokenPreProcess |
TokenizerFactory.getTokenPreProcessor()
Returns TokenPreProcessor set for this TokenizerFactory instance
|
Modifier and Type | Method and Description |
---|---|
void |
NGramTokenizerFactory.setTokenPreProcessor(TokenPreProcess preProcessor) |
void |
DefaultTokenizerFactory.setTokenPreProcessor(TokenPreProcess preProcessor) |
void |
TokenizerFactory.setTokenPreProcessor(TokenPreProcess preProcessor)
Sets a token pre processor to be used
with every tokenizer
|
Constructor and Description |
---|
BertWordPieceTokenizerFactory(NavigableMap<String,Integer> vocab,
TokenPreProcess preTokenizePreProcessor) |
Copyright © 2022. All rights reserved.