Class NGramTokenizerFactory
- java.lang.Object
-
- org.deeplearning4j.text.tokenization.tokenizerfactory.NGramTokenizerFactory
-
- All Implemented Interfaces:
TokenizerFactory
public class NGramTokenizerFactory extends Object implements TokenizerFactory
- Author:
- sonali
-
-
Constructor Summary
Constructors Constructor Description NGramTokenizerFactory(TokenizerFactory tokenizerFactory, Integer minN, Integer maxN)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description Tokenizer
create(InputStream toTokenize)
Create a tokenizer based on an input streamTokenizer
create(String toTokenize)
The tokenizer to createComplexTokenPreProcess
getTokenPreProcessor()
Returns TokenPreProcessor set for this TokenizerFactory instancevoid
setTokenPreProcessor(TokenPreProcess preProcessor)
Sets a token pre processor to be used with every tokenizer
-
-
-
Constructor Detail
-
NGramTokenizerFactory
public NGramTokenizerFactory(TokenizerFactory tokenizerFactory, Integer minN, Integer maxN)
-
-
Method Detail
-
create
public Tokenizer create(String toTokenize)
Description copied from interface:TokenizerFactory
The tokenizer to createComplex- Specified by:
create
in interfaceTokenizerFactory
- Parameters:
toTokenize
- the string to createComplex the tokenizer with- Returns:
- the new tokenizer
-
create
public Tokenizer create(InputStream toTokenize)
Description copied from interface:TokenizerFactory
Create a tokenizer based on an input stream- Specified by:
create
in interfaceTokenizerFactory
- Returns:
-
setTokenPreProcessor
public void setTokenPreProcessor(TokenPreProcess preProcessor)
Description copied from interface:TokenizerFactory
Sets a token pre processor to be used with every tokenizer- Specified by:
setTokenPreProcessor
in interfaceTokenizerFactory
- Parameters:
preProcessor
- the token pre processor to use
-
getTokenPreProcessor
public TokenPreProcess getTokenPreProcessor()
Returns TokenPreProcessor set for this TokenizerFactory instance- Specified by:
getTokenPreProcessor
in interfaceTokenizerFactory
- Returns:
- TokenPreProcessor instance, or null if no preprocessor was defined
-
-