Class DefaultTokenizerFactory
- java.lang.Object
-
- org.deeplearning4j.text.tokenization.tokenizerfactory.DefaultTokenizerFactory
-
- All Implemented Interfaces:
TokenizerFactory
public class DefaultTokenizerFactory extends Object implements TokenizerFactory
Default tokenizer based on string tokenizer or stream tokenizer- Author:
- Adam Gibson
-
-
Constructor Summary
Constructors Constructor Description DefaultTokenizerFactory()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description Tokenizercreate(InputStream toTokenize)Create a tokenizer based on an input streamTokenizercreate(String toTokenize)The tokenizer to createComplexTokenPreProcessgetTokenPreProcessor()Returns TokenPreProcessor set for this TokenizerFactory instancevoidsetTokenPreProcessor(TokenPreProcess preProcessor)Sets a token pre processor to be used with every tokenizer
-
-
-
Method Detail
-
create
public Tokenizer create(String toTokenize)
Description copied from interface:TokenizerFactoryThe tokenizer to createComplex- Specified by:
createin interfaceTokenizerFactory- Parameters:
toTokenize- the string to createComplex the tokenizer with- Returns:
- the new tokenizer
-
create
public Tokenizer create(InputStream toTokenize)
Description copied from interface:TokenizerFactoryCreate a tokenizer based on an input stream- Specified by:
createin interfaceTokenizerFactory- Returns:
-
setTokenPreProcessor
public void setTokenPreProcessor(TokenPreProcess preProcessor)
Description copied from interface:TokenizerFactorySets a token pre processor to be used with every tokenizer- Specified by:
setTokenPreProcessorin interfaceTokenizerFactory- Parameters:
preProcessor- the token pre processor to use
-
getTokenPreProcessor
public TokenPreProcess getTokenPreProcessor()
Returns TokenPreProcessor set for this TokenizerFactory instance- Specified by:
getTokenPreProcessorin interfaceTokenizerFactory- Returns:
- TokenPreProcessor instance, or null if no preprocessor was defined
-
-