All a lines from the input File to this lexicon.
All a lines from the input File to this lexicon. File contains multiple newline-separated lexicon entries
All a lines from the input String to this lexicon.
All a lines from the input String to this lexicon. String contains multiple newline-separated lexicon entries
All a lines from the input Source to this lexicon.
All a lines from the input Source to this lexicon. Source is assumed to contain multiple newline-separated lexicon entries
Tokenize and lemmatize the input String and add it as a single entry to the Lexicon
Tokenize and lemmatize the input String and add it as a single entry to the Lexicon
Tokenizes and lemmatizes query.string, then checks if the sequence is in the lexicon
Tokenizes and lemmatizes query.string, then checks if the sequence is in the lexicon
Tokenizes and lemmatizes the string of each entry in 'query', then checks if the sequence is in the lexicon
Tokenizes and lemmatizes the string of each entry in 'query', then checks if the sequence is in the lexicon
Is the input String in the lexicon.
Is the input String in the lexicon. The input is tokenized and lemmatized; if the tokenizer indicates that it is a multi-word phrase, it will be processed by containsWords, otherwise containsWord.
Checks whether the lexicon contains this already-lemmatized/tokenized single word
Checks whether the lexicon contains this already-lemmatized/tokenized single word
Checks whether the lexicon contains this already-lemmatized/tokenized phrase, where 'words' can either be single word or a multi-word expression.
Checks whether the lexicon contains this already-lemmatized/tokenized phrase, where 'words' can either be single word or a multi-word expression.
Is this single word in the lexicon? The input String will not be processed by tokenizer, but will be processed by the lemmatizer.
Is this single word in the lexicon? The input String will not be processed by tokenizer, but will be processed by the lemmatizer.
Is the pre-tokenized sequence of words in the lexicon? Each of the input words will be processed by the lemmatizer.
Is the pre-tokenized sequence of words in the lexicon? Each of the input words will be processed by the lemmatizer.
The string lemmatizer that simplifies lexicon entries and queries before searching for a match.
The string lemmatizer that simplifies lexicon entries and queries before searching for a match. For example, a common lemmatizer is one that lowercases all strings.
An identifier for this lexicon, suitable for adding as a category to a FeatureVectorVariable[String].
An identifier for this lexicon, suitable for adding as a category to a FeatureVectorVariable[String].
Return length of match, or -1 if no match.
Return length of match, or -1 if no match.
The string segmenter that breaks a lexicon entries and queries into (potentially) multi-word phrases.
The string segmenter that breaks a lexicon entries and queries into (potentially) multi-word phrases.