Index

$ A B C D E F G H I J L M N O P R S T U V W 
All Classes and Interfaces|All Packages|Serialized Form

$

$ - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Punctuation $

A

Abbreviations - Interface in smile.nlp.dictionary
A dictionary interface for abbreviations.
add(Text) - Method in class smile.nlp.SimpleCorpus
Adds a document to the corpus.
addAnchor(String) - Method in interface smile.nlp.AnchorText
Adds a link label to the anchor text.
addAnchor(String) - Method in class smile.nlp.SimpleText
 
addChild(K[], V, int) - Method in class smile.nlp.Trie.Node
Adds a child.
AnchorText - Interface in smile.nlp
The anchor text is the visible, clickable text in a hyperlink.
apply(String) - Method in class smile.nlp.embedding.Word2Vec
Returns the embedding vector of a word.
apply(String) - Method in interface smile.nlp.tokenizer.Tokenizer
 
avgDocSize() - Method in interface smile.nlp.Corpus
Returns the average size of documents in the corpus.
avgDocSize() - Method in class smile.nlp.SimpleCorpus
 

B

Bigram - Class in smile.nlp
Bigrams or digrams are groups of two words, and are very commonly used as the basis for simple statistical analysis of text.
Bigram - Class in smile.nlp.collocation
Collocations are expressions of multiple words which commonly co-occur.
Bigram(String, String) - Constructor for class smile.nlp.Bigram
Constructor.
Bigram(String, String, int, double) - Constructor for class smile.nlp.collocation.Bigram
Constructor.
bigrams() - Method in interface smile.nlp.Corpus
Returns the iterator over the bigrams in the corpus.
bigrams() - Method in class smile.nlp.SimpleCorpus
 
BM25 - Class in smile.nlp.relevance
The BM25 weighting scheme, often called Okapi weighting, after the system in which it was first implemented, was developed as a way of building a probabilistic model sensitive to term frequency and document length while not introducing too many additional parameters into the model.
BM25() - Constructor for class smile.nlp.relevance.BM25
Default constructor with k1 = 1.2, b = 0.75, delta = 1.0.
BM25(double, double, double) - Constructor for class smile.nlp.relevance.BM25
Constructor.
body - Variable in class smile.nlp.Text
The text body.
BreakIteratorSentenceSplitter - Class in smile.nlp.tokenizer
A sentence splitter based on the java.text.BreakIterator, which supports multiple natural languages (selected by locale setting).
BreakIteratorSentenceSplitter() - Constructor for class smile.nlp.tokenizer.BreakIteratorSentenceSplitter
Constructor for the default locale.
BreakIteratorSentenceSplitter(Locale) - Constructor for class smile.nlp.tokenizer.BreakIteratorSentenceSplitter
Constructor for the given locale.
BreakIteratorTokenizer - Class in smile.nlp.tokenizer
A word tokenizer based on the java.text.BreakIterator, which supports multiple natural languages (selected by locale setting).
BreakIteratorTokenizer() - Constructor for class smile.nlp.tokenizer.BreakIteratorTokenizer
Constructor for the default locale.
BreakIteratorTokenizer(Locale) - Constructor for class smile.nlp.tokenizer.BreakIteratorTokenizer
Constructor for the given locale.

C

CC - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Coordinating conjunction.
CD - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Cardinal number.
CLOSING_PARENTHESIS - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Punctuation ) ] }
CLOSING_QUOTATION - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Punctuation ' or ''
COLON - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Punctuation ; : ...
COMMA - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Punctuation ,
compareTo(Bigram) - Method in class smile.nlp.collocation.Bigram
 
compareTo(NGram) - Method in class smile.nlp.collocation.NGram
 
compareTo(Relevance) - Method in class smile.nlp.relevance.Relevance
 
COMPREHENSIVE - Enum constant in enum class smile.nlp.dictionary.EnglishStopWords
A very long list of stop words.
CONCISE - Enum constant in enum class smile.nlp.dictionary.EnglishDictionary
A concise dictionary of common terms in English.
contains(String) - Method in interface smile.nlp.dictionary.Dictionary
Returns true if this dictionary contains the specified word.
contains(String) - Method in enum class smile.nlp.dictionary.EnglishDictionary
 
contains(String) - Method in class smile.nlp.dictionary.EnglishPunctuations
 
contains(String) - Method in enum class smile.nlp.dictionary.EnglishStopWords
 
contains(String) - Method in class smile.nlp.dictionary.SimpleDictionary
 
CooccurrenceKeywords - Interface in smile.nlp.keyword
Keyword extraction from a single document using word co-occurrence statistical information.
Corpus - Interface in smile.nlp
A corpus is a collection of documents.
count - Variable in class smile.nlp.collocation.Bigram
The frequency of bigram in the corpus.
count - Variable in class smile.nlp.collocation.NGram
The frequency of n-gram in the corpus.
count(String) - Method in interface smile.nlp.Corpus
Returns the total frequency of the term in the corpus.
count(String) - Method in class smile.nlp.SimpleCorpus
 
count(Bigram) - Method in interface smile.nlp.Corpus
Returns the total frequency of the bigram in the corpus.
count(Bigram) - Method in class smile.nlp.SimpleCorpus
 

D

DASH - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Punctuation -
DEFAULT - Enum constant in enum class smile.nlp.dictionary.EnglishStopWords
Default stop words list.
Dictionary - Interface in smile.nlp.dictionary
A dictionary is a set of words in some natural language.
dimension() - Method in class smile.nlp.embedding.Word2Vec
Returns the dimension of embedding vector space.
DT - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Determiner.

E

EnglishDictionary - Enum Class in smile.nlp.dictionary
A concise dictionary of common terms in English.
EnglishPOSLexicon - Class in smile.nlp.pos
An English lexicon with part-of-speech tags.
EnglishPunctuations - Class in smile.nlp.dictionary
Punctuation marks in English.
EnglishStopWords - Enum Class in smile.nlp.dictionary
Several sets of English stop words.
equals(Object) - Method in class smile.nlp.Bigram
 
equals(Object) - Method in class smile.nlp.NGram
 
equals(Object) - Method in class smile.nlp.SimpleText
 
EX - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Existential there.

F

fit(String[][], PennTreebankPOS[][]) - Static method in class smile.nlp.pos.HMMPOSTagger
Fits an HMM POS tagger by maximum likelihood estimation.
FW - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Foreign word.

G

get(String) - Method in class smile.nlp.embedding.Word2Vec
Returns the embedding vector of a word.
get(String) - Static method in class smile.nlp.pos.EnglishPOSLexicon
Returns the part-of-speech tags for given word, or null if the word does not exist in the dictionary.
get(K) - Method in class smile.nlp.Trie
Returns the node of a given key.
get(K[]) - Method in class smile.nlp.Trie
Returns the associated value of a given key.
getAbbreviation(String) - Method in interface smile.nlp.dictionary.Abbreviations
Returns the abbreviation for a word.
getAnchor() - Method in interface smile.nlp.AnchorText
Returns the anchor text if any.
getAnchor() - Method in class smile.nlp.SimpleText
Returns the anchor text if any.
getChild(K) - Method in class smile.nlp.Trie.Node
Returns the child with the key.
getChild(K[], int) - Method in class smile.nlp.Trie.Node
Returns the value matching the key sequence.
getDefault() - Static method in class smile.nlp.pos.HMMPOSTagger
Returns the default English POS tagger.
getFull(String) - Method in interface smile.nlp.dictionary.Abbreviations
Returns the full word of an abbreviation.
getInstance() - Static method in class smile.nlp.dictionary.EnglishPunctuations
Returns the singleton instance.
getInstance() - Static method in class smile.nlp.normalizer.SimpleNormalizer
Returns the singleton instance.
getInstance() - Static method in class smile.nlp.tokenizer.PennTreebankTokenizer
Returns the singleton instance.
getInstance() - Static method in class smile.nlp.tokenizer.SimpleParagraphSplitter
Returns the singleton instance.
getInstance() - Static method in class smile.nlp.tokenizer.SimpleSentenceSplitter
Returns the singleton instance.
getKey() - Method in class smile.nlp.Trie.Node
Returns the key.
getValue() - Method in class smile.nlp.Trie.Node
Returns the value.
getValue(String) - Static method in enum class smile.nlp.pos.PennTreebankPOS
Returns an enum value from a string.
GloVe - Class in smile.nlp.embedding
Global Vectors for Word Representation.
GloVe() - Constructor for class smile.nlp.embedding.GloVe
 
GOOGLE - Enum constant in enum class smile.nlp.dictionary.EnglishStopWords
The stop words list used by Google.

H

hashCode() - Method in class smile.nlp.Bigram
 
hashCode() - Method in class smile.nlp.NGram
 
hashCode() - Method in class smile.nlp.SimpleText
 
HMMPOSTagger - Class in smile.nlp.pos
Part-of-speech tagging with hidden Markov model.
HMMPOSTagger() - Constructor for class smile.nlp.pos.HMMPOSTagger
Constructor.

I

id - Variable in class smile.nlp.Text
The id of document in the corpus.
IN - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Preposition or subordinating conjunction.
iterator() - Method in interface smile.nlp.dictionary.Dictionary
Returns an iterator over the words in this dictionary.
iterator() - Method in enum class smile.nlp.dictionary.EnglishDictionary
 
iterator() - Method in class smile.nlp.dictionary.EnglishPunctuations
 
iterator() - Method in enum class smile.nlp.dictionary.EnglishStopWords
 
iterator() - Method in class smile.nlp.dictionary.SimpleDictionary
 

J

JJ - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Adjective.
JJR - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Adjective, comparative.
JJS - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Adjective, superlative.

L

LancasterStemmer - Class in smile.nlp.stemmer
The Paice/Husk Lancaster stemming algorithm.
LancasterStemmer() - Constructor for class smile.nlp.stemmer.LancasterStemmer
Constructor with default rules.
LancasterStemmer(boolean) - Constructor for class smile.nlp.stemmer.LancasterStemmer
Constructor with default rules.
LancasterStemmer(InputStream) - Constructor for class smile.nlp.stemmer.LancasterStemmer
Constructor with customized rules.
LancasterStemmer(InputStream, boolean) - Constructor for class smile.nlp.stemmer.LancasterStemmer
Constructor with customized rules.
LS - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
List item marker.

M

main(String[]) - Static method in class smile.nlp.pos.HMMPOSTagger
Train the default model on WSJ and BROWN datasets.
maxtf() - Method in class smile.nlp.SimpleText
 
maxtf() - Method in interface smile.nlp.TextTerms
Returns the maximum term frequency over all terms in the document.
MD - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Modal verb.
MYSQL - Enum constant in enum class smile.nlp.dictionary.EnglishStopWords
The stop words list used by MySQL FullText feature.

N

nbigram() - Method in interface smile.nlp.Corpus
Returns the number of bigrams in the corpus.
nbigram() - Method in class smile.nlp.SimpleCorpus
 
ndoc() - Method in interface smile.nlp.Corpus
Returns the number of documents in the corpus.
ndoc() - Method in class smile.nlp.SimpleCorpus
 
NGram - Class in smile.nlp.collocation
An n-gram is a contiguous sequence of n words from a given sequence of text.
NGram - Class in smile.nlp
An n-gram is a contiguous sequence of n words from a given sequence of text.
NGram(String[]) - Constructor for class smile.nlp.NGram
Constructor.
NGram(String[], int) - Constructor for class smile.nlp.collocation.NGram
Constructor.
NN - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Noun, singular or mass.
NNP - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Proper noun, singular.
NNPS - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Proper noun, plural.
NNS - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Noun, plural.
Node(K) - Constructor for class smile.nlp.Trie.Node
Constructor.
normalize(String) - Method in interface smile.nlp.normalizer.Normalizer
Normalize the given string.
normalize(String) - Method in class smile.nlp.normalizer.SimpleNormalizer
 
Normalizer - Interface in smile.nlp.normalizer
Normalization transforms text into a canonical form by removing unwanted variations.
nterm() - Method in interface smile.nlp.Corpus
Returns the number of unique terms in the corpus.
nterm() - Method in class smile.nlp.SimpleCorpus
 

O

of(String) - Static method in interface smile.nlp.keyword.CooccurrenceKeywords
Returns the top 10 keywords.
of(String, int) - Static method in interface smile.nlp.keyword.CooccurrenceKeywords
Returns a given number of top keywords.
of(Path) - Static method in class smile.nlp.embedding.GloVe
Loads a GloVe model.
of(Path) - Static method in class smile.nlp.embedding.Word2Vec
Loads a pre-trained word2vec model from binary file of ByteOrder.LITTLE_ENDIAN.
of(Path, ByteOrder) - Static method in class smile.nlp.embedding.Word2Vec
Loads a pre-trained word2vec model from binary file.
of(Collection<String[]>, int, int) - Static method in class smile.nlp.collocation.NGram
Extracts n-gram phrases by an Apiori-like algorithm.
of(Corpus, double, int) - Static method in class smile.nlp.collocation.Bigram
Finds bigram collocations in the given corpus whose p-value is less than the given threshold.
of(Corpus, int, int) - Static method in class smile.nlp.collocation.Bigram
Finds top k bigram collocations in the given corpus.
open - Variable in enum class smile.nlp.pos.PennTreebankPOS
True if the POS is a open class.
OPENING_PARENTHESIS - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Punctuation ( [ {
OPENING_QUOTATION - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Punctuation ` or ``

P

ParagraphSplitter - Interface in smile.nlp.tokenizer
A paragraph splitter segments text into paragraphs.
PDT - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Predeterminer.
PennTreebankPOS - Enum Class in smile.nlp.pos
The Penn Treebank Tag set.
PennTreebankTokenizer - Class in smile.nlp.tokenizer
A word tokenizer that tokenizes English sentences using the conventions used by the Penn Treebank.
PorterStemmer - Class in smile.nlp.stemmer
Porter's stemming algorithm.
PorterStemmer() - Constructor for class smile.nlp.stemmer.PorterStemmer
Constructor.
POS - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Possessive ending.
POSTagger - Interface in smile.nlp.pos
Part-of-speech tagging (POS tagging) is the process of marking up the words in a sentence as corresponding to a particular part of speech.
POUND - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Punctuation #
PRP - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Personal pronoun.
PRP$ - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Possessive pronoun.
Punctuations - Interface in smile.nlp.dictionary
Punctuation marks are symbols that indicate the structure and organization of written language, as well as intonation and pauses to be observed when reading aloud.
put(K[], V) - Method in class smile.nlp.Trie
Add a key with associated value to the trie.

R

rank(int, int, long, long) - Method in class smile.nlp.relevance.TFIDF
Returns the relevance score between a term and a document based on a corpus.
rank(Corpus, TextTerms, String[], int[], int) - Method in class smile.nlp.relevance.BM25
 
rank(Corpus, TextTerms, String[], int[], int) - Method in interface smile.nlp.relevance.RelevanceRanker
Returns the relevance score between a set of terms and a document based on a corpus.
rank(Corpus, TextTerms, String[], int[], int) - Method in class smile.nlp.relevance.TFIDF
 
rank(Corpus, TextTerms, String, int, int) - Method in class smile.nlp.relevance.BM25
 
rank(Corpus, TextTerms, String, int, int) - Method in interface smile.nlp.relevance.RelevanceRanker
Returns the relevance score between a term and a document based on a corpus.
rank(Corpus, TextTerms, String, int, int) - Method in class smile.nlp.relevance.TFIDF
 
RB - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Adverb.
RBR - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Adverb, comparative.
RBS - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Adverb, superlative.
read(Path, List<String[]>, List<PennTreebankPOS[]>) - Static method in class smile.nlp.pos.HMMPOSTagger
Load training data from a corpora.
Relevance - Class in smile.nlp.relevance
In the context of information retrieval, relevance denotes how well a retrieved set of documents meets the information need of the user.
Relevance(Text, double) - Constructor for class smile.nlp.relevance.Relevance
Constructor.
RelevanceRanker - Interface in smile.nlp.relevance
An interface to provide relevance ranking algorithm.
RP - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Particle.

S

score - Variable in class smile.nlp.collocation.Bigram
The chi-square statistical score of the collocation.
score - Variable in class smile.nlp.relevance.Relevance
The relevance score.
score(double, int, double, long, long) - Method in class smile.nlp.relevance.BM25
Returns the relevance score between a term and a document based on a corpus.
score(double, long, long) - Method in class smile.nlp.relevance.BM25
Returns the relevance score between a term and a document based on a corpus.
score(int, int, double, int, int, double, int, int, double, long, long) - Method in class smile.nlp.relevance.BM25
Returns the relevance score between a term and a document based on a corpus.
search(String) - Method in interface smile.nlp.Corpus
Returns the iterator over the set of documents containing the given term.
search(String) - Method in class smile.nlp.SimpleCorpus
 
search(RelevanceRanker, String) - Method in interface smile.nlp.Corpus
Returns the iterator over the set of documents containing the given term in descending order of relevance.
search(RelevanceRanker, String) - Method in class smile.nlp.SimpleCorpus
 
search(RelevanceRanker, String[]) - Method in interface smile.nlp.Corpus
Returns the iterator over the set of documents containing (at least one of) the given terms in descending order of relevance.
search(RelevanceRanker, String[]) - Method in class smile.nlp.SimpleCorpus
 
SENT - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Sentence-break punctuation .
SentenceSplitter - Interface in smile.nlp.tokenizer
A sentence splitter segments text into sentences (a string of words satisfying the grammatical rules of a language).
setAnchor(String) - Method in interface smile.nlp.AnchorText
Sets the anchor text.
setAnchor(String) - Method in class smile.nlp.SimpleText
Sets the anchor text.
SimpleCorpus - Class in smile.nlp
An in-memory text corpus.
SimpleCorpus() - Constructor for class smile.nlp.SimpleCorpus
Constructor.
SimpleCorpus(SentenceSplitter, Tokenizer, StopWords, Punctuations) - Constructor for class smile.nlp.SimpleCorpus
Constructor.
SimpleDictionary - Class in smile.nlp.dictionary
A simple implementation of dictionary interface.
SimpleDictionary(String) - Constructor for class smile.nlp.dictionary.SimpleDictionary
Constructor.
SimpleNormalizer - Class in smile.nlp.normalizer
A baseline normalizer for processing Unicode text.
SimpleParagraphSplitter - Class in smile.nlp.tokenizer
This is a simple paragraph splitter.
SimpleSentenceSplitter - Class in smile.nlp.tokenizer
This is a simple sentence splitter for English.
SimpleText - Class in smile.nlp
A list-of-words representation of documents.
SimpleText(String, String, String, String[]) - Constructor for class smile.nlp.SimpleText
Constructor.
SimpleTokenizer - Class in smile.nlp.tokenizer
A word tokenizer that tokenizes English sentences with some differences from TreebankWordTokenizer, notably on handling not-contractions.
SimpleTokenizer() - Constructor for class smile.nlp.tokenizer.SimpleTokenizer
Constructor.
SimpleTokenizer(boolean) - Constructor for class smile.nlp.tokenizer.SimpleTokenizer
Constructor.
size() - Method in interface smile.nlp.Corpus
Returns the number of words in the corpus.
size() - Method in interface smile.nlp.dictionary.Dictionary
Returns the number of words in this dictionary.
size() - Method in enum class smile.nlp.dictionary.EnglishDictionary
 
size() - Method in class smile.nlp.dictionary.EnglishPunctuations
 
size() - Method in enum class smile.nlp.dictionary.EnglishStopWords
 
size() - Method in class smile.nlp.dictionary.SimpleDictionary
 
size() - Method in class smile.nlp.SimpleCorpus
 
size() - Method in class smile.nlp.SimpleText
 
size() - Method in interface smile.nlp.TextTerms
Returns the number of words.
size() - Method in class smile.nlp.Trie
Returns the number of entries.
smile.nlp - package smile.nlp
Natural language processing.
smile.nlp.collocation - package smile.nlp.collocation
Collocation finding algorithms.
smile.nlp.dictionary - package smile.nlp.dictionary
Common dictionaries such as stop words, punctuation, common English words, etc.
smile.nlp.embedding - package smile.nlp.embedding
Word embedding.
smile.nlp.keyword - package smile.nlp.keyword
Keyword extraction.
smile.nlp.normalizer - package smile.nlp.normalizer
Text normalization.
smile.nlp.pos - package smile.nlp.pos
Part-of-speech taggers.
smile.nlp.relevance - package smile.nlp.relevance
Term-document relevance ranking algorithms.
smile.nlp.stemmer - package smile.nlp.stemmer
English word stemmer algorithms.
smile.nlp.tokenizer - package smile.nlp.tokenizer
Sentence splitter and word tokenizer.
split(String) - Method in class smile.nlp.tokenizer.BreakIteratorSentenceSplitter
 
split(String) - Method in class smile.nlp.tokenizer.BreakIteratorTokenizer
 
split(String) - Method in interface smile.nlp.tokenizer.ParagraphSplitter
Splits the text into paragraphs.
split(String) - Method in class smile.nlp.tokenizer.PennTreebankTokenizer
 
split(String) - Method in interface smile.nlp.tokenizer.SentenceSplitter
Splits the text into sentences.
split(String) - Method in class smile.nlp.tokenizer.SimpleParagraphSplitter
 
split(String) - Method in class smile.nlp.tokenizer.SimpleSentenceSplitter
 
split(String) - Method in class smile.nlp.tokenizer.SimpleTokenizer
 
split(String) - Method in interface smile.nlp.tokenizer.Tokenizer
Splits the string into a list of tokens.
stem(String) - Method in class smile.nlp.stemmer.LancasterStemmer
 
stem(String) - Method in class smile.nlp.stemmer.PorterStemmer
 
stem(String) - Method in interface smile.nlp.stemmer.Stemmer
Transforms a word into its root form.
Stemmer - Interface in smile.nlp.stemmer
A Stemmer transforms a word into its root form.
StopWords - Interface in smile.nlp.dictionary
A set of stop words in some language.
stripPluralParticiple(String) - Method in class smile.nlp.stemmer.PorterStemmer
Removes plurals and participles.
SYM - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Symbol.

T

tag(String[]) - Method in class smile.nlp.pos.HMMPOSTagger
 
tag(String[]) - Method in interface smile.nlp.pos.POSTagger
Tags the sentence in the form of a sequence of words.
terms() - Method in interface smile.nlp.Corpus
Returns the iterator over the terms in the corpus.
terms() - Method in class smile.nlp.SimpleCorpus
 
text - Variable in class smile.nlp.relevance.Relevance
The document to rank.
Text - Class in smile.nlp
A minimal interface of text in the corpus.
Text(String) - Constructor for class smile.nlp.Text
Constructor.
Text(String, String) - Constructor for class smile.nlp.Text
Constructor.
Text(String, String, String) - Constructor for class smile.nlp.Text
Constructor.
TextTerms - Interface in smile.nlp
The terms in a text.
tf(String) - Method in class smile.nlp.SimpleText
 
tf(String) - Method in interface smile.nlp.TextTerms
Returns the term frequency.
TFIDF - Class in smile.nlp.relevance
The tf-idf weight (term frequency-inverse document frequency) is a weight often used in information retrieval and text mining.
TFIDF() - Constructor for class smile.nlp.relevance.TFIDF
Constructor.
TFIDF(double) - Constructor for class smile.nlp.relevance.TFIDF
Constructor.
title - Variable in class smile.nlp.Text
The title of document;
TO - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
to.
Tokenizer - Interface in smile.nlp.tokenizer
A token is a string of characters, categorized according to the rules as a symbol.
toString() - Method in class smile.nlp.Bigram
 
toString() - Method in class smile.nlp.collocation.Bigram
 
toString() - Method in class smile.nlp.collocation.NGram
 
toString() - Method in class smile.nlp.NGram
 
toString() - Method in class smile.nlp.SimpleText
 
Trie<K,V> - Class in smile.nlp
A trie, also called digital tree or prefix tree, is an ordered tree data structure that is used to store a dynamic set or associative array where the keys are usually strings.
Trie() - Constructor for class smile.nlp.Trie
Constructor.
Trie(int) - Constructor for class smile.nlp.Trie
Constructor.
Trie.Node - Class in smile.nlp
The nodes in the trie.

U

UH - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Interjection.
unique() - Method in class smile.nlp.SimpleText
 
unique() - Method in interface smile.nlp.TextTerms
Returns the iterator of unique words.

V

valueOf(String) - Static method in enum class smile.nlp.dictionary.EnglishDictionary
Returns the enum constant of this class with the specified name.
valueOf(String) - Static method in enum class smile.nlp.dictionary.EnglishStopWords
Returns the enum constant of this class with the specified name.
valueOf(String) - Static method in enum class smile.nlp.pos.PennTreebankPOS
Returns the enum constant of this class with the specified name.
values() - Static method in enum class smile.nlp.dictionary.EnglishDictionary
Returns an array containing the constants of this enum class, in the order they are declared.
values() - Static method in enum class smile.nlp.dictionary.EnglishStopWords
Returns an array containing the constants of this enum class, in the order they are declared.
values() - Static method in enum class smile.nlp.pos.PennTreebankPOS
Returns an array containing the constants of this enum class, in the order they are declared.
VB - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Verb, base form.
VBD - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Verb, past tense.
VBG - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Verb, gerund or present participle.
VBN - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Verb, past participle.
VBP - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Verb, non-3rd person singular present.
VBZ - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Verb, 3rd person singular present.
vectors - Variable in class smile.nlp.embedding.Word2Vec
The vector space.

W

w1 - Variable in class smile.nlp.Bigram
Immutable first word of bigram.
w2 - Variable in class smile.nlp.Bigram
Immutable second word of bigram.
walkin(File, List<File>) - Static method in class smile.nlp.pos.HMMPOSTagger
Recursive function to descend into the directory tree and find all the files that end with ".POS"
WDT - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Wh-determiner.
Word2Vec - Class in smile.nlp.embedding
Word2vec is a group of related models that are used to produce word embeddings.
Word2Vec(String[], float[][]) - Constructor for class smile.nlp.embedding.Word2Vec
Constructor.
words - Variable in class smile.nlp.embedding.Word2Vec
The vocabulary.
words - Variable in class smile.nlp.NGram
Immutable word sequences.
words() - Method in class smile.nlp.SimpleText
 
words() - Method in interface smile.nlp.TextTerms
Returns the iterator of the words of the document.
WP - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Wh-pronoun.
WP$ - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Possessive wh-pronoun.
WRB - Enum constant in enum class smile.nlp.pos.PennTreebankPOS
Wh-adverb.
$ A B C D E F G H I J L M N O P R S T U V W 
All Classes and Interfaces|All Packages|Serialized Form