FastText (deeplearning4j-nlp 1.0.0-M2 API)

java.lang.Object
- org.deeplearning4j.models.fasttext.FastText

All Implemented Interfaces:

Serializable, WordVectors, org.deeplearning4j.nn.weights.embeddings.EmbeddingInitializer
```
public class FastText
extends Object
implements WordVectors, Serializable
```
See Also:

Serialized Form

Constructor Summary

Constructors
Constructor and Description

FastText()

FastText(File modelPath)

Constructors
Constructor and Description
`FastText()`
`FastText(File modelPath)`

Method Summary

All Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`Map<String,Double>`	`accuracy(List<String> questions)` Accuracy based on questions which are a space separated list of strings where the first word is the query word, the next 2 words are negative, and the last word is the predicted word to be nearest
`void`	`fit()`
`int`	`getContextWindowSize()`
`int`	`getDimension()`
`int`	`getEpoch()`
`String`	`getLabelPrefix()`
`double`	`getLearningRate()`
`String`	`getLossName()`
`String`	`getModelName()`
`int`	`getNegativesNumber()`
`int`	`getNumberOfBuckets()`
`String`	`getUNK()`
`int`	`getWordNgrams()`
`double[]`	`getWordVector(String word)` Get the word vector for a given matrix
`org.nd4j.linalg.api.ndarray.INDArray`	`getWordVectorMatrix(String word)` Get the word vector for a given matrix
`org.nd4j.linalg.api.ndarray.INDArray`	`getWordVectorMatrixNormalized(String word)` Returns the word vector divided by the norm2 of the array
`org.nd4j.linalg.api.ndarray.INDArray`	`getWordVectors(Collection<String> labels)` This method returns 2D array, where each row represents corresponding word/label
`org.nd4j.linalg.api.ndarray.INDArray`	`getWordVectorsMean(Collection<String> labels)` This method returns mean vector, built from words/labels passed in
`boolean`	`hasWord(String word)` Returns true if the model has this word in the vocab
`int`	`indexOf(String word)`
`boolean`	`jsonSerializable()`
`void`	`loadBinaryModel(String modelPath)`
`void`	`loadIterator()`
`void`	`loadPretrainedVectors(File vectorsFile)`
`void`	`loadWeightsInto(org.nd4j.linalg.api.ndarray.INDArray array)`
`WeightLookupTable`	`lookupTable()` Lookup table for the vectors
`boolean`	`outOfVocabularySupported()` Does implementation vectorize words absent in vocabulary
`String`	`predict(String text)`
`org.nd4j.common.primitives.Pair<String,Float>`	`predictProbability(String text)`
`void`	`setModelUtils(ModelUtils utils)` Specifies ModelUtils to be used to access model
`void`	`setUNK(String input)`
`double`	`similarity(String word, String word2)` Returns the similarity of 2 words
`List<String>`	`similarWordsInVocabTo(String word, double accuracy)` Find all words with a similar characters in the vocab
`void`	`test(File testFile)`
`void`	`unloadBinaryModel()`
`int`	`vectorSize()`
`VocabCache`	`vocab()` Vocab for the vectors
`long`	`vocabSize()`
`Collection<String>`	`wordsNearest(Collection<String> positive, Collection<String> negative, int top)` Words nearest based on positive and negative words
`Collection<String>`	`wordsNearest(org.nd4j.linalg.api.ndarray.INDArray words, int top)`
`Collection<String>`	`wordsNearest(String word, int n)` Get the top n words most similar to the given word
`Collection<String>`	`wordsNearestSum(Collection<String> positive, Collection<String> negative, int top)` Words nearest based on positive and negative words
`Collection<String>`	`wordsNearestSum(org.nd4j.linalg.api.ndarray.INDArray words, int top)`
`Collection<String>`	`wordsNearestSum(String word, int n)` Get the top n words most similar to the given word

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Constructor Detail
  - FastText
```
public FastText(File modelPath)
```
  - FastText
```
public FastText()
```
- Method Detail
  - fit
```
public void fit()
```
  - loadIterator
```
public void loadIterator()
```
  - loadPretrainedVectors
```
public void loadPretrainedVectors(File vectorsFile)
```
  - loadBinaryModel
```
public void loadBinaryModel(String modelPath)
```
  - unloadBinaryModel
```
public void unloadBinaryModel()
```
  - test
```
public void test(File testFile)
```
  - predict
```
public String predict(String text)
```
  - predictProbability
```
public org.nd4j.common.primitives.Pair<String,Float> predictProbability(String text)
```
  - vocab
```
public VocabCache vocab()
```
    Description copied from interface: WordVectors
    
    Vocab for the vectors
    
    Specified by:
    
    vocab in interface WordVectors
    
    Returns:
  - vocabSize
```
public long vocabSize()
```
    Specified by:
    
    vocabSize in interface org.deeplearning4j.nn.weights.embeddings.EmbeddingInitializer
  - getUNK
```
public String getUNK()
```
    Specified by:
    
    getUNK in interface WordVectors
  - setUNK
```
public void setUNK(String input)
```
    Specified by:
    
    setUNK in interface WordVectors
  - getWordVector
```
public double[] getWordVector(String word)
```
    Description copied from interface: WordVectors
    
    Get the word vector for a given matrix
    
    Specified by:
    
    getWordVector in interface WordVectors
    
    Parameters:
    
    word - the word to get the matrix for
    
    Returns:
    
    the ndarray for this word
  - getWordVectorMatrixNormalized
```
public org.nd4j.linalg.api.ndarray.INDArray getWordVectorMatrixNormalized(String word)
```
    Description copied from interface: WordVectors
    
    Returns the word vector divided by the norm2 of the array
    
    Specified by:
    
    getWordVectorMatrixNormalized in interface WordVectors
    
    Parameters:
    
    word - the word to get the matrix for
    
    Returns:
    
    the looked up matrix
  - getWordVectorMatrix
```
public org.nd4j.linalg.api.ndarray.INDArray getWordVectorMatrix(String word)
```
    Description copied from interface: WordVectors
    
    Get the word vector for a given matrix
    
    Specified by:
    
    getWordVectorMatrix in interface WordVectors
    
    Parameters:
    
    word - the word to get the matrix for
    
    Returns:
    
    the ndarray for this word
  - getWordVectors
```
public org.nd4j.linalg.api.ndarray.INDArray getWordVectors(Collection<String> labels)
```
    Description copied from interface: WordVectors
    
    This method returns 2D array, where each row represents corresponding word/label
    
    Specified by:
    
    getWordVectors in interface WordVectors
    
    Returns:
  - getWordVectorsMean
```
public org.nd4j.linalg.api.ndarray.INDArray getWordVectorsMean(Collection<String> labels)
```
    Description copied from interface: WordVectors
    
    This method returns mean vector, built from words/labels passed in
    
    Specified by:
    
    getWordVectorsMean in interface WordVectors
    
    Returns:
  - hasWord
```
public boolean hasWord(String word)
```
    Description copied from interface: WordVectors
    
    Returns true if the model has this word in the vocab
    
    Specified by:
    
    hasWord in interface WordVectors
    
    Parameters:
    
    word - the word to test for
    
    Returns:
    
    true if the model has the word in the vocab
  - wordsNearest
```
public Collection<String> wordsNearest(org.nd4j.linalg.api.ndarray.INDArray words,
                                       int top)
```
    Specified by:
    
    wordsNearest in interface WordVectors
  - wordsNearestSum
```
public Collection<String> wordsNearestSum(org.nd4j.linalg.api.ndarray.INDArray words,
                                          int top)
```
    Specified by:
    
    wordsNearestSum in interface WordVectors
  - wordsNearestSum
```
public Collection<String> wordsNearestSum(String word,
                                          int n)
```
    Description copied from interface: WordVectors
    
    Get the top n words most similar to the given word
    
    Specified by:
    
    wordsNearestSum in interface WordVectors
    
    Parameters:
    
    word - the word to compare
    
    n - the n to get
    
    Returns:
    
    the top n words
  - wordsNearestSum
```
public Collection<String> wordsNearestSum(Collection<String> positive,
                                          Collection<String> negative,
                                          int top)
```
    Description copied from interface: WordVectors
    
    Words nearest based on positive and negative words
    
    Specified by:
    
    wordsNearestSum in interface WordVectors
    
    Parameters:
    
    positive - the positive words
    
    negative - the negative words
    
    top - the top n words
    
    Returns:
    
    the words nearest the mean of the words
  - accuracy
```
public Map<String,Double> accuracy(List<String> questions)
```
    Description copied from interface: WordVectors
    
    Accuracy based on questions which are a space separated list of strings where the first word is the query word, the next 2 words are negative, and the last word is the predicted word to be nearest
    
    Specified by:
    
    accuracy in interface WordVectors
    
    Parameters:
    
    questions - the questions to ask
    
    Returns:
    
    the accuracy based on these questions
  - indexOf
```
public int indexOf(String word)
```
    Specified by:
    
    indexOf in interface WordVectors
  - similarWordsInVocabTo
```
public List<String> similarWordsInVocabTo(String word,
                                          double accuracy)
```
    Description copied from interface: WordVectors
    
    Find all words with a similar characters in the vocab
    
    Specified by:
    
    similarWordsInVocabTo in interface WordVectors
    
    Parameters:
    
    word - the word to compare
    
    accuracy - the accuracy: 0 to 1
    
    Returns:
    
    the list of words that are similar in the vocab
  - wordsNearest
```
public Collection<String> wordsNearest(Collection<String> positive,
                                       Collection<String> negative,
                                       int top)
```
    Description copied from interface: WordVectors
    
    Words nearest based on positive and negative words
    
    Specified by:
    
    wordsNearest in interface WordVectors
    
    Parameters:
    
    positive - the positive words
    
    negative - the negative words
    
    top - the top n words
    
    Returns:
    
    the words nearest the mean of the words
  - wordsNearest
```
public Collection<String> wordsNearest(String word,
                                       int n)
```
    Description copied from interface: WordVectors
    
    Get the top n words most similar to the given word
    
    Specified by:
    
    wordsNearest in interface WordVectors
    
    Parameters:
    
    word - the word to compare
    
    n - the n to get
    
    Returns:
    
    the top n words
  - similarity
```
public double similarity(String word,
                         String word2)
```
    Description copied from interface: WordVectors
    
    Returns the similarity of 2 words
    
    Specified by:
    
    similarity in interface WordVectors
    
    Parameters:
    
    word - the first word
    
    word2 - the second word
    
    Returns:
    
    a normalized similarity (cosine similarity)
  - lookupTable
```
public WeightLookupTable lookupTable()
```
    Description copied from interface: WordVectors
    
    Lookup table for the vectors
    
    Specified by:
    
    lookupTable in interface WordVectors
    
    Returns:
  - setModelUtils
```
public void setModelUtils(ModelUtils utils)
```
    Description copied from interface: WordVectors
    
    Specifies ModelUtils to be used to access model
    
    Specified by:
    
    setModelUtils in interface WordVectors
  - loadWeightsInto
```
public void loadWeightsInto(org.nd4j.linalg.api.ndarray.INDArray array)
```
    Specified by:
    
    loadWeightsInto in interface org.deeplearning4j.nn.weights.embeddings.EmbeddingInitializer
  - vectorSize
```
public int vectorSize()
```
    Specified by:
    
    vectorSize in interface org.deeplearning4j.nn.weights.embeddings.EmbeddingInitializer
  - jsonSerializable
```
public boolean jsonSerializable()
```
    Specified by:
    
    jsonSerializable in interface org.deeplearning4j.nn.weights.embeddings.EmbeddingInitializer
  - getLearningRate
```
public double getLearningRate()
```
  - getDimension
```
public int getDimension()
```
  - getContextWindowSize
```
public int getContextWindowSize()
```
  - getEpoch
```
public int getEpoch()
```
  - getNegativesNumber
```
public int getNegativesNumber()
```
  - getWordNgrams
```
public int getWordNgrams()
```
  - getLossName
```
public String getLossName()
```
  - getModelName
```
public String getModelName()
```
  - getNumberOfBuckets
```
public int getNumberOfBuckets()
```
  - getLabelPrefix
```
public String getLabelPrefix()
```
  - outOfVocabularySupported
```
public boolean outOfVocabularySupported()
```
    Description copied from interface: WordVectors
    
    Does implementation vectorize words absent in vocabulary
    
    Specified by:
    
    outOfVocabularySupported in interface WordVectors
    
    Returns:
    
    boolean

Class FastText

Constructor Summary

Method Summary

Methods inherited from class java.lang.Object

Constructor Detail

FastText

FastText

Method Detail

fit

loadIterator

loadPretrainedVectors

loadBinaryModel

unloadBinaryModel

test

predict

predictProbability

vocab

vocabSize

getUNK

setUNK

getWordVector

getWordVectorMatrixNormalized

getWordVectorMatrix

getWordVectors

getWordVectorsMean

hasWord

wordsNearest

wordsNearestSum

wordsNearestSum

wordsNearestSum

accuracy

indexOf

similarWordsInVocabTo

wordsNearest

wordsNearest

similarity

lookupTable

setModelUtils

loadWeightsInto

vectorSize

jsonSerializable

getLearningRate

getDimension

getContextWindowSize

getEpoch

getNegativesNumber

getWordNgrams

getLossName

getModelName

getNumberOfBuckets

getLabelPrefix

outOfVocabularySupported