Class Word2Vec

    • Constructor Detail

      • Word2Vec

        public Word2Vec()
    • Method Detail

      • setTokenizerFactory

        public void setTokenizerFactory​(@NonNull
                                        @NonNull TokenizerFactory tokenizerFactory)
        This method defines TokenizerFactory instance to be using during model building
        Parameters:
        tokenizerFactory - TokenizerFactory instance
      • setSentenceIterator

        public void setSentenceIterator​(@NonNull
                                        @NonNull SentenceIterator iterator)
        This method defines SentenceIterator instance, that will be used as training corpus source
        Parameters:
        iterator - SentenceIterator instance
      • setSequenceIterator

        public void setSequenceIterator​(@NonNull
                                        @NonNull SequenceIterator<VocabWord> iterator)
        This method defines SequenceIterator instance, that will be used as training corpus source. Main difference with other iterators here: it allows you to pass already tokenized Sequence for training
        Parameters:
        iterator -
      • toJson

        public String toJson()
                      throws org.nd4j.shade.jackson.core.JsonProcessingException
        Throws:
        org.nd4j.shade.jackson.core.JsonProcessingException