Class UniversalTextTokenizer

    • Constructor Detail

      • UniversalTextTokenizer

        public UniversalTextTokenizer()
      • UniversalTextTokenizer

        public UniversalTextTokenizer​(Languages... languages)
    • Method Detail

      • getLanguage

        public Languages getLanguage()
        Description copied from interface: TextTokenizer
        Gets the language for the tokenizer.
        Returns:
        the language for this tokenizer.
      • stopWords

        public Set<String> stopWords()
        Description copied from interface: TextTokenizer
        Gets all stop-words for a language.
        Returns:
        the set of all stop-words.