Package

com.johnsnowlabs.nlp

annotators

Permalink

package annotators

Visibility
  1. Public
  2. All

Type Members

  1. class ChunkTokenizer extends Tokenizer

    Permalink
  2. class ChunkTokenizerModel extends TokenizerModel

    Permalink
  3. class Chunker extends AnnotatorModel[Chunker]

    Permalink
  4. class DateMatcher extends AnnotatorModel[DateMatcher]

    Permalink

    Matches standard date formats into a provided format

  5. class Lemmatizer extends AnnotatorApproach[LemmatizerModel]

    Permalink

    Class to find standarized lemmas from words.

    Class to find standarized lemmas from words. Uses a user-provided or default dictionary.

  6. class LemmatizerModel extends AnnotatorModel[LemmatizerModel]

    Permalink
  7. class NGramGenerator extends AnnotatorModel[NGramGenerator]

    Permalink

    A feature transformer that converts the input array of strings (annotatorType TOKEN) into an array of n-grams (annotatorType CHUNK).

    A feature transformer that converts the input array of strings (annotatorType TOKEN) into an array of n-grams (annotatorType CHUNK). Null values in the input array are ignored. It returns an array of n-grams where each n-gram is represented by a space-separated string of words.

    When the input is empty, an empty array is returned. When the input array length is less than n (number of elements per n-gram), no n-grams are returned.

  8. class Normalizer extends AnnotatorApproach[NormalizerModel]

    Permalink

    Annotator that cleans out tokens.

    Annotator that cleans out tokens. Requires stems, hence tokens

  9. class NormalizerModel extends AnnotatorModel[NormalizerModel]

    Permalink
  10. trait ReadablePretrainedLemmatizer extends ParamsAndFeaturesReadable[LemmatizerModel] with HasPretrained[LemmatizerModel]

    Permalink
  11. trait ReadablePretrainedTextMatcher extends ParamsAndFeaturesReadable[TextMatcherModel] with HasPretrained[TextMatcherModel]

    Permalink
  12. trait ReadablePretrainedTokenizer extends ParamsAndFeaturesReadable[TokenizerModel] with HasPretrained[TokenizerModel]

    Permalink
  13. class RegexMatcher extends AnnotatorApproach[RegexMatcherModel]

    Permalink
  14. class RegexMatcherModel extends AnnotatorModel[RegexMatcherModel]

    Permalink

    Matches regular expressions and maps them to specified values optionally provided Rules are provided from external source file

  15. class SimpleTokenizer extends AnnotatorModel[SimpleTokenizer]

    Permalink
  16. class Stemmer extends AnnotatorModel[Stemmer]

    Permalink

    Hard stemming of words for cut-of into standard word references

  17. class StopWordsCleaner extends AnnotatorModel[StopWordsCleaner]

    Permalink
  18. class TextMatcher extends AnnotatorApproach[TextMatcherModel]

    Permalink
  19. class TextMatcherModel extends AnnotatorModel[TextMatcherModel]

    Permalink

    Extracts entities out of provided phrases

  20. class Token2Chunk extends AnnotatorModel[Token2Chunk]

    Permalink
  21. class Tokenizer extends AnnotatorApproach[TokenizerModel]

    Permalink
  22. class TokenizerModel extends AnnotatorModel[TokenizerModel]

    Permalink

    Tokenizes raw text into word pieces, tokens.

Value Members

  1. object ChunkTokenizer extends DefaultParamsReadable[ChunkTokenizer] with Serializable

    Permalink
  2. object ChunkTokenizerModel extends ParamsAndFeaturesReadable[ChunkTokenizerModel] with Serializable

    Permalink
  3. object Chunker extends DefaultParamsReadable[Chunker] with Serializable

    Permalink
  4. object DateMatcher extends DefaultParamsReadable[DateMatcher] with Serializable

    Permalink
  5. object EnglishStemmer

    Permalink
  6. object Lemmatizer extends DefaultParamsReadable[Lemmatizer] with Serializable

    Permalink
  7. object LemmatizerModel extends ReadablePretrainedLemmatizer with Serializable

    Permalink
  8. object NGramGenerator extends ParamsAndFeaturesReadable[NGramGenerator] with Serializable

    Permalink
  9. object Normalizer extends DefaultParamsReadable[Normalizer] with Serializable

    Permalink
  10. object NormalizerModel extends ParamsAndFeaturesReadable[NormalizerModel] with Serializable

    Permalink
  11. object RegexMatcher extends DefaultParamsReadable[RegexMatcher] with Serializable

    Permalink
  12. object RegexMatcherModel extends ParamsAndFeaturesReadable[RegexMatcherModel] with Serializable

    Permalink
  13. object Stemmer extends DefaultParamsReadable[Stemmer] with Serializable

    Permalink
  14. object StopWordsCleaner extends ParamsAndFeaturesReadable[StopWordsCleaner] with Serializable

    Permalink
  15. object TextMatcher extends DefaultParamsReadable[TextMatcher] with Serializable

    Permalink
  16. object TextMatcherModel extends ReadablePretrainedTextMatcher with Serializable

    Permalink
  17. object Token2Chunk extends DefaultParamsReadable[Token2Chunk] with Serializable

    Permalink
  18. object Tokenizer extends DefaultParamsReadable[Tokenizer] with Serializable

    Permalink
  19. object TokenizerModel extends ReadablePretrainedTokenizer with Serializable

    Permalink
  20. package common

    Permalink
  21. package ner

    Permalink
  22. package param

    Permalink
  23. package parser

    Permalink
  24. package pos

    Permalink
  25. package sbd

    Permalink
  26. package sda

    Permalink
  27. package spell

    Permalink

Ungrouped