A representation of a chunked token.
A Chunker takes postagged text and adds a chunk tag, specifying whether a noun or verb phrase is starting or continuing.
A constituency parser turns a sentence into a constituency tree, a structure that is somewhat like chunking but hierarchical.
A trait for a tool that produces a dependency graph, such as the Stanford dependency parser.
A representation of the constituency parse.
A representation for a part-of-speech tagged token.
A POS tagger takes tokenized input and associates a part of speech tag with each token.
A sentencer breaks text into sentences.
A stemmer takes a string token and produces a normalized form.
The most simple representation of a token.
A tokenizer takes a sentence string as input and separates words (tokens) along word (token) boundaries.
Shared utilities for making Factorie work.
This object provides a function to generate a hash code out of multiple hashable parts.
A trivial stemmer that doesn't apply a stemming algorithm.