A representation of a chunked token.
A Chunker takes postagged text and adds a chunk tag, specifying whether a noun or verb phrase is starting or continuing.
A constituency parser turns a sentence into a constituency tree, a structure that is somewhat like chunking but hierarchical.
A trait for a tool that produces a dependency graph, such as the Stanford dependency parser.
A trait for a tool that produces a dependency graph, such as the Stanford dependency parser. Subclasses should override dependencyGraphPostagged.
A representation of the constituency parse.
A representation for a part-of-speech tagged token.
A representation for a part-of-speech tagged token. POS tokens use PENN-treebank style tags.
A POS tagger takes tokenized input and associates a part of speech tag with each token.
A sentencer breaks text into sentences.
A stemmer takes a string token and produces a normalized form.
The most simple representation of a token.
The most simple representation of a token. A token has a string and a character offset in the original text.
A tokenizer takes a sentence string as input and separates words (tokens) along word (token) boundaries.
Shared utilities for making Factorie work.
Shared utilities for making Factorie work. These are probably not generally useful.
This object provides a function to generate a hash code out of multiple hashable parts.
A trivial stemmer that doesn't apply a stemming algorithm.
A representation of a chunked token. A chunked token has all the aspects of a postagged token along with a chunk tag.