org.allenai.nlpstack.parse.poly

core

package core

Visibility
  1. Public
  2. All

Type Members

  1. case class AnnotatedSentence(sentence: Sentence, annotation: IndexedSeq[FeatureVector]) extends Product with Serializable

    An AnnotatedSentence is a sentence whose tokens are each annotated with a feature vector.

    An AnnotatedSentence is a sentence whose tokens are each annotated with a feature vector.

    sentence

    the unannotated sentence

    annotation

    an indexed sequence, of which the nth element is the feature vector for the nth token of the sentence

  2. case class BrownClustersTagger(clusters: Seq[BrownClusters]) extends SentenceTransform with Product with Serializable

    The BrownClustersTagger tags the tokens of a sentence with their Brown clusters.

  3. case class DirectedGraph(nodes: IndexedSeq[DirectedGraphNode], edgesByNode: IndexedSeq[Seq[DirectedGraphEdge]]) extends Product with Serializable

    A directed graph.

    A directed graph.

    The index of a node is its position in the nodes list (counting from 0).

    nodes

    the nodes of the directed graph

    edgesByNode

    each node's outgoing edges

  4. case class DirectedGraphEdge(from: Int, to: Int, labels: Map[Symbol, String]) extends Product with Serializable

    An edge of a directed graph.

    An edge of a directed graph.

    from

    index of the node at the edge's tail

    to

    index of the node at the edge's head

    labels

    a labeling of this edge

  5. case class DirectedGraphNode(labels: Map[Symbol, String]) extends Product with Serializable

    A node of a directed graph.

    A node of a directed graph.

    labels

    a labeling of this node

  6. case class GoogleUnigramDepLabelTagger(googleNgram: DatastoreGoogleNGram) extends SentenceTransform with Product with Serializable

    Frequency Distribution of dependency labels for tokens based on Google Ngram's Nodes (unigrams).

  7. case class GoogleUnigramPostagTagger(googleNgram: DatastoreGoogleNGram) extends SentenceTransform with Product with Serializable

    Frequency Distribution of POS tags for words based on Google Ngram's Nodes (unigrams).

  8. case class Position(components: Seq[Int]) extends Product with Serializable

    A Position is a essentially a pointer to a node in a rooted, directed tree.

    A Position is a essentially a pointer to a node in a rooted, directed tree. It is a sequence of non-negative integers.

    It is probably easiest to describe the concept by example. The root of the tree is the empty sequence. The first child of the root is Seq(0) (we count from zero). The third child of the first child of the root is Seq(2, 0). The seventh child of the third child of the first child of the root is Seq(6, 2, 0). Etc.

    components

    the sequence of integers corresponding to a node's position in a tree

  9. case class PositionTree(nodes: Seq[(Position, PositionTreeNode)]) extends Product with Serializable

    A PositionTree is a rooted, directed tree, implemented as a map from positions to nodes.

    A PositionTree is a rooted, directed tree, implemented as a map from positions to nodes.

    nodes

    a sequence of (position, node) pairs

  10. case class PositionTreeNode(labels: Map[Symbol, String]) extends Product with Serializable

    A node of a position tree.

    A node of a position tree.

    labels

    a map from label names to label values

  11. case class Sentence(tokens: IndexedSeq[Token]) extends MarbleBlock with Product with Serializable

    A Sentence is a sequence of tokens.

    A Sentence is a sequence of tokens.

    tokens

    the sequence of tokens in the sentence

  12. trait SentenceSource extends AnyRef

    A data source for Sentence objects.

  13. trait SentenceTransform extends AnyRef

  14. class SubstitutionNode extends PositionTreeNode

  15. class SubstitutionTree extends PositionTree

  16. case class Token(word: Symbol, properties: Map[Symbol, Set[Symbol]] = Map()) extends Product with Serializable

    A Token is the basic atom of a sentence.

    A Token is the basic atom of a sentence.

    word

    the surface form of the token

  17. case class VerbnetTagger(verbnet: Verbnet) extends SentenceTransform with Product with Serializable

    The VerbnetTagger tags the tokens of an input sentence with their Verbnet classes.

Value Members

  1. object AnnotatedSentence extends Serializable

  2. object ConstituencyParse

  3. object FactorieLemmatizer extends SentenceTransform with Product with Serializable

    The FactorieLemmatizer tags the tokens of an input sentence with their lemmas, according to the Factorie lemmatizer.

  4. object FactorieSentenceTagger extends SentenceTransform with Product with Serializable

    The FactorieSentenceTagger tags an input sentence with automatic part-of-speech tags from the Factorie tagger.

  5. object LexicalPropertiesTagger extends SentenceTransform with Product with Serializable

    The LexicalPropertiesTagger tags the tokens of an input sentence with lexical properties like whether the first letter is capitalized, or whether it contains numeric digits.

  6. object NexusToken extends Token

    The NexusToken is the "zeroth" token of a dependency parse.

  7. object Position extends Serializable

  8. object Sentence extends Serializable

  9. object SentenceTransform

  10. object StanfordSentenceTagger extends SentenceTransform with Product with Serializable

    The StanfordSentenceTagger tags an input sentence with automatic part-of-speech tags from the Stanford tagger.

  11. object Token extends Serializable

  12. object Util

  13. object WordClusters

Ungrouped