core

Type Members

case class AnnotatedSentence(sentence: Sentence, annotation: IndexedSeq[FeatureVector]) extends Product with Serializable

An AnnotatedSentence is a sentence whose tokens are each annotated with a feature vector.
An AnnotatedSentence is a sentence whose tokens are each annotated with a feature vector.
sentence
the unannotated sentence
annotation
an indexed sequence, of which the nth element is the feature vector for the nth token of the sentence
case class BrownClustersTagger(clusters: Seq[BrownClusters]) extends SentenceTransform with Product with Serializable

The BrownClustersTagger tags the tokens of a sentence with their Brown clusters.
case class DirectedGraph(nodes: IndexedSeq[DirectedGraphNode], edgesByNode: IndexedSeq[Seq[DirectedGraphEdge]]) extends Product with Serializable

A directed graph.
A directed graph.
The index of a node is its position in the nodes list (counting from 0).
nodes
the nodes of the directed graph
edgesByNode
each node's outgoing edges
case class DirectedGraphEdge(from: Int, to: Int, labels: Map[Symbol, String]) extends Product with Serializable

An edge of a directed graph.
An edge of a directed graph.
from
index of the node at the edge's tail
to
index of the node at the edge's head
labels
a labeling of this edge
case class DirectedGraphNode(labels: Map[Symbol, String]) extends Product with Serializable

A node of a directed graph.
A node of a directed graph.
labels
a labeling of this node
case class GoogleUnigramDepLabelTagger(googleNgram: DatastoreGoogleNGram) extends SentenceTransform with Product with Serializable

Frequency Distribution of dependency labels for tokens based on Google Ngram's Nodes (unigrams).
case class GoogleUnigramPostagTagger(googleNgram: DatastoreGoogleNGram) extends SentenceTransform with Product with Serializable

Frequency Distribution of POS tags for words based on Google Ngram's Nodes (unigrams).
case class Position(components: Seq[Int]) extends Product with Serializable

A Position is a essentially a pointer to a node in a rooted, directed tree.
A Position is a essentially a pointer to a node in a rooted, directed tree. It is a sequence of non-negative integers.
It is probably easiest to describe the concept by example. The root of the tree is the empty sequence. The first child of the root is Seq(0) (we count from zero). The third child of the first child of the root is Seq(2, 0). The seventh child of the third child of the first child of the root is Seq(6, 2, 0). Etc.
components
the sequence of integers corresponding to a node's position in a tree
case class PositionTree(nodes: Seq[(Position, PositionTreeNode)]) extends Product with Serializable

A PositionTree is a rooted, directed tree, implemented as a map from positions to nodes.
A PositionTree is a rooted, directed tree, implemented as a map from positions to nodes.
nodes
a sequence of (position, node) pairs
case class PositionTreeNode(labels: Map[Symbol, String]) extends Product with Serializable

A node of a position tree.
A node of a position tree.
labels
a map from label names to label values
case class Sentence(tokens: IndexedSeq[Token]) extends MarbleBlock with Product with Serializable

A Sentence is a sequence of tokens.
A Sentence is a sequence of tokens.
tokens
the sequence of tokens in the sentence
trait SentenceSource extends AnyRef

A data source for Sentence objects.
trait SentenceTransform extends AnyRef
class SubstitutionNode extends PositionTreeNode
class SubstitutionTree extends PositionTree
case class Token(word: Symbol, properties: Map[Symbol, Set[Symbol]] = Map()) extends Product with Serializable

A Token is the basic atom of a sentence.
A Token is the basic atom of a sentence.
word
the surface form of the token
case class VerbnetTagger(verbnet: Verbnet) extends SentenceTransform with Product with Serializable

The VerbnetTagger tags the tokens of an input sentence with their Verbnet classes.

Value Members

object AnnotatedSentence extends Serializable
object ConstituencyParse
object FactorieLemmatizer extends SentenceTransform with Product with Serializable

The FactorieLemmatizer tags the tokens of an input sentence with their lemmas, according to the Factorie lemmatizer.
object FactorieSentenceTagger extends SentenceTransform with Product with Serializable

The FactorieSentenceTagger tags an input sentence with automatic part-of-speech tags from the Factorie tagger.
object LexicalPropertiesTagger extends SentenceTransform with Product with Serializable

The LexicalPropertiesTagger tags the tokens of an input sentence with lexical properties like whether the first letter is capitalized, or whether it contains numeric digits.
object NexusToken extends Token

The NexusToken is the "zeroth" token of a dependency parse.
object Position extends Serializable
object Sentence extends Serializable
object SentenceTransform
object StanfordSentenceTagger extends SentenceTransform with Product with Serializable

The StanfordSentenceTagger tags an input sentence with automatic part-of-speech tags from the Stanford tagger.
object Token extends Serializable
object Util
object WordClusters

package core

Type Members

case class AnnotatedSentence(sentence: Sentence, annotation: IndexedSeq[FeatureVector]) extends Product with Serializable

case class BrownClustersTagger(clusters: Seq[BrownClusters]) extends SentenceTransform with Product with Serializable

case class DirectedGraph(nodes: IndexedSeq[DirectedGraphNode], edgesByNode: IndexedSeq[Seq[DirectedGraphEdge]]) extends Product with Serializable

case class DirectedGraphEdge(from: Int, to: Int, labels: Map[Symbol, String]) extends Product with Serializable

case class DirectedGraphNode(labels: Map[Symbol, String]) extends Product with Serializable

case class GoogleUnigramDepLabelTagger(googleNgram: DatastoreGoogleNGram) extends SentenceTransform with Product with Serializable

case class GoogleUnigramPostagTagger(googleNgram: DatastoreGoogleNGram) extends SentenceTransform with Product with Serializable

case class Position(components: Seq[Int]) extends Product with Serializable

case class PositionTree(nodes: Seq[(Position, PositionTreeNode)]) extends Product with Serializable

case class PositionTreeNode(labels: Map[Symbol, String]) extends Product with Serializable

case class Sentence(tokens: IndexedSeq[Token]) extends MarbleBlock with Product with Serializable

trait SentenceSource extends AnyRef

trait SentenceTransform extends AnyRef

class SubstitutionNode extends PositionTreeNode

class SubstitutionTree extends PositionTree

case class Token(word: Symbol, properties: Map[Symbol, Set[Symbol]] = Map()) extends Product with Serializable

case class VerbnetTagger(verbnet: Verbnet) extends SentenceTransform with Product with Serializable

Value Members

object AnnotatedSentence extends Serializable

object ConstituencyParse

object FactorieLemmatizer extends SentenceTransform with Product with Serializable

object FactorieSentenceTagger extends SentenceTransform with Product with Serializable

object LexicalPropertiesTagger extends SentenceTransform with Product with Serializable

object NexusToken extends Token

object Position extends Serializable

object Sentence extends Serializable

object SentenceTransform

object StanfordSentenceTagger extends SentenceTransform with Product with Serializable

object Token extends Serializable

object Util

object WordClusters

Ungrouped