Trait

org.mitre.jcarafe.crf

TextSeqGen

Related Doc: package crf

Permalink

trait TextSeqGen extends SeqGen[String] with FactoredSeqGen[String] with XmlConversions

Deserialization functionality for handling text input. This uses a customized lexer to identify tokens within a body of text, identifies sentence/zone boundaries and produce annotations as inline tags. It does not use an XML parser, however, and will therefore produce and consume files that do not conform to XML.

Linear Supertypes
XmlConversions, FactoredSeqGen[String], SeqGen[String], Serializable, Serializable, AnyRef, Any
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. TextSeqGen
  2. XmlConversions
  3. FactoredSeqGen
  4. SeqGen
  5. Serializable
  6. Serializable
  7. AnyRef
  8. Any
  1. Hide All
  2. Show all
Visibility
  1. Public
  2. All

Type Members

  1. type DeserializationT = TextSeqDeserialization

    Permalink
    Definition Classes
    TextSeqGenSeqGen
  2. abstract type FRepT <: FeatureRep[String]

    Permalink
    Definition Classes
    SeqGen
  3. type Seqs = IndexedSeq[SourceSequence[String]]

    Permalink
    Definition Classes
    SeqGen
  4. type Src = ObsSource[String]

    Permalink
    Definition Classes
    SeqGen

Abstract Value Members

  1. abstract val boundaries: Tagset

    Permalink
    Definition Classes
    SeqGen
  2. abstract def extractFeatures(src: SourceSequence[String]): InstanceSequence

    Permalink
    Definition Classes
    SeqGen
  3. abstract def extractFeatures(spSeqs: Seqs): IndexedSeq[InstanceSequence]

    Permalink
    Definition Classes
    SeqGen
  4. abstract val frep: FRepT

    Permalink
    Definition Classes
    SeqGen
  5. abstract def getNumberOfFeatures: Int

    Permalink
    Definition Classes
    SeqGen
  6. abstract def processSupportingFeatures(src: SourceSequence[String]): InstanceSequence

    Permalink
    Definition Classes
    SeqGen

Concrete Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. val MAX_SEQ_LEN: Int

    Permalink
  5. object StateCache

    Permalink
    Definition Classes
    FactoredSeqGen
  6. var addBeginStates: Boolean

    Permalink
    Definition Classes
    SeqGen
  7. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  8. def cleanUp(): Unit

    Permalink
    Definition Classes
    SeqGen
  9. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  10. def countFeatureTypes(spSeqs: Seqs): Unit

    Permalink
    Definition Classes
    SeqGen
  11. def countFeatureTypes(src: SourceSequence[String]): Unit

    Permalink
    Definition Classes
    SeqGen
  12. def countFeatureTypesFromFiles: Unit

    Permalink
    Definition Classes
    SeqGen
  13. def createDistributionalSource(dist: List[(AbstractLabel, Double)], obs: String, beg: Boolean, i: Map[String, String]): ObsSource[String]

    Permalink
    Definition Classes
    SeqGen
  14. def createInstancesFromFiles: IndexedSeq[InstanceSequence]

    Permalink
    Definition Classes
    SeqGen
  15. def createSeqsFromFiles: IndexedSeq[InstanceSequence]

    Permalink
    Definition Classes
    SeqGen
  16. def createSeqsWithInput(dseq: IndexedSeq[DeserializationT]): IndexedSeq[InstanceSequence]

    Permalink
    Definition Classes
    SeqGen
  17. def createSeqsWithInput(d: DeserializationT): IndexedSeq[InstanceSequence]

    Permalink
    Definition Classes
    SeqGen
  18. def createSource(o: String, i: Map[String, String]): ObsSource[String]

    Permalink
    Definition Classes
    SeqGen
  19. def createSource(l: AbstractLabel, o: String, i: Map[String, String]): ObsSource[String]

    Permalink
    Definition Classes
    SeqGen
  20. def createSource(l: AbstractLabel, o: String, beg: Boolean, i: Map[String, String]): ObsSource[String]

    Permalink
    Definition Classes
    SeqGen
  21. def createSource(l: AbstractLabel, o: String): ObsSource[String]

    Permalink
    Definition Classes
    SeqGen
  22. def createSource(l: AbstractLabel, o: String, beg: Boolean): ObsSource[String]

    Permalink
    Definition Classes
    SeqGen
  23. def createSourceI(i: Int, o: String, b: Boolean, m: Option[Map[String, String]]): ObsSource[String]

    Permalink
    Attributes
    protected
    Definition Classes
    SeqGen
  24. def createSourceSequence(ss: Seq[ObsSource[String]], st: Int, en: Int): SourceSequence[String]

    Permalink
    Definition Classes
    SeqGen
  25. def createSourceSequence(ss: Seq[ObsSource[String]]): SourceSequence[String]

    Permalink

    Creates a source sequence.

    Creates a source sequence. Subclasses can over-ride this to add additional infor regarding sequences

    returns

    SourceSequence[Obs] - observation sequence

    Definition Classes
    SeqGen
  26. def createSourcesFromFiles: IndexedSeq[Seqs]

    Permalink
    Definition Classes
    SeqGen
  27. def deserializeFromFile(file: String): DeserializationT

    Permalink

    Create a deserialization from a file path provided as a String

    Create a deserialization from a file path provided as a String

    file

    File path as a string

    returns

    A deserialized object

    Definition Classes
    TextSeqGenSeqGen
  28. def deserializeFromFile(file: File): DeserializationT

    Permalink

    Create a deserialization from a File object

    Create a deserialization from a File object

    file

    File object

    returns

    A deserialized object

    Definition Classes
    SeqGen
  29. def deserializeFromRawString(s: String): DeserializationT

    Permalink
  30. def deserializeFromString(s: String): DeserializationT

    Permalink

    Create a deserialized object from a string containing a serialized representation

    Create a deserialized object from a string containing a serialized representation

    returns

    A deserialized object

    Definition Classes
    TextSeqGenSeqGen
  31. def deserializeFromTokenSeq(seq: Seq[String]): DeserializationT

    Permalink

    Create a deserialized object from a list of tokens

    Create a deserialized object from a list of tokens

    returns

    A deserialized object

    Definition Classes
    TextSeqGenSeqGen
  32. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  33. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  34. def evaluateSequences(seqs: Seq[InstanceSequence]): Unit

    Permalink
    Definition Classes
    SeqGen
  35. def extractFeaturesSeq(sourcePairSeqsSeq: IndexedSeq[Seqs]): IndexedSeq[InstanceSequence]

    Permalink
    Definition Classes
    SeqGen
  36. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  37. def gatherFeatureTables(d: DeserializationT): Unit

    Permalink
    Definition Classes
    SeqGen
  38. def getAccuracy: Double

    Permalink
    Definition Classes
    SeqGen
  39. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  40. def getIndex(l: AbstractLabel): Int

    Permalink
    Attributes
    protected
    Definition Classes
    SeqGen
  41. def getInducedFeatureMap: Option[InducedFeatureMap]

    Permalink

    Return the lexicon used by the feature manager

    Return the lexicon used by the feature manager

    returns

    An optional lexicon

    Definition Classes
    SeqGen
  42. def getLAlphabet: Alphabet[AbstractLabel]

    Permalink

    Return the label Alphabet to use.

    Return the label Alphabet to use. Should be overrided with a different alphabet when doing recoding, for example.

    returns

    A label alphabet to pass to the trainer/decoder

    Definition Classes
    SeqGen
  43. def getLabelAndAttrsFromTag(t: String): (String, Map[String, String])

    Permalink
  44. def getLexicon: Option[BloomLexicon]

    Permalink

    Return the lexicon used by the feature manager

    Return the lexicon used by the feature manager

    returns

    An optional lexicon

    Definition Classes
    SeqGen
  45. def getMaxSegmentSize: Int

    Permalink
    Definition Classes
    SeqGen
  46. def getModelName: String

    Permalink

    Return the name of the feature manager instance (i.e.

    Return the name of the feature manager instance (i.e. feature set name)

    returns

    A feature set name String

    Definition Classes
    SeqGen
  47. def getNumberOfNeuralFeatures: Int

    Permalink
    Definition Classes
    SeqGen
  48. def getNumberOfStates: Int

    Permalink

    Return the number of states in the model (when the size is fixed across the dataset)

    Return the number of states in the model (when the size is fixed across the dataset)

    returns

    Number of states in the model

    Definition Classes
    SeqGen
  49. def getState(l: AbstractLabel, b: Boolean): AbstractLabel

    Permalink
    Definition Classes
    SeqGen
  50. def getWordProps: Option[WordProperties]

    Permalink

    Return the lexicon used by the feature manager

    Return the lexicon used by the feature manager

    returns

    An optional lexicon

    Definition Classes
    SeqGen
  51. def getWordScores: Option[WordScores]

    Permalink

    Return the lexicon used by the feature manager

    Return the lexicon used by the feature manager

    returns

    An optional lexicon

    Definition Classes
    SeqGen
  52. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  53. var ignoreFlag: Boolean

    Permalink
  54. lazy val invLa: Map[Int, AbstractLabel]

    Permalink
    Definition Classes
    SeqGen
  55. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  56. val lAlphabet: Alphabet[AbstractLabel]

    Permalink
    Definition Classes
    SeqGen
  57. lazy val lexAttributedTagSet: Boolean

    Permalink
  58. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  59. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  60. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  61. val opts: Options

    Permalink

    A set of command-line options passed in by user

    A set of command-line options passed in by user

    Definition Classes
    SeqGen
  62. val other: AbstractLabel

    Permalink
    Definition Classes
    FactoredSeqGen
  63. def otherIndex: Option[Int]

    Permalink
    Definition Classes
    SeqGen
  64. def printElemAsSingleTag(empty: Boolean, x: Node): String

    Permalink
    Definition Classes
    XmlConversions
  65. val printExistingTags: Boolean

    Permalink
    Definition Classes
    SeqGen
  66. val recode: Boolean

    Permalink
    Definition Classes
    SeqGen
  67. val recodeAlphabet: Alphabet[AbstractLabel]

    Permalink
    Definition Classes
    SeqGen
  68. def reset(): Unit

    Permalink
    Definition Classes
    SeqGen
  69. def seqToWriter(d: DeserializationT, iSeq: InstanceSequence, os: OutputStreamWriter): Unit

    Permalink
  70. def seqsToAnnotations(d: DeserializationT, seqs: Seq[InstanceSequence]): Map[AbstractLabel, ListBuffer[Annotation]]

    Permalink
  71. def seqsToDeserialized(d: DeserializationT, seqs: Seq[InstanceSequence]): DeserializationT

    Permalink
  72. def seqsToFile(d: DeserializationT, seqs: Seq[InstanceSequence], f: File): Unit

    Permalink
  73. def seqsToSlicedWriter(d: DeserializationT, dseq: SourceSequence[String], iseqs: InstanceSequence, os: OutputStreamWriter): Unit

    Permalink
  74. def seqsToStream(d: DeserializationT, seqs: Seq[InstanceSequence], ostr: OutputStream, close: Boolean = true): Unit

    Permalink
  75. def seqsToString(d: DeserializationT, seqs: Seq[InstanceSequence]): String

    Permalink
  76. def seqsToWriter(d: DeserializationT, seqs: Seq[InstanceSequence], os: OutputStreamWriter, close: Boolean = true): Unit

    Permalink
  77. implicit def seqsToXmlAtts(x: Map[String, String]): MetaData

    Permalink
    Definition Classes
    XmlConversions
  78. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  79. def toSources(d: DeserializationT): Seqs

    Permalink

    Computes a sequence of sequences of ObsSource objects from a given deserialized object

    Computes a sequence of sequences of ObsSource objects from a given deserialized object

    returns

    A sequence of sequences of ObsSource objects

    Definition Classes
    TextSeqGenSeqGen
  80. def toSources(file: File): Seqs

    Permalink
    Definition Classes
    SeqGen
  81. def toSources(file: String): Seqs

    Permalink

    Takes a file path (as string) and converts it to a sequence of sequences of ObsSource objects

    Takes a file path (as string) and converts it to a sequence of sequences of ObsSource objects

    file

    A file path as a string

    returns

    A sequence of sequences of ObsSource objects that represent observations and auxiliary information.

    Definition Classes
    SeqGen
  82. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  83. var totalIncorrectTok: Int

    Permalink
    Definition Classes
    SeqGen
  84. var totalTokCnt: Int

    Permalink
    Definition Classes
    SeqGen
  85. val unrecodeAlphabet: Alphabet[AbstractLabel]

    Permalink
    Definition Classes
    SeqGen
  86. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  87. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  88. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  89. def writeTok(write: Boolean, t: String, os: OutputStreamWriter, lexInfo: Option[Map[String, String]] = None): Unit

    Permalink
  90. implicit def xmlAttsToSeq(x: MetaData): Map[String, String]

    Permalink
    Definition Classes
    XmlConversions

Inherited from XmlConversions

Inherited from FactoredSeqGen[String]

Inherited from SeqGen[String]

Inherited from Serializable

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped