Class/Object

org.clulab.processors.clu.bio

BioTokenizerPreProcessor

Related Docs: object BioTokenizerPreProcessor | package bio

Permalink

class BioTokenizerPreProcessor extends TokenizerPreProcessor

Preprocesses bio text, including Unicode normalization, and removing figure and table references User: mihais Date: 9/10/17

Linear Supertypes
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. BioTokenizerPreProcessor
  2. TokenizerPreProcessor
  3. AnyRef
  4. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new BioTokenizerPreProcessor(removeFigTabReferences: Boolean, removeBibReferences: Boolean)

    Permalink

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  5. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  6. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  7. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  8. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  9. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  10. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  11. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  12. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  13. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  14. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  15. def process(origText: String): String

    Permalink
  16. def removeBibRefs(text: String): String

    Permalink

    Removes references to Bibliographies in the given text string, if the removeBibReferences flag is set for this class instance.

    Removes references to Bibliographies in the given text string, if the removeBibReferences flag is set for this class instance.

    returns

    The original text or the cleaned text, depending on setting of the removeBibReferences flag.

  17. def removeFigTabRefs(pattern: Pattern, text: String): String

    Permalink

    Removes references to Tables and Figures

    Removes references to Tables and Figures

    pattern

    Fig/Tab pattern

    text

    The original text

    returns

    The cleaned text

  18. def removeFigureAndTableReferences(origText: String): String

    Permalink
  19. def replaceUnicodeWithAscii(origText: String): String

    Permalink
  20. def stringHasBibRef(stringInParens: String): Boolean

    Permalink

    Tell whether the given parentheses-bounded string contains a bibliographic reference or not.

  21. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  22. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  23. val unicodes: Map[Char, String]

    Permalink
  24. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  25. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  26. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from TokenizerPreProcessor

Inherited from AnyRef

Inherited from Any

Ungrouped