public interface NCCustomWord
NCCustomParser.parse(List)
method. This token is loosely
similar to NCToken
interface.NCModel.getParser()
Modifier and Type | Method and Description |
---|---|
int |
getEndCharIndex()
Gets end character index of this token.
|
String |
getLemma()
Gets the lemma of this token, i.e.
|
String |
getNormText()
Gets normalized user input text for this token.
|
String |
getOrigText()
Gets original user input text for this token.
|
String |
getPos()
Gets Penn Treebank POS tag for this token.
|
String |
getPosDescription()
Gets description of Penn Treebank POS tag.
|
int |
getStartCharIndex()
Gets start character index of this token.
|
String |
getStem()
Gets the stem of this token.
|
boolean |
isBracketed()
Gets whether or not this token is surrounded by any of
'[', ']', '{', '}', '(', ')' brackets. |
boolean |
isQuoted()
Gets whether or not this token is surrounded by single or double quotes.
|
boolean |
isStopword()
Gets whether or not this token is a stopword.
|
String getNormText()
String getOrigText()
int getStartCharIndex()
int getEndCharIndex()
String getPos()
'---'
synthetic tag to indicate a POS tag for multiword tokens.
Learn more at http://www.ling.upenn.edu/courses/Fall_2003/ling001/penn_treebank_pos.htmlString getPosDescription()
String getLemma()
String getStem()
boolean isStopword()
a, the, can, of, about, over
, etc. are typical
stopwords in English. NLPCraft has built-in set of stopwords.boolean isBracketed()
'[', ']', '{', '}', '(', ')'
brackets.'[', ']', '{', '}', '(', ')'
brackets.boolean isQuoted()
Copyright © 2013-2019 NLPCraft Project. All rights reserved.