public class BertWordPieceStreamTokenizer extends BertWordPieceTokenizer
splitPattern
Constructor and Description |
---|
BertWordPieceStreamTokenizer(InputStream tokens,
Charset encoding,
NavigableMap<String,Integer> vocab,
TokenPreProcess preTokenizePreProcessor,
TokenPreProcess tokenPreProcess) |
Modifier and Type | Method and Description |
---|---|
static String |
readAndClose(InputStream is,
Charset encoding) |
checkIfEmpty, countTokens, findLongestSubstring, getTokens, hasMoreTokens, nextToken, setTokenPreProcessor
public BertWordPieceStreamTokenizer(InputStream tokens, Charset encoding, NavigableMap<String,Integer> vocab, TokenPreProcess preTokenizePreProcessor, TokenPreProcess tokenPreProcess)
public static String readAndClose(InputStream is, Charset encoding)
Copyright © 2022. All rights reserved.