public class BertWordPieceStreamTokenizer extends BertWordPieceTokenizer
splitPattern| Constructor and Description |
|---|
BertWordPieceStreamTokenizer(InputStream tokens,
Charset encoding,
NavigableMap<String,Integer> vocab,
TokenPreProcess preTokenizePreProcessor,
TokenPreProcess tokenPreProcess) |
| Modifier and Type | Method and Description |
|---|---|
static String |
readAndClose(InputStream is,
Charset encoding) |
checkIfEmpty, countTokens, findLongestSubstring, getTokens, hasMoreTokens, nextToken, setTokenPreProcessorpublic BertWordPieceStreamTokenizer(InputStream tokens, Charset encoding, NavigableMap<String,Integer> vocab, TokenPreProcess preTokenizePreProcessor, TokenPreProcess tokenPreProcess)
public static String readAndClose(InputStream is, Charset encoding)
Copyright © 2022. All rights reserved.