Modifier and Type | Field and Description |
---|---|
protected SentenceIterator |
Builder.sentenceIterator |
protected SentenceIterator |
BaseTextVectorizer.sentenceIterator |
Modifier and Type | Method and Description |
---|---|
SentenceIterator |
BaseTextVectorizer.getSentenceIterator() |
Modifier and Type | Method and Description |
---|---|
Builder |
Builder.iterate(SentenceIterator sentenceIterator) |
void |
BaseTextVectorizer.setSentenceIterator(SentenceIterator sentenceIterator) |
Constructor and Description |
---|
BagOfWordsVectorizer(VocabCache cache,
TokenizerFactory tokenizerFactory,
List<String> stopWords,
int minWordFrequency,
DocumentIterator docIter,
SentenceIterator sentenceIterator,
List<String> labels,
InvertedIndex index,
int batchSize,
double sample,
boolean stem,
boolean cleanup) |
BaseTextVectorizer(VocabCache cache,
TokenizerFactory tokenizerFactory,
List<String> stopWords,
int minWordFrequency,
DocumentIterator docIter,
SentenceIterator sentenceIterator,
List<String> labels,
InvertedIndex index,
int batchSize,
double sample,
boolean stem,
boolean cleanup) |
TfidfVectorizer(VocabCache cache,
TokenizerFactory tokenizerFactory,
List<String> stopWords,
int minWordFrequency,
DocumentIterator docIter,
SentenceIterator sentenceIterator,
List<String> labels,
InvertedIndex index,
int batchSize,
double sample,
boolean stem,
boolean cleanup) |
Modifier and Type | Field and Description |
---|---|
protected SentenceIterator |
Glove.Builder.sentenceIterator |
Modifier and Type | Method and Description |
---|---|
Glove.Builder |
Glove.Builder.iterate(SentenceIterator iterator) |
LegacyGlove.Builder |
LegacyGlove.Builder.iterate(SentenceIterator sentenceIterator) |
CoOccurrences.Builder |
CoOccurrences.Builder.iterate(SentenceIterator sentenceIterator) |
Constructor and Description |
---|
CoOccurrences(TokenizerFactory tokenizerFactory,
SentenceIterator sentenceIterator,
int windowSize,
VocabCache cache,
CounterMap<String,String> coOCurreneCounts,
boolean symmetric) |
LegacyGlove(VocabCache cache,
SentenceIterator sentenceIterator,
TextVectorizer textVectorizer,
TokenizerFactory tokenizerFactory,
GloveWeightLookupTable lookupTable,
int layerSize,
double learningRate,
double xMax,
int windowSize,
CoOccurrences coOccurrences,
List<String> stopWords,
boolean stem,
int batchSize,
int minWordFrequency,
double maxCount,
int iterations,
boolean symmetric,
org.nd4j.linalg.api.rng.Random gen,
boolean shuffle,
long seed,
int numWorkers)
Deprecated.
|
Modifier and Type | Method and Description |
---|---|
ParagraphVectors.Builder |
ParagraphVectors.Builder.iterate(SentenceIterator iterator)
This method used to feed SentenceIterator, that contains training corpus, into ParagraphVectors
|
Modifier and Type | Method and Description |
---|---|
SentenceTransformer.Builder |
SentenceTransformer.Builder.iterator(SentenceIterator iterator) |
Modifier and Type | Field and Description |
---|---|
protected SentenceIterator |
Word2Vec.sentenceIter |
protected SentenceIterator |
Word2Vec.Builder.sentenceIterator |
Modifier and Type | Method and Description |
---|---|
Word2Vec.Builder |
Word2Vec.Builder.iterate(SentenceIterator iterator)
This method used to feed SentenceIterator, that contains training corpus, into ParagraphVectors
|
void |
Word2Vec.setSentenceIter(SentenceIterator iterator) |
Constructor and Description |
---|
SentenceJobIterator(SentenceIterator iterator) |
Constructor and Description |
---|
Builder(SentenceIterator iterator)
We assume that each sentence in this iterator is separate document/paragraph
|
Modifier and Type | Class and Description |
---|---|
class |
AggregatingSentenceIterator
This is simple wrapper suited for aggregation of few SentenceIterators into single flow.
|
class |
BaseSentenceIterator
Creates a baseline default.
|
class |
BasicLineIterator
Primitive single-line iterator, without any options involved.
|
class |
CollectionSentenceIterator |
class |
FileSentenceIterator |
class |
LineSentenceIterator
Each line is a sentence
|
class |
LuceneSentenceIterator
Lucene sentence iterator.
|
class |
PrefetchingSentenceIterator
Wrapper over SentenceIterator, that allows background prefetch from original SentenceIterator
It could be useful, if your SentencePreProcessor implementation is CPU intensive as well as whole pipeline behind iterator is cpu intensive too.
|
class |
StreamLineIterator
Simple class suitable for iterating over InputStreams as over lines of strings
Please note, this class is NOT thread safe
|
class |
SynchronizedSentenceIterator
Simple synchronized wrapper for SentenceIterator interface implementations
|
class |
UimaSentenceIterator
Iterates over and returns sentences
based on the passed in analysis engine
|
Modifier and Type | Method and Description |
---|---|
static SentenceIterator |
UimaSentenceIterator.createWithPath(String path)
Creates a uima sentence iterator with the given path
|
Modifier and Type | Method and Description |
---|---|
AggregatingSentenceIterator.Builder |
AggregatingSentenceIterator.Builder.addSentenceIterator(SentenceIterator iterator) |
Modifier and Type | Method and Description |
---|---|
AggregatingSentenceIterator.Builder |
AggregatingSentenceIterator.Builder.addSentenceIterators(Collection<SentenceIterator> iterator) |
Constructor and Description |
---|
Builder(SentenceIterator iterator) |
SynchronizedSentenceIterator(SentenceIterator iterator) |
Constructor and Description |
---|
SentenceIteratorConverter(SentenceIterator iterator) |
SentenceIteratorConverter(SentenceIterator iterator,
LabelsSource generator) |
Modifier and Type | Interface and Description |
---|---|
interface |
LabelAwareSentenceIterator
SentenceIterator that is aware of its label.
|
Modifier and Type | Class and Description |
---|---|
class |
LabelAwareFileSentenceIterator
Label aware sentence iterator
|
class |
LabelAwareListSentenceIterator
Iterates over an input stream with the textual format:
label delimiter text
|
class |
LabelAwareUimaSentenceIterator
Uima sentence iterator that is aware of the current file
|
Copyright © 2016. All Rights Reserved.