EnglishAnalyzerBuilder
- Source:
- AnalyzerBuilder.scala
Type members
Types
Value members
Concrete methods
Build the Analyzer wrapped inside a Resource.
Build the Analyzer wrapped inside a Resource.
- Source:
- AnalyzerBuilder.scala
Adds the Porter Stemmer to the end of the analyzer pipeline and enables lowercasing.
Stemming reduces words like jumping
and jumps
to their root word jump
.
NOTE: Lowercasing is forced as it is required by most Lucene stemmers.
Adds the Porter Stemmer to the end of the analyzer pipeline and enables lowercasing.
Stemming reduces words like jumping
and jumps
to their root word jump
.
NOTE: Lowercasing is forced as it is required by most Lucene stemmers.
- Source:
- AnalyzerBuilder.scala
Inherited methods
Build a tokenizing function that uses the Analyzer and collects tokens in a vector
Build a tokenizing function that uses the Analyzer and collects tokens in a vector
- Inherited from:
- AnalyzerBuilder
- Source:
- AnalyzerBuilder.scala
Adds an ASCII folding stage to the analyzer pipeline ASCII folding converts alphanumeric and symbolic Unicode characters into their ASCII equivalents, if one exists.
Adds an ASCII folding stage to the analyzer pipeline ASCII folding converts alphanumeric and symbolic Unicode characters into their ASCII equivalents, if one exists.
- Inherited from:
- AnalyzerBuilder
- Source:
- AnalyzerBuilder.scala
Adds a stop filter stage to analyzer pipeline for non-empty sets.
Adds a stop filter stage to analyzer pipeline for non-empty sets.
- Inherited from:
- AnalyzerBuilder
- Source:
- AnalyzerBuilder.scala
Adds a lowercasing stage to the analyzer pipeline
Adds a lowercasing stage to the analyzer pipeline
- Inherited from:
- AnalyzerBuilder
- Source:
- AnalyzerBuilder.scala
Concrete fields
A convenience value for debugging or investigating, to inspect the Lucene default stop words.
This set is immutable, and unused; it is the underlying Lucene CharArraySet
that we use to
build the default StopFilter
A convenience value for debugging or investigating, to inspect the Lucene default stop words.
This set is immutable, and unused; it is the underlying Lucene CharArraySet
that we use to
build the default StopFilter
- Source:
- AnalyzerBuilder.scala