org.zouzias.spark.lucenerdd.facets
Default value for topK queries
Default value for topK queries
List of string fields *not* to be analyzed
List of string fields *not* to be analyzed
RDD compute method.
RDD compute method.
Deduplication of self
Deduplication of self
Search query mapper function
Number of results to deduplication
Method to perform linkage
Lucene generic query
Lucene generic query
Faceted query with multiple facets
Faceted query with multiple facets
Lucene query string
Fields on which to compute facets
Number of results
Number of faceted results
Faceted query
Faceted query
Lucene query string
Field on which to compute facet
Number of results
Number of faceted results
Return all document fields
Return all document fields
Lucene fuzzy query
Lucene fuzzy query
Name of field
Query text
Fuzziness, edit distance
Number of documents to return
Entity linkage via Lucene query over all elements of an RDD.
Entity linkage via Lucene query over all elements of an RDD.
A type
RDD to be linked
Function that generates a search query for each element of other
Method to perform linkage, default value from configuration
an RDD of Tuple2 that contains the linked search Lucene documents in the second Note: Currently the query strings of the other RDD are collected to the driver and broadcast to the workers.
Entity linkage via Lucene query over all elements of an RDD.
Entity linkage via Lucene query over all elements of an RDD.
A type
RDD to be linked
Function that generates a Lucene Query object for each element of other
Method to perform linkage
an RDD of Tuple2 that contains the linked search Lucene Document in the second position
Entity linkage via Lucene query over all elements of an RDD.
Entity linkage via Lucene query over all elements of an RDD.
DataFrame to be linked
Function that generates a search query for each element of other
Method to perform linkage
an RDD of Tuple2 that contains the linked search Lucene documents in the second
Lucene's More Like This (MLT) functionality
Lucene's More Like This (MLT) functionality
Field name
Query text
Minimum term frequency
Minimum document frequency
Number of returned documents
Maps partition results
Maps partition results
Function to apply on each partition / distributed index
Lucene phrase Query
Lucene phrase Query
Name of field
Query text
Number of documents to return
Lucene prefix query
Lucene prefix query
Name of field
Prefix query text
Number of documents to return
Generic query using Lucene's query parser
Set the name for the RDD; By default set to "LuceneRDD"
Set the name for the RDD; By default set to "LuceneRDD"
Lucene term query
Lucene term query
Name of field
Term to search on
Number of documents to return
Return Term vector for a Lucene field
Return Term vector for a Lucene field
Field name for term vectors
Lucene field that contains unique id: default set to None, in which case id equals (docId, partitionId)
RDD of term vector entries, i.e., (document id, term as String, term frequency in document)
LuceneRDD with faceted functionality