Class/Object

org.zouzias.spark.lucenerdd

LuceneRDD

Related Docs: object LuceneRDD | package lucenerdd

Permalink

class LuceneRDD[T] extends RDD[T] with SparkScoreDocAggregatable

Spark RDD with Lucene's query capabilities (term, prefix, fuzzy, phrase query)

Linear Supertypes
SparkScoreDocAggregatable, LuceneRDDConfigurable, RDD[T], Logging, Serializable, Serializable, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. LuceneRDD
  2. SparkScoreDocAggregatable
  3. LuceneRDDConfigurable
  4. RDD
  5. Logging
  6. Serializable
  7. Serializable
  8. AnyRef
  9. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new LuceneRDD(partitionsRDD: RDD[AbstractLuceneRDDPartition[T]])(implicit arg0: ClassTag[T])

    Permalink

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. def ++(other: RDD[T]): RDD[T]

    Permalink
    Definition Classes
    RDD
  4. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  5. val DefaultFacetNum: Int

    Permalink
    Attributes
    protected
    Definition Classes
    LuceneRDDConfigurable
  6. val DefaultTopK: Int

    Permalink

    Default value for topK queries

    Default value for topK queries

    Attributes
    protected
    Definition Classes
    LuceneRDDConfigurable
  7. val MaxDefaultTopKValue: Int

    Permalink
    Attributes
    protected
    Definition Classes
    LuceneRDDConfigurable
  8. val SparkDocTopKMonoid: TopKMonoid[SparkScoreDoc]

    Permalink
    Attributes
    protected
    Definition Classes
    SparkScoreDocAggregatable
  9. def aggregate[U](zeroValue: U)(seqOp: (U, T) ⇒ U, combOp: (U, U) ⇒ U)(implicit arg0: ClassTag[U]): U

    Permalink
    Definition Classes
    RDD
  10. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  11. def cache(): LuceneRDD.this.type

    Permalink
    Definition Classes
    RDD
  12. def cartesian[U](other: RDD[U])(implicit arg0: ClassTag[U]): RDD[(T, U)]

    Permalink
    Definition Classes
    RDD
  13. def checkpoint(): Unit

    Permalink
    Definition Classes
    RDD
  14. def clearDependencies(): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    RDD
  15. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  16. def close(): Unit

    Permalink
  17. def coalesce(numPartitions: Int, shuffle: Boolean)(implicit ord: Ordering[T]): RDD[T]

    Permalink
    Definition Classes
    RDD
  18. def collect[U](f: PartialFunction[T, U])(implicit arg0: ClassTag[U]): RDD[U]

    Permalink
    Definition Classes
    RDD
  19. def collect(): Array[T]

    Permalink
    Definition Classes
    RDD
  20. def compute(part: Partition, context: TaskContext): Iterator[T]

    Permalink

    RDD compute method.

    RDD compute method.

    Definition Classes
    LuceneRDD → RDD
  21. val config: Config

    Permalink
    Definition Classes
    LuceneRDDConfigurable
  22. def context: SparkContext

    Permalink
    Definition Classes
    RDD
  23. def count(): Long

    Permalink
    Definition Classes
    LuceneRDD → RDD
  24. def countApprox(timeout: Long, confidence: Double): PartialResult[BoundedDouble]

    Permalink
    Definition Classes
    RDD
    Annotations
    @Experimental()
  25. def countApproxDistinct(relativeSD: Double): Long

    Permalink
    Definition Classes
    RDD
  26. def countApproxDistinct(p: Int, sp: Int): Long

    Permalink
    Definition Classes
    RDD
    Annotations
    @Experimental()
  27. def countByValue()(implicit ord: Ordering[T]): Map[T, Long]

    Permalink
    Definition Classes
    RDD
  28. def countByValueApprox(timeout: Long, confidence: Double)(implicit ord: Ordering[T]): PartialResult[Map[T, BoundedDouble]]

    Permalink
    Definition Classes
    RDD
    Annotations
    @Experimental()
  29. final def dependencies: Seq[Dependency[_]]

    Permalink
    Definition Classes
    RDD
  30. def distinct(): RDD[T]

    Permalink
    Definition Classes
    RDD
  31. def distinct(numPartitions: Int)(implicit ord: Ordering[T]): RDD[T]

    Permalink
    Definition Classes
    RDD
  32. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  33. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  34. def exists(elem: T): Boolean

    Permalink
  35. def exists(doc: Map[String, String]): Boolean

    Permalink

    Lucene generic query

  36. def facetQueries(searchString: String, facetFields: Seq[String], topK: Int = DefaultTopK, facetNum: Int = DefaultFacetNum): (Iterable[SparkScoreDoc], Map[String, SparkFacetResult])

    Permalink

    Faceted query with multiple facets

  37. def facetQuery(searchString: String, facetField: String, topK: Int = DefaultTopK, facetNum: Int = DefaultFacetNum): (Iterable[SparkScoreDoc], SparkFacetResult)

    Permalink

    Faceted query

  38. def fields(): Set[String]

    Permalink

    Return all document fields

  39. def filter(pred: (T) ⇒ Boolean): LuceneRDD[T]

    Permalink
    Definition Classes
    LuceneRDD → RDD
  40. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  41. def first(): T

    Permalink
    Definition Classes
    RDD
  42. def firstParent[U](implicit arg0: ClassTag[U]): RDD[U]

    Permalink
    Attributes
    protected[org.apache.spark]
    Definition Classes
    RDD
  43. def flatMap[U](f: (T) ⇒ TraversableOnce[U])(implicit arg0: ClassTag[U]): RDD[U]

    Permalink
    Definition Classes
    RDD
  44. def fold(zeroValue: T)(op: (T, T) ⇒ T): T

    Permalink
    Definition Classes
    RDD
  45. def foreach(f: (T) ⇒ Unit): Unit

    Permalink
    Definition Classes
    RDD
  46. def foreachPartition(f: (Iterator[T]) ⇒ Unit): Unit

    Permalink
    Definition Classes
    RDD
  47. def fuzzyQuery(fieldName: String, query: String, maxEdits: Int, topK: Int = DefaultTopK): Iterable[SparkScoreDoc]

    Permalink

    Lucene fuzzy query

    Lucene fuzzy query

    fieldName

    Name of field

    query

    Query text

    maxEdits

    Fuzziness, edit distance

    topK

    Number of documents to return

  48. def getCheckpointFile: Option[String]

    Permalink
    Definition Classes
    RDD
  49. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  50. def getDependencies: Seq[Dependency[_]]

    Permalink
    Attributes
    protected
    Definition Classes
    RDD
  51. def getPartitions: Array[Partition]

    Permalink
    Attributes
    protected
    Definition Classes
    LuceneRDD → RDD
  52. def getPreferredLocations(s: Partition): Seq[String]

    Permalink
    Attributes
    protected
    Definition Classes
    LuceneRDD → RDD
  53. def getStorageLevel: StorageLevel

    Permalink
    Definition Classes
    RDD
  54. def glom(): RDD[Array[T]]

    Permalink
    Definition Classes
    RDD
  55. def groupBy[K](f: (T) ⇒ K, p: Partitioner)(implicit kt: ClassTag[K], ord: Ordering[K]): RDD[(K, Iterable[T])]

    Permalink
    Definition Classes
    RDD
  56. def groupBy[K](f: (T) ⇒ K, numPartitions: Int)(implicit kt: ClassTag[K]): RDD[(K, Iterable[T])]

    Permalink
    Definition Classes
    RDD
  57. def groupBy[K](f: (T) ⇒ K)(implicit kt: ClassTag[K]): RDD[(K, Iterable[T])]

    Permalink
    Definition Classes
    RDD
  58. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  59. val id: Int

    Permalink
    Definition Classes
    RDD
  60. def intersection(other: RDD[T], numPartitions: Int): RDD[T]

    Permalink
    Definition Classes
    RDD
  61. def intersection(other: RDD[T], partitioner: Partitioner)(implicit ord: Ordering[T]): RDD[T]

    Permalink
    Definition Classes
    RDD
  62. def intersection(other: RDD[T]): RDD[T]

    Permalink
    Definition Classes
    RDD
  63. def isCheckpointed: Boolean

    Permalink
    Definition Classes
    RDD
  64. def isEmpty(): Boolean

    Permalink
    Definition Classes
    RDD
  65. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  66. def isTraceEnabled(): Boolean

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  67. final def iterator(split: Partition, context: TaskContext): Iterator[T]

    Permalink
    Definition Classes
    RDD
  68. def keyBy[K](f: (T) ⇒ K): RDD[(K, T)]

    Permalink
    Definition Classes
    RDD
  69. def localCheckpoint(): LuceneRDD.this.type

    Permalink
    Definition Classes
    RDD
  70. def log: Logger

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  71. def logDebug(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  72. def logDebug(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  73. def logError(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  74. def logError(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  75. def logInfo(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  76. def logInfo(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  77. def logName: String

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  78. def logTrace(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  79. def logTrace(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  80. def logWarning(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  81. def logWarning(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  82. def map[U](f: (T) ⇒ U)(implicit arg0: ClassTag[U]): RDD[U]

    Permalink
    Definition Classes
    RDD
  83. def mapPartitions[U](f: (Iterator[T]) ⇒ Iterator[U], preservesPartitioning: Boolean)(implicit arg0: ClassTag[U]): RDD[U]

    Permalink
    Definition Classes
    RDD
  84. def mapPartitionsWithIndex[U](f: (Int, Iterator[T]) ⇒ Iterator[U], preservesPartitioning: Boolean)(implicit arg0: ClassTag[U]): RDD[U]

    Permalink
    Definition Classes
    RDD
  85. def max()(implicit ord: Ordering[T]): T

    Permalink
    Definition Classes
    RDD
  86. def min()(implicit ord: Ordering[T]): T

    Permalink
    Definition Classes
    RDD
  87. var name: String

    Permalink
    Definition Classes
    RDD
  88. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  89. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  90. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  91. def parent[U](j: Int)(implicit arg0: ClassTag[U]): RDD[U]

    Permalink
    Attributes
    protected[org.apache.spark]
    Definition Classes
    RDD
  92. val partitioner: Option[Partitioner]

    Permalink
    Definition Classes
    RDD
  93. final def partitions: Array[Partition]

    Permalink
    Definition Classes
    RDD
  94. def persist(newLevel: StorageLevel): LuceneRDD.this.type

    Permalink
    Definition Classes
    LuceneRDD → RDD
  95. def persist(): LuceneRDD.this.type

    Permalink
    Definition Classes
    RDD
  96. def phraseQuery(fieldName: String, query: String, topK: Int = DefaultTopK): Iterable[SparkScoreDoc]

    Permalink

    Lucene phrase Query

    Lucene phrase Query

    fieldName

    Name of field

    query

    Query text

    topK

    Number of documents to return

  97. def pipe(command: Seq[String], env: Map[String, String], printPipeContext: ((String) ⇒ Unit) ⇒ Unit, printRDDElement: (T, (String) ⇒ Unit) ⇒ Unit, separateWorkingDir: Boolean): RDD[String]

    Permalink
    Definition Classes
    RDD
  98. def pipe(command: String, env: Map[String, String]): RDD[String]

    Permalink
    Definition Classes
    RDD
  99. def pipe(command: String): RDD[String]

    Permalink
    Definition Classes
    RDD
  100. final def preferredLocations(split: Partition): Seq[String]

    Permalink
    Definition Classes
    RDD
  101. def prefixQuery(fieldName: String, query: String, topK: Int = DefaultTopK): Iterable[SparkScoreDoc]

    Permalink

    Lucene prefix query

    Lucene prefix query

    fieldName

    Name of field

    query

    Prefix query text

    topK

    Number of documents to return

  102. def query(searchString: String, topK: Int = DefaultTopK): Iterable[SparkScoreDoc]

    Permalink

    Generic query using Lucene's query parser

    Generic query using Lucene's query parser

    searchString

    Query String

  103. def randomSplit(weights: Array[Double], seed: Long): Array[RDD[T]]

    Permalink
    Definition Classes
    RDD
  104. def reduce(f: (T, T) ⇒ T): T

    Permalink
    Definition Classes
    RDD
  105. def repartition(numPartitions: Int)(implicit ord: Ordering[T]): RDD[T]

    Permalink
    Definition Classes
    RDD
  106. def sample(withReplacement: Boolean, fraction: Double, seed: Long): RDD[T]

    Permalink
    Definition Classes
    RDD
  107. def saveAsObjectFile(path: String): Unit

    Permalink
    Definition Classes
    RDD
  108. def saveAsTextFile(path: String, codec: Class[_ <: CompressionCodec]): Unit

    Permalink
    Definition Classes
    RDD
  109. def saveAsTextFile(path: String): Unit

    Permalink
    Definition Classes
    RDD
  110. def setName(_name: String): LuceneRDD.this.type

    Permalink
    Definition Classes
    LuceneRDD → RDD
  111. def sortBy[K](f: (T) ⇒ K, ascending: Boolean, numPartitions: Int)(implicit ord: Ordering[K], ctag: ClassTag[K]): RDD[T]

    Permalink
    Definition Classes
    RDD
  112. def sparkContext: SparkContext

    Permalink
    Definition Classes
    RDD
  113. def subtract(other: RDD[T], p: Partitioner)(implicit ord: Ordering[T]): RDD[T]

    Permalink
    Definition Classes
    RDD
  114. def subtract(other: RDD[T], numPartitions: Int): RDD[T]

    Permalink
    Definition Classes
    RDD
  115. def subtract(other: RDD[T]): RDD[T]

    Permalink
    Definition Classes
    RDD
  116. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  117. def take(num: Int): Array[T]

    Permalink
    Definition Classes
    RDD
  118. def takeOrdered(num: Int)(implicit ord: Ordering[T]): Array[T]

    Permalink
    Definition Classes
    RDD
  119. def takeSample(withReplacement: Boolean, num: Int, seed: Long): Array[T]

    Permalink
    Definition Classes
    RDD
  120. def termQuery(fieldName: String, query: String, topK: Int = DefaultTopK): Iterable[SparkScoreDoc]

    Permalink

    Lucene term query

    Lucene term query

    fieldName

    Name of field

    query

    Term to search on

    topK

    Number of documents to return

  121. def toDebugString: String

    Permalink
    Definition Classes
    RDD
  122. def toJavaRDD(): JavaRDD[T]

    Permalink
    Definition Classes
    RDD
  123. def toLocalIterator: Iterator[T]

    Permalink
    Definition Classes
    RDD
  124. def toString(): String

    Permalink
    Definition Classes
    RDD → AnyRef → Any
  125. def top(num: Int)(implicit ord: Ordering[T]): Array[T]

    Permalink
    Definition Classes
    RDD
  126. def treeAggregate[U](zeroValue: U)(seqOp: (U, T) ⇒ U, combOp: (U, U) ⇒ U, depth: Int)(implicit arg0: ClassTag[U]): U

    Permalink
    Definition Classes
    RDD
  127. def treeReduce(f: (T, T) ⇒ T, depth: Int): T

    Permalink
    Definition Classes
    RDD
  128. def union(other: RDD[T]): RDD[T]

    Permalink
    Definition Classes
    RDD
  129. def unpersist(blocking: Boolean = true): LuceneRDD.this.type

    Permalink
    Definition Classes
    LuceneRDD → RDD
  130. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  131. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  132. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  133. def zip[U](other: RDD[U])(implicit arg0: ClassTag[U]): RDD[(T, U)]

    Permalink
    Definition Classes
    RDD
  134. def zipPartitions[B, C, D, V](rdd2: RDD[B], rdd3: RDD[C], rdd4: RDD[D])(f: (Iterator[T], Iterator[B], Iterator[C], Iterator[D]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[C], arg2: ClassTag[D], arg3: ClassTag[V]): RDD[V]

    Permalink
    Definition Classes
    RDD
  135. def zipPartitions[B, C, D, V](rdd2: RDD[B], rdd3: RDD[C], rdd4: RDD[D], preservesPartitioning: Boolean)(f: (Iterator[T], Iterator[B], Iterator[C], Iterator[D]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[C], arg2: ClassTag[D], arg3: ClassTag[V]): RDD[V]

    Permalink
    Definition Classes
    RDD
  136. def zipPartitions[B, C, V](rdd2: RDD[B], rdd3: RDD[C])(f: (Iterator[T], Iterator[B], Iterator[C]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[C], arg2: ClassTag[V]): RDD[V]

    Permalink
    Definition Classes
    RDD
  137. def zipPartitions[B, C, V](rdd2: RDD[B], rdd3: RDD[C], preservesPartitioning: Boolean)(f: (Iterator[T], Iterator[B], Iterator[C]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[C], arg2: ClassTag[V]): RDD[V]

    Permalink
    Definition Classes
    RDD
  138. def zipPartitions[B, V](rdd2: RDD[B])(f: (Iterator[T], Iterator[B]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[V]): RDD[V]

    Permalink
    Definition Classes
    RDD
  139. def zipPartitions[B, V](rdd2: RDD[B], preservesPartitioning: Boolean)(f: (Iterator[T], Iterator[B]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[V]): RDD[V]

    Permalink
    Definition Classes
    RDD
  140. def zipWithIndex(): RDD[(T, Long)]

    Permalink
    Definition Classes
    RDD
  141. def zipWithUniqueId(): RDD[(T, Long)]

    Permalink
    Definition Classes
    RDD

Deprecated Value Members

  1. def filterWith[A](constructA: (Int) ⇒ A)(p: (T, A) ⇒ Boolean): RDD[T]

    Permalink
    Definition Classes
    RDD
    Annotations
    @deprecated
    Deprecated

    (Since version 1.0.0) use mapPartitionsWithIndex and filter

  2. def flatMapWith[A, U](constructA: (Int) ⇒ A, preservesPartitioning: Boolean)(f: (T, A) ⇒ Seq[U])(implicit arg0: ClassTag[U]): RDD[U]

    Permalink
    Definition Classes
    RDD
    Annotations
    @deprecated
    Deprecated

    (Since version 1.0.0) use mapPartitionsWithIndex and flatMap

  3. def foreachWith[A](constructA: (Int) ⇒ A)(f: (T, A) ⇒ Unit): Unit

    Permalink
    Definition Classes
    RDD
    Annotations
    @deprecated
    Deprecated

    (Since version 1.0.0) use mapPartitionsWithIndex and foreach

  4. def mapPartitionsWithContext[U](f: (TaskContext, Iterator[T]) ⇒ Iterator[U], preservesPartitioning: Boolean)(implicit arg0: ClassTag[U]): RDD[U]

    Permalink
    Definition Classes
    RDD
    Annotations
    @DeveloperApi() @deprecated
    Deprecated

    (Since version 1.2.0) use TaskContext.get

  5. def mapPartitionsWithSplit[U](f: (Int, Iterator[T]) ⇒ Iterator[U], preservesPartitioning: Boolean)(implicit arg0: ClassTag[U]): RDD[U]

    Permalink
    Definition Classes
    RDD
    Annotations
    @deprecated
    Deprecated

    (Since version 0.7.0) use mapPartitionsWithIndex

  6. def mapWith[A, U](constructA: (Int) ⇒ A, preservesPartitioning: Boolean)(f: (T, A) ⇒ U)(implicit arg0: ClassTag[U]): RDD[U]

    Permalink
    Definition Classes
    RDD
    Annotations
    @deprecated
    Deprecated

    (Since version 1.0.0) use mapPartitionsWithIndex

  7. def toArray(): Array[T]

    Permalink
    Definition Classes
    RDD
    Annotations
    @deprecated
    Deprecated

    (Since version 1.0.0) use collect

Inherited from SparkScoreDocAggregatable

Inherited from LuceneRDDConfigurable

Inherited from RDD[T]

Inherited from Logging

Inherited from Serializable

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped