Class/Object

com.memsql.spark.connector.rdd

MemSQLRDD

Related Docs: object MemSQLRDD | package rdd

Permalink

case class MemSQLRDD[T](sc: SparkContext, cluster: MemSQLCluster, sql: String, sqlParams: Seq[Any] = Nil, databaseName: Option[String] = None, mapRow: (ResultSet) ⇒ T = MemSQLRDD.resultSetToArray, disablePartitionPushdown: Boolean = false, enableStreaming: Boolean = false)(implicit evidence$1: ClassTag[T]) extends RDD[T] with Product with Serializable

An org.apache.spark.rdd.RDD that can read data from a MemSQL database based on a SQL query.

If the given query supports it, this RDD will read data directly from the MemSQL cluster's leaf nodes rather than from the master aggregator, which typically results in much faster reads. However, if the given query does not support this (e.g. queries involving joins or GROUP BY operations), the results will be returned in a single partition.

cluster

A connected MemSQLCluster instance.

sql

The text of the query. Can be a prepared statement template, in which case parameters from sqlParams are substituted.

sqlParams

The parameters of the query if sql is a template.

databaseName

Optionally provide a database name for this RDD. This is required for Partition Pushdown

mapRow

A function from a ResultSet to a single row of the desired result type(s). This should only call getInt, getString, etc; the RDD takes care of calling next. The default maps a ResultSet to an array of Any.

Linear Supertypes
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. MemSQLRDD
  2. Product
  3. Equals
  4. RDD
  5. Logging
  6. Serializable
  7. Serializable
  8. AnyRef
  9. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new MemSQLRDD(sc: SparkContext, cluster: MemSQLCluster, sql: String, sqlParams: Seq[Any] = Nil, databaseName: Option[String] = None, mapRow: (ResultSet) ⇒ T = MemSQLRDD.resultSetToArray, disablePartitionPushdown: Boolean = false, enableStreaming: Boolean = false)(implicit arg0: ClassTag[T])

    Permalink

    cluster

    A connected MemSQLCluster instance.

    sql

    The text of the query. Can be a prepared statement template, in which case parameters from sqlParams are substituted.

    sqlParams

    The parameters of the query if sql is a template.

    databaseName

    Optionally provide a database name for this RDD. This is required for Partition Pushdown

    mapRow

    A function from a ResultSet to a single row of the desired result type(s). This should only call getInt, getString, etc; the RDD takes care of calling next. The default maps a ResultSet to an array of Any.

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. def ++(other: RDD[T]): RDD[T]

    Permalink
    Definition Classes
    RDD
  4. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  5. def aggregate[U](zeroValue: U)(seqOp: (U, T) ⇒ U, combOp: (U, U) ⇒ U)(implicit arg0: ClassTag[U]): U

    Permalink
    Definition Classes
    RDD
  6. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  7. def cache(): MemSQLRDD.this.type

    Permalink
    Definition Classes
    RDD
  8. def cartesian[U](other: RDD[U])(implicit arg0: ClassTag[U]): RDD[(T, U)]

    Permalink
    Definition Classes
    RDD
  9. def checkpoint(): Unit

    Permalink
    Definition Classes
    RDD
  10. def clearDependencies(): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    RDD
  11. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  12. val cluster: MemSQLCluster

    Permalink

    A connected MemSQLCluster instance.

  13. def coalesce(numPartitions: Int, shuffle: Boolean, partitionCoalescer: Option[PartitionCoalescer])(implicit ord: Ordering[T]): RDD[T]

    Permalink
    Definition Classes
    RDD
  14. def collect[U](f: PartialFunction[T, U])(implicit arg0: ClassTag[U]): RDD[U]

    Permalink
    Definition Classes
    RDD
  15. def collect(): Array[T]

    Permalink
    Definition Classes
    RDD
  16. def compute(sparkPartition: Partition, context: TaskContext): Iterator[T]

    Permalink
    Definition Classes
    MemSQLRDD → RDD
  17. def context: SparkContext

    Permalink
    Definition Classes
    RDD
  18. def count(): Long

    Permalink
    Definition Classes
    RDD
  19. def countApprox(timeout: Long, confidence: Double): PartialResult[BoundedDouble]

    Permalink
    Definition Classes
    RDD
  20. def countApproxDistinct(relativeSD: Double): Long

    Permalink
    Definition Classes
    RDD
  21. def countApproxDistinct(p: Int, sp: Int): Long

    Permalink
    Definition Classes
    RDD
  22. def countByValue()(implicit ord: Ordering[T]): Map[T, Long]

    Permalink
    Definition Classes
    RDD
  23. def countByValueApprox(timeout: Long, confidence: Double)(implicit ord: Ordering[T]): PartialResult[Map[T, BoundedDouble]]

    Permalink
    Definition Classes
    RDD
  24. val databaseName: Option[String]

    Permalink

    Optionally provide a database name for this RDD.

    Optionally provide a database name for this RDD. This is required for Partition Pushdown

  25. final def dependencies: Seq[Dependency[_]]

    Permalink
    Definition Classes
    RDD
  26. val disablePartitionPushdown: Boolean

    Permalink
  27. def distinct(): RDD[T]

    Permalink
    Definition Classes
    RDD
  28. def distinct(numPartitions: Int)(implicit ord: Ordering[T]): RDD[T]

    Permalink
    Definition Classes
    RDD
  29. val enableStreaming: Boolean

    Permalink
  30. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  31. def filter(f: (T) ⇒ Boolean): RDD[T]

    Permalink
    Definition Classes
    RDD
  32. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  33. def first(): T

    Permalink
    Definition Classes
    RDD
  34. def firstParent[U](implicit arg0: ClassTag[U]): RDD[U]

    Permalink
    Attributes
    protected[org.apache.spark]
    Definition Classes
    RDD
  35. def flatMap[U](f: (T) ⇒ TraversableOnce[U])(implicit arg0: ClassTag[U]): RDD[U]

    Permalink
    Definition Classes
    RDD
  36. def fold(zeroValue: T)(op: (T, T) ⇒ T): T

    Permalink
    Definition Classes
    RDD
  37. def foreach(f: (T) ⇒ Unit): Unit

    Permalink
    Definition Classes
    RDD
  38. def foreachPartition(f: (Iterator[T]) ⇒ Unit): Unit

    Permalink
    Definition Classes
    RDD
  39. def getCheckpointFile: Option[String]

    Permalink
    Definition Classes
    RDD
  40. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  41. def getDependencies: Seq[Dependency[_]]

    Permalink
    Attributes
    protected
    Definition Classes
    RDD
  42. final def getNumPartitions: Int

    Permalink
    Definition Classes
    RDD
    Annotations
    @Since( "1.6.0" )
  43. def getPartitions: Array[Partition]

    Permalink
    Definition Classes
    MemSQLRDD → RDD
  44. def getPreferredLocations(sparkPartition: Partition): Seq[String]

    Permalink
    Definition Classes
    MemSQLRDD → RDD
  45. def getStorageLevel: StorageLevel

    Permalink
    Definition Classes
    RDD
  46. def glom(): RDD[Array[T]]

    Permalink
    Definition Classes
    RDD
  47. def groupBy[K](f: (T) ⇒ K, p: Partitioner)(implicit kt: ClassTag[K], ord: Ordering[K]): RDD[(K, Iterable[T])]

    Permalink
    Definition Classes
    RDD
  48. def groupBy[K](f: (T) ⇒ K, numPartitions: Int)(implicit kt: ClassTag[K]): RDD[(K, Iterable[T])]

    Permalink
    Definition Classes
    RDD
  49. def groupBy[K](f: (T) ⇒ K)(implicit kt: ClassTag[K]): RDD[(K, Iterable[T])]

    Permalink
    Definition Classes
    RDD
  50. val id: Int

    Permalink
    Definition Classes
    RDD
  51. def initializeLogIfNecessary(isInterpreter: Boolean): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  52. def intersection(other: RDD[T], numPartitions: Int): RDD[T]

    Permalink
    Definition Classes
    RDD
  53. def intersection(other: RDD[T], partitioner: Partitioner)(implicit ord: Ordering[T]): RDD[T]

    Permalink
    Definition Classes
    RDD
  54. def intersection(other: RDD[T]): RDD[T]

    Permalink
    Definition Classes
    RDD
  55. def isCheckpointed: Boolean

    Permalink
    Definition Classes
    RDD
  56. def isEmpty(): Boolean

    Permalink
    Definition Classes
    RDD
  57. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  58. def isTraceEnabled(): Boolean

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  59. final def iterator(split: Partition, context: TaskContext): Iterator[T]

    Permalink
    Definition Classes
    RDD
  60. def keyBy[K](f: (T) ⇒ K): RDD[(K, T)]

    Permalink
    Definition Classes
    RDD
  61. def localCheckpoint(): MemSQLRDD.this.type

    Permalink
    Definition Classes
    RDD
  62. def log: Logger

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  63. def logDebug(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  64. def logDebug(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  65. def logError(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  66. def logError(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  67. def logInfo(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  68. def logInfo(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  69. def logName: String

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  70. def logTrace(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  71. def logTrace(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  72. def logWarning(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  73. def logWarning(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  74. def map[U](f: (T) ⇒ U)(implicit arg0: ClassTag[U]): RDD[U]

    Permalink
    Definition Classes
    RDD
  75. def mapPartitions[U](f: (Iterator[T]) ⇒ Iterator[U], preservesPartitioning: Boolean)(implicit arg0: ClassTag[U]): RDD[U]

    Permalink
    Definition Classes
    RDD
  76. def mapPartitionsWithIndex[U](f: (Int, Iterator[T]) ⇒ Iterator[U], preservesPartitioning: Boolean)(implicit arg0: ClassTag[U]): RDD[U]

    Permalink
    Definition Classes
    RDD
  77. val mapRow: (ResultSet) ⇒ T

    Permalink

    A function from a ResultSet to a single row of the desired result type(s).

    A function from a ResultSet to a single row of the desired result type(s). This should only call getInt, getString, etc; the RDD takes care of calling next. The default maps a ResultSet to an array of Any.

  78. def max()(implicit ord: Ordering[T]): T

    Permalink
    Definition Classes
    RDD
  79. def min()(implicit ord: Ordering[T]): T

    Permalink
    Definition Classes
    RDD
  80. var name: String

    Permalink
    Definition Classes
    RDD
  81. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  82. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  83. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  84. def parent[U](j: Int)(implicit arg0: ClassTag[U]): RDD[U]

    Permalink
    Attributes
    protected[org.apache.spark]
    Definition Classes
    RDD
  85. val partitioner: Option[Partitioner]

    Permalink
    Definition Classes
    RDD
  86. final def partitions: Array[Partition]

    Permalink
    Definition Classes
    RDD
  87. def persist(): MemSQLRDD.this.type

    Permalink
    Definition Classes
    RDD
  88. def persist(newLevel: StorageLevel): MemSQLRDD.this.type

    Permalink
    Definition Classes
    RDD
  89. def pipe(command: Seq[String], env: Map[String, String], printPipeContext: ((String) ⇒ Unit) ⇒ Unit, printRDDElement: (T, (String) ⇒ Unit) ⇒ Unit, separateWorkingDir: Boolean, bufferSize: Int, encoding: String): RDD[String]

    Permalink
    Definition Classes
    RDD
  90. def pipe(command: String, env: Map[String, String]): RDD[String]

    Permalink
    Definition Classes
    RDD
  91. def pipe(command: String): RDD[String]

    Permalink
    Definition Classes
    RDD
  92. final def preferredLocations(split: Partition): Seq[String]

    Permalink
    Definition Classes
    RDD
  93. def randomSplit(weights: Array[Double], seed: Long): Array[RDD[T]]

    Permalink
    Definition Classes
    RDD
  94. def reduce(f: (T, T) ⇒ T): T

    Permalink
    Definition Classes
    RDD
  95. def repartition(numPartitions: Int)(implicit ord: Ordering[T]): RDD[T]

    Permalink
    Definition Classes
    RDD
  96. def sample(withReplacement: Boolean, fraction: Double, seed: Long): RDD[T]

    Permalink
    Definition Classes
    RDD
  97. def saveAsObjectFile(path: String): Unit

    Permalink
    Definition Classes
    RDD
  98. def saveAsTextFile(path: String, codec: Class[_ <: CompressionCodec]): Unit

    Permalink
    Definition Classes
    RDD
  99. def saveAsTextFile(path: String): Unit

    Permalink
    Definition Classes
    RDD
  100. val sc: SparkContext

    Permalink
  101. def setName(_name: String): MemSQLRDD.this.type

    Permalink
    Definition Classes
    RDD
  102. def sortBy[K](f: (T) ⇒ K, ascending: Boolean, numPartitions: Int)(implicit ord: Ordering[K], ctag: ClassTag[K]): RDD[T]

    Permalink
    Definition Classes
    RDD
  103. def sparkContext: SparkContext

    Permalink
    Definition Classes
    RDD
  104. val sql: String

    Permalink

    The text of the query.

    The text of the query. Can be a prepared statement template, in which case parameters from sqlParams are substituted.

  105. val sqlParams: Seq[Any]

    Permalink

    The parameters of the query if sql is a template.

  106. def subtract(other: RDD[T], p: Partitioner)(implicit ord: Ordering[T]): RDD[T]

    Permalink
    Definition Classes
    RDD
  107. def subtract(other: RDD[T], numPartitions: Int): RDD[T]

    Permalink
    Definition Classes
    RDD
  108. def subtract(other: RDD[T]): RDD[T]

    Permalink
    Definition Classes
    RDD
  109. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  110. def take(num: Int): Array[T]

    Permalink
    Definition Classes
    RDD
  111. def takeOrdered(num: Int)(implicit ord: Ordering[T]): Array[T]

    Permalink
    Definition Classes
    RDD
  112. def takeSample(withReplacement: Boolean, num: Int, seed: Long): Array[T]

    Permalink
    Definition Classes
    RDD
  113. def toDebugString: String

    Permalink
    Definition Classes
    RDD
  114. def toJavaRDD(): JavaRDD[T]

    Permalink
    Definition Classes
    RDD
  115. def toLocalIterator: Iterator[T]

    Permalink
    Definition Classes
    RDD
  116. def toString(): String

    Permalink
    Definition Classes
    RDD → AnyRef → Any
  117. def top(num: Int)(implicit ord: Ordering[T]): Array[T]

    Permalink
    Definition Classes
    RDD
  118. def treeAggregate[U](zeroValue: U)(seqOp: (U, T) ⇒ U, combOp: (U, U) ⇒ U, depth: Int)(implicit arg0: ClassTag[U]): U

    Permalink
    Definition Classes
    RDD
  119. def treeReduce(f: (T, T) ⇒ T, depth: Int): T

    Permalink
    Definition Classes
    RDD
  120. def union(other: RDD[T]): RDD[T]

    Permalink
    Definition Classes
    RDD
  121. def unpersist(blocking: Boolean): MemSQLRDD.this.type

    Permalink
    Definition Classes
    RDD
  122. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  123. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  124. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  125. def zip[U](other: RDD[U])(implicit arg0: ClassTag[U]): RDD[(T, U)]

    Permalink
    Definition Classes
    RDD
  126. def zipPartitions[B, C, D, V](rdd2: RDD[B], rdd3: RDD[C], rdd4: RDD[D])(f: (Iterator[T], Iterator[B], Iterator[C], Iterator[D]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[C], arg2: ClassTag[D], arg3: ClassTag[V]): RDD[V]

    Permalink
    Definition Classes
    RDD
  127. def zipPartitions[B, C, D, V](rdd2: RDD[B], rdd3: RDD[C], rdd4: RDD[D], preservesPartitioning: Boolean)(f: (Iterator[T], Iterator[B], Iterator[C], Iterator[D]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[C], arg2: ClassTag[D], arg3: ClassTag[V]): RDD[V]

    Permalink
    Definition Classes
    RDD
  128. def zipPartitions[B, C, V](rdd2: RDD[B], rdd3: RDD[C])(f: (Iterator[T], Iterator[B], Iterator[C]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[C], arg2: ClassTag[V]): RDD[V]

    Permalink
    Definition Classes
    RDD
  129. def zipPartitions[B, C, V](rdd2: RDD[B], rdd3: RDD[C], preservesPartitioning: Boolean)(f: (Iterator[T], Iterator[B], Iterator[C]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[C], arg2: ClassTag[V]): RDD[V]

    Permalink
    Definition Classes
    RDD
  130. def zipPartitions[B, V](rdd2: RDD[B])(f: (Iterator[T], Iterator[B]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[V]): RDD[V]

    Permalink
    Definition Classes
    RDD
  131. def zipPartitions[B, V](rdd2: RDD[B], preservesPartitioning: Boolean)(f: (Iterator[T], Iterator[B]) ⇒ Iterator[V])(implicit arg0: ClassTag[B], arg1: ClassTag[V]): RDD[V]

    Permalink
    Definition Classes
    RDD
  132. def zipWithIndex(): RDD[(T, Long)]

    Permalink
    Definition Classes
    RDD
  133. def zipWithUniqueId(): RDD[(T, Long)]

    Permalink
    Definition Classes
    RDD

Inherited from Product

Inherited from Equals

Inherited from RDD[T]

Inherited from Logging

Inherited from Serializable

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped