TypedDatasetForwarded

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def asInstanceOf[T0]: T0

Definition Classes
Any
def cache(): TypedDataset[T]

Persist this TypedDataset with the default storage level (MEMORY_AND_DISK).
Persist this TypedDataset with the default storage level (MEMORY_AND_DISK).
apache/spark
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
def coalesce(numPartitions: Int): TypedDataset[T]

Returns a new TypedDataset that has exactly numPartitions partitions.
Returns a new TypedDataset that has exactly numPartitions partitions. Similar to coalesce defined on an RDD, this operation results in a narrow dependency, e.g. if you go from 1000 partitions to 100 partitions, there will not be a shuffle, instead each of the 100 new partitions will claim 10 of the current partitions.
apache/spark
def distinct: TypedDataset[T]

Returns a new TypedDataset that contains only the unique elements of this TypedDataset.
Returns a new TypedDataset that contains only the unique elements of this TypedDataset.
Note that, equality checking is performed directly on the encoded representation of the data and thus is not affected by a custom equals function defined on T.
apache/spark
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def except(other: TypedDataset[T]): TypedDataset[T]

Returns a new Dataset containing rows in this Dataset but not in another Dataset.
Returns a new Dataset containing rows in this Dataset but not in another Dataset. This is equivalent to EXCEPT in SQL.
Note that, equality checking is performed directly on the encoded representation of the data and thus is not affected by a custom equals function defined on T.
apache/spark
def explain(extended: Boolean = false): Unit

Prints the plans (logical and physical) to the console for debugging purposes.
Prints the plans (logical and physical) to the console for debugging purposes.
apache/spark
def filter(func: (T) ⇒ Boolean): TypedDataset[T]

Returns a new TypedDataset that only contains elements where func returns true.
Returns a new TypedDataset that only contains elements where func returns true.
apache/spark
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
def flatMap[U](func: (T) ⇒ TraversableOnce[U])(implicit arg0: TypedEncoder[U]): TypedDataset[U]

Returns a new TypedDataset by first applying a function to all elements of this TypedDataset, and then flattening the results.
Returns a new TypedDataset by first applying a function to all elements of this TypedDataset, and then flattening the results.
apache/spark
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
def hashCode(): Int

Definition Classes
AnyRef → Any
def intersect(other: TypedDataset[T]): TypedDataset[T]

Returns a new TypedDataset that contains only the elements of this TypedDataset that are also present in other.
Returns a new TypedDataset that contains only the elements of this TypedDataset that are also present in other.
Note that, equality checking is performed directly on the encoded representation of the data and thus is not affected by a custom equals function defined on T.
apache/spark
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
def limit(n: Int): TypedDataset[T]

Returns a new Dataset by taking the first n rows.
Returns a new Dataset by taking the first n rows. The difference between this function and head is that head is an action and returns an array (by triggering query execution) while limit returns a new Dataset.
apache/spark
def map[U](func: (T) ⇒ U)(implicit arg0: TypedEncoder[U]): TypedDataset[U]

Returns a new TypedDataset that contains the result of applying func to each element.
Returns a new TypedDataset that contains the result of applying func to each element.
apache/spark
def mapPartitions[U](func: (Iterator[T]) ⇒ Iterator[U])(implicit arg0: TypedEncoder[U]): TypedDataset[U]

Returns a new TypedDataset that contains the result of applying func to each partition.
Returns a new TypedDataset that contains the result of applying func to each partition.
apache/spark
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
def persist(newLevel: StorageLevel = StorageLevel.MEMORY_AND_DISK): TypedDataset[T]

Persist this TypedDataset with the given storage level.
Persist this TypedDataset with the given storage level.
newLevel
One of: MEMORY_ONLY, MEMORY_AND_DISK, MEMORY_ONLY_SER, MEMORY_AND_DISK_SER, DISK_ONLY, MEMORY_ONLY_2, MEMORY_AND_DISK_2, etc. apache/spark
def printSchema(): Unit

Prints the schema of the underlying Dataset to the console in a nice tree format.
Prints the schema of the underlying Dataset to the console in a nice tree format.
apache/spark
def rdd: RDD[T]

Converts this TypedDataset to an RDD.
Converts this TypedDataset to an RDD.
apache/spark
def repartition(numPartitions: Int): TypedDataset[T]

Returns a new TypedDataset that has exactly numPartitions partitions.
Returns a new TypedDataset that has exactly numPartitions partitions.
apache/spark
def sample(withReplacement: Boolean, fraction: Double, seed: Long = Random.nextLong): TypedDataset[T]

Returns a new TypedDataset by sampling a fraction of records.
Returns a new TypedDataset by sampling a fraction of records.
apache/spark
def schema: StructType

Returns the schema of this Dataset.
Returns the schema of this Dataset.
apache/spark
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def toDF(): DataFrame

Converts this strongly typed collection of data to generic Dataframe.
Converts this strongly typed collection of data to generic Dataframe. In contrast to the strongly typed objects that Dataset operations work on, a Dataframe returns generic Row objects that allow fields to be accessed by ordinal or name.
apache/spark
def toString(): String

Definition Classes
TypedDatasetForwarded → AnyRef → Any
def transform[U](t: (TypedDataset[T]) ⇒ TypedDataset[U]): TypedDataset[U]

Concise syntax for chaining custom transformations.
Concise syntax for chaining custom transformations.
apache/spark
def union(other: TypedDataset[T]): TypedDataset[T]

Returns a new TypedDataset that contains the elements of both this and the other TypedDataset combined.
Returns a new TypedDataset that contains the elements of both this and the other TypedDataset combined.
Note that, this function is not a typical set union operation, in that it does not eliminate duplicate items. As such, it is analogous to UNION ALL in SQL.
apache/spark
def unpersist(blocking: Boolean = false): TypedDataset[T]

Mark the TypedDataset as non-persistent, and remove all blocks for it from memory and disk.
Mark the TypedDataset as non-persistent, and remove all blocks for it from memory and disk.
blocking
Whether to block until all blocks are deleted. apache/spark
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )

Related Doc: package frameless

trait TypedDatasetForwarded[T] extends AnyRef

Value Members

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: Any): Boolean

final def asInstanceOf[T0]: T0

def cache(): TypedDataset[T]

def clone(): AnyRef

def coalesce(numPartitions: Int): TypedDataset[T]

def distinct: TypedDataset[T]

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def except(other: TypedDataset[T]): TypedDataset[T]

def explain(extended: Boolean = false): Unit

def filter(func: (T) ⇒ Boolean): TypedDataset[T]

def finalize(): Unit

def flatMap[U](func: (T) ⇒ TraversableOnce[U])(implicit arg0: TypedEncoder[U]): TypedDataset[U]

final def getClass(): Class[_]

def hashCode(): Int

def intersect(other: TypedDataset[T]): TypedDataset[T]

final def isInstanceOf[T0]: Boolean

def limit(n: Int): TypedDataset[T]

def map[U](func: (T) ⇒ U)(implicit arg0: TypedEncoder[U]): TypedDataset[U]

def mapPartitions[U](func: (Iterator[T]) ⇒ Iterator[U])(implicit arg0: TypedEncoder[U]): TypedDataset[U]

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

def persist(newLevel: StorageLevel = StorageLevel.MEMORY_AND_DISK): TypedDataset[T]

def printSchema(): Unit

def rdd: RDD[T]

def repartition(numPartitions: Int): TypedDataset[T]

def sample(withReplacement: Boolean, fraction: Double, seed: Long = Random.nextLong): TypedDataset[T]

def schema: StructType

final def synchronized[T0](arg0: ⇒ T0): T0

def toDF(): DataFrame

def toString(): String

def transform[U](t: (TypedDataset[T]) ⇒ TypedDataset[U]): TypedDataset[U]

def union(other: TypedDataset[T]): TypedDataset[T]

def unpersist(blocking: Boolean = false): TypedDataset[T]

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

Inherited from AnyRef

Inherited from Any

Ungrouped