Trait

com.holdenkarau.spark.testing

DatasetSuiteBaseLike

Related Doc: package testing

Permalink

trait DatasetSuiteBaseLike extends DataFrameSuiteBaseLike

Linear Supertypes
DataFrameSuiteBaseLike, Serializable, Serializable, TestSuiteLike, SparkContextProvider, AnyRef, Any
Known Subclasses
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. DatasetSuiteBaseLike
  2. DataFrameSuiteBaseLike
  3. Serializable
  4. Serializable
  5. TestSuiteLike
  6. SparkContextProvider
  7. AnyRef
  8. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Abstract Value Members

  1. abstract def assert[U](message: String, expected: U, actual: U)(implicit CT: ClassTag[U]): Unit

    Permalink
    Definition Classes
    TestSuiteLike
  2. abstract def assert[U](expected: U, actual: U)(implicit CT: ClassTag[U]): Unit

    Permalink
    Definition Classes
    TestSuiteLike
  3. abstract def assertEmpty[U](arr: Array[U])(implicit CT: ClassTag[U]): Unit

    Permalink
    Definition Classes
    TestSuiteLike
  4. abstract def assertTrue(expected: Boolean): Unit

    Permalink
    Definition Classes
    TestSuiteLike
  5. abstract def fail(message: String): Unit

    Permalink
    Definition Classes
    TestSuiteLike
  6. abstract def sc: SparkContext

    Permalink
    Definition Classes
    SparkContextProvider

Concrete Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. def appID: String

    Permalink
    Definition Classes
    SparkContextProvider
  5. def appName: String

    Permalink
    Definition Classes
    SparkContextProvider
  6. def approxEquals(r1: Row, r2: Row, tol: Double, tolTimestamp: Duration): Boolean

    Permalink
    Definition Classes
    DataFrameSuiteBaseLike
  7. def approxEquals(r1: Row, r2: Row, tolTimestamp: Duration): Boolean

    Permalink
    Definition Classes
    DataFrameSuiteBaseLike
  8. def approxEquals(r1: Row, r2: Row, tol: Double): Boolean

    Permalink
    Definition Classes
    DataFrameSuiteBaseLike
  9. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  10. def assertDataFrameApproximateEquals(expected: DataFrame, result: DataFrame, tol: Double, tolTimestamp: Duration, customShow: (DataFrame) ⇒ Unit = _.show()): Unit

    Permalink

    Compares if two DataFrames are equal, checks that the schemas are the same.

    Compares if two DataFrames are equal, checks that the schemas are the same. When comparing inexact fields uses tol & tolTimestamp.

    tol

    max acceptable numeric tolerance, should be less than 1.

    tolTimestamp

    max acceptable timestamp tolerance.

    customShow

    unit function to customize the show method when dataframes are not equal. IE: df.show(false) or df.toJSON.show(false).

    Definition Classes
    DataFrameSuiteBaseLike
  11. def assertDataFrameDataEquals(expected: DataFrame, result: DataFrame): Unit

    Permalink

    Compares if two DataFrames are equal without caring about order of rows, by finding elements in one DataFrame that is not in the other.

    Compares if two DataFrames are equal without caring about order of rows, by finding elements in one DataFrame that is not in the other. The resulting DataFrame should be empty inferring the two DataFrames have the same elements. Does not compare the schema.

    Definition Classes
    DataFrameSuiteBaseLike
  12. def assertDataFrameEquals(expected: DataFrame, result: DataFrame, customShow: (DataFrame) ⇒ Unit = _.show()): Unit

    Permalink

    Compares if two DataFrames are equal, checks the schema and then if that matches checks if the rows are equal.

    Compares if two DataFrames are equal, checks the schema and then if that matches checks if the rows are equal.

    customShow

    unit function to customize the show method when dataframes are not equal. IE: df.show(false) or df.toJSON.show(false).

    Definition Classes
    DataFrameSuiteBaseLike
  13. def assertDataFrameNoOrderEquals(expected: DataFrame, result: DataFrame): Unit

    Permalink

    Compares if two DataFrames are equal without caring about order of rows, by finding elements in one DataFrame that is not in the other.

    Compares if two DataFrames are equal without caring about order of rows, by finding elements in one DataFrame that is not in the other. The resulting DataFrame should be empty inferring the two DataFrames have the same elements. Also verifies that the schema is identical.

    Definition Classes
    DataFrameSuiteBaseLike
  14. def assertDatasetApproximateEquals[U](expected: Dataset[U], result: Dataset[U], tol: Double, tolTimestamp: Duration, customShow: (DataFrame) ⇒ Unit = _.show())(implicit UCT: ClassTag[U]): Unit

    Permalink

    Compares if two Datasets are equal, Datasets should have the same type.

    Compares if two Datasets are equal, Datasets should have the same type. When comparing inexact fields uses tol & tolTimestamp.

    tol

    max acceptable tolerance for numeric (between(0, 1))

    tolTimestamp

    max acceptable timestamp tolerance.

    customShow

    unit function to customize the show method when dataframes are not equal. IE: df.show(false) or df.toJSON.show(false).

  15. def assertDatasetEquals[U](expected: Dataset[U], result: Dataset[U])(implicit UCT: ClassTag[U]): Unit

    Permalink

    Check if two Datasets are equals, Datasets should have the same type.

    Check if two Datasets are equals, Datasets should have the same type. This method could be customized by overriding equals method for the given class type.

  16. def assertSchemasEqual(expected: StructType, result: StructType): Unit

    Permalink

    Compare if two schemas are equal, ignoring autoGeneratedAlias magic

    Compare if two schemas are equal, ignoring autoGeneratedAlias magic

    Definition Classes
    DataFrameSuiteBaseLike
  17. def assertSmallDataFrameDataEquals(expected: DataFrame, result: DataFrame): Unit

    Permalink

    Compares if two DataFrames are equal without caring about order of rows, by finding elements in one DataFrame that is not in the other.

    Compares if two DataFrames are equal without caring about order of rows, by finding elements in one DataFrame that is not in the other. Similar to the function assertDataFrameDataEquals but for small DataFrame that can be collected in memory for the comparison.

    Definition Classes
    DataFrameSuiteBaseLike
  18. def builder(): Builder

    Permalink

    Constructs a configuration for hive or iceberg, where the metastore is located in a temp directory.

    Constructs a configuration for hive or iceberg, where the metastore is located in a temp directory.

    Definition Classes
    DataFrameSuiteBaseLike
  19. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @IntrinsicCandidate() @throws( ... )
  20. def conf: SparkConf

    Permalink
    Definition Classes
    SparkContextProvider
  21. def enableHiveSupport: Boolean

    Permalink
    Attributes
    protected
    Definition Classes
    DataFrameSuiteBaseLike
  22. def enableIcebergSupport: Boolean

    Permalink
    Attributes
    protected
    Definition Classes
    DataFrameSuiteBaseLike
  23. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  24. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  25. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
    Annotations
    @IntrinsicCandidate()
  26. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
    Annotations
    @IntrinsicCandidate()
  27. val icebergWarehouse: String

    Permalink
    Definition Classes
    DataFrameSuiteBaseLike
  28. implicit def impSqlContext: SQLContext

    Permalink
    Attributes
    protected
    Definition Classes
    DataFrameSuiteBaseLike
  29. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  30. lazy val localMetastorePath: String

    Permalink
    Definition Classes
    DataFrameSuiteBaseLike
  31. lazy val localWarehousePath: String

    Permalink
    Definition Classes
    DataFrameSuiteBaseLike
  32. val maxUnequalRowsToShow: Int

    Permalink
    Definition Classes
    DataFrameSuiteBaseLike
  33. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  34. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @IntrinsicCandidate()
  35. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @IntrinsicCandidate()
  36. def setup(sc: SparkContext): Unit

    Permalink

    Setup work to be called when creating a new SparkContext.

    Setup work to be called when creating a new SparkContext. Default implementation currently sets a checkpoint directory.

    This _should_ be called by the context provider automatically.

    Definition Classes
    SparkContextProvider
  37. lazy val spark: SparkSession

    Permalink
    Definition Classes
    DataFrameSuiteBaseLike
  38. def sqlBeforeAllTestCases(): Unit

    Permalink
    Definition Classes
    DataFrameSuiteBaseLike
  39. lazy val sqlContext: SQLContext

    Permalink
    Definition Classes
    DataFrameSuiteBaseLike
  40. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  41. lazy val tempDir: File

    Permalink
    Definition Classes
    DataFrameSuiteBaseLike
  42. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  43. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  44. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  45. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Deprecated Value Members

  1. def assertDataFrameApproximateEquals(expected: DataFrame, result: DataFrame, tol: Double): Unit

    Permalink

    Compares if two DataFrames are equal, checks that the schemas are the same.

    Compares if two DataFrames are equal, checks that the schemas are the same. When comparing inexact fields uses tol.

    tol

    max acceptable tolerance for numeric (between(0, 1)) & timestamp (millis).

    Definition Classes
    DataFrameSuiteBaseLike
    Annotations
    @deprecated
    Deprecated

    (Since version 1.5.0) Use assertDataFrameApproximateEquals with timestamp tolerance

  2. def assertDatasetApproximateEquals[U](expected: Dataset[U], result: Dataset[U], tol: Double)(implicit UCT: ClassTag[U]): Unit

    Permalink

    Compares if two Datasets are equal, Datasets should have the same type.

    Compares if two Datasets are equal, Datasets should have the same type. When comparing inexact fields uses tol.

    tol

    max acceptable tolerance for numeric (between(0, 1)) & timestamp (millis).

    Annotations
    @deprecated
    Deprecated

    (Since version 1.5.0) Use assertDatasetApproximateEquals with timestamp tolerance

  3. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @Deprecated @deprecated @throws( classOf[java.lang.Throwable] )
    Deprecated

    (Since version ) see corresponding Javadoc for more information.

Inherited from DataFrameSuiteBaseLike

Inherited from Serializable

Inherited from Serializable

Inherited from TestSuiteLike

Inherited from SparkContextProvider

Inherited from AnyRef

Inherited from Any

Ungrouped