Package

com.holdenkarau.spark

testing

Permalink

package testing

Visibility
  1. Public
  2. All

Type Members

  1. class BatchCountListener extends StreamingListener

    Permalink
  2. class ColumnGenerator extends Serializable

    Permalink
  3. trait DataFrameSuiteBase extends TestSuite with SharedSparkContext with DataFrameSuiteBaseLike

    Permalink

    :: Experimental :: Base class for testing Spark DataFrames.

  4. trait DataFrameSuiteBaseLike extends SparkContextProvider with TestSuiteLike with Serializable

    Permalink
  5. class HDFSCluster extends HDFSClusterLike

    Permalink
  6. trait HDFSClusterLike extends AnyRef

    Permalink
  7. class JavaDataFrameSuiteBase extends SharedJavaSparkContext with DataFrameSuiteBaseLike with JavaTestSuite

    Permalink
  8. class JavaStreamingSuiteBase extends JavaSuiteBase with StreamingSuiteCommon

    Permalink

    This is the base trait for Spark Streaming testsuite.

    This is the base trait for Spark Streaming testsuite. This provides basic functionality to run user-defined set of input on user-defined stream operations, and verify the output. This implementation is designed to work with JUnit for java users.

    Note: this always uses the manual clock

  9. class JavaSuiteBase extends SharedJavaSparkContext

    Permalink
  10. trait JavaTestSuite extends TestSuiteLike

    Permalink
  11. trait LocalSparkContext extends BeforeAndAfterEach with BeforeAndAfterAll

    Permalink

    Manages a local sc SparkContext variable, correctly stopping it after each test.

  12. trait PerTestSparkContext extends LocalSparkContext with BeforeAndAfterEach

    Permalink

    Provides a local sc SparkContext variable, correctly stopping it after each test.

    Provides a local sc SparkContext variable, correctly stopping it after each test. The stopping logic is provided in LocalSparkContext

  13. class PerfListener extends SparkListener

    Permalink

    This listener collects basic execution time information to be used in micro type performance tests.

    This listener collects basic execution time information to be used in micro type performance tests. Be careful imposing strict limits as there is a large amount of variability.

  14. trait RDDComparisons extends RDDComparisonsLike with TestSuite

    Permalink
  15. trait RDDComparisonsLike extends TestSuiteLike

    Permalink
  16. class SharedJavaSparkContext extends SparkContextProvider

    Permalink
  17. trait SharedMiniCluster extends BeforeAndAfterAll with HDFSClusterLike with YARNClusterLike

    Permalink

    Shares an HDFS MiniCluster based SparkContext between all tests in a suite and closes it at the end.

    Shares an HDFS MiniCluster based SparkContext between all tests in a suite and closes it at the end. This requires that the env variable SPARK_HOME is set. Further more if this is used, all Spark tests must run against the yarn mini cluster (see https://issues.apache.org/jira/browse/SPARK-10812 for details).

  18. trait SharedSparkContext extends BeforeAndAfterAll with SparkContextProvider

    Permalink

    Shares a local SparkContext between all tests in a suite and closes it at the end.

  19. trait SparkContextProvider extends AnyRef

    Permalink
  20. trait StreamingActionBase extends StreamingSuiteBase

    Permalink

    Methods for testing Spark actions.

    Methods for testing Spark actions. Because actions don't return a DStream, you will need to verify the results of your test against mocks.

  21. trait StreamingSuiteBase extends BeforeAndAfterAll with Logging with StreamingSuiteCommon with SharedSparkContext

    Permalink

    This is the base trait for Spark Streaming testsuites.

    This is the base trait for Spark Streaming testsuites. This provides basic functionality to run user-defined set of input on user-defined stream operations, and verify the output.

  22. class TestInputStream[T] extends FriendlyInputDStream[T]

    Permalink

    This is a input stream just for the testsuites.

    This is a input stream just for the testsuites. This is equivalent to a checkpointable, replayable, reliable message queue like Kafka. It requires a sequence as input, and returns the i_th element at the i_th batch unde manual clock. Based on TestInputStream class from TestSuiteBase in the Apache Spark project.

  23. class TestOutputStream[T] extends Serializable

    Permalink

    This is a output stream just for testing.

    This is a output stream just for testing.

    The buffer contains a sequence of RDD's, each containing a sequence of items

  24. trait TestSuite extends TestSuiteLike

    Permalink
  25. trait TestSuiteLike extends AnyRef

    Permalink
  26. class YARNCluster extends YARNClusterLike

    Permalink

    Shares an HDFS MiniCluster based SparkContext between all tests in a suite and closes it at the end.

    Shares an HDFS MiniCluster based SparkContext between all tests in a suite and closes it at the end. This requires that the env variable SPARK_HOME is set. Further more if this is used, all Spark tests must run against the yarn mini cluster (see https://issues.apache.org/jira/browse/SPARK-10812 for details).

  27. trait YARNClusterLike extends AnyRef

    Permalink

Value Members

  1. object DataFrameSuiteBase extends Serializable

    Permalink
  2. object DataframeGenerator

    Permalink
  3. object JavaRDDComparisons extends RDDComparisonsLike with JavaTestSuite

    Permalink
  4. object LocalSparkContext

    Permalink
  5. object RDDGenerator

    Permalink
    Annotations
    @Experimental()
  6. object SQLContextProvider

    Permalink
  7. object Utils extends Logging

    Permalink

Ungrouped