Persist RDDs of this SchemaDStream with the default storage level (MEMORY_ONLY_SER)
Persist RDDs of this SchemaDStream with the default storage level (MEMORY_ONLY_SER)
Enable periodic checkpointing of RDDs of this SchemaDStream
Enable periodic checkpointing of RDDs of this SchemaDStream
Time interval after which generated RDD will be checkpointed
Returns all column names as an array.
Method that generates a RDD for the given time
Method that generates a RDD for the given time
List of parent DStreams on which this DStream depends on
List of parent DStreams on which this DStream depends on
Return a new SchemaDStream containing only the elements that satisfy a predicate.
Return a new SchemaDStream containing only the elements that satisfy a predicate.
Apply a function to each DataFrame in this SchemaDStream.
Apply a function to each DataFrame in this SchemaDStream. This is an output operator, so 'this' SchemaDStream will be registered as an output stream and therefore materialized.
Apply a function to each DataFrame in this SchemaDStream.
Apply a function to each DataFrame in this SchemaDStream. This is an output operator, so 'this' SchemaDStream will be registered as an output stream and therefore materialized.
Apply a function to each DataFrame in this SchemaDStream.
Apply a function to each DataFrame in this SchemaDStream. This is an output operator, so 'this' SchemaDStream will be registered as an output stream and therefore materialized.
Apply a function to each DataFrame in this SchemaDStream.
Apply a function to each DataFrame in this SchemaDStream. This is an output operator, so 'this' SchemaDStream will be registered as an output stream and therefore materialized.
Apply a function to each DataFrame in this SchemaDStream.
Apply a function to each DataFrame in this SchemaDStream. This is an output operator, so 'this' SchemaDStream will be registered as an output stream and therefore materialized.
Apply a function to each DataFrame in this SchemaDStream.
Apply a function to each DataFrame in this SchemaDStream. This is an output operator, so 'this' SchemaDStream will be registered as an output stream and therefore materialized.
Apply a function to each DataFrame in this SchemaDStream.
Apply a function to each DataFrame in this SchemaDStream. This is an output operator, so 'this' SchemaDStream will be registered as an output stream and therefore materialized.
Persist RDDs of this SchemaDStream with the default storage level (MEMORY_ONLY_SER)
Persist RDDs of this SchemaDStream with the default storage level (MEMORY_ONLY_SER)
Persist the RDDs of this SchemaDStream with the given storage level
Persist the RDDs of this SchemaDStream with the given storage level
Registers this SchemaDStream as a table in the catalog.
Return a new SchemaDStream with an increased or decreased level of parallelism.
Return a new SchemaDStream with an increased or decreased level of parallelism. Each RDD in the returned SchemaDStream has exactly numPartitions partitions.
Returns the schema of this SchemaDStream (represented by a StructType).
Time interval after which the DStream generates a RDD
Time interval after which the DStream generates a RDD
A SQL based DStream with support for schema/Product This class offers the ability to manipulate SQL query on DStreams It is similar to SchemaRDD, which offers the similar functions Internally, RDD of each batch duration is treated as a small table and CQs are evaluated on those small tables Some of the abstraction and code is borrowed from the project: https://github.com/Intel-bigdata/spark-streamingsql