Persist RDDs of this SchemaDStream with the default storage level (MEMORY_ONLY_SER)
Persist RDDs of this SchemaDStream with the default storage level (MEMORY_ONLY_SER)
Enable periodic checkpointing of RDDs of this SchemaDStream
Enable periodic checkpointing of RDDs of this SchemaDStream
Time interval after which generated RDD will be checkpointed
Returns all column names as an array.
Method that generates a RDD for the given time
Method that generates a RDD for the given time
List of parent DStreams on which this SchemaDStream depends on
List of parent DStreams on which this SchemaDStream depends on
Return a new SchemaDStream containing only the elements that satisfy a predicate.
Return a new SchemaDStream containing only the elements that satisfy a predicate.
Return a new DStream by applying a function to all elements of this SchemaDStream, and then flattening the results
Return a new DStream by applying a function to all elements of this SchemaDStream, and then flattening the results
Apply a function to each DataFrame in this SchemaDStream.
Apply a function to each DataFrame in this SchemaDStream. This is an output operator, so 'this' SchemaDStream will be registered as an output stream and therefore materialized.
Apply a function to each DataFrame in this SchemaDStream.
Apply a function to each DataFrame in this SchemaDStream. This is an output operator, so 'this' SchemaDStream will be registered as an output stream and therefore materialized.
Apply a function to each DataFrame in this SchemaDStream.
Apply a function to each DataFrame in this SchemaDStream. This is an output operator, so 'this' SchemaDStream will be registered as an output stream and therefore materialized.
Apply a function to each DataFrame in this SchemaDStream.
Apply a function to each DataFrame in this SchemaDStream. This is an output operator, so 'this' SchemaDStream will be registered as an output stream and therefore materialized.
Apply a function to each DataFrame in this SchemaDStream.
Apply a function to each DataFrame in this SchemaDStream. This is an output operator, so 'this' SchemaDStream will be registered as an output stream and therefore materialized.
Apply a function to each DataFrame in this SchemaDStream.
Apply a function to each DataFrame in this SchemaDStream. This is an output operator, so 'this' SchemaDStream will be registered as an output stream and therefore materialized.
Apply a function to each DataFrame in this SchemaDStream.
Apply a function to each DataFrame in this SchemaDStream. This is an output operator, so 'this' SchemaDStream will be registered as an output stream and therefore materialized.
Return a new DStream in which each RDD is generated by applying glom() to each RDD of this SchemaDStream.
Return a new DStream in which each RDD is generated by applying glom() to each RDD of this SchemaDStream. Applying glom() to an RDD coalesces all elements within each partition into an array.
Return a new DStream by applying a function to all elements of this SchemaDStream.
Return a new DStream by applying a function to all elements of this SchemaDStream.
Return a new DStream in which each RDD is generated by applying mapPartitions() to each RDDs of this SchemaDStream.
Return a new DStream in which each RDD is generated by applying mapPartitions() to each RDDs of this SchemaDStream. Applying mapPartitions() to an RDD applies a function to each partition of the RDD.
Persist RDDs of this SchemaDStream with the default storage level (MEMORY_ONLY_SER)
Persist RDDs of this SchemaDStream with the default storage level (MEMORY_ONLY_SER)
Persist the RDDs of this SchemaDStream with the given storage level
Persist the RDDs of this SchemaDStream with the given storage level
Registers this SchemaDStream as a table in the catalog.
Return a new SchemaDStream with an increased or decreased level of parallelism.
Return a new SchemaDStream with an increased or decreased level of parallelism. Each RDD in the returned SchemaDStream has exactly numPartitions partitions.
Returns the schema of this SchemaDStream (represented by a StructType).
Time interval after which the SchemaDStream generates a RDD
Time interval after which the SchemaDStream generates a RDD
Return a new DStream in which each RDD is generated by applying a function on each RDD of 'this' SchemaDStream.
Return a new DStream in which each RDD is generated by applying a function on each RDD of 'this' SchemaDStream.
Return a new DStream in which each RDD is generated by applying a function on each RDD of 'this' SchemaDStream.
Return a new DStream in which each RDD is generated by applying a function on each RDD of 'this' SchemaDStream.
Return a new DStream in which each RDD is generated by applying a function on each RDD of 'this' DStream and 'other' DStream.
Return a new DStream in which each RDD is generated by applying a function on each RDD of 'this' DStream and 'other' DStream.
Return a new DStream in which each RDD is generated by applying a function on each RDD of 'this' SchemaDStream and 'other' SchemaDStream.
Return a new DStream in which each RDD is generated by applying a function on each RDD of 'this' SchemaDStream and 'other' SchemaDStream.
A SQL based DStream with support for schema/Product This class offers the ability to manipulate SQL query on DStreams It is similar to SchemaRDD, which offers the similar functions Internally, RDD of each batch duration is treated as a small table and CQs are evaluated on those small tables Some of the abstraction and code is borrowed from the project: https://github.com/Intel-bigdata/spark-streamingsql