Primary method to transform the source data stream into the output data stream.
Primary method to transform the source data stream into the output data stream. The output of this method is passed into sink(). This method must be overridden by subclasses.
input data stream created by source()
implicit flink job config
output data stream
A source data stream for the control events.
A source data stream for the control events.
implicit flink config
a data stream of control events.
A source data stream for the data events.
A source data stream for the data events.
implicit flink config
a data stream of data events.
A pipeline for transforming a single stream.
A pipeline for transforming a single stream. Passes the output of source() through transform() and the result of that into maybeSink(), which may pass it into sink() if we're not testing. Ultimately, returns the output data stream to facilitate testing.
implicit flink job config
data output stream
The output stream will only be passed to sink() if FlinkConfig.mockEdges evaluates to false (ie, you're not testing).
The output stream will only be passed to sink() if FlinkConfig.mockEdges evaluates to false (ie, you're not testing).
the output data stream to pass into sink()
implicit flink job config
Writes the transformed data stream to configured output sinks.
Writes the transformed data stream to configured output sinks. *
a transformed stream from transform()
implicit flink job config
Generate a stream of data records filtered by the control stream.
Generate a stream of data records filtered by the control stream. This method does not generally need to be overridden in subclasses. It interleaves the data and control streams to produce a single stream of DataOrControl objects and then uses a flat map with state to determine when to emit the data records. It remembers the last control time and state and updates it when the state changes. *
implicit flink config
data stream of data records
A simple flink job that interleaves a data stream and a control stream and creates a source of data records that occur during active control periods.
As an example, let
on
represent an active control,off
represent an inactive control, andd
to represent data elements. Then the following stream:d1 d2 on d3 d4 d5 off d6 d7
would output
d3 d4 d5
.the data type
the control type
the output stream element type