| Package | Description | 
|---|---|
| org.apache.flink.api.java | |
| org.apache.flink.api.java.operators | |
| org.apache.flink.api.java.operators.join | |
| org.apache.flink.api.java.utils | 
| Modifier and Type | Method and Description | 
|---|---|
<X> DataSet<X> | 
DataSet.runOperation(CustomUnaryOperation<T,X> operation)
Runs a  
CustomUnaryOperation on the data set. | 
| Modifier and Type | Method and Description | 
|---|---|
protected static void | 
DataSet.checkSameExecutionContext(DataSet<?> set1,
                         DataSet<?> set2)  | 
protected static void | 
DataSet.checkSameExecutionContext(DataSet<?> set1,
                         DataSet<?> set2)  | 
<R> CoGroupOperator.CoGroupOperatorSets<T,R> | 
DataSet.coGroup(DataSet<R> other)
Initiates a CoGroup transformation. 
 | 
<R> CrossOperator.DefaultCross<T,R> | 
DataSet.cross(DataSet<R> other)
Initiates a Cross transformation. 
 | 
<R> CrossOperator.DefaultCross<T,R> | 
DataSet.crossWithHuge(DataSet<R> other)
Initiates a Cross transformation. 
 | 
<R> CrossOperator.DefaultCross<T,R> | 
DataSet.crossWithTiny(DataSet<R> other)
Initiates a Cross transformation. 
 | 
<R> JoinOperatorSetsBase<T,R> | 
DataSet.fullOuterJoin(DataSet<R> other)
Initiates a Full Outer Join transformation. 
 | 
<R> JoinOperatorSetsBase<T,R> | 
DataSet.fullOuterJoin(DataSet<R> other,
             org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint strategy)
Initiates a Full Outer Join transformation. 
 | 
<R> DeltaIteration<T,R> | 
DataSet.iterateDelta(DataSet<R> workset,
            int maxIterations,
            int... keyPositions)
Initiates a delta iteration. 
 | 
<R> JoinOperator.JoinOperatorSets<T,R> | 
DataSet.join(DataSet<R> other)
Initiates a Join transformation. 
 | 
<R> JoinOperator.JoinOperatorSets<T,R> | 
DataSet.join(DataSet<R> other,
    org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint strategy)
Initiates a Join transformation. 
 | 
<R> JoinOperator.JoinOperatorSets<T,R> | 
DataSet.joinWithHuge(DataSet<R> other)
Initiates a Join transformation. 
 | 
<R> JoinOperator.JoinOperatorSets<T,R> | 
DataSet.joinWithTiny(DataSet<R> other)
Initiates a Join transformation. 
 | 
<R> JoinOperatorSetsBase<T,R> | 
DataSet.leftOuterJoin(DataSet<R> other)
Initiates a Left Outer Join transformation. 
 | 
<R> JoinOperatorSetsBase<T,R> | 
DataSet.leftOuterJoin(DataSet<R> other,
             org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint strategy)
Initiates a Left Outer Join transformation. 
 | 
<R> JoinOperatorSetsBase<T,R> | 
DataSet.rightOuterJoin(DataSet<R> other)
Initiates a Right Outer Join transformation. 
 | 
<R> JoinOperatorSetsBase<T,R> | 
DataSet.rightOuterJoin(DataSet<R> other,
              org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint strategy)
Initiates a Right Outer Join transformation. 
 | 
UnionOperator<T> | 
DataSet.union(DataSet<T> other)
Creates a union of this DataSet with an other DataSet. 
 | 
| Modifier and Type | Class and Description | 
|---|---|
class  | 
AggregateOperator<IN>
This operator represents the application of a "aggregate" operation on a data set, and the result
 data set produced by the function. 
 | 
class  | 
BulkIterationResultSet<T>
Resulting  
DataSet of bulk iterations. | 
class  | 
CoGroupOperator<I1,I2,OUT>
A  
DataSet that is the result of a CoGroup transformation. | 
class  | 
CoGroupRawOperator<I1,I2,OUT>
A  
DataSet that is the result of a CoGroup transformation. | 
class  | 
CrossOperator<I1,I2,OUT>
A  
DataSet that is the result of a Cross transformation. | 
static class  | 
CrossOperator.DefaultCross<I1,I2>
A Cross transformation that wraps pairs of crossed elements into  
Tuple2. | 
static class  | 
CrossOperator.ProjectCross<I1,I2,OUT extends org.apache.flink.api.java.tuple.Tuple>
A Cross transformation that projects crossing elements or fields of crossing  
Tuples into result Tuples. | 
class  | 
DataSource<OUT>
An operation that creates a new data set (data source). 
 | 
static class  | 
DeltaIteration.SolutionSetPlaceHolder<ST>
A  
DataSet that acts as a placeholder for the solution set during the iteration. | 
static class  | 
DeltaIteration.WorksetPlaceHolder<WT>
A  
DataSet that acts as a placeholder for the workset during the iteration. | 
class  | 
DeltaIterationResultSet<ST,WT>
Resulting  
DataSet of a delta iteration operation. | 
class  | 
DistinctOperator<T>
This operator represents the application of a "distinct" function on a data set, and the result
 data set produced by the function. 
 | 
class  | 
FilterOperator<T>
This operator represents the application of a "filter" function on a data set, and the result
 data set produced by the function. 
 | 
class  | 
FlatMapOperator<IN,OUT>
This operator represents the application of a "flatMap" function on a data set, and the result
 data set produced by the function. 
 | 
class  | 
GroupCombineOperator<IN,OUT>
This operator behaves like the GroupReduceOperator with Combine but only runs the Combine part
 which reduces all data locally in their partitions. 
 | 
class  | 
GroupReduceOperator<IN,OUT>
This operator represents the application of a "reduceGroup" function on a data set, and the
 result data set produced by the function. 
 | 
class  | 
IterativeDataSet<T>
The IterativeDataSet represents the start of an iteration. 
 | 
class  | 
JoinOperator<I1,I2,OUT>
A  
DataSet that is the result of a Join transformation. | 
static class  | 
JoinOperator.DefaultJoin<I1,I2>
A Join transformation that wraps pairs of joining elements into  
Tuple2. | 
static class  | 
JoinOperator.EquiJoin<I1,I2,OUT>
A Join transformation that applies a  
JoinFunction on each pair of joining elements. | 
static class  | 
JoinOperator.ProjectJoin<I1,I2,OUT extends org.apache.flink.api.java.tuple.Tuple>
A Join transformation that projects joining elements or fields of joining  
Tuples into result Tuples. | 
class  | 
MapOperator<IN,OUT>
This operator represents the application of a "map" function on a data set, and the result data
 set produced by the function. 
 | 
class  | 
MapPartitionOperator<IN,OUT>
This operator represents the application of a "mapPartition" function on a data set, and the
 result data set produced by the function. 
 | 
class  | 
NoOpOperator<IN>
This operator will be ignored during translation. 
 | 
class  | 
Operator<OUT,O extends Operator<OUT,O>>
Base class of all operators in the Java API. 
 | 
class  | 
PartitionOperator<T>
This operator represents a partitioning. 
 | 
class  | 
ProjectOperator<IN,OUT extends org.apache.flink.api.java.tuple.Tuple>
This operator represents the application of a projection operation on a data set, and the result
 data set produced by the function. 
 | 
class  | 
ReduceOperator<IN>
This operator represents the application of a "reduce" function on a data set, and the result
 data set produced by the function. 
 | 
class  | 
SingleInputOperator<IN,OUT,O extends SingleInputOperator<IN,OUT,O>>
Base class for operations that operates on a single input data set. 
 | 
class  | 
SingleInputUdfOperator<IN,OUT,O extends SingleInputUdfOperator<IN,OUT,O>>
The SingleInputUdfOperator is the base class of all unary operators that execute
 user-defined functions (UDFs). 
 | 
class  | 
SortPartitionOperator<T>
This operator represents a DataSet with locally sorted partitions. 
 | 
class  | 
TwoInputOperator<IN1,IN2,OUT,O extends TwoInputOperator<IN1,IN2,OUT,O>>
Base class for operations that operates on two input data sets. 
 | 
class  | 
TwoInputUdfOperator<IN1,IN2,OUT,O extends TwoInputUdfOperator<IN1,IN2,OUT,O>>
The TwoInputUdfOperator is the base class of all binary operators that execute
 user-defined functions (UDFs). 
 | 
class  | 
UnionOperator<T>
Java API operator for union of two data sets. 
 | 
| Modifier and Type | Field and Description | 
|---|---|
protected DataSet<T> | 
Grouping.inputDataSet  | 
| Modifier and Type | Method and Description | 
|---|---|
DataSet<ST> | 
DeltaIteration.closeWith(DataSet<ST> solutionSetDelta,
         DataSet<WT> newWorkset)
Closes the delta iteration. 
 | 
DataSet<T> | 
IterativeDataSet.closeWith(DataSet<T> iterationResult)
Closes the iteration. 
 | 
DataSet<T> | 
IterativeDataSet.closeWith(DataSet<T> iterationResult,
         DataSet<?> terminationCriterion)
Closes the iteration and specifies a termination criterion. 
 | 
DataSet<OUT> | 
CustomUnaryOperation.createResult()  | 
DataSet<T> | 
DataSink.getDataSet()  | 
DataSet<ST> | 
DeltaIteration.getInitialSolutionSet()
Gets the initial solution set. 
 | 
DataSet<WT> | 
DeltaIteration.getInitialWorkset()
Gets the initial workset. 
 | 
DataSet<IN> | 
NoOpOperator.getInput()  | 
DataSet<IN> | 
SingleInputOperator.getInput()
Gets the data set that this operation uses as its input. 
 | 
DataSet<IN1> | 
TwoInputOperator.getInput1()
Gets the data set that this operation uses as its first input. 
 | 
DataSet<IN2> | 
TwoInputOperator.getInput2()
Gets the data set that this operation uses as its second input. 
 | 
DataSet<T> | 
Grouping.getInputDataSet()
Returns the input DataSet of a grouping operation, that is the one before the grouping. 
 | 
DataSet<T> | 
BulkIterationResultSet.getNextPartialSolution()  | 
DataSet<ST> | 
DeltaIterationResultSet.getNextSolutionSet()  | 
DataSet<WT> | 
DeltaIterationResultSet.getNextWorkset()  | 
DataSet<?> | 
BulkIterationResultSet.getTerminationCriterion()  | 
| Modifier and Type | Method and Description | 
|---|---|
Map<String,DataSet<?>> | 
TwoInputUdfOperator.getBroadcastSets()  | 
Map<String,DataSet<?>> | 
SingleInputUdfOperator.getBroadcastSets()  | 
Map<String,DataSet<?>> | 
UdfOperator.getBroadcastSets()
Gets the broadcast sets (name and data set) that have been added to context of the UDF. 
 | 
| Modifier and Type | Method and Description | 
|---|---|
DataSet<ST> | 
DeltaIteration.closeWith(DataSet<ST> solutionSetDelta,
         DataSet<WT> newWorkset)
Closes the delta iteration. 
 | 
DataSet<ST> | 
DeltaIteration.closeWith(DataSet<ST> solutionSetDelta,
         DataSet<WT> newWorkset)
Closes the delta iteration. 
 | 
DataSet<T> | 
IterativeDataSet.closeWith(DataSet<T> iterationResult)
Closes the iteration. 
 | 
DataSet<T> | 
IterativeDataSet.closeWith(DataSet<T> iterationResult,
         DataSet<?> terminationCriterion)
Closes the iteration and specifies a termination criterion. 
 | 
DataSet<T> | 
IterativeDataSet.closeWith(DataSet<T> iterationResult,
         DataSet<?> terminationCriterion)
Closes the iteration and specifies a termination criterion. 
 | 
void | 
CustomUnaryOperation.setInput(DataSet<IN> inputData)  | 
void | 
NoOpOperator.setInput(DataSet<IN> input)  | 
O | 
TwoInputUdfOperator.withBroadcastSet(DataSet<?> data,
                String name)  | 
O | 
SingleInputUdfOperator.withBroadcastSet(DataSet<?> data,
                String name)  | 
O | 
UdfOperator.withBroadcastSet(DataSet<?> data,
                String name)
Adds a certain data set as a broadcast set to this operator. 
 | 
| Constructor and Description | 
|---|
AggregateOperator(DataSet<IN> input,
                 Aggregations function,
                 int field,
                 String aggregateLocationName)
Non grouped aggregation. 
 | 
CoGroupOperator(DataSet<I1> input1,
               DataSet<I2> input2,
               org.apache.flink.api.common.operators.Keys<I1> keys1,
               org.apache.flink.api.common.operators.Keys<I2> keys2,
               org.apache.flink.api.common.functions.CoGroupFunction<I1,I2,OUT> function,
               org.apache.flink.api.common.typeinfo.TypeInformation<OUT> returnType,
               List<org.apache.commons.lang3.tuple.Pair<Integer,org.apache.flink.api.common.operators.Order>> groupSortKeyOrderFirst,
               List<org.apache.commons.lang3.tuple.Pair<Integer,org.apache.flink.api.common.operators.Order>> groupSortKeyOrderSecond,
               org.apache.flink.api.common.functions.Partitioner<?> customPartitioner,
               String defaultName)  | 
CoGroupOperator(DataSet<I1> input1,
               DataSet<I2> input2,
               org.apache.flink.api.common.operators.Keys<I1> keys1,
               org.apache.flink.api.common.operators.Keys<I2> keys2,
               org.apache.flink.api.common.functions.CoGroupFunction<I1,I2,OUT> function,
               org.apache.flink.api.common.typeinfo.TypeInformation<OUT> returnType,
               List<org.apache.commons.lang3.tuple.Pair<Integer,org.apache.flink.api.common.operators.Order>> groupSortKeyOrderFirst,
               List<org.apache.commons.lang3.tuple.Pair<Integer,org.apache.flink.api.common.operators.Order>> groupSortKeyOrderSecond,
               org.apache.flink.api.common.functions.Partitioner<?> customPartitioner,
               String defaultName)  | 
CoGroupOperator(DataSet<I1> input1,
               DataSet<I2> input2,
               org.apache.flink.api.common.operators.Keys<I1> keys1,
               org.apache.flink.api.common.operators.Keys<I2> keys2,
               org.apache.flink.api.common.functions.CoGroupFunction<I1,I2,OUT> function,
               org.apache.flink.api.common.typeinfo.TypeInformation<OUT> returnType,
               org.apache.flink.api.common.functions.Partitioner<?> customPartitioner,
               String defaultName)  | 
CoGroupOperator(DataSet<I1> input1,
               DataSet<I2> input2,
               org.apache.flink.api.common.operators.Keys<I1> keys1,
               org.apache.flink.api.common.operators.Keys<I2> keys2,
               org.apache.flink.api.common.functions.CoGroupFunction<I1,I2,OUT> function,
               org.apache.flink.api.common.typeinfo.TypeInformation<OUT> returnType,
               org.apache.flink.api.common.functions.Partitioner<?> customPartitioner,
               String defaultName)  | 
CoGroupOperatorSets(DataSet<I1> input1,
                   DataSet<I2> input2)  | 
CoGroupOperatorSets(DataSet<I1> input1,
                   DataSet<I2> input2)  | 
CoGroupRawOperator(DataSet<I1> input1,
                  DataSet<I2> input2,
                  org.apache.flink.api.common.operators.Keys<I1> keys1,
                  org.apache.flink.api.common.operators.Keys<I2> keys2,
                  org.apache.flink.api.common.functions.CoGroupFunction<I1,I2,OUT> function,
                  org.apache.flink.api.common.typeinfo.TypeInformation<OUT> returnType,
                  String defaultName)  | 
CoGroupRawOperator(DataSet<I1> input1,
                  DataSet<I2> input2,
                  org.apache.flink.api.common.operators.Keys<I1> keys1,
                  org.apache.flink.api.common.operators.Keys<I2> keys2,
                  org.apache.flink.api.common.functions.CoGroupFunction<I1,I2,OUT> function,
                  org.apache.flink.api.common.typeinfo.TypeInformation<OUT> returnType,
                  String defaultName)  | 
CrossOperator(DataSet<I1> input1,
             DataSet<I2> input2,
             org.apache.flink.api.common.functions.CrossFunction<I1,I2,OUT> function,
             org.apache.flink.api.common.typeinfo.TypeInformation<OUT> returnType,
             org.apache.flink.api.common.operators.base.CrossOperatorBase.CrossHint hint,
             String defaultName)  | 
CrossOperator(DataSet<I1> input1,
             DataSet<I2> input2,
             org.apache.flink.api.common.functions.CrossFunction<I1,I2,OUT> function,
             org.apache.flink.api.common.typeinfo.TypeInformation<OUT> returnType,
             org.apache.flink.api.common.operators.base.CrossOperatorBase.CrossHint hint,
             String defaultName)  | 
CrossProjection(DataSet<I1> ds1,
               DataSet<I2> ds2,
               int[] firstFieldIndexes,
               int[] secondFieldIndexes,
               org.apache.flink.api.common.operators.base.CrossOperatorBase.CrossHint hint)  | 
CrossProjection(DataSet<I1> ds1,
               DataSet<I2> ds2,
               int[] firstFieldIndexes,
               int[] secondFieldIndexes,
               org.apache.flink.api.common.operators.base.CrossOperatorBase.CrossHint hint)  | 
DataSink(DataSet<T> data,
        org.apache.flink.api.common.io.OutputFormat<T> format,
        org.apache.flink.api.common.typeinfo.TypeInformation<T> type)  | 
DefaultCross(DataSet<I1> input1,
            DataSet<I2> input2,
            org.apache.flink.api.common.operators.base.CrossOperatorBase.CrossHint hint,
            String defaultName)  | 
DefaultCross(DataSet<I1> input1,
            DataSet<I2> input2,
            org.apache.flink.api.common.operators.base.CrossOperatorBase.CrossHint hint,
            String defaultName)  | 
DefaultJoin(DataSet<I1> input1,
           DataSet<I2> input2,
           org.apache.flink.api.common.operators.Keys<I1> keys1,
           org.apache.flink.api.common.operators.Keys<I2> keys2,
           org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint,
           String joinLocationName,
           JoinType type)  | 
DefaultJoin(DataSet<I1> input1,
           DataSet<I2> input2,
           org.apache.flink.api.common.operators.Keys<I1> keys1,
           org.apache.flink.api.common.operators.Keys<I2> keys2,
           org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint,
           String joinLocationName,
           JoinType type)  | 
DeltaIteration(ExecutionEnvironment context,
              org.apache.flink.api.common.typeinfo.TypeInformation<ST> type,
              DataSet<ST> solutionSet,
              DataSet<WT> workset,
              org.apache.flink.api.common.operators.Keys<ST> keys,
              int maxIterations)  | 
DeltaIteration(ExecutionEnvironment context,
              org.apache.flink.api.common.typeinfo.TypeInformation<ST> type,
              DataSet<ST> solutionSet,
              DataSet<WT> workset,
              org.apache.flink.api.common.operators.Keys<ST> keys,
              int maxIterations)  | 
DistinctOperator(DataSet<T> input,
                org.apache.flink.api.common.operators.Keys<T> keys,
                String distinctLocationName)  | 
EquiJoin(DataSet<I1> input1,
        DataSet<I2> input2,
        org.apache.flink.api.common.operators.Keys<I1> keys1,
        org.apache.flink.api.common.operators.Keys<I2> keys2,
        org.apache.flink.api.common.functions.FlatJoinFunction<I1,I2,OUT> generatedFunction,
        org.apache.flink.api.common.functions.JoinFunction<I1,I2,OUT> function,
        org.apache.flink.api.common.typeinfo.TypeInformation<OUT> returnType,
        org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint,
        String joinLocationName)  | 
EquiJoin(DataSet<I1> input1,
        DataSet<I2> input2,
        org.apache.flink.api.common.operators.Keys<I1> keys1,
        org.apache.flink.api.common.operators.Keys<I2> keys2,
        org.apache.flink.api.common.functions.FlatJoinFunction<I1,I2,OUT> generatedFunction,
        org.apache.flink.api.common.functions.JoinFunction<I1,I2,OUT> function,
        org.apache.flink.api.common.typeinfo.TypeInformation<OUT> returnType,
        org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint,
        String joinLocationName)  | 
EquiJoin(DataSet<I1> input1,
        DataSet<I2> input2,
        org.apache.flink.api.common.operators.Keys<I1> keys1,
        org.apache.flink.api.common.operators.Keys<I2> keys2,
        org.apache.flink.api.common.functions.FlatJoinFunction<I1,I2,OUT> generatedFunction,
        org.apache.flink.api.common.functions.JoinFunction<I1,I2,OUT> function,
        org.apache.flink.api.common.typeinfo.TypeInformation<OUT> returnType,
        org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint,
        String joinLocationName,
        JoinType type)  | 
EquiJoin(DataSet<I1> input1,
        DataSet<I2> input2,
        org.apache.flink.api.common.operators.Keys<I1> keys1,
        org.apache.flink.api.common.operators.Keys<I2> keys2,
        org.apache.flink.api.common.functions.FlatJoinFunction<I1,I2,OUT> generatedFunction,
        org.apache.flink.api.common.functions.JoinFunction<I1,I2,OUT> function,
        org.apache.flink.api.common.typeinfo.TypeInformation<OUT> returnType,
        org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint,
        String joinLocationName,
        JoinType type)  | 
EquiJoin(DataSet<I1> input1,
        DataSet<I2> input2,
        org.apache.flink.api.common.operators.Keys<I1> keys1,
        org.apache.flink.api.common.operators.Keys<I2> keys2,
        org.apache.flink.api.common.functions.FlatJoinFunction<I1,I2,OUT> function,
        org.apache.flink.api.common.typeinfo.TypeInformation<OUT> returnType,
        org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint,
        String joinLocationName)  | 
EquiJoin(DataSet<I1> input1,
        DataSet<I2> input2,
        org.apache.flink.api.common.operators.Keys<I1> keys1,
        org.apache.flink.api.common.operators.Keys<I2> keys2,
        org.apache.flink.api.common.functions.FlatJoinFunction<I1,I2,OUT> function,
        org.apache.flink.api.common.typeinfo.TypeInformation<OUT> returnType,
        org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint,
        String joinLocationName)  | 
EquiJoin(DataSet<I1> input1,
        DataSet<I2> input2,
        org.apache.flink.api.common.operators.Keys<I1> keys1,
        org.apache.flink.api.common.operators.Keys<I2> keys2,
        org.apache.flink.api.common.functions.FlatJoinFunction<I1,I2,OUT> function,
        org.apache.flink.api.common.typeinfo.TypeInformation<OUT> returnType,
        org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint,
        String joinLocationName,
        JoinType type)  | 
EquiJoin(DataSet<I1> input1,
        DataSet<I2> input2,
        org.apache.flink.api.common.operators.Keys<I1> keys1,
        org.apache.flink.api.common.operators.Keys<I2> keys2,
        org.apache.flink.api.common.functions.FlatJoinFunction<I1,I2,OUT> function,
        org.apache.flink.api.common.typeinfo.TypeInformation<OUT> returnType,
        org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint,
        String joinLocationName,
        JoinType type)  | 
FilterOperator(DataSet<T> input,
              org.apache.flink.api.common.functions.FilterFunction<T> function,
              String defaultName)  | 
FlatMapOperator(DataSet<IN> input,
               org.apache.flink.api.common.typeinfo.TypeInformation<OUT> resultType,
               org.apache.flink.api.common.functions.FlatMapFunction<IN,OUT> function,
               String defaultName)  | 
GroupCombineOperator(DataSet<IN> input,
                    org.apache.flink.api.common.typeinfo.TypeInformation<OUT> resultType,
                    org.apache.flink.api.common.functions.GroupCombineFunction<IN,OUT> function,
                    String defaultName)
Constructor for a non-grouped reduce (all reduce). 
 | 
Grouping(DataSet<T> set,
        org.apache.flink.api.common.operators.Keys<T> keys)  | 
GroupReduceOperator(DataSet<IN> input,
                   org.apache.flink.api.common.typeinfo.TypeInformation<OUT> resultType,
                   org.apache.flink.api.common.functions.GroupReduceFunction<IN,OUT> function,
                   String defaultName)
Constructor for a non-grouped reduce (all reduce). 
 | 
IterativeDataSet(ExecutionEnvironment context,
                org.apache.flink.api.common.typeinfo.TypeInformation<T> type,
                DataSet<T> input,
                int maxIterations)  | 
JoinOperator(DataSet<I1> input1,
            DataSet<I2> input2,
            org.apache.flink.api.common.operators.Keys<I1> keys1,
            org.apache.flink.api.common.operators.Keys<I2> keys2,
            org.apache.flink.api.common.typeinfo.TypeInformation<OUT> returnType,
            org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint,
            JoinType type)  | 
JoinOperator(DataSet<I1> input1,
            DataSet<I2> input2,
            org.apache.flink.api.common.operators.Keys<I1> keys1,
            org.apache.flink.api.common.operators.Keys<I2> keys2,
            org.apache.flink.api.common.typeinfo.TypeInformation<OUT> returnType,
            org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint,
            JoinType type)  | 
JoinOperatorSets(DataSet<I1> input1,
                DataSet<I2> input2)  | 
JoinOperatorSets(DataSet<I1> input1,
                DataSet<I2> input2)  | 
JoinOperatorSets(DataSet<I1> input1,
                DataSet<I2> input2,
                org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint)  | 
JoinOperatorSets(DataSet<I1> input1,
                DataSet<I2> input2,
                org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint)  | 
MapOperator(DataSet<IN> input,
           org.apache.flink.api.common.typeinfo.TypeInformation<OUT> resultType,
           org.apache.flink.api.common.functions.MapFunction<IN,OUT> function,
           String defaultName)  | 
MapPartitionOperator(DataSet<IN> input,
                    org.apache.flink.api.common.typeinfo.TypeInformation<OUT> resultType,
                    org.apache.flink.api.common.functions.MapPartitionFunction<IN,OUT> function,
                    String defaultName)  | 
NoOpOperator(DataSet<IN> input,
            org.apache.flink.api.common.typeinfo.TypeInformation<IN> resultType)  | 
PartitionOperator(DataSet<T> input,
                 org.apache.flink.api.common.operators.Keys<T> pKeys,
                 org.apache.flink.api.common.functions.Partitioner<?> customPartitioner,
                 String partitionLocationName)  | 
PartitionOperator(DataSet<T> input,
                 org.apache.flink.api.common.operators.Keys<T> pKeys,
                 org.apache.flink.api.common.functions.Partitioner<P> customPartitioner,
                 org.apache.flink.api.common.typeinfo.TypeInformation<P> partitionerTypeInfo,
                 String partitionLocationName)  | 
PartitionOperator(DataSet<T> input,
                 org.apache.flink.api.common.operators.base.PartitionOperatorBase.PartitionMethod pMethod,
                 org.apache.flink.api.common.operators.Keys<T> pKeys,
                 org.apache.flink.api.common.distributions.DataDistribution distribution,
                 String partitionLocationName)  | 
PartitionOperator(DataSet<T> input,
                 org.apache.flink.api.common.operators.base.PartitionOperatorBase.PartitionMethod pMethod,
                 org.apache.flink.api.common.operators.Keys<T> pKeys,
                 String partitionLocationName)  | 
PartitionOperator(DataSet<T> input,
                 org.apache.flink.api.common.operators.base.PartitionOperatorBase.PartitionMethod pMethod,
                 String partitionLocationName)  | 
ProjectCross(DataSet<I1> input1,
            DataSet<I2> input2,
            int[] fields,
            boolean[] isFromFirst,
            org.apache.flink.api.java.typeutils.TupleTypeInfo<OUT> returnType,
            CrossOperator.CrossProjection<I1,I2> crossProjection,
            org.apache.flink.api.common.operators.base.CrossOperatorBase.CrossHint hint)  | 
ProjectCross(DataSet<I1> input1,
            DataSet<I2> input2,
            int[] fields,
            boolean[] isFromFirst,
            org.apache.flink.api.java.typeutils.TupleTypeInfo<OUT> returnType,
            CrossOperator.CrossProjection<I1,I2> crossProjection,
            org.apache.flink.api.common.operators.base.CrossOperatorBase.CrossHint hint)  | 
ProjectCross(DataSet<I1> input1,
            DataSet<I2> input2,
            int[] fields,
            boolean[] isFromFirst,
            org.apache.flink.api.java.typeutils.TupleTypeInfo<OUT> returnType,
            org.apache.flink.api.common.operators.base.CrossOperatorBase.CrossHint hint)  | 
ProjectCross(DataSet<I1> input1,
            DataSet<I2> input2,
            int[] fields,
            boolean[] isFromFirst,
            org.apache.flink.api.java.typeutils.TupleTypeInfo<OUT> returnType,
            org.apache.flink.api.common.operators.base.CrossOperatorBase.CrossHint hint)  | 
Projection(DataSet<T> ds,
          int[] fieldIndexes)  | 
ProjectJoin(DataSet<I1> input1,
           DataSet<I2> input2,
           org.apache.flink.api.common.operators.Keys<I1> keys1,
           org.apache.flink.api.common.operators.Keys<I2> keys2,
           org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint,
           int[] fields,
           boolean[] isFromFirst,
           org.apache.flink.api.java.typeutils.TupleTypeInfo<OUT> returnType)  | 
ProjectJoin(DataSet<I1> input1,
           DataSet<I2> input2,
           org.apache.flink.api.common.operators.Keys<I1> keys1,
           org.apache.flink.api.common.operators.Keys<I2> keys2,
           org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint,
           int[] fields,
           boolean[] isFromFirst,
           org.apache.flink.api.java.typeutils.TupleTypeInfo<OUT> returnType)  | 
ProjectJoin(DataSet<I1> input1,
           DataSet<I2> input2,
           org.apache.flink.api.common.operators.Keys<I1> keys1,
           org.apache.flink.api.common.operators.Keys<I2> keys2,
           org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint,
           int[] fields,
           boolean[] isFromFirst,
           org.apache.flink.api.java.typeutils.TupleTypeInfo<OUT> returnType,
           org.apache.flink.api.java.operators.JoinOperator.JoinProjection<I1,I2> joinProj)  | 
ProjectJoin(DataSet<I1> input1,
           DataSet<I2> input2,
           org.apache.flink.api.common.operators.Keys<I1> keys1,
           org.apache.flink.api.common.operators.Keys<I2> keys2,
           org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint,
           int[] fields,
           boolean[] isFromFirst,
           org.apache.flink.api.java.typeutils.TupleTypeInfo<OUT> returnType,
           org.apache.flink.api.java.operators.JoinOperator.JoinProjection<I1,I2> joinProj)  | 
ProjectOperator(DataSet<IN> input,
               int[] fields,
               org.apache.flink.api.java.typeutils.TupleTypeInfo<OUT> returnType)  | 
ReduceOperator(DataSet<IN> input,
              org.apache.flink.api.common.functions.ReduceFunction<IN> function,
              String defaultName)
This is the case for a reduce-all case (in contrast to the reduce-per-group case). 
 | 
SingleInputOperator(DataSet<IN> input,
                   org.apache.flink.api.common.typeinfo.TypeInformation<OUT> resultType)  | 
SingleInputUdfOperator(DataSet<IN> input,
                      org.apache.flink.api.common.typeinfo.TypeInformation<OUT> resultType)
Creates a new operators with the given data set as input. 
 | 
SortedGrouping(DataSet<T> set,
              org.apache.flink.api.common.operators.Keys<T> keys,
              int field,
              org.apache.flink.api.common.operators.Order order)  | 
SortedGrouping(DataSet<T> set,
              org.apache.flink.api.common.operators.Keys<T> keys,
              org.apache.flink.api.common.operators.Keys.SelectorFunctionKeys<T,K> keySelector,
              org.apache.flink.api.common.operators.Order order)  | 
SortedGrouping(DataSet<T> set,
              org.apache.flink.api.common.operators.Keys<T> keys,
              String field,
              org.apache.flink.api.common.operators.Order order)  | 
SortPartitionOperator(DataSet<T> dataSet,
                     int sortField,
                     org.apache.flink.api.common.operators.Order sortOrder,
                     String sortLocationName)  | 
SortPartitionOperator(DataSet<T> dataSet,
                     org.apache.flink.api.common.operators.Keys.SelectorFunctionKeys<T,K> sortKey,
                     org.apache.flink.api.common.operators.Order sortOrder,
                     String sortLocationName)  | 
SortPartitionOperator(DataSet<T> dataSet,
                     String sortField,
                     org.apache.flink.api.common.operators.Order sortOrder,
                     String sortLocationName)  | 
TwoInputOperator(DataSet<IN1> input1,
                DataSet<IN2> input2,
                org.apache.flink.api.common.typeinfo.TypeInformation<OUT> resultType)  | 
TwoInputOperator(DataSet<IN1> input1,
                DataSet<IN2> input2,
                org.apache.flink.api.common.typeinfo.TypeInformation<OUT> resultType)  | 
TwoInputUdfOperator(DataSet<IN1> input1,
                   DataSet<IN2> input2,
                   org.apache.flink.api.common.typeinfo.TypeInformation<OUT> resultType)
Creates a new operators with the two given data sets as inputs. 
 | 
TwoInputUdfOperator(DataSet<IN1> input1,
                   DataSet<IN2> input2,
                   org.apache.flink.api.common.typeinfo.TypeInformation<OUT> resultType)
Creates a new operators with the two given data sets as inputs. 
 | 
UnionOperator(DataSet<T> input1,
             DataSet<T> input2,
             String unionLocationName)
Create an operator that produces the union of the two given data sets. 
 | 
UnionOperator(DataSet<T> input1,
             DataSet<T> input2,
             String unionLocationName)
Create an operator that produces the union of the two given data sets. 
 | 
UnsortedGrouping(DataSet<T> set,
                org.apache.flink.api.common.operators.Keys<T> keys)  | 
| Modifier and Type | Field and Description | 
|---|---|
protected DataSet<I1> | 
JoinOperatorSetsBase.input1  | 
protected DataSet<I2> | 
JoinOperatorSetsBase.input2  | 
| Constructor and Description | 
|---|
JoinOperatorSetsBase(DataSet<I1> input1,
                    DataSet<I2> input2)  | 
JoinOperatorSetsBase(DataSet<I1> input1,
                    DataSet<I2> input2)  | 
JoinOperatorSetsBase(DataSet<I1> input1,
                    DataSet<I2> input2,
                    org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint)  | 
JoinOperatorSetsBase(DataSet<I1> input1,
                    DataSet<I2> input2,
                    org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint)  | 
JoinOperatorSetsBase(DataSet<I1> input1,
                    DataSet<I2> input2,
                    org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint,
                    JoinType type)  | 
JoinOperatorSetsBase(DataSet<I1> input1,
                    DataSet<I2> input2,
                    org.apache.flink.api.common.operators.base.JoinOperatorBase.JoinHint hint,
                    JoinType type)  | 
| Modifier and Type | Method and Description | 
|---|---|
static <T> DataSet<org.apache.flink.api.java.tuple.Tuple2<Integer,Long>> | 
DataSetUtils.countElementsPerPartition(DataSet<T> input)
Method that goes over all the elements in each partition in order to retrieve the total
 number of elements. 
 | 
static <T> DataSet<T> | 
DataSetUtils.sampleWithSize(DataSet<T> input,
              boolean withReplacement,
              int numSamples)
Generate a sample of DataSet which contains fixed size elements. 
 | 
static <T> DataSet<T> | 
DataSetUtils.sampleWithSize(DataSet<T> input,
              boolean withReplacement,
              int numSamples,
              long seed)
Generate a sample of DataSet which contains fixed size elements. 
 | 
static <T> DataSet<org.apache.flink.api.java.tuple.Tuple2<Long,T>> | 
DataSetUtils.zipWithIndex(DataSet<T> input)
Method that assigns a unique  
Long value to all elements in the input data set. | 
static <T> DataSet<org.apache.flink.api.java.tuple.Tuple2<Long,T>> | 
DataSetUtils.zipWithUniqueId(DataSet<T> input)
Method that assigns a unique  
Long value to all elements in the input data set as
 described below. | 
| Modifier and Type | Method and Description | 
|---|---|
static <T> Utils.ChecksumHashCode | 
DataSetUtils.checksumHashCode(DataSet<T> input)
Deprecated. 
 
replaced with  
org.apache.flink.graph.asm.dataset.ChecksumHashCode in
     Gelly | 
static <T> DataSet<org.apache.flink.api.java.tuple.Tuple2<Integer,Long>> | 
DataSetUtils.countElementsPerPartition(DataSet<T> input)
Method that goes over all the elements in each partition in order to retrieve the total
 number of elements. 
 | 
static <T> PartitionOperator<T> | 
DataSetUtils.partitionByRange(DataSet<T> input,
                org.apache.flink.api.common.distributions.DataDistribution distribution,
                int... fields)
Range-partitions a DataSet on the specified tuple field positions. 
 | 
static <T,K extends Comparable<K>> | 
DataSetUtils.partitionByRange(DataSet<T> input,
                org.apache.flink.api.common.distributions.DataDistribution distribution,
                org.apache.flink.api.java.functions.KeySelector<T,K> keyExtractor)
Range-partitions a DataSet using the specified key selector function. 
 | 
static <T> PartitionOperator<T> | 
DataSetUtils.partitionByRange(DataSet<T> input,
                org.apache.flink.api.common.distributions.DataDistribution distribution,
                String... fields)
Range-partitions a DataSet on the specified fields. 
 | 
static <T> MapPartitionOperator<T,T> | 
DataSetUtils.sample(DataSet<T> input,
      boolean withReplacement,
      double fraction)
Generate a sample of DataSet by the probability fraction of each element. 
 | 
static <T> MapPartitionOperator<T,T> | 
DataSetUtils.sample(DataSet<T> input,
      boolean withReplacement,
      double fraction,
      long seed)
Generate a sample of DataSet by the probability fraction of each element. 
 | 
static <T> DataSet<T> | 
DataSetUtils.sampleWithSize(DataSet<T> input,
              boolean withReplacement,
              int numSamples)
Generate a sample of DataSet which contains fixed size elements. 
 | 
static <T> DataSet<T> | 
DataSetUtils.sampleWithSize(DataSet<T> input,
              boolean withReplacement,
              int numSamples,
              long seed)
Generate a sample of DataSet which contains fixed size elements. 
 | 
static <R extends org.apache.flink.api.java.tuple.Tuple,T extends org.apache.flink.api.java.tuple.Tuple> | 
DataSetUtils.summarize(DataSet<T> input)
Summarize a DataSet of Tuples by collecting single pass statistics for all columns. 
 | 
static <T> DataSet<org.apache.flink.api.java.tuple.Tuple2<Long,T>> | 
DataSetUtils.zipWithIndex(DataSet<T> input)
Method that assigns a unique  
Long value to all elements in the input data set. | 
static <T> DataSet<org.apache.flink.api.java.tuple.Tuple2<Long,T>> | 
DataSetUtils.zipWithUniqueId(DataSet<T> input)
Method that assigns a unique  
Long value to all elements in the input data set as
 described below. | 
Copyright © 2014–2022 The Apache Software Foundation. All rights reserved.