applies an optional additional transformation
applies columnBlackList and columnWhitelist
applies type casting decimal -> integral/float
applies the transformers
Apply execution mode to partition values
applies filterClauseExpr
applies all the transformations above
Check plausibility of latest timestamp of a DataFrame vs.
Check plausibility of latest timestamp of a DataFrame vs. a given timestamp. Throws exception if not successful.
to compare with
DataFrame to compare with
the timestamp column of the dataframe
Enriches SparkSubFeed with DataFrame if not existing
Enriches SparkSubFeed with DataFrame if not existing
input data object.
input SubFeed.
Remove all columns in blacklist from a DataFrame.
Remove all columns in blacklist from a DataFrame.
columns to remove
DataFrame to be filtered
DataFrame with all columns in blacklist removed
Filter DataFrame with given partition values
Filter DataFrame with given partition values
DataFrame to filter
partition values to use as filter condition
filtered DataFrame
Removes all columns from a DataFrame except those specified in whitelist.
Removes all columns from a DataFrame except those specified in whitelist.
columns to keep
DataFrame to be filtered
DataFrame with all columns removed except those specified in whitelist
applies multiple transformations to a sequence of subfeeds
applies multiple transformations to a sequence of subfeeds
Replace all special characters in a String with underscore Used to get valid temp view names
search common inits between to partition column definitions
search greatest common init between to partition column definitions
transform sequence of subfeeds
create util literal column from
Updates the partition values of a SubFeed to the partition columns of an output, removing not existing columns from the partition values.
Updates the partition values of a SubFeed to the partition columns of an output, removing not existing columns from the partition values. Further the transformed DataFrame is validated to have the output's partition columns included.
output DataObject
SubFeed with transformed DataFrame
SubFeed with updated partition values.
Validate that DataFrame contains a given list of columns, throwing an exception otherwise.
Validate that DataFrame contains a given list of columns, throwing an exception otherwise.
DataFrame to validate
Columns that must exist in DataFrame
name to mention in exception