Aggregate function: returns the approximate number of distinct items in a group.
Aggregate function: returns the approximate number of distinct items in a group.
maximum estimation error allowed (default = 0.05) apache/spark
Aggregate function: returns the approximate number of distinct items in a group.
Aggregate function: returns the approximate number of distinct items in a group.
Aggregate function: returns the average of the values in a group.
Aggregate function: returns the average of the values in a group.
apache/spark
Aggregate function: returns a list of objects with duplicates.
Aggregate function: returns a list of objects with duplicates.
apache/spark
Aggregate function: returns a set of objects with duplicate elements eliminated.
Aggregate function: returns a set of objects with duplicate elements eliminated.
apache/spark
Aggregate function: returns the Pearson Correlation Coefficient for two columns.
Aggregate function: returns the Pearson Correlation Coefficient for two columns.
In Spark corr always returns Double https://github.com/apache/spark/blob/4a3c09601ba69f7d49d1946bb6f20f5cfe453031/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/Corr.scala#L95 apache/spark
Aggregate function: returns the number of items in a group for which the selected column is not null.
Aggregate function: returns the number of items in a group for which the selected column is not null.
apache/spark
Aggregate function: returns the number of items in a group.
Aggregate function: returns the number of items in a group.
apache/spark
Aggregate function: returns the number of distinct items in a group.
Aggregate function: returns the number of distinct items in a group.
apache/spark
Aggregate function: returns the covariance of two collumns.
Aggregate function: returns the covariance of two collumns.
In Spark covar_pop always returns Double https://github.com/apache/spark/blob/4a3c09601ba69f7d49d1946bb6f20f5cfe453031/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/Covariance.scala#L82 apache/spark
Aggregate function: returns the covariance of two columns.
Aggregate function: returns the covariance of two columns.
In Spark covar_samp always returns Double https://github.com/apache/spark/blob/4a3c09601ba69f7d49d1946bb6f20f5cfe453031/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/Covariance.scala#L93 apache/spark
Aggregate function: returns the first value in a group.
Aggregate function: returns the first value in a group.
The function by default returns the first values it sees. It will return the first non-null value it sees when ignoreNulls is set to true. If all values are null, then null is returned.
apache/spark
Aggregate function: returns the kurtosis of a column.
Aggregate function: returns the kurtosis of a column.
In Spark kurtosis always returns Double https://github.com/apache/spark/blob/4a3c09601ba69f7d49d1946bb6f20f5cfe453031/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/CentralMomentAgg.scala#L220 apache/spark
Aggregate function: returns the last value in a group.
Aggregate function: returns the last value in a group.
The function by default returns the last values it sees. It will return the last non-null value it sees when ignoreNulls is set to true. If all values are null, then null is returned.
apache/spark
Aggregate function: returns the maximum value of the column in a group.
Aggregate function: returns the maximum value of the column in a group.
apache/spark
Aggregate function: returns the minimum value of the column in a group.
Aggregate function: returns the minimum value of the column in a group.
apache/spark
Aggregate function: returns the skewness of a column.
Aggregate function: returns the skewness of a column.
In Spark skewness always returns Double https://github.com/apache/spark/blob/4a3c09601ba69f7d49d1946bb6f20f5cfe453031/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/CentralMomentAgg.scala#L200 apache/spark
Aggregate function: returns the sample standard deviation.
Aggregate function: returns the sample standard deviation.
In Spark stddev always returns Double https://github.com/apache/spark/blob/4a3c09601ba69f7d49d1946bb6f20f5cfe453031/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/CentralMomentAgg.scala#155 apache/spark
Aggregate function: returns the standard deviation of a column by population.
Aggregate function: returns the standard deviation of a column by population.
In Spark stddev always returns Double https://github.com/apache/spark/blob/4a3c09601ba69f7d49d1946bb6f20f5cfe453031/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/CentralMomentAgg.scala#L143 apache/spark
Aggregate function: returns the standard deviation of a column by sample.
Aggregate function: returns the standard deviation of a column by sample.
In Spark stddev always returns Double https://github.com/apache/spark/blob/4a3c09601ba69f7d49d1946bb6f20f5cfe453031/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/CentralMomentAgg.scala#L160 apache/spark
Aggregate function: returns the sum of all values in the given column.
Aggregate function: returns the sum of all values in the given column.
apache/spark
Aggregate function: returns the sum of distinct values in the column.
Aggregate function: returns the sum of distinct values in the column.
apache/spark
Aggregate function: returns the unbiased variance of the values in a group.
Aggregate function: returns the unbiased variance of the values in a group.
In Spark variance always returns Double https://github.com/apache/spark/blob/4a3c09601ba69f7d49d1946bb6f20f5cfe453031/sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/CentralMomentAgg.scala#186 apache/spark