Case class ContinuousMetric with all corresponding Metrics
Case class ContinuousMetric with all corresponding Metrics
: the name of the variable
: the metric function
Function to compute the Dataframe with Category, Count and Frequencies obtain from the initial Dataframe
Function to compute the Dataframe with Category, Count and Frequencies obtain from the initial Dataframe
: column of the variable.
: initial DataFrame.
(Column, DataFrame) : tuple2 of the column of the variable and the initial Dataframe
Function to compute the DataFrame metrics by row
Function to compute the DataFrame metrics by row
: initial DataFrame.
: name list of all variables.
: list of metrics you want to calculate.
DataFrame : DataFrame metric of all variables by row.
Function to compute and to combine all the partial DataFrame metric by variable (to get one DataFrame by row).
Function to compute and to combine all the partial DataFrame metric by variable (to get one DataFrame by row).
: initial DataFrame.
: name of the variable.
: list of metrics you want to calculate.
DataFrame : DataFrame with alle the metric by variable by row
List of all available metrics
Customize catCountFreq for discrete variable
Customize Category for discrete variable
Customize Category for discrete variable
: couple of name of the variable and the dataframe obtain from categoryCountFreqDataframe()
Column : the computed value of the function metricCategory
Customize Count Discrete for discrete variable
Customize Count Discrete for discrete variable
: couple of name of the variable and the dataframe obtain from categoryCountFreqDataframe()
Column : the computed value of the function metricCountDiscret
Customize CountDistinct for discrete variable
Customize CountDistinct for discrete variable
: couple of name of the variable and the dataframe obtain from categoryCountFreqDataframe()
Column : the computed value of the function metricCountDistinct
Customize missing values
Customize missing values
: the column
Integer : the number of missing values, NaN values and null values
Customize number of Missing Values for discrete variable
Customize number of Missing Values for discrete variable
: couple of name of the variable and the dataframe obtain from categoryCountFreqDataframe()
Column : the computed value of the function metricMissingValues
Customize Count Distinct for discrete variable
Customize Count Distinct for discrete variable
: couple of name of the variable and the dataframe obtain from categoryCountFreqDataframe()
Column : the computed value of the function metricCountDistinct
Customize mean of the column e
Customize mean of the column e
: the column
Integer : the computed value of the mean
Customize Median of the column e
Customize Median of the column e
: the column
Integer : the computed value of the Median
Customize function metric in the case continuous variabes used for : mean, variance and stddev
Customize function metric in the case continuous variabes used for : mean, variance and stddev
: the column
: the name of the metric
: the metric function
: the computed value of the function
Customize Metric Discret for discrete variable
Customize Metric Discret for discrete variable
: name of the column
: the dataframe obtain from categoryCountFreqDataframe()
: te metric name
: the metric function
Column : the computed value of the function
Customize function metric in the case continuous variabes used for : percentile 25, median and percentile75
Customize function metric in the case continuous variabes used for : percentile 25, median and percentile75
: the column
: the name of the metric
: the metric function
: the approximation method
: the value to pass to stat_method
Customize Stddev of the column e
Customize Stddev of the column e
: the name of the column
Integer : the computed value of the Stddev
Customize variance of the column e
Customize variance of the column e
: the name of the column
Integer : the computed value of the variance
Function to compute the Dataframe metric by variable
Function to compute the Dataframe metric by variable
: tuple of column variable and the Dataframe with Category, Count and Frequencies obtain from categoryCountFreqDataframe()
: list of metrics you want to calculate.
Dataframe : with all the values of discrete metrics
List of all available metrics.
Function to extract the column that contains the list of struct cat_count_freq
Function to extract the column that contains the list of category
Function to extract the column that contains the list of category
: the data frame obtain from categoryCountFreqDataframe()
Column : of that contain the list of category values
Function to extract the column that contains the list of CountDiscret
Function to extract the column that contains the list of CountDiscret
: the data frame obtain from categoryCountFreqDataframe()
Column : of that contain the list of CountDiscrete values
Function to extract the column that contains the list of CountDistinct
Function to extract the column that contains the list of CountDistinct
: the data frame obtain from categoryCountFreqDataframe()
Column : of that contain the list of CountDistinct values
Function to extract the column that contains the list of frequencies
Function to extract the column that contains the list of frequencies
: the data frame obtain from categoryCountFreqDataframe()
Column : of that contain the list of frequencies values
Function to extract the column that contains the list of number of Missing values
Function to extract the column that contains the list of number of Missing values
: the data frame obtain from categoryCountFreqDataframe()
Column : of that contain the list of Missing Values values
Customize percentile of order 0.25 of the column e
Customize percentile of order 0.25 of the column e
: the column
Integer : the computed value of the percentile of order 0.25
Customize percentile of order 0.75 of the column e
Customize percentile of order 0.75 of the column e
: the column
Integer : the computed value of the percentile of order 0.75
Function to regroup and reformat all metrics for a given variable
Function to regroup and reformat all metrics for a given variable
: the name of the column.
: the DataFrame of all the computed metrics for each variable by columns.
: the DataFrame metric associated to the variable (namecol).