Package

za.co.absa.hermes

datasetComparison

Permalink

package datasetComparison

Visibility
  1. Public
  2. All

Type Members

  1. final case class BadProvidedSchema(diffSchema: String, cause: Throwable = None.orNull) extends DatasetComparisonException with Product with Serializable

    Permalink
  2. case class ComparisonResult(refRowCount: Long, newRowCount: Long, refDuplicateCount: Long, newDuplicateCount: Long, passedCount: Long, usedSchemaSelector: List[Column], resultDF: Option[DataFrame], diffCount: Long = 0, passedOptions: String = "", additionalInfo: Map[String, String] = Map.empty) extends Product with Serializable

    Permalink

    refRowCount

    Row Count of the reference data

    newRowCount

    Row Count of the new data

    usedSchemaSelector

    Selector used to align schemas created from reference data schema

    resultDF

    Result dataframe, if None, there were no differences between reference and new data

    diffCount

    Number of differences if there are any

    passedOptions

    Raw options passed to the job by user. Might be empty if comparison used as a library

  3. class DatasetComparator extends AnyRef

    Permalink

    Class that is the brain of the DatasetComparison module.

    Class that is the brain of the DatasetComparison module. This class should be used in case of using DatasetComparison as a library. In case of running the DatasetComparison as SparkJob, please use the DatasetComparisonJob.

  4. class DatasetComparisonException extends Exception

    Permalink
  5. final case class DatasetsDifferException(refPath: String, stdPath: String, outPath: String, expectedCount: Long, actualCount: Long, cause: Throwable = None.orNull) extends DatasetComparisonException with Product with Serializable

    Permalink
  6. final case class DuplicateRowsInDF(countRef: Long, countNew: Long) extends DatasetComparisonException with Product with Serializable

    Permalink
  7. final case class MissingArgumentException(message: String, cause: Throwable = None.orNull) extends Exception with Product with Serializable

    Permalink
  8. final case class SchemasDifferException(diffSchema: String, cause: Throwable = None.orNull) extends DatasetComparisonException with Product with Serializable

    Permalink

Value Members

  1. object ComparisonResultProtocol extends DefaultJsonProtocol

    Permalink
  2. object DatasetComparisonJob

    Permalink
  3. package cliUtils

    Permalink
  4. package config

    Permalink
  5. package dataFrame

    Permalink

Ungrouped