Object

ai.chronon.spark

JoinUtils

Related Doc: package spark

Permalink

object JoinUtils

Linear Supertypes
AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. JoinUtils
  2. AnyRef
  3. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  5. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  6. def coalescedJoin(leftDf: DataFrame, rightDf: DataFrame, keys: Seq[String], joinType: String = "left"): DataFrame

    Permalink

    * join left and right dataframes, merging any shared columns if exists by the coalesce rule.

    * join left and right dataframes, merging any shared columns if exists by the coalesce rule. fails if there is any data type mismatch between shared columns.

    The order of output joined dataframe is:

    • all keys
    • all columns on left (incl. both shared and non-shared) in the original order of left
    • all columns on right that are NOT shared by left, in the original order of right
  7. val contains_any: UserDefinedFunction

    Permalink
  8. def createLatestLabelView(viewName: String, baseView: String, tableUtils: TableUtils, propertiesOverride: Map[String, String] = null): Unit

    Permalink

    * Method to create a view with latest available label_ds for a given ds.

    * Method to create a view with latest available label_ds for a given ds. This view is built on top of final label view which has all label versions available. This view will inherit the final label view properties as well.

  9. def createOrReplaceView(viewName: String, leftTable: String, rightTable: String, joinKeys: Array[String], tableUtils: TableUtils, viewProperties: Map[String, String] = null, labelColumnPrefix: String = Constants.LabelColumnPrefix): Unit

    Permalink

    * Method to create or replace a view for feature table joining with labels.

    * Method to create or replace a view for feature table joining with labels. Label columns will be prefixed with "label" or custom prefix for easy identification

  10. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  11. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  12. def filterColumns(df: DataFrame, filter: Seq[String]): DataFrame

    Permalink
  13. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  14. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  15. def getLatestLabelMapping(tableName: String, tableUtils: TableUtils): Map[String, Seq[PartitionRange]]

    Permalink

    compute the mapping label_ds -> PartitionRange of ds which has this label_ds as latest version

    compute the mapping label_ds -> PartitionRange of ds which has this label_ds as latest version

    • Get all partitions from table
    • For each ds, find the latest available label_ds
    • Reverse the mapping and get the ds partition range for each label version(label_ds)
    returns

    Mapping of the label ds -> partition ranges of ds which has this label available as latest

  16. def getRangesToFill(leftSource: Source, tableUtils: TableUtils, endPartition: String): PartitionRange

    Permalink

    * Compute partition range to be filled for given join conf

  17. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  18. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  19. def leftDf(joinConf: api.Join, range: PartitionRange, tableUtils: TableUtils, allowEmpty: Boolean = false): Option[DataFrame]

    Permalink

    * Util methods for join computation

  20. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  21. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  22. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  23. val set_add: UserDefinedFunction

    Permalink
  24. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  25. def tablesToRecompute(joinConf: api.Join, outputTable: String, tableUtils: TableUtils): Seq[String]

    Permalink
  26. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  27. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  28. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  29. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from AnyRef

Inherited from Any

Ungrouped