Class/Object

com.salesforce.op.utils.spark

OpVectorColumnMetadata

Related Docs: object OpVectorColumnMetadata | package spark

Permalink

case class OpVectorColumnMetadata(parentFeatureName: Seq[String], parentFeatureType: Seq[String], grouping: Option[String], indicatorValue: Option[String] = None, descriptorValue: Option[String] = None, index: Int = 0) extends JsonLike with Product with Serializable

Represents the metadata a column in a vector.

Because we expect every vector column to have been produced by some vectorization process, we provide the name of the feature that led to this column.

Also note that each column's indicator value should be unique, meaning that they represent mutually exclusive values. The output of a hashing vectorizer, for instance, does not produce mutually exclusive values.

parentFeatureName

The name of the parent feature(s) for the column. Usually a column has one parent feature, but can have many (eg. in the case of multiple Text columns being vectorized using a shared hash space)

parentFeatureType

The type of the parent feature(s) for the column

grouping

The name of the group an column belongs to (usually the parent feature, but in the case of Maps, this is the keys). Every other column in the same vector that has this grouping should be mutually exclusive to this one. If there is no grouping then this field is None

indicatorValue

A name for an binary indicator value (null indicator or result of a pivot or whatever that value is), otherwise None eg this is none when the column is from a numeric group that is not pivoted

descriptorValue

A name for a value that is continuous (not a binary indicator) eg for geolocation (lat, lon, accuracy) or for dates that have been converted to a circular representation the time window and x or y coordinate, otherwise None

index

Index of the vector this info is associated with (this is updated when OpVectorColumnMetadata is passed into OpVectorMetadata

Linear Supertypes
Product, Equals, JsonLike, Serializable, Serializable, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. OpVectorColumnMetadata
  2. Product
  3. Equals
  4. JsonLike
  5. Serializable
  6. Serializable
  7. AnyRef
  8. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new OpVectorColumnMetadata(parentFeatureName: Seq[String], parentFeatureType: Seq[String], grouping: Option[String], indicatorValue: Option[String] = None, descriptorValue: Option[String] = None, index: Int = 0)

    Permalink

    parentFeatureName

    The name of the parent feature(s) for the column. Usually a column has one parent feature, but can have many (eg. in the case of multiple Text columns being vectorized using a shared hash space)

    parentFeatureType

    The type of the parent feature(s) for the column

    grouping

    The name of the group an column belongs to (usually the parent feature, but in the case of Maps, this is the keys). Every other column in the same vector that has this grouping should be mutually exclusive to this one. If there is no grouping then this field is None

    indicatorValue

    A name for an binary indicator value (null indicator or result of a pivot or whatever that value is), otherwise None eg this is none when the column is from a numeric group that is not pivoted

    descriptorValue

    A name for a value that is continuous (not a binary indicator) eg for geolocation (lat, lon, accuracy) or for dates that have been converted to a circular representation the time window and x or y coordinate, otherwise None

    index

    Index of the vector this info is associated with (this is updated when OpVectorColumnMetadata is passed into OpVectorMetadata

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  5. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  6. val descriptorValue: Option[String]

    Permalink

    A name for a value that is continuous (not a binary indicator) eg for geolocation (lat, lon, accuracy) or for dates that have been converted to a circular representation the time window and x or y coordinate, otherwise None

  7. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  8. def featureGroup(): Option[String]

    Permalink

    Get the feature grouping qualified by the parent feature name

    Get the feature grouping qualified by the parent feature name

    returns

    Optional string of feature grouping

  9. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  10. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  11. val grouping: Option[String]

    Permalink

    The name of the group an column belongs to (usually the parent feature, but in the case of Maps, this is the keys).

    The name of the group an column belongs to (usually the parent feature, but in the case of Maps, this is the keys). Every other column in the same vector that has this grouping should be mutually exclusive to this one. If there is no grouping then this field is None

  12. def hasParentOfSubType[O <: FeatureType](implicit tt: scala.reflect.api.JavaUniverse.TypeTag[O]): Boolean

    Permalink

    Does column have parent features of which are subtypes of feature type O

  13. def hasParentOfType[O <: FeatureType](implicit tt: scala.reflect.api.JavaUniverse.TypeTag[O]): Boolean

    Permalink

    Does column have parent features of specified feature type O

  14. val index: Int

    Permalink

    Index of the vector this info is associated with (this is updated when OpVectorColumnMetadata is passed into OpVectorMetadata

  15. val indicatorValue: Option[String]

    Permalink

    A name for an binary indicator value (null indicator or result of a pivot or whatever that value is), otherwise None eg this is none when the column is from a numeric group that is not pivoted

  16. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  17. def isNullIndicator: Boolean

    Permalink

    Is this column corresponds to a null-encoded value

    Is this column corresponds to a null-encoded value

    returns

    true if this column corresponds to a null-encoded value

  18. def isOtherIndicator: Boolean

    Permalink

    Is this column corresponds the other category of a one hot encoded categorical

    Is this column corresponds the other category of a one hot encoded categorical

    returns

    true if this column corresponds to the other category of a one hot encoded categorical

  19. def makeColName(): String

    Permalink

    Make unique name for this column

    Make unique name for this column

    returns

    String name for this column

  20. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  21. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  22. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  23. val parentFeatureName: Seq[String]

    Permalink

    The name of the parent feature(s) for the column.

    The name of the parent feature(s) for the column. Usually a column has one parent feature, but can have many (eg. in the case of multiple Text columns being vectorized using a shared hash space)

  24. val parentFeatureType: Seq[String]

    Permalink

    The type of the parent feature(s) for the column

  25. def parentNamesWithMapKeys(): Seq[String]

    Permalink

    Return parent features names with the key (grouping) from any map parents included in name

    Return parent features names with the key (grouping) from any map parents included in name

    returns

    Sequence of parent feature names, simple names when features are not maps, names plus keys for columns with map parent features

  26. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  27. def toJson(pretty: Boolean): String

    Permalink
    Definition Classes
    JsonLike
  28. def toMetadata(): Metadata

    Permalink

    Convert this column into Spark metadata.

    Convert this column into Spark metadata.

    returns

    column Spark metadata

  29. def toMetadata(ind: Array[Int]): Metadata

    Permalink

    Convert this column into Spark metadata.

    Convert this column into Spark metadata.

    ind

    Indexes of vector columns that match this OpVectorColumnMetadata

    returns

    column Spark metadata

  30. def toString(): String

    Permalink
    Definition Classes
    JsonLike → AnyRef → Any
  31. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  32. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  33. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from Product

Inherited from Equals

Inherited from JsonLike

Inherited from Serializable

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped