Package

org.spark

anonymizer

Permalink

package anonymizer

Visibility
  1. Public
  2. All

Type Members

  1. class FirstNameConverter extends NameConverter with Serializable

    Permalink
  2. class FullNameConverter extends NameConverter with Serializable

    Permalink
  3. class LastNameConverter extends NameConverter with Serializable

    Permalink
  4. abstract class NameConverter extends Serializable

    Permalink
  5. trait NameDatabase extends AnyRef

    Permalink
  6. class StringNameDatabase extends NameDatabase with Serializable

    Permalink
  7. class WebNameDatabase extends NameDatabase with Serializable

    Permalink

Value Members

  1. object Anonymizer extends Serializable

    Permalink

    Anonymizes selected columns in a dataframe while preserving format.

    Anonymizes selected columns in a dataframe while preserving format.

    To anonymize selected columns in a dataframe:

    import org.spark.Anonymizer.Extensions

    val df = input_df.anonymize((p => Array("col1", "col2").contains(p)))

    To anonymize all columns in a dataframe: val df = input_df.anonymize()

    To anonymize all columns in a dataframe except one: val df = input_df.anonymize((p => p != "id"))

    To anonymize a single column:

    import org.spark.Anonymizer.Extensions

    df.withColumn("anonymized_col1", Anonymizer.AnonymizeStringUdf($"col1"))

  2. object DataFrame

    Permalink
  3. object DefaultNameDatabase extends NameDatabase with Serializable

    Permalink

Ungrouped