Object/Class

com.microsoft.azure.cosmosdb.spark

CosmosDBSpark

Related Docs: class CosmosDBSpark | package spark

Permalink

object CosmosDBSpark extends CosmosDBLoggingTrait with Serializable

The CosmosDBSpark allow fast creation of RDDs, DataFrames or Datasets from CosmosDBSpark.

Since

1.0

Linear Supertypes
Serializable, Serializable, CosmosDBLoggingTrait, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. CosmosDBSpark
  2. Serializable
  3. Serializable
  4. CosmosDBLoggingTrait
  5. AnyRef
  6. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Type Members

  1. class Builder extends AnyRef

    Permalink

    Builder for configuring and creating a CosmosDBSpark

    Builder for configuring and creating a CosmosDBSpark

    It requires a SparkSession or the SparkContext

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  5. def builder(): Builder

    Permalink

    Create a builder for configuring the CosmosDBSpark

    Create a builder for configuring the CosmosDBSpark

    returns

    a CosmosDBSession Builder

  6. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  7. val defaultSource: String

    Permalink

    The default source string for creating DataFrames from CosmosDB

  8. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  9. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  10. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  11. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  12. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  13. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  14. var lastUpsertSetting: Option[Boolean]

    Permalink

    For verfication purpose

  15. var lastWritingBatchSize: Option[Int]

    Permalink
  16. def load(jsc: JavaSparkContext, readConfig: Config): JavaCosmosDBRDD

    Permalink

    Load data from CosmosDB

    Load data from CosmosDB

    jsc

    the Spark context containing the CosmosDB connection configuration

    returns

    a CosmosDBRDD

  17. def load(jsc: JavaSparkContext): JavaCosmosDBRDD

    Permalink

    Load data from CosmosDB

    Load data from CosmosDB

    jsc

    the Spark context containing the CosmosDB connection configuration

    returns

    a CosmosDBRDD

  18. def load[D](sparkSession: SparkSession, readConfig: Config, clazz: Class[D]): Dataset[D]

    Permalink

    Load data from CosmosDB

    Load data from CosmosDB

    D

    The bean class defining the schema for the data

    sparkSession

    the SparkSession containing the CosmosDB connection configuration

    clazz

    the class of the data contained in the RDD

    returns

    a CosmosDBRDD

  19. def load[D <: Product](sparkSession: SparkSession, readConfig: Config)(implicit arg0: scala.reflect.api.JavaUniverse.TypeTag[D]): DataFrame

    Permalink

    Load data from CosmosDB

    Load data from CosmosDB

    D

    The optional class defining the schema for the data

    sparkSession

    the SparkSession containing the CosmosDB connection configuration

    returns

    a CosmosDBRDD

  20. def load[D <: Product](sparkSession: SparkSession)(implicit arg0: scala.reflect.api.JavaUniverse.TypeTag[D]): DataFrame

    Permalink

    Load data from CosmosDB

    Load data from CosmosDB

    D

    The optional class defining the schema for the data

    sparkSession

    the SparkSession containing the CosmosDB connection configuration

    returns

    a CosmosDBRDD

  21. def load(sc: SparkContext, readConfig: Config): CosmosDBRDD

    Permalink

    Load data from CosmosDB

    Load data from CosmosDB

    sc

    the Spark context containing the CosmosDB connection configuration

    returns

    a CosmosDBRDD

  22. def load(sc: SparkContext): CosmosDBRDD

    Permalink

    Load data from CosmosDB

    Load data from CosmosDB

    sc

    the Spark context containing the CosmosDB connection configuration

    returns

    a CosmosDBRDD

  23. def log: Logger

    Permalink
    Attributes
    protected
    Definition Classes
    CosmosDBLoggingTrait
  24. def logDebug(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    CosmosDBLoggingTrait
  25. def logDebug(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    CosmosDBLoggingTrait
  26. def logError(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    CosmosDBLoggingTrait
  27. def logError(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    CosmosDBLoggingTrait
  28. def logInfo(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    CosmosDBLoggingTrait
  29. def logInfo(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    CosmosDBLoggingTrait
  30. def logName: String

    Permalink
    Attributes
    protected
    Definition Classes
    CosmosDBLoggingTrait
  31. def logTrace(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    CosmosDBLoggingTrait
  32. def logTrace(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    CosmosDBLoggingTrait
  33. def logWarning(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    CosmosDBLoggingTrait
  34. def logWarning(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    CosmosDBLoggingTrait
  35. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  36. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  37. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  38. val random: Random

    Permalink
  39. def read(sparkSession: SparkSession): DataFrameReader

    Permalink

    Creates a DataFrameReader with CosmosDB as the source

    Creates a DataFrameReader with CosmosDB as the source

    sparkSession

    the SparkSession

    returns

    the DataFrameReader

  40. def save[D](javaRDD: JavaRDD[D], writeConfig: Config, clazz: Class[D]): Unit

    Permalink

    Save data to CosmosDB

    Save data to CosmosDB

    Uses the writeConfig for the database information Requires a codec for the data type

    D

    the type of the data in the RDD

    javaRDD

    the RDD data to save to CosmosDB

    writeConfig

    the com.microsoft.azure.cosmosdb.spark.config.Config

    clazz

    the class of the data contained in the RDD

    returns

    the javaRDD

  41. def save(javaRDD: JavaRDD[documentdb.Document], writeConfig: Config): Unit

    Permalink

    Save data to CosmosDB

    Save data to CosmosDB

    Uses the SparkConf for the database information

    javaRDD

    the RDD data to save to CosmosDB

    writeConfig

    the com.microsoft.azure.cosmosdb.spark.config.Config

    returns

    the javaRDD

  42. def save[D](javaRDD: JavaRDD[D], clazz: Class[D]): Unit

    Permalink

    Save data to CosmosDB

    Save data to CosmosDB

    Uses the SparkConf for the database and collection information Requires a codec for the data type

    D

    the type of the data in the RDD

    javaRDD

    the RDD data to save to CosmosDB

    clazz

    the class of the data contained in the RDD

    returns

    the javaRDD

  43. def save(javaRDD: JavaRDD[documentdb.Document]): Unit

    Permalink

    Save data to CosmosDB

    Save data to CosmosDB

    Uses the SparkConf for the database and collection information

    javaRDD

    the RDD data to save to CosmosDB

    returns

    the javaRDD

  44. def save(dataFrameWriter: DataFrameWriter[_], writeConfig: Config): Unit

    Permalink

    Save data to CosmosDB

    Save data to CosmosDB

    dataFrameWriter

    the DataFrameWriter save to CosmosDB

    writeConfig

    the writeConfig

  45. def save(dataFrameWriter: DataFrameWriter[_]): Unit

    Permalink

    Save data to CosmosDB

    Save data to CosmosDB

    Uses the SparkConf for the database and collection information

    dataFrameWriter

    the DataFrameWriter save to CosmosDB

  46. def save[D](dataset: Dataset[D], writeConfig: Config)(implicit arg0: ClassTag[D]): Unit

    Permalink

    Save data to CosmosDB

    Save data to CosmosDB

    Note: If the dataFrame contains an _id field the data will upserted and replace any existing documents in the collection.

    dataset

    the dataset to save to CosmosDB

    writeConfig

    the writeConfig

    Since

    1.1.0

  47. def save[D](dataset: Dataset[D])(implicit arg0: ClassTag[D]): Unit

    Permalink

    Save data to CosmosDB

    Save data to CosmosDB

    Uses the SparkConf for the database and collection information

    Note: If the dataFrame contains an _id field the data will upserted and replace any existing documents in the collection.

    dataset

    the dataset to save to CosmosDB

    Since

    1.1.0

  48. def save[D](rdd: RDD[D], writeConfig: Config)(implicit arg0: ClassTag[D]): Unit

    Permalink

    Save data to CosmosDB

    Save data to CosmosDB

    D

    the type of the data in the RDD

    rdd

    the RDD data to save to CosmosDB

    writeConfig

    the writeConfig

  49. def save[D](rdd: RDD[D])(implicit arg0: ClassTag[D]): Unit

    Permalink

    Save data to CosmosDB

    Save data to CosmosDB

    Uses the SparkConf for the database and collection information Requires a codec for the data type

    D

    the type of the data in the RDD

    rdd

    the RDD data to save to CosmosDB

  50. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  51. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  52. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  53. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  54. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  55. def write(dataFrame: DataFrame): DataFrameWriter[Row]

    Permalink

    Creates a DataFrameWriter with the CosmosDB underlying output data source.

    Creates a DataFrameWriter with the CosmosDB underlying output data source.

    dataFrame

    the DataFrame to convert into a DataFrameWriter

    returns

    the DataFrameWriter

Inherited from Serializable

Inherited from Serializable

Inherited from CosmosDBLoggingTrait

Inherited from AnyRef

Inherited from Any

Ungrouped