Class

org.apache.spark.sql.execution.datasources

PartitioningAwareFileCatalog

Related Doc: package datasources

Permalink

abstract class PartitioningAwareFileCatalog extends FileCatalog with Logging

An abstract class that represents FileCatalogs that are aware of partitioned tables. It provides the necessary methods to parse partition data based on a set of files.

Linear Supertypes
Logging, FileCatalog, AnyRef, Any
Known Subclasses
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. PartitioningAwareFileCatalog
  2. Logging
  3. FileCatalog
  4. AnyRef
  5. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new PartitioningAwareFileCatalog(sparkSession: SparkSession, parameters: Map[String, String], partitionSchema: Option[StructType])

    Permalink

    parameters

    as set of options to control partition discovery

    partitionSchema

    an optional partition schema that will be use to provide types for the discovered partitions

Abstract Value Members

  1. abstract def leafDirToChildrenFiles: Map[Path, Array[FileStatus]]

    Permalink
    Attributes
    protected
  2. abstract def leafFiles: LinkedHashMap[Path, FileStatus]

    Permalink
    Attributes
    protected
  3. abstract def partitionSpec(): PartitionSpec

    Permalink

    Returns the specification of the partitions inferred from the data.

    Returns the specification of the partitions inferred from the data.

    Definition Classes
    FileCatalog
  4. abstract def paths: Seq[Path]

    Permalink

    Returns the list of input paths from which the catalog will get files.

    Returns the list of input paths from which the catalog will get files.

    Definition Classes
    FileCatalog
  5. abstract def refresh(): Unit

    Permalink

    Refresh the file listing

    Refresh the file listing

    Definition Classes
    FileCatalog

Concrete Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. def allFiles(): Seq[FileStatus]

    Permalink

    Returns all the valid files.

    Returns all the valid files.

    Definition Classes
    PartitioningAwareFileCatalogFileCatalog
  5. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  6. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  7. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  8. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  9. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  10. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  11. val hadoopConf: Configuration

    Permalink
    Attributes
    protected
  12. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  13. def inferPartitioning(): PartitionSpec

    Permalink
    Attributes
    protected
  14. def initializeLogIfNecessary(isInterpreter: Boolean): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  15. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  16. def isTraceEnabled(): Boolean

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  17. def listFiles(filters: Seq[Expression]): Seq[Partition]

    Permalink

    Returns all valid files grouped into partitions when the data is partitioned.

    Returns all valid files grouped into partitions when the data is partitioned. If the data is unpartitioned, this will return a single partition with no partition values.

    filters

    The filters used to prune which partitions are returned. These filters must only refer to partition columns and this method will only return files where these predicates are guaranteed to evaluate to true. Thus, these filters will not need to be evaluated again on the returned data.

    Definition Classes
    PartitioningAwareFileCatalogFileCatalog
  18. def log: Logger

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  19. def logDebug(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  20. def logDebug(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  21. def logError(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  22. def logError(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  23. def logInfo(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  24. def logInfo(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  25. def logName: String

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  26. def logTrace(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  27. def logTrace(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  28. def logWarning(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  29. def logWarning(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  30. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  31. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  32. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  33. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  34. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  35. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  36. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  37. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from Logging

Inherited from FileCatalog

Inherited from AnyRef

Inherited from Any

Ungrouped