A FileIndex for a metastore catalog table.
Create a table and optionally insert some data into it.
Create or replace a local/global temporary view with given data source.
The main class responsible for representing a pluggable Data Source in Spark SQL.
Replaces generic operations with specific variants that are designed to work with Spark SQL Data Sources.
A Strategy for planning scans over data sources defined using the sources API.
Used to read and write data stored in files to/from the InternalRow format.
An interface for objects capable of enumerating the root paths of a relation as well as the partitions of a relation subject to some pruning expressions.
A collection of file blocks that should be read as a single task (possibly from multiple partitioned directories).
An RDD that scans a list of file partitions.
A cache of the leaf files of partition directories.
Replaces CatalogRelation with data source table if its table provider is not hive.
An adaptor from a PartitionedFile to an Iterator of Text, which are all of the lines in that file.
Acts as a container for all of the metadata required to read from a datasource.
A FileIndex that generates the list of files to process by recursively listing all the
files present in paths
.
Inserts the results of query
in to a relation that extends InsertableRelation.
A command for writing data to a HadoopFsRelation.
Used to link a BaseRelation in to a logical query plan.
OutputWriter is used together with HadoopFsRelation for persisting rows to the underlying file system.
A factory that produces OutputWriters.
A collection of data files from a partitioned relation, along with the partition values in the form of an InternalRow.
Holds a directory in a partitioned collection of files as well as the partition values in the form of a Row.
A part (i.
An abstract class that represents FileIndexs that are aware of partitioned tables.
Preprocess CreateTable, to do some normalization and checking.
Preprocess the InsertIntoTable plan.
An adaptor from a Hadoop RecordReader to an Iterator over the values returned.
Try to replaces UnresolvedRelations if the plan is for direct query on files.
A variant of HadoopMapReduceCommitProtocol that allows specifying the actual Hadoop output committer using an option specified in SQLConf.
Saves the results of query
in to a data source.
The base class file format that is based on text file.
A helper object for writing FileFormat data out to a location.
A strategy for planning scans over collections of files that might be partitioned or bucketed by user specified columns.
Use FileStatusCache.getOrCreate() to construct a globally shared file status cache.
A rule to check whether the functions are supported only when Hive support is enabled
A non-caching implementation used when partition file status caching is disabled.
A rule to do various checks before inserting into or writing to a data source table.