Class ParquetInputSplit

  • All Implemented Interfaces:
    org.apache.hadoop.io.Writable

    @Private
    @Deprecated
    public class ParquetInputSplit
    extends org.apache.hadoop.mapreduce.lib.input.FileSplit
    implements org.apache.hadoop.io.Writable
    Deprecated.
    will be removed in 2.0.0. use org.apache.hadoop.mapred.FileSplit instead.
    An input split for the Parquet format It contains the information to read one block of the file. This class is private to the ParquetInputFormat. Backward compatibility is not maintained.
    • Constructor Detail

      • ParquetInputSplit

        public ParquetInputSplit()
        Deprecated.
        Writables must have a parameterless constructor
      • ParquetInputSplit

        @Deprecated
        public ParquetInputSplit​(org.apache.hadoop.fs.Path path,
                                 long start,
                                 long length,
                                 String[] hosts,
                                 List<BlockMetaData> blocks,
                                 String requestedSchema,
                                 String fileSchema,
                                 Map<String,​String> extraMetadata,
                                 Map<String,​String> readSupportMetadata)
        Deprecated.
        Parameters:
        path - a Path
        start - split start location
        length - split length
        hosts - locality information for this split
        blocks - Parquet blocks in this split
        requestedSchema - the requested schema
        fileSchema - the file schema
        extraMetadata - string map of file metadata
        readSupportMetadata - string map of metadata from read support
      • ParquetInputSplit

        public ParquetInputSplit​(org.apache.hadoop.fs.Path file,
                                 long start,
                                 long end,
                                 long length,
                                 String[] hosts,
                                 long[] rowGroupOffsets)
        Deprecated.
        Parameters:
        file - the path of the file for that split
        start - the start offset in the file
        end - the end offset in the file
        length - the actual size in bytes that we expect to read
        hosts - the hosts with the replicas of this data
        rowGroupOffsets - the offsets of the rowgroups selected if loaded on the client
    • Method Detail

      • getBlocks

        @Deprecated
        public List<BlockMetaData> getBlocks()
        Deprecated.
        the file footer is no longer read before creating input splits
        Returns:
        the block meta data
      • getFileSchema

        @Deprecated
        public String getFileSchema()
        Deprecated.
        the file footer is no longer read before creating input splits
        Returns:
        the file schema
      • getEnd

        public long getEnd()
        Deprecated.
        Returns:
        the end offset of that split
      • getExtraMetadata

        @Deprecated
        public Map<String,​String> getExtraMetadata()
        Deprecated.
        will be removed in 2.0.0. the file footer is no longer read before creating input splits
        Returns:
        app specific metadata from the file
      • getRowGroupOffsets

        public long[] getRowGroupOffsets()
        Deprecated.
        will be removed in 2.0.0.
        Returns:
        the offsets of the row group selected if this has been determined on the client side
      • toString

        public String toString()
        Deprecated.
        Overrides:
        toString in class org.apache.hadoop.mapreduce.lib.input.FileSplit
      • readFields

        public void readFields​(DataInput hin)
                        throws IOException
        Deprecated.
        Specified by:
        readFields in interface org.apache.hadoop.io.Writable
        Overrides:
        readFields in class org.apache.hadoop.mapreduce.lib.input.FileSplit
        Throws:
        IOException
      • write

        public void write​(DataOutput hout)
                   throws IOException
        Deprecated.
        Specified by:
        write in interface org.apache.hadoop.io.Writable
        Overrides:
        write in class org.apache.hadoop.mapreduce.lib.input.FileSplit
        Throws:
        IOException