The ADAMContext provides functions on top of a SparkContext for loading genomic data.
Argument configuration for saving any output format.
An abstract class that extends GenomicRDD and where the underlying data are Avro IndexedRecords.
An abstract class describing a GenomicRDD where:
Partition a genome into a set of bins.
A trait describing a GenomicRDD that also supports the Spark SQL APIs.
GenomicPositionPartitioner partitions ReferencePosition objects into separate, spatially-coherent regions of the genome.
A trait that wraps an RDD of genomic data with helpful metadata.
A partitioner for ReferenceRegion-keyed data.
Formats data going into a pipe to an invoked process.
A trait for singleton objects that build an InFormatter from a GenomicRDD.
Implements an inner region join where the left side of the join is broadcast.
Performs an inner region join, followed logically by grouping by the right value.
An abstract class that extends the MultisampleGenomicRDD trait, where the data are Avro IndexedRecords.
A trait describing a GenomicRDD with data from multiple samples.
Deserializes data coming out of a pipe from an invoked process.
Repartitions objects that are keyed by a ReferencePosition or ReferenceRegion into a single partition per contig.
A trait describing a join in the genomic coordinate space between two RDDs where the values are keyed by a ReferenceRegion.
Implements a right outer region join where the left side of the join is broadcast.
Performs a right outer region join, followed logically by grouping by the right value.
A trait describing join implementations that are based on a sort-merge join.
Implements a shuffle free (broadcast) region join.
This singleton provides an implicit conversion from a SparkContext to the ADAMContext, as well as implicit functions for the Pipe API.
Helper for creating genomic position partitioners.
Helper object for creating GenomicRegionPartitioners.