Trait

org.bdgenomics.adam.rdd

VictimlessSortedIntervalPartitionJoin

Related Doc: package rdd

Permalink

sealed trait VictimlessSortedIntervalPartitionJoin[T, U, RT, RU] extends ShuffleRegionJoin[T, U, RT, RU]

Linear Supertypes
ShuffleRegionJoin[T, U, RT, RU], RegionJoin[T, U, RT, RU], Serializable, Serializable, AnyRef, Any
Known Subclasses
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. VictimlessSortedIntervalPartitionJoin
  2. ShuffleRegionJoin
  3. RegionJoin
  4. Serializable
  5. Serializable
  6. AnyRef
  7. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Abstract Value Members

  1. abstract def emptyFn(left: Iterator[(ReferenceRegion, T)], right: Iterator[(ReferenceRegion, U)]): Iterator[(RT, RU)]

    Permalink
    Attributes
    protected
    Definition Classes
    ShuffleRegionJoin
  2. abstract val leftRdd: RDD[(ReferenceRegion, T)]

    Permalink
    Attributes
    protected
    Definition Classes
    ShuffleRegionJoin
  3. abstract def postProcessHits(iter: Iterable[U], currentLeft: T): Iterable[(RT, RU)]

    Permalink
    Attributes
    protected
    Definition Classes
    ShuffleRegionJoin
  4. abstract val rightRdd: RDD[(ReferenceRegion, U)]

    Permalink
    Attributes
    protected
    Definition Classes
    ShuffleRegionJoin

Concrete Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. def advanceCache(cache: SetTheoryCache[U, RT, RU], right: BufferedIterator[(ReferenceRegion, U)], until: ReferenceRegion): Unit

    Permalink

    Adds elements from right to cache based on the next region encountered.

    Adds elements from right to cache based on the next region encountered.

    cache

    The cache for this partition.

    right

    The right iterator.

    until

    The next region to join with.

    Attributes
    protected
    Definition Classes
    VictimlessSortedIntervalPartitionJoinShuffleRegionJoin
  5. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  6. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  7. def compute(): RDD[(RT, RU)]

    Permalink

    Performs a region join between two RDDs (shuffle join).

    Performs a region join between two RDDs (shuffle join). All data should be pre-shuffled and copartitioned.

    returns

    An RDD of joins (x, y), where x is from leftRDD, y is from rightRDD, and the region corresponding to x overlaps the region corresponding to y.

    Definition Classes
    ShuffleRegionJoin
  8. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  9. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  10. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  11. def finalizeHits(cache: SetTheoryCache[U, RT, RU], right: BufferedIterator[(ReferenceRegion, U)]): Iterable[(RT, RU)]

    Permalink

    Computes all victims for the partition.

    Computes all victims for the partition. NOTE: These are victimless joins so we have no victims.

    cache

    The cache for this partition.

    right

    The right iterator.

    returns

    An empty iterator.

    Attributes
    protected
    Definition Classes
    VictimlessSortedIntervalPartitionJoinShuffleRegionJoin
  12. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  13. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  14. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  15. def makeIterator(leftIter: Iterator[(ReferenceRegion, T)], rightIter: Iterator[(ReferenceRegion, U)]): Iterator[(RT, RU)]

    Permalink
    Attributes
    protected
    Definition Classes
    ShuffleRegionJoin
  16. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  17. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  18. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  19. def partitionAndJoin(left: RDD[(ReferenceRegion, T)], right: RDD[(ReferenceRegion, U)]): RDD[(RT, RU)]

    Permalink

    Performs a region join between two RDDs.

    Performs a region join between two RDDs.

    returns

    An RDD of pairs (x, y), where x is from baseRDD, y is from joinedRDD, and the region corresponding to x overlaps the region corresponding to y.

    Definition Classes
    ShuffleRegionJoinRegionJoin
  20. def processHits(cache: SetTheoryCache[U, RT, RU], currentLeft: T, currentLeftRegion: ReferenceRegion): Iterable[(RT, RU)]

    Permalink

    Process hits for a given object in left.

    Process hits for a given object in left.

    cache

    The cache containing potential hits.

    currentLeft

    The current object from the left

    currentLeftRegion

    The ReferenceRegion of currentLeft.

    returns

    An iterator containing all hits, formatted by postProcessHits.

    Attributes
    protected
    Definition Classes
    ShuffleRegionJoin
  21. def pruneCache(cache: SetTheoryCache[U, RT, RU], to: ReferenceRegion): Unit

    Permalink

    Removes elements from cache in place that do not meet the condition for the next region.

    Removes elements from cache in place that do not meet the condition for the next region.

    cache

    The cache for this partition.

    to

    The next region in the left iterator.

    Attributes
    protected
    Definition Classes
    VictimlessSortedIntervalPartitionJoinShuffleRegionJoin
    Note

    At one point these were all variables and we built new collections and reassigned the pointers every time. We fixed this by using trimStart() and ++=() to improve performance. Overall, we see roughly 25% improvement in runtime by doing things this way.

  22. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  23. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  24. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  25. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  26. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from ShuffleRegionJoin[T, U, RT, RU]

Inherited from RegionJoin[T, U, RT, RU]

Inherited from Serializable

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped