Contains multiple implementations of a 'region join', an operation that joins two sets of regions based on the spatial overlap between the regions.
Extends the ShuffleRegionJoin trait to implement a full outer join.
Partition a genome into a set of bins.
GenomicPositionPartitioner partitions ReferencePosition objects into separate, spatially-coherent regions of the genome.
Formats data going into a pipe to an invoked process.
Extends the BroadcastRegionJoin trait to implement an inner join.
Extends the ShuffleRegionJoin trait to implement an inner join.
Extends the ShuffleRegionJoin trait to implement an inner join followed by grouping by the left value.
Extends the ShuffleRegionJoin trait to implement a left outer join.
Deserializes data coming out of a pipe from an invoked process.
Repartitions objects that are keyed by a ReferencePosition or ReferenceRegion into a single partition per contig.
Extends the BroadcastRegionJoin trait to implement a right outer join.
Extends the ShuffleRegionJoin trait to implement a right outer join.
Extends the ShuffleRegionJoin trait to implement a right outer join followed by grouping by all non-null left values.
Helper object to merge sharded files together.