Perform a full outer join with a SideInput[Map[K, Iterable[W]]]
.
Perform a full outer join with a SideInput[Map[K, Iterable[W]]]
.
val si = pairSCollRight.asMultiMapSingletonSideInput val joined1 = pairSColl1Left.hashFullOuterJoin(si) val joined2 = pairSColl2Left.hashFullOuterJoin(si)
Perform a full outer join by replicating rhs
to all workers.
Perform a full outer join by replicating rhs
to all workers. The right side should be tiny
and fit in memory.
Return an SCollection with the pairs from this
whose keys are in the SideSet rhs
.
Return an SCollection with the pairs from this
whose keys are in the SideSet rhs
.
Unlike SCollection.intersection this preserves duplicates in this
.
Return an SCollection with the pairs from this
whose keys are in rhs
given rhs
is small enough to fit in memory.
Return an SCollection with the pairs from this
whose keys are in rhs
given rhs
is small enough to fit in memory.
Unlike SCollection.intersection this preserves duplicates in this
.
Perform an inner join with a MultiMap SideInput[Map[K, Iterable[V]]
Perform an inner join with a MultiMap SideInput[Map[K, Iterable[V]]
The right side is tiny and fits in memory. The SideInput can be used reused for multiple joins.
val si = pairSCollRight.asMultiMapSingletonSideInput val joined1 = pairSColl1Left.hashJoin(si) val joined2 = pairSColl2Left.hashJoin(si)
Perform an inner join by replicating rhs
to all workers.
Perform an inner join by replicating rhs
to all workers. The right side should be tiny and
fit in memory.
Perform a left outer join with a MultiMap SideInput[Map[K, Iterable[V]]
Perform a left outer join with a MultiMap SideInput[Map[K, Iterable[V]]
val si = pairSCollRight.asMultiMapSingletonSideInput val joined1 = pairSColl1Left.hashLeftOuterJoin(si) val joined2 = pairSColl2Left.hashLeftOuterJoin(si)
Perform a left outer join by replicating rhs
to all workers.
Perform a left outer join by replicating rhs
to all workers. The right side should be tiny
and fit in memory.
The tiny SCollection[(K, W)] treated as right side of the join.
val si = pairSCollRight // Should be tiny val joined = pairSColl1Left.hashLeftOuterJoin(pairSCollRight)
Return an SCollection with the pairs from this
whose keys are not in SCollection[V] rhs
.
Return an SCollection with the pairs from this
whose keys are not in SCollection[V] rhs
.
Rhs must be small enough to fit into memory.
Return an SCollection with the pairs from this
whose keys are not in SideInput[Set] rhs
.
Perform a full outer join with a SideMap.
Perform a full outer join with a SideMap.
SideMaps are deprecated in favour of SideInput[Map[K, Iterable[W]]]
.
Example replacement:
val si = pairSCollRight.asMultiMapSingletonSideInput val joined1 = pairSColl1Left.hashFullOuterJoin(si) val joined2 = pairSColl2Left.hashFullOuterJoin(si)
(Since version 0.8.0) Use SCollection[(K, V)]#hashFullOuterJoin(rhs) or SCollection[(K, V)]#hashFullOuterJoin(rhs.asMultiMapSingletonSideInput) instead.
Return an SCollection with the pairs from this
whose keys are in the SideSet rhs
.
Return an SCollection with the pairs from this
whose keys are in the SideSet rhs
.
Unlike SCollection.intersection this preserves duplicates in this
.
(Since version 0.8.0) Use SCollection[(K, V)]#hashIntersectByKey(rhs.asSetSingletonSideInput) instead
Perform an inner join with a SideMap.
Perform an inner join with a SideMap.
SideMap
s are deprecated in favor of SideInput[Map[K, Iterable[W]]]
.
Example replacement:
val si = pairSCollRight.asMultiMapSingletonSideInput val joined1 = pairSColl1Left.hashJoin(si) val joined2 = pairSColl2Left.hashJoin(si)
(Since version 0.8.0) Use SCollection[(K, V)]#hashJoin(rhs) or SCollection[(K, V)]#hashJoin(rhs.asMultiMapSingletonSideInput) instead.
Perform a left outer join with a SideMap.
Perform a left outer join with a SideMap.
SideMaps are deprecated in favor of SideInput[Map[K, Iterable[W]]]
.
Example replacement:
val si = pairSCollRight.asMultiMapSingletonSideInput val joined1 = pairSColl1Left.hashLeftOuterJoin(si) val joined2 = pairSColl2Left.hashLeftOuterJoin(si)
(Since version 0.8.0) Use SCollection[(K, V)]#hashLeftOuterJoin(pairSColl) or SCollection[(K, V)]#hashLeftOuterJoin(pairSColl.asMultiMapSingletonSideInput) instead.
Perform a left outer join by replicating rhs
to all workers.
Perform a left outer join by replicating rhs
to all workers. The right side should be tiny
and fit in memory.
(Since version 0.8.0) Use SCollection[(K, V)]#hashLeftOuterJoin(pairSColl) instead.
val si = pairSCollRight // Should be tiny val joined = pairSColl1Left.hashLeftOuterJoin(pairSCollRight)
(Since version 0.8.0) Use SCollection[(K, V)]#asMultiMapSingletonSideInput instead
Extra functions available on SCollections of (key, value) pairs for hash based joins through an implicit conversion.