public static class HL7v2IO.ListHL7v2Messages extends org.apache.beam.sdk.transforms.PTransform<org.apache.beam.sdk.values.PBegin,org.apache.beam.sdk.values.PCollection<HL7v2Message>>
This transform is optimized for splitting of message.list calls for large batches of historical data and assumes rather continuous stream of sendTimes.
Note on Benchmarking: The default initial splitting on day will make more queries than necessary when used with very small data sets (or very sparse data sets in the sendTime dimension). If you are looking to get an accurate benchmark be sure to use sufficient volume of data with messages that span sendTimes over a realistic time range (days)
Implementation includes overhead for:
withInitialSplitDuration(Duration)
Modifier and Type | Method and Description |
---|---|
org.apache.beam.sdk.values.PCollection<HL7v2Message> |
expand(org.apache.beam.sdk.values.PBegin input) |
HL7v2IO.ListHL7v2Messages |
withInitialSplitDuration(org.joda.time.Duration initialSplitDuration) |
public HL7v2IO.ListHL7v2Messages withInitialSplitDuration(org.joda.time.Duration initialSplitDuration)
public org.apache.beam.sdk.values.PCollection<HL7v2Message> expand(org.apache.beam.sdk.values.PBegin input)
expand
in class org.apache.beam.sdk.transforms.PTransform<org.apache.beam.sdk.values.PBegin,org.apache.beam.sdk.values.PCollection<HL7v2Message>>