Interface | Description |
---|---|
CollectMultipleMetricsSpark.SparkCollectorProvider | |
MetricsCollectorSpark<T extends MetricsArgumentCollection> |
Each metrics collector has to be able to run from 4 different contexts:
- a standalone walker tool
- the
org.broadinstitute.hellbender.metrics.analysis.CollectMultipleMetrics walker tool
- a standalone Spark tool
- the CollectMultipleMetricsSpark tool
In order to allow a single collector implementation to be shared across all of these
contexts (standalone and CollectMultiple, Spark and non-Spark), collectors should be
factored into the following classes, where X in the class names represents the specific
type of metrics being collected:
XMetrics extends MetricBase : defines the aggregate metrics that we're trying to collect
XMetricsArgumentCollection: defines parameters for XMetrics, extends MetricsArgumentCollection
XMetricsCollector: processes a single read, and has a reduce/combiner
For multi level collectors, XMetricsCollector is composed of several classes:
XMetricsCollector extends MultiLevelReducibleCollector <
XMetrics, HISTOGRAM_KEY, XMetricsCollectorArgs, XMetricsPerUnitCollector>
XMetricsPerUnitCollector: per level collector, implements
PerUnitMetricCollector MultiLevelReducibleCollector )
XMetricsCollectorSpark: adapter/bridge between RDD and the (read-based) XMetricsCollector,
implements MetricsCollectorSpark org.broadinstitute.hellbender.metrics.analysis.SinglePassSamProgram
CollectXMetricsSpark extends MetricsCollectorSparkTool |
Class | Description |
---|---|
CollectBaseDistributionByCycleSpark |
Collects base distribution per cycle in SAM/BAM/CRAM file(s).
|
CollectInsertSizeMetricsSpark |
Collects insert size distribution information in alignment data.
|
CollectMultipleMetricsSpark |
Runs multiple metrics collection modules for a given alignment file.
|
CollectQualityYieldMetricsSpark |
Collects quality yield metrics in SAM/BAM/CRAM file(s).
|
InsertSizeMetricsCollectorSpark |
Worker class to collect insert size metrics, add metrics to file, and provides
accessors to stats of groups of different level.
|
MeanQualityByCycleSpark |
Program to generate a data table and chart of mean quality by cycle from a
BAM file.
|
MetricsCollectorSparkTool<T extends MetricsArgumentCollection> |
Base class for standalone Spark metrics collector tools.
|
QualityScoreDistributionSpark |
Charts quality score distribution within a BAM file.
|
QualityYieldMetricsCollectorSpark |
QualityYieldMetricsCollector for Spark.
|
Enum | Description |
---|---|
CollectMultipleMetricsSpark.SparkCollectors |