@DocumentedFeature public final class PreprocessIntervals extends GATKTool
The input intervals are first checked for overlapping intervals, which are merged. The resulting intervals are then padded. The padded intervals are then split into bins. Finally, bins that contain only Ns are filtered out.
interval-merging-rule
must be set to IntervalMergingRule.OVERLAPPING_ONLY
and all other common arguments for interval padding or merging must be set to their defaults.
If no intervals are specified, then each contig will be assumed to be a single interval and binned accordingly;
this produces bins appropriate for whole genome sequencing analyses.
padding
to specify the size of each of the regions added to both ends of the intervals that result
after overlapping intervals have been merged. Do not use the common interval-padding
argument.
Intervals that would overlap after padding by the specified amount are instead only
padded until they are adjacent.
gatk PreprocessIntervals \ -R reference.fa \ -L intervals.interval_list \ --bin-length 0 \ --padding 250 \ -O preprocessed_intervals.interval_listTo generate consecutive bins of 1000 bases from the reference (e.g., for whole genome sequencing analyses):
gatk PreprocessIntervals \ -R reference.fa \ --bin-length 1000 \ --padding 0 \ -O preprocessed_intervals.interval_list
Modifier and Type | Field and Description |
---|---|
static java.lang.String |
BIN_LENGTH_LONG_NAME |
static java.lang.String |
PADDING_LONG_NAME |
addOutputSAMProgramRecord, addOutputVCFCommandLine, cloudIndexPrefetchBuffer, cloudPrefetchBuffer, createOutputBamIndex, createOutputBamMD5, createOutputVariantIndex, createOutputVariantMD5, disableBamIndexCaching, features, intervalArgumentCollection, lenientVCFProcessing, outputSitesOnlyVCFs, progressMeter, readArguments, referenceArguments, SECONDS_BETWEEN_PROGRESS_UPDATES_NAME, seqValidationArguments
GATK_CONFIG_FILE, logger, NIO_MAX_REOPENS, NIO_PROJECT_FOR_REQUESTER_PAYS, QUIET, specialArgumentsCollection, tmpDir, useJdkDeflater, useJdkInflater, VERBOSITY
Constructor and Description |
---|
PreprocessIntervals() |
Modifier and Type | Method and Description |
---|---|
void |
onTraversalStart()
Operations performed just prior to the start of traversal.
|
java.lang.Object |
onTraversalSuccess()
Operations performed immediately after a successful traversal (ie when no uncaught exceptions were thrown during the traversal).
|
boolean |
requiresReference()
Does this tool require reference data? Traversals types and/or tools that do should override to return true.
|
void |
traverse()
A complete traversal from start to finish.
|
addFeatureInputsAfterInitialization, closeTool, createSAMWriter, createSAMWriter, createVCFWriter, createVCFWriter, directlyAccessEngineFeatureManager, directlyAccessEngineReadsDataSource, directlyAccessEngineReferenceDataSource, doWork, getBestAvailableSequenceDictionary, getDefaultCloudIndexPrefetchBufferSize, getDefaultCloudPrefetchBufferSize, getDefaultReadFilters, getDefaultToolVCFHeaderLines, getDefaultVariantAnnotationGroups, getDefaultVariantAnnotations, getGenomicsDBOptions, getHeaderForFeatures, getHeaderForReads, getHeaderForSAMWriter, getMasterSequenceDictionary, getPluginDescriptors, getProgressMeterRecordLabel, getReferenceDictionary, getSequenceDictionaryValidationArgumentCollection, getToolName, getTransformedReadStream, getTraversalIntervals, hasFeatures, hasReads, hasReference, hasUserSuppliedIntervals, makePostReadFilterTransformer, makePreReadFilterTransformer, makeReadFilter, makeVariantAnnotations, onShutdown, onStartup, requiresFeatures, requiresIntervals, requiresReads, transformTraversalIntervals, useVariantAnnotations
customCommandLineValidation, getCommandLine, getCommandLineParser, getDefaultHeaders, getMetricsFile, getSupportInformation, getToolkitName, getToolkitShortName, getToolStatusWarning, getUsage, getVersion, instanceMain, instanceMainPostParseArgs, isBetaFeature, isExperimentalFeature, parseArgs, printLibraryVersions, printSettings, printStartupMessage, runTool, setDefaultHeaders, warnOnToolStatus
public static final java.lang.String BIN_LENGTH_LONG_NAME
public static final java.lang.String PADDING_LONG_NAME
public boolean requiresReference()
GATKTool
requiresReference
in class GATKTool
public void onTraversalStart()
GATKTool
onTraversalStart
in class GATKTool
public void traverse()
GATKTool
public java.lang.Object onTraversalSuccess()
GATKTool
onTraversalSuccess
in class GATKTool