All Classes and Interfaces (biojava-core 7.0.1 API)

Class

Description

AAindexFactory

Factory class to get Providers for substitution matrices that are provided by the AAINDEX database.

Title: ABITrace

The details of a Compound

AbstractCompoundSet<C extends Compound>

AbstractCompoundTranslator<F extends Compound,T extends Compound>

AbstractFeature<S extends AbstractSequence<C>,C extends Compound>

A feature is currently any descriptive item that can be associated with a sequence position(s) A feature has a type and a source which is currently a string to allow flexibility for the user Ideally well defined features should have a class to describe attributes of that feature

AbstractLocation

Base abstraction of a location which encodes for the majority of important features about a location such as the start, end and strand

AbstractNucleotideCompoundSet<C extends NucleotideCompound>

AbstractReference

AbstractSequence<C extends Compound>

The base class for DNA, RNA and Protein sequences.

AbstractSequence.AnnotationType

AccesionedLocation

A location which is bound to an AccessionID.

Accessioned

Indicates an entity is accessioned

AccessionID

Used in Sequences as the unique identifier.

AlignedSequence<S extends Sequence<C>,C extends Compound>

Defines a data structure for a Sequence within an alignment.

AlignedSequence.Step

Defines an alignment step in order to pass alignment information from an Aligner to a constructor.

AmbiguityDNACompoundSet

AmbiguityDNARNAHybridCompoundSet

Ambiguity set for hybrid DNA/RNA sequences.

AmbiguityRNACompoundSet

AminoAcidCompound

Used to describe an Amino Acid.

AminoAcidCompoundSet

Set of proteinogenic amino acids.

ArrayListProxySequenceReader<C extends Compound>

ArrayListSequenceReader<C extends Compound>

Stores a Sequence as a collection of compounds in an ArrayList

BasicSequence<C extends Compound>

Bare bones version of the Sequence object to be used sparingly.

BitSequenceReader<C extends Compound>

An implementation of the popular bit encodings.

BitSequenceReader.BitArrayWorker<C extends Compound>

The logic of working with a bit has been separated out into this class to help developers create the bit data structures without having to put the code into an intermediate format and to also use the format without the need to copy this code.

BlastHitBuilder

BlastHspBuilder

BlastResult

This class models a Blast/Blast plus result.

BlastResultBuilder

BlastTabularParser

BlastXMLParser

Re-designed by Paolo Pavan on the footprint of: org.biojava.nbio.genome.query.BlastXMLQuery by Scooter Willis You may want to find my contacts on Github and LinkedIn for code info or discuss major changes.

BufferedReaderBytesRead

Need to keep track of actual bytes read and take advantage of buffered reader performance.

CaseInsensitiveCompound

Attempts to wrap compounds so it is possible to view them in a case insensitive manner

CasePreservingProteinSequenceCreator

A sequence creator which preserves the case of its input string in the user collection of the returned ProteinSequence.

CDSComparator

CDSSequence

Represents a exon or coding sequence in a gene.

ChromosomeSequence

A ChromosomeSequence is a DNASequence but keeps track of geneSequences

ClasspathResource

This object represents a classpath resource on the local system.

CodonCompound

Define a codon

ComplementCompound

ComplementSequenceView<C extends ComplementCompound>

For a given sequence this class will create a view over the top of it and for every request the code will return the complement of the underlying base e.g.

Compound

CompoundNotFoundException

CompoundSet<C extends Compound>

CompoundTranslator<F extends Compound,T extends Compound>

ConcurrencyTools

Static utility to easily share a thread pool for concurrent/parallel/lazy execution.

CRC64Checksum

Utility class that calculates a CRC64 checksum on a stream of bytes.

DatabaseReferenceInterface

If a SequenceProxyReader implements this interface then that external source has a list of cross reference id(s)

DataSource

GenBank gi|gi-number|gb|accession|locus ENA Data Library gi|gi-number|emb|accession|locus DDBJ, DNA Database of Japan gi|gi-number|dbj|accession|locus NBRF PIR pir||entry Protein Research Foundation prf||name SWISS-PROT UNIPROT sp|accession|name Brookhaven Protein Data Bank (1) pdb|entry|chain Brookhaven Protein Data Bank (2) entry:chain|PDBID|CHAIN|SEQUENCE Patents pat|country|number GenInfo Backbone Id bbs|number General database identifier gnl|database|identifier NCBI Reference Sequence ref|accession|locus Local Sequence identifier lcl|identifier

DBReferenceInfo

If you have a uniprot ID then it is possible to get a collection of other id(s) that the protein is known by.

DefaultAAIndexProvider

The default provider for AAINDEX loads substitution matrices from the AAINDEX file in the resources directory

DemoSixFrameTranslation

Created by andreas on 8/10/15.

DNACompoundSet

DNASequence

This is class should model the attributes associated with a DNA sequence

DNASequence.DNAType

The type of DNA sequence

DNASequenceCreator

A helper class that allows different ways to read a string and create a DNA sequence.

DNAToRNATranslator

Performs the first stage of transcription by going from DNA to RNA.

Edit<C extends Compound>

Interface for carrying out edit operations on a Sequence.

Edit.AbstractEdit<C extends Compound>

Abstract class which defines all edit operations as a call to discover what 5' and 3' ends of an editing Sequence should be joined together with a target Sequence.

Edit.Delete<C extends Compound>

Implementation which allows for the deletion of bases from a Sequence

Edit.Insert<C extends Compound>

Edit implementation which allows us to insert a base at any position in a Sequence.

Edit.Substitute<C extends Compound>

Allows for the substitution of bases into an existing Sequence.

EmblId

This class contains the processed data of embl file Primary accession number Sequence version number Topology: 'circular' or 'linear' Molecule type Data class Taxonomic division Sequence length

EmblReader

This class should process the data of embl file

EmblRecord

this class contains the parsed data of embl file

EmblReference

This class contains the processed data of embl file that contains the referenceNumber, referenceComment, referencePosition referenceCrossReference, referenceGroup, referenceAuthor referenceTitle, referenceLocation

Equals

A set of helper methods which return true if the two parameters are equal to each other.

ExonComparator

Sort Exon where it is a little confusing if exons should always be ordered left to right where a negative stranded gene should go the other direction.

ExonSequence

A gene contains a collection of Exon sequences

FastaGeneWriter

A Gene sequence has a Positive or Negative Strand where we want to write out to a stream the 5 to 3 prime version.

FastaHeaderFormatInterface<S extends Sequence<?>,C extends Compound>

FastaReader<S extends Sequence<?>,C extends Compound>

Use FastaReaderHelper as an example of how to use this class where FastaReaderHelper should be the primary class used to read Fasta files

FastaReaderHelper

FastaSequenceParser

Used to parse a stream of a fasta file to get the sequence

FastaWriter<S extends Sequence<?>,C extends Compound>

The FastaWriter writes a collection of sequences to an outputStream.

FastaWriterHelper

The class that should be used to write out fasta file of a sequence collection

FeatureDbReferenceInfo<S extends AbstractSequence<C>,C extends Compound>

It is DBReferenceInfo which implements FeatureInterface.

FeatureInterface<S extends AbstractSequence<C>,C extends Compound>

Interface class to handle describing arbitrary features.

FeatureRetriever<C extends Compound>

If a SequenceProxyReader implements this interface then that external source has a list features

FeaturesKeyWordInterface

Models the keywords that are annotated for a protein sequence at Uniprot.

FileDownloadUtils

FileDownloadUtils.Hash

FileProxyDNASequenceCreator

This class is a good example of using the SequenceCreatorInterface where during parsing of the stream the sequence and the offset index are passed to create a Protein sequence that will be loaded in lazily.

FileProxyProteinSequenceCreator

FileProxyRNASequenceCreator

FlatFileCache

Provides a cache for storing multiple small files in memory.

FourBitSequenceReader<C extends Compound>

Four bit encoding of the bit formats.

FourBitSequenceReader.FourBitArrayWorker<C extends Compound>

A four bit per compound implementation of the bit array worker code.

Frame

Indicates a way of translating a sequence.

FuzzyPoint

Implementation for resolving fuzzy locations.

GenbankHeaderFormatInterface<S extends Sequence<?>,C extends Compound>

GenbankProxySequenceReader<C extends Compound>

GenbankReader<S extends AbstractSequence<C>,C extends Compound>

Use GenbankReaderHelper as an example of how to use this class where GenbankReaderHelper should be the primary class used to read Genbank files

GenbankReaderHelper

GenbankReference

For Genbank format file only.

GenbankSequenceParser<S extends AbstractSequence<C>,C extends Compound>

GenbankWriter<S extends Sequence<?>,C extends Compound>

GenbankWriterHelper

The class that should be used to write out genbank file of a sequence collection

GenericFastaHeaderFormat<S extends AbstractSequence<?>,C extends Compound>

We store the original header if the sequence is parsed from a fasta file and will use that exact sequence if we write out the sequences to a fasta file.

GenericFastaHeaderParser<S extends AbstractSequence<C>,C extends Compound>

The default fasta header parser where some headers are well defined based on the source database which allows us to set the source of the protein sequence and the identifier that can be used in future implementations to load features from external sources If the user has a custom header with local data then they can create their own implementation of a FastaHeaderParserInterface

GenericGenbankHeaderFormat<S extends AbstractSequence<C>,C extends Compound>

GenericGenbankHeaderParser<S extends AbstractSequence<C>,C extends Compound>

GenericInsdcHeaderFormat<S extends AbstractSequence<C>,C extends Compound>

GeneSequence

Hashcoder

Contains helper methods for generating a HashCode without having to resort to the commons lang hashcode builders.

Hit

This class models a search Hit.

Hsp<S extends Sequence<C>,C extends Compound>

This class models a search Hsp.

InputStreamProvider

A class that provides an InputStream from a File.

InsdcLocations

A collection of locations which are used whenever we work with INSDC; some of which could be deprecated (from INSDC's point of view) yet appear in records.

InsdcLocations.BondLocation

Used to represent bond locations equivalent to bond(7,8) or bond(7).

InsdcLocations.GroupLocation

Deprecated in INSDC yet still appears; equivalent to the order() directive except no 5' to 3' ordering is defined.

InsdcLocations.OneOfLocation

Deprecated in INSDC; refers to a set of locations of which one location could be valid e.g.

InsdcLocations.OrderLocation

Used to describe a 5' to 3' ordering but no firm assurance it is correct

InsdcParser

Parser for working with INSDC style locations.

IntronSequence

IOUtils

IOUtils.ReaderProcessor

Closure interface used when working with IOUtils#processReader(String).

IUPACParser

Available translations 1 - UNIVERSAL 2 - VERTEBRATE_MITOCHONDRIAL 3 - YEAST_MITOCHONDRIAL 4 - MOLD_MITOCHONDRIAL 5 - INVERTEBRATE_MITOCHONDRIAL 6 - CILIATE_NUCLEAR 9 - ECHINODERM_MITOCHONDRIAL 10 - EUPLOTID_NUCLEAR 11 - BACTERIAL 12 - ALTERNATIVE_YEAST_NUCLEAR 13 - ASCIDIAN_MITOCHONDRIAL 14 - FLATWORM_MITOCHONDRIAL 15 - BLEPHARISMA_MACRONUCLEAR 16 - 2CHLOROPHYCEAN_MITOCHONDRIAL 21 - TREMATODE_MITOCHONDRIAL 23 - SCENEDESMUS_MITOCHONDRIAL Taken from NCBI with slight modification and put into the classpath resource.

IUPACParser.IUPACTable

Holds the concept of a codon table from the IUPAC format

JoiningSequenceReader<C extends Compound>

This reader actually proxies onto multiple types of sequence in order to allow a number of sequence objects to act as if they are one sequence.

LightweightProfile<S extends Sequence<C>,C extends Compound>

Defines a minimal data structure for reading and writing a sequence alignment.

LightweightProfile.StringFormat

List of output formats.

Location

Sets of integers used to represent the location of features on sequence.

Location.Tools

Helper methods for use with the Location classes.

LocationHelper

Helper methods for use with the Location classes.

Messages

MultipleSequenceAlignment<S extends Sequence<C>,C extends Compound>

Implements a minimal data structure for reading and writing a sequence alignment.

MutableAlignedSequence<S extends Sequence<C>,C extends Compound>

Defines a mutable (editable) data structure for an AlignedSequence.

MutableProfile<S extends Sequence<C>,C extends Compound>

Defines a mutable (editable) data structure for a Profile.

MutableProfilePair<S extends Sequence<C>,C extends Compound>

Defines a mutable (editable) data structure for a ProfilePair.

MutableSequencePair<S extends Sequence<C>,C extends Compound>

Defines a mutable (editable) data structure for the results of pairwise sequence alignment.

NucleotideCompound

ParseFastaFileDemo

Created by andreas on 6/17/15.

ParserException

General abstraction of different parsing errors

PlainFastaHeaderParser<S extends AbstractSequence<C>,C extends Compound>

The plain fasta header takes everything in the header as a single entity.

Point

Holds a single point part of a location

Point.Resolver<T extends Point>

Used to resolve a position about a point

PrettyXMLWriter

Implementation of XMLWriter which emits nicely formatted documents to a PrintWriter.

Profile<S extends Sequence<C>,C extends Compound>

Defines a data structure for the results of sequence alignment.

Profile.StringFormat

List of output formats.

ProfilePair<S extends Sequence<C>,C extends Compound>

Defines a data structure for the results of the alignment of a pair of Profiles.

ProfileView<S extends Sequence<C>,C extends Compound>

Defines a data structure for a view of sequence alignment.

ProteinSequence

The representation of a ProteinSequence

ProteinSequenceCreator

Used to create a ProteinSequence from a String to allow for details about the location of the sequence etc.

ProxySequenceReader<C extends Compound>

Qualifier

QualityFeature<S extends AbstractSequence<C>,C extends Compound>

DNA Sequences produced by modern sequencers usually have quality informaion attached to them.

QuantityFeature<S extends AbstractSequence<C>,C extends Compound>

It is common to have a numerical value or values associated with a feature.

ReferenceInterface

Result

This class models a search result.

ResultFactory

ReversedSequenceView<C extends Compound>

For a given sequence this class will return the base at the reversed position i.e.

RNACompoundSet

RNASequence

RNASequence where RNACompoundSet are the allowed values

RNASequenceCreator

Used to create a RNA sequence

RnaSequenceView

Attempts to do on the fly translation of RNA by not requesting the compounds until asked.

RNAToAminoAcidTranslator

Takes a Sequence of NucleotideCompound which should represent an RNA sequence (RNASequence is good for this) and returns a list of Sequence which hold AminoAcidCompound.

ScaledSubstitutionMatrix

The biojava-alignment module represents substitution matrices with short values.

SearchIO

Sequence<C extends Compound>

Main interface for defining a collection of Compounds and accessing them using biological indexes

SequenceAsStringHelper<C extends Compound>

This is a common method that can be used across multiple storage/proxy implementations to handle Negative strand and other interesting elements of sequence data.

SequenceComparator

Used to sort sequences in ascending order of bioBegin property.

SequenceCreatorInterface<C extends Compound>

SequenceFileProxyLoader<C extends Compound>

This class represents the storage container of a sequence stored in a fasta file where the initial parsing of the file we store the offset and length of the sequence.

SequenceHeaderParserInterface<S extends Sequence<?>,C extends Compound>

SequenceLocation<S extends AbstractSequence<C>,C extends Compound>

A location in a sequence that keeps a reference to its parent sequence

SequenceMixin

Provides a set of static methods to be used as static imports when needed across multiple Sequence implementations but inheritance gets in the way.

SequenceMixin.SequenceIterator<C extends Compound>

A basic sequence iterator which iterates over the given Sequence by biological index.

SequenceOptimizationHints

A static class that provides optimization hints for memory or performance handling of sequence data.

SequenceOptimizationHints.SequenceCollection

SequenceOptimizationHints.SequenceUsage

SequencePair<S extends Sequence<C>,C extends Compound>

Defines a data structure for the results of pairwise sequence alignment.

SequenceParserInterface

SequenceProxyView<C extends Compound>

SequenceReader<C extends Compound>

SequenceTools

SequenceView<C extends Compound>

SimpleAlignedSequence<S extends Sequence<C>,C extends Compound>

Implements a data structure for a Sequence within an alignment.

SimpleLocation

Very basic implementation of the Location interface which defines a series of simple constructors.

SimplePoint

Basic implementation of the Point interface.

SimpleProfile<S extends Sequence<C>,C extends Compound>

Implements a data structure for the results of sequence alignment.

SimpleProfilePair<S extends Sequence<C>,C extends Compound>

Implements a data structure for the results of the alignment of a pair of Profiles.

SimpleSequencePair<S extends Sequence<C>,C extends Compound>

Implements a data structure for the results of pairwise sequence alignment.

SimpleSubstitutionMatrix<C extends Compound>

Implements a data structure which holds the score (penalty or bonus) given during alignment for the exchange of one Compound in a sequence for another.

SingleCompoundSequenceReader<C extends Compound>

An implementation of the SequenceReader interface which for every call will return only 1 compound (given to it during construction; a String is also valid but will require a CompoundSet).

SingleLinkageClusterer

An implementation of a single linkage clusterer See http://en.wikipedia.org/wiki/Single-linkage_clustering

SoftHashMap<K,V>

A in memory cache using soft references.

StartCodonSequence

Used to map the start codon feature on a gene

StopCodonSequence

Used to map the stop codon sequence on a gene

Strand

Provides a way of representing the strand of a sequence, location hit or feature.

StringManipulationHelper

A utility class for common String manipulation tasks.

StringProxySequenceReader<C extends Compound>

An example of a ProxySequenceReader that is created from a String.

SubstitutionMatrix<C extends Compound>

Defines a data structure which holds the score (penalty or bonus) given during alignment for the exchange of one Compound in a sequence for another.

SubstitutionMatrixHelper

Static utility to access substitution matrices that come bundled with BioJava.

Table

Provides a way of separating us from the specific IUPACParser.IUPACTable even though this is the only implementing class for the interface.

Table.CaseInsensitiveTriplet

Class used to hold three nucleotides together and allow for equality to be assessed in a case insensitive manner.

Table.Codon

Instance of a Codon which is 3 NucleotideCompounds, its corresponding AminoAcidCompound and if it is a start or stop codon.

TaxonomyID

A sequence can be associated with a species or Taxonomy ID

TextFeature<S extends AbstractSequence<C>,C extends Compound>

A implmentation of AbstractFeature

TranscriptionEngine

Used as a way of encapsulating the data structures required to parse DNA to a Protein sequence.

TranscriptionEngine.Builder

This class is the way to create a TranslationEngine.

TranscriptSequence

This is the sequence if you want to go from a gene sequence to a protein sequence.

TranslationException

Thrown from AbstractCompundTranslator

TwoBitSequenceReader<C extends NucleotideCompound>

Implementation of the 2bit encoding.

TwoBitSequenceReader.TwoBitArrayWorker<C extends NucleotideCompound>

Extension of the BitArrayWorker which provides the 2bit implementation code.

UncompressFile

Uncompresses a single tarred or zipped file, writing output to stdandard out

UncompressInputStream

This class decompresses an input stream containing data compressed with the unix "compress" utility (LZC, a LZW variant).

UniprotProxySequenceReader<C extends Compound>

Pass in a Uniprot ID and this ProxySequenceReader when passed to a ProteinSequence will get the sequence data and other data elements associated with the ProteinSequence by Uniprot.

WindowedSequence<C extends Compound>

A sliding window view of a sequence which does not implement any interfaces like Sequence because they do not fit how this works.

XMLHelper

Helper methods to simplify boilerplate XML parsing code for org.w3c.dom XML objects

XMLWriter

Simple interface for building XML documents.