org.bdgenomics.adam.algorithms.consensus
Provides a generator to extract consensuses from a known set of INDELs.
Provides a generator to extract consensuses from a known set of INDELs.
The previously called INDEL variants.
The number of bases to flank each known INDEL by. Default is 0 bases.
A consensus generator that looks at previously called INDELs.
Provides a generator to extract consensuses from aligned reads.
Provides a generator to extract consensuses from aligned reads.
A consensus generator that looks directly at aligned reads. Here, consensus sequences are extracted by substituting INDELs that are present in a single aligned read back into the reference sequence where they are aligned.
Provides a generator to extract consensuses by realigning reads.
Provides a generator to extract consensuses by realigning reads.
Match weight to use for Smith-Waterman.
Mismatch penalty to use for Smith-Waterman.
Insert penalty to use for Smith-Waterman.
Deletion penalty to use for Smith-Waterman.
A consensus generator that uses Smith-Waterman to realign reads to the reference sequence they overlap. INDELs that are present after this realignment stage are then used as targets for a second realignment phase.
Provides a generator to extract consensuses using several methods.
Provides a generator to extract consensuses using several methods.
A consensus generator that generates consensuses with several methods.
Singleton object for creating consensus generators.
Consensus generators are used in INDEL realignment to generate the consensus sequences that reads are realigned against. We have three consensus modes:
* From reads: This mode looks at the read alignments for evidence of INDELs. Any INDEL that shows up in a read alignment is extracted and used to generate a consensus sequence. * From reads with Smith-Waterman: This mode discards the original read alignments and uses Smith-Waterman to locally realign the read. If this realignment leads to a read being aligned with an insertion or deletion against the reference, we generate a consensus sequence for the INDEL. * From knowns: In this mode, we use a set of provided INDEL variants as the INDELs to generate consensus sequences for.
Additionally, we support a union operator, that takes the union of several consensus modes.