Structural and splice variant detection from transcriptome assembly
PAVFinder_transcriptome (PVT) is a Python package written to identify structural variants in transcriptome assemblies. In a nutshell, the algorithm infers variants from non-contiguous (split or gapped) contig sequence alignments to the reference genome. With the aid of gene-model annotation(s), diversified classes of variants such as gene fusions, read-throughs, internal and partial tandem duplications, indels and novel splice variants are classified.
The program is usually preceded by de novo assembly of RNAseq sequences followed by alignment to the reference genome. As such, a pipeline that bundles the 3 analysis steps called TAP (Transabyss-Alignment-PAVFinder) is provided as a standalone application. TAP can also be run in targeted mode on selected genes. This requires a Bloom Filter of target gene sequences to be created beforehand. Whereas the full assembly of a RNAseq library with over 100 million read pairs requires more than 24 hours to complete, a targeted assembly and analysis of a gene list (e.g. COSMIC) of several hundred can be completed within half an hour.
Released Oct 18, 2016
First public release
More about this release…
- Get PAVFinder_transcriptome for Linux
- PAVFinder_transcriptome v0.3.1