SSAKE

SSAKE is a genomics application for assembling millions of very short DNA sequences.

Current release
SSAKE 3.8

Released May 03, 2011

v3.8+ is 30% faster than the previous release. Additional assembly control has been implemented (-w) that limits the generation of low depth of coverage contigs
More about this release…

Download file Get SSAKE for all platforms (2.2 MB)
ssake_v3-8-tar.gz

Project Description

The Short Sequence Assembly by K-mer search and 3' read Extension (SSAKE) is a genomics application for aggressively assembling millions of short nucleotide sequences by progressively searching for perfect 3'-most k-mers using a DNA prefix tree. SSAKE is designed to help leverage the information from short sequences reads by stringently clustering them into contigs that can be used to characterize novel sequencing targets.

*Best performance is achieved by quality-trimming your reads before assembly


Enjoy SSAKE responsibly!

 

Summary

SSAKE is written in PERL and runs on Linux. SSAKE cycles through short sequence reads stored in a hash table and progressively searches through a prefix tree for the longest possible identical overlap between any two sequences. The algorithm was used to assemble 25-36 bp sequence reads from viral, bacterial and fungal genomes and on forty millions 25-mers simulated using the whole-genome shotgun (WGS) sequence data from the Sargasso sea metagenomics project. Considering the number of sequences to assemble, SSAKE is robust and tractable.

 

Documentation

René L Warren, Granger G Sutton, Steven JM Jones, Robert A Holt. 2007 (epub 2006 Dec 8). Assembling millions of short DNA sequences using SSAKE. bioinformatics. 23:500-501.

License

Copyright (c) 2006-2011 Canada's Michael Smith Genome Science Centre. All rights reserved.

This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

 

Credits

René Warren, Granger Sutton, Steven Jones and Robert Holt