Programs to assign taxonomic info to loads of rRNA sequences




SSuMMo is a library of functions designed around iteratively using hmmer to assign sequences to taxa. Results are highly annotated trees showing species / genus distribution within that community.

Programs included with the source code include tools to:-

- Build a hierarchical database of hidden Markov Models -;
- allocate sequences to recognised taxonomic names -;
- analyse biological diversity, using Simpson, Shannon & other methods-;
- visualise results as cladograms with a distinct ability to easily cross-compare datasets -
- convert results to phyloxml format: ;
- build html representation - .
- plot rarefaction curves and calculate corresponding biodiversity indices

Python source code is provided here, on google code. The prebuilt hierarchical database of HMMs, as well as an optimised SQL taxonomy database (used for deducing ranks of each taxon) can be downloaded from:-

For install information, please refer to the README. For usage information, there is a wiki (above), and a preliminary User Manual has been added to the svn trunk.
Last updated on May 6th, 2012

