Sequence selection

A rough tree of unaligned sequences

% mafft --retree 0 --treeout --reorder --6merpair --averagelinkage input

Newick tree
Reordered sequences

Size = 1000 sequences × 234-3409 sites
Distance measure = the number of shared 6mers
Clustering method = average linkage

MAFFT home:
    http://mafft.cbrc.jp/alignment/software/

MAYSCRIPT

Archaeopteryx home:
    https://sites.google.com/site/cmzmasek/home/software/archaeopteryx

Archaeopteryx references:
    Zmasek and Eddy (2001)
    Han and Zmasek (2009)

Job id =
Submitted at Sat Feb 16 23:19:05 +0900 2013
The results will be removed after 96 hours.