About
MAFFT
is a multiple sequence alignment program for
unix-like operating systems.
It offers a range of multiple alignment methods,
L-INS-i (accurate; for alignment of <∼200 sequences),
FFT-NS-2 (fast; for alignment of <∼10,000 sequences),
etc.
Download and Installation
The latest version is 6.864.
New! (2011/11/10)
Input Format
Fasta format.
example1 (LSU rRNA),
example2 (protein)
The type of input sequences (amino acid or nucleotide)
is automatically recognized.
Usage
% mafft [arguments] input > output
An alias for an accurate option (L-INS-i) for an alignment of up to ∼200 sequences × ∼2,000 sites:
% mafft-linsi input > output
A fast option (FFT-NS-2) for a larger sequence alignment:
% mafft input > output
If not sure which option to use,
% mafft --auto input > output
Related Resources
References
-
Katoh, Toh 2010
(Bioinformatics 26:1899-1900)
Parallelization of the MAFFT multiple sequence alignment program.
(describes the multithread version; Linux only)
-
Katoh, Asimenos, Toh 2009
(Methods in Molecular Biology 537:39-64)
Multiple Alignment of DNA Sequences with MAFFT. In Bioinformatics for DNA Sequence Analysis edited by D. Posada
(outlines DNA alignment methods and several tips including group-to-group alignment and rough clustering of a large number of sequences)
-
Katoh, Toh 2008
(BMC Bioinformatics 9:212)
Improved accuracy of multiple ncRNA alignment by incorporating structural information into a MAFFT-based framework.
(describes RNA structural alignment methods)
-
Katoh, Toh 2008
(Briefings in Bioinformatics 9:286-298)
Recent developments in the MAFFT multiple sequence alignment program.
(outlines version 6;
Fast Breaking Paper in Thomson Reuters' ScienceWatch)
-
Katoh, Toh 2007
(Bioinformatics 23:372-374) Errata
PartTree: an algorithm to build an approximate tree from a large number of unaligned sequences.
(describes the PartTree algorithm)
-
Katoh, Kuma, Toh, Miyata 2005
(Nucleic Acids Res. 33:511-518)
MAFFT version 5: improvement in accuracy of multiple sequence alignment.
(describes [ancestral versions of] the G-INS-i, L-INS-i and E-INS-i strategies)
-
Katoh, Misawa, Kuma, Miyata 2002
(Nucleic Acids Res. 30:3059-3066)
MAFFT: a novel method for rapid multiple sequence alignment based on
fast Fourier transform.
(describes the FFT-NS-1, FFT-NS-2 and FFT-NS-i strategies)
Contact
kazutaka.katoh@aist.go.jp
License
Copyright © 2011 Kazutaka Katoh