Introduction
DIAMOND is a sequence aligner for protein and translated DNA searches, designed for high performance analysis of big sequence data. The key features are:
- Pairwise alignment of proteins and translated DNA at 100x-10,000x speed of BLAST.
- Protein clustering of up to tens of billions of proteins
- Frameshift alignments for long read analysis.
- Low resource requirements and suitable for running on standard desktops or laptops.
- Various output formats, including BLAST pairwise, tabular and XML, as well as taxonomic classification.
Documentation
The online documentation is located at the GitHub Wiki.
Support
Diamond is actively supported and developed software. Please use the issue tracker for malfunctions and the GitHub discussions for questions, comments, feature requests, etc.
About
Since 2019, DIAMOND is developed by Benjamin Buchfink at the Drost lab, Max Planck Institute for Biology Tübingen. From 2018-2019, its development was supported by the German Federal Ministry for Economic Affairs and Energy through an EXIST grant. From 2016-2018, it was developed by Benjamin Buchfink as an independent researcher. From 2013-2015, the initial version was developed by Benjamin Buchfink at the Huson lab, University of Tübingen, Germany.
[📧Email] [Twitter] [Google Scholar] [Drost lab] [MPI-BIO]
When using the tool in published research, please cite:
- Buchfink B, Reuter K, Drost HG, "Sensitive protein alignments at tree-of-life scale using DIAMOND", Nature Methods 18, 366–368 (2021). doi:10.1038/s41592-021-01101-x
For sequence clustering:
- Buchfink B, Ashkenazy H, Reuter K, Kennedy JA, Drost HG, "Sensitive clustering of protein sequences at tree-of-life scale using DIAMOND DeepClust", bioRxiv 2023.01.24.525373; doi: https://doi.org/10.1101/2023.01.24.525373
Original publication to cite DIAMOND until v0.7.9:
- Buchfink B, Xie C, Huson DH, "Fast and sensitive protein alignment using DIAMOND", Nature Methods 12, 59-60 (2015). doi:10.1038/nmeth.3176