dRep
dRep is a python program for rapidly comparing large numbers of genomes. dRep can also "de-replicate" a genome set by identifying groups of highly similar genomes and choosing the best representative genome for each genome set.
Manual, installation instructions, and API are at available at ReadTheDocs
Publication is available at ISMEJ
Open source pre-print publication is available at bioRxiv
Installation with pip
$ pip install drep
Quick start
Genome comparison:
$ dRep compare output_directory -g path/to/genomes/*.fasta
Genome de-replication:
$ dRep dereplicate output_directory -g path/to/genomes/*.fasta
Make sure dependencies are properly installed:
$ dRep check_dependencies
Dependencies
Near Essential
- Mash - Makes primary clusters (v1.1.1 confirmed works)
- MUMmer - Performs default ANIm comparison method (v3.23 confirmed works)
Optional
- fastANI - A fast secondary clustering algorithm
- CheckM_ - Determines contamination and completeness of genomes (v1.0.7 confirmed works)
- gANI (aka ANIcalculator) - Performs gANI comparison method (v1.0 confirmed works)
- Prodigal - Used be both checkM and gANI (v2.6.3 confirmed works)
- NSimScan - Only needed for goANI algorithm (open source version of gANI)