• Stars
    star
    285
  • Rank 145,115 (Top 3 %)
  • Language
    Python
  • License
    GNU General Publi...
  • Created almost 12 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Assess the quality of microbial genomes recovered from isolates, single cells, and metagenomes

CheckM

version status Bioconda Downloads BioConda Install

Installing and using CheckM

Please see the project home page for usage details and installation instructions: https://github.com/Ecogenomics/CheckM/wiki

We do not recommend installing CheckM from the master branch. This may be unstable. Please install an official release of CheckM or use pip.

Estimating quality of CPR genomes

Information about obtaining improved quality estimates for CPR (Patescibacteria) genomes can be found here: https://github.com/Ecogenomics/CheckM/wiki/Workflows#using-cpr-marker-set

Migration to Python 3

CheckM has been ported to Python 3 to accomodate Python 2 reaching end of life on January 1, 2020. CheckM >=1.1.0 requires Python 3. Python 2 will no longer be actively supported. Apologies for any issues this may cause.

Massive thanks to baudrly, Vini Salazar, and Asaf Peer for initial Python 2 to 3 porting.

Python 2 to 3 Validation

Porting of CheckM to Python 3 was validation on a set of 1,000 genomes randomly select from the GTDB R89 representative genomes. Results were compared to those generated with CheckM v1.0.18, the last Python 2 version of CheckM. Identical results were obtained for the 'lineage_wf', 'taxonomy_wf', and 'ssu_finder' methods across this set of test genomes. Other CheckM methods have been executed on a small set of 3 genomes to verify they run to completion under Python 3.

Removed Functionality

The following features have been removed from CheckM v1.1.x in order to simplify the code base and focus CheckM and support requests on critical functionality:

  • bin_qa_plot: non-critical, rarely used plot which does not scale to the large numbers of MAGs now being recovered
  • par_plot: non-critical plot and the same information is better presented in the reference distribution plots
  • cov_pca, tetra_pca: alternatives to these static plots exist in tools such as Anvi'o
  • len_plot: rarely used plot which is largely redundant with the len_hist and nx_plot plots
  • bin_union, bin_compare: feature rich alternative now exist such as DAS Tool and UniteM

Bug Reports

Please report bugs through the GitHub issues system.

Copyright ยฉ 2014 Donovan Parks, Connor Skennerton, Michael Imelfort. See LICENSE for further details.

More Repositories

1

GTDBTk

GTDB-Tk: a toolkit for assigning objective taxonomic classifications to bacterial and archaeal genomes.
Python
461
star
2

gtdb-species-clusters

Toolkit for establishing, updating, and validating GTDB species clusters
Python
20
star
3

BamM

Metagenomics-focused BAM file manipulation
C
15
star
4

GTDBNCBI

The GTDB provides the software infrastructure for working with a large collection of genomic resources. The major goal of this initiative is to provide a phylogenetically consistent taxonomy for archaea and bacteria.
Python
9
star
5

gtdb-itol-decorate

Creates iTOL files for tree decoration, given a set of GTDB genomes.
Python
4
star
6

Mannotator

Microbial annotation pipeline - in progress
Perl
4
star
7

FrankenQIIME

Feel the Franken
Python
4
star
8

basespace_project_downloader

Download project fastq files from BaseSpace
Python
3
star
9

gtdb.ecogenomic.org

https://gtdb.ecogenomic.org/
Vue
2
star
10

ScaffoldM

scaffoldm
Python
2
star
11

ace-guix

GNU Guix package definitions for ACE or otherwise
Scheme
2
star
12

mingle

Australian Centre for Ecogenomics' genome tree database: Taxonomically annotated trees from HMMs and BLAST
Python
2
star
13

api.gtdb.ecogenomic.org

https://api.gtdb.ecogenomic.org/
Python
2
star
14

ace-cluster-orchestrator

A template for distributing jobs across the ACE cluster.
Python
1
star
15

gtdb-migration-tk

Toolkit for updating the GTDB to the next release and test data
Python
1
star
16

gtdb-release-tk

Toolkit for updating the GTDB to the next release and generating data files for the GTDB website.
Python
1
star
17

slamM

slamM: a hybrid metagenomic assembly pipeline
Python
1
star
18

pfam_search

pfam_scan meets hmmsearch.
Perl
1
star
19

bioscripts

A collection of scripts written by ACE members
Shell
1
star
20

PhylogeneticM

Feed the tree...
Python
1
star
21

hatchet

Tools used to split the GTDB-Tk reference tree into smaller sub trees
Python
1
star