• Stars
    star
    113
  • Rank 304,270 (Top 7 %)
  • Language
    C++
  • License
    MIT License
  • Created about 9 years ago
  • Updated 8 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Integrate DNA-seq and RNA-seq data to identify mutations that are associated with regulatory effects on gene expression.

Build Status Documentation Status Coverage Status

RegTools

Tools that integrate DNA-seq and RNA-seq data to help interpret mutations in a regulatory and splicing context.

Features

  • Identify evidence for aberrant splicing in RNA reads near a list of variants.
  • Extract exon-exon junctions from a RNAseq BAM file.
  • Annotate exon-exon junctions with information from a known transcriptome.
  • Annotate variants with splice-region(the definition of this region is configurable) annotations.

Hardware requirements

RegTools requires only a standard computer with enough RAM to support the in-memory operations.

Software requirements

OS Requirements This package is supported for macOS and Linux. The package has been tested on the following systems:

macOS: macOS 10.12 (Sierra), macOS 10.13 (High Sierra), macOS 10.14 (Mojave), macOS 10.15 (Catalina), macOS 11 (Big Sur), macOS 12 (Monterey)

Linux: Ubuntu 16.04, Ubuntu 18.04, Ubuntu 20.04

Installation

Clone and install regtools by running the following:

    git clone https://github.com/griffithlab/regtools
    cd regtools/
    mkdir build
    cd build/
    cmake ..
    make

Installation should take 1-5 minutes.

For convienience we also maintain a docker image available at https://hub.docker.com/r/griffithlab/regtools/

Usage:

    regtools --help

If one wishes to test their installation, we include test data under test_data.

Here's an example command using that data along with the example output. This should run in under a minute.

regtools cis-splice-effects identify -s RF -e 10 -i 10 test_data/HCC1395_chr22.vcf.gz test_data/HCC1395_tumor.bam test_data/chr22_with_ERCC92.fa test_data/chr22_with_ERCC92.gtf

Variant 22	42129188	42129189	-1
Variant region is 22:42128784-42130813

chrom	start	end	name	score	strand	splice_site	acceptors_skipped	exons_skipped	donors_skipped	anchor	known_donor	known_acceptor	known_junction	gene_names	gene_ids	transcripts	variant_info
position = 22:42125408-42125409
position = 22:42130565-42130566
22	42125407	42130567	JUNC00000001	4	+	GT-AG	0	0	0	D	1	0	0	NDUFA6-AS1	ENSG00000237037	ENST00000439129	22:42129188-42129189
position = 22:42128881-42128882
position = 22:42129670-42129671
22	42128880	42129672	JUNC00000002	3	+	GT-AG	0	0	0	N	0	0	0	NA	NA	NA	22:42129188-42129189
position = 22:42128944-42128945
position = 22:42129031-42129032
22	42128943	42129033	JUNC00000003	4	-	GT-GG	1	0	0	D	1	0	0	CYP2D6	ENSG00000100197	ENST00000360608,ENST00000389970,ENST00000488442	22:42129188-42129189
position = 22:42129783-42129784
position = 22:42143453-42143454
22	42129782	42143455	JUNC00000004	2	+	GC-AG	9	8	9	N	0	0	0	NA	NA	NA	22:42129188-42129189
position = 22:42130224-42130225
position = 22:42130565-42130566
22	42130223	42130567	JUNC00000005	2	+	GT-AG	0	0	0	N	0	0	0	NA	NA	NA	22:42129188-42129189

Contribute

  • Issue Tracker: github.com/griffithlab/regtools/issues
  • Source Code: github.com/griffithlab/regtools

Support

If you have issues using the project, please let us know. We have a mailing list located at: [email protected] and the forum is here - https://groups.google.com/forum/#!forum/regtools. Github issues are another option to contact the project about potential bugs.

Documentation

The documentation for the project is hosted on Read the Docs.

If you would like to build the documentation locally, please install mkdocs, pip install mkdocs --user should work on most machines. Then run mkdocs serve from within the regtools base directory.

Acknowledgements

Regtools uses several open-source libraries. We would like to thank the developers of htslib and bedtools. We would also like to thank Travis Abbott for useful comments and code.

License

The project is licensed under the MIT license.

Stable release with DOI

DOI

More Repositories

1

rnaseq_tutorial

Informatics for RNA-seq: A web resource for analysis on the cloud. Educational tutorials and working pipelines for RNA-seq analysis including an introduction to: cloud computing, critical file formats, reference genomes, gene annotation, expression, differential expression, alternative splicing, data visualization, and interpretation.
R
1,314
star
2

GenVisR

Genome data visualizations
R
203
star
3

pVACtools

Python
128
star
4

rnabio.org

website for the rnaseq course
SCSS
83
star
5

pVAC-Seq

DEPRECATED. This tool has been superseded by https://github.com/griffithlab/pVACtools
Python
60
star
6

civic-client

Web client for CIViC: Clinical Interpretations of Variants in Cancer
JavaScript
50
star
7

DeepSVR

Jupyter Notebook
50
star
8

pmbio.org

Website for the precision medicine workshop
SCSS
41
star
9

civic-server

Backend Server for CIViC Project
HTML
39
star
10

rnaseq_tutorial_wiki

The wiki repo, with pull request enabled, for the rnaseq_tutorial
Perl
25
star
11

VAtools

A set of tools to annotate VCF files with expression and readcount data
Python
25
star
12

convert_zero_one_based

Python CLI to convert between zero and one based genomic coordinate systems
Python
21
star
13

civic-v2

CIViC is an open access, open source, community-driven web resource for Clinical Interpretation of Variants in Cancer
TypeScript
19
star
14

genviz.org

Genomic data interpretation and visualization Workshop
SCSS
18
star
15

igvnav

Python
15
star
16

griffithlab.org

Griffith lab research website
JavaScript
14
star
17

rnaseq_tutorial_v1

Archive of original RNAseq.wiki tutorial that accompanied PLoS Comp Bio paper
R
12
star
18

civic-meeting

Repo for advertising and organizing CIViC hackathon/meeting activities
10
star
19

civicpy

A python interface for the CIViC db application
Python
8
star
20

anchor_huiming_etal_2023

Code for computational workflows and analyses relating to "Computational prediction of MHC anchor locations guide neoantigen identification and prioritization"
Jupyter Notebook
5
star
21

docm

Rails frontend to The Genome Institute's database of curated mutations (DoCM)
Ruby
5
star
22

civic-docs

Source code for the civicdb.org docs
4
star
23

gen-viz-lectures

lectures for genviz.org workshop kept separate for performance
3
star
24

docker-pvactools

Python
3
star
25

analysis-wdls

Early stages of converting genome/analysis-workflows from CWL to WDL
wdl
3
star
26

aml31

Resource website for the AML31 publication
CSS
2
star
27

BGA-interface-projects

Monorepo for user-interface projects of the Bioinformatics and Genome Analytics group at MGI
TypeScript
1
star
28

neoag_protocol

Protocol for end-to-end neoantigen analysis and vaccine design for a single patient
1
star
29

CRC_biomarkers

Repo for analysis of CRC biomarkers from stool samples
1
star
30

immuno_gcp_wdl_local

Tutorial to run immuno.wdl on Google Cloud
1
star
31

cloud-workflows

Infrastructure and tooling required to get genomic workflows running in the cloud
Python
1
star
32

civic-panel

Tools for selecting targets for and designing panel for various assays
Jupyter Notebook
1
star
33

ast17

CSHL Advanced Sequencing Technologies and Applications 2017 Course Directory
CSS
1
star
34

civic2clinvar

extraction of CIViC variants into the clinvar submission format
Python
1
star
35

bioinformatic-workflows

CWL workflows for bioinformatic analysis
Common Workflow Language
1
star