• Stars
    star
    188
  • Rank 205,563 (Top 5 %)
  • Language
    Python
  • Created almost 9 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A bioinformatics software tool for clinical interpretation of genetic variants by the 2015 ACMG-AMP guideline

InterVar

A bioinformatics software tool for clinical interpretation of genetic variants by the ACMG-AMP 2015 guidelines

SYNOPSIS

Intervar.py [options]

WHAT DOES IT DO

InterVar is a python script for variant interpretation of clinical significance.

PREREQUISITE

  1. You need install Python >=2.6.6.
  2. You need install ANNOVAR version >= 2016-02-01.
  3. You need download other files such as mim2gene.txt from OMIM.
  4. Please use the updated files(should be generated: >= 2016-09) from OMIM, outdated files will bring problems of InterVar.

OPTIONS

  • -h, --help
    show this help message and exit

  • --version
    show program''s version number and exit

  • --config=config.ini Load your config file. The config file contains all options.

if you use this options,you can ignore all the other options bellow.

  • -i INPUTFILE, --input=INPUTFILE
    input file of variants for analysis

  • --input_type=AVinput The input file type, it can be AVinput(Annovar''sformat),VCF

  • -o OUTPUTFILE, --output=OUTPUTFILE
    prefix the output file (default:output)

  • -b BUILDVER, --buildver=BUILDVER
    version of reference genome: hg18, hg19(default)

  • -t intervardb, --database_intervar=intervardb The database location/dir for the InterVar dataset files

  • -s your_evidence_file, --evidence_file=your_evidence_file

    This potion is for user specified evidence file for each variant,

    How to add your own Evidence for each Variant:

    Prepare your own evidence file as tab-delimited,the line format:

(The code for additional evidence should be as: PS5/PM7/PP6/BS5/BP8 ;

The format for upgrad/downgrade of criteria should be like:

grade_PS1=2;           1 for Strong; 2 for Moderate; 3 for Supporting)
 Chr Pos Ref_allele Alt_allele  evidence_list

 1 123456 A G PM1=1;BS2=1;BP3=0;PS5=1;grade_PM1=1
  • --table_annovar=./table_annovar.pl The Annovar perl script of table_annovar.pl

  • --convert2annovar=./convert2annovar.pl The Annovar perl script of convert2annovar.pl

  • --annotate_variation=./annotate_variation.pl The Annovar perl script of annotate_variation.pl

  • -d humandb, --database_locat=humandb The database location/dir for the Annovar annotation datasets

EXAMPLE

    ./InterVar.py -c config.ini  # Run the examples in config.ini
    ./InterVar.py  -b hg19 -i your_input  --input_type=VCF  -o your_output

HOW DOES IT WORK

InterVar takes either pre-annotated files, or unannotated input files in VCF format or ANNOVAR input format, where each line corresponds to one genetic variant; if the input files are unannotated, InterVar will call ANNOVAR to generate necessary annotations. The execution of InterVar mainly consists of two major steps: 1) automatically interpret 28 evidence codes; and 2) manual adjustment by users to re-interpret the clinical significance. However, users can specify their own evidence code and import into InterVar by using the argument "-evidence_file=your_evidence_file" so that one single step is sufficient to generate the final results. In the output, based on all 28 pieces of evidence codes that are either automatically generated or supplied by the user, each variant will be assigned as "pathogenic", "likely pathogenic", "uncertain significance", "likely benign" or "benign" by rules specified in the ACMG2015 guidelines 24.

We also developed a web server of InterVar called wInterVar, which can be accessed at http://wintervar.wglab.org. The user can directly input their missense variants in wInterVar by chromosomal position, by dbSNP identifier, or by gene name with nucleic acid change information. The wInterVar server will provide full details on the variants, including all the evidence codes for the variants. The user then has the ability to manually adjust these evidence codes and resubmit to the server to perform re-interpretation. Since all evidence codes for all possible non-synonymous variants have been pre-computed by us, the execution of wInterVar is very fast, typically less than 1 second to obtain the results. However, the wInterVar server cannot process other types of variants (such as indels), and the user will need to use InterVar instead.

Web server

wInterVar: http://wintervar.wglab.org

LICENSE

InterVar is free for non-commercial use without warranty. Users need to obtain licenses such as OMIM and ANNOVAR by themselves. Please contact the authors for commercial use.

REFERENCE

Quan Li and Kai Wang. InterVar: Clinical interpretation of genetic variants by ACMG-AMP 2015 guideline. The American Journal of Human Genetics 100(2):267-280, 2017,http://dx.doi.org/10.1016/j.ajhg.2017.01.004

The ACMG 2015 guide Richards, S. et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genetics in medicine : official journal of the American College of Medical Genetics 17, 405-424 (2015).

More Repositories

1

doc-ANNOVAR

Documentation for the ANNOVAR software
228
star
2

dragonstar2019

135
star
3

DeepMod

DeepMod: a deep-learning tool for genomic-scale, strand-sensitive and single-nucleotide based detection of DNA modifications
Python
99
star
4

NanoCaller

Variant calling tool for long-read sequencing data
Python
98
star
5

PennCNV

Copy number vaiation detection from SNP arrays
C
88
star
6

biocluster

Tutorial on building a computing cluster for bioinformatics
86
star
7

phenolyzer

phenotype-based prioritization of candidate genes for human diseases
Perl
64
star
8

Bioformer

Bioformer: an efficient BERT model for biomedical text mining
53
star
9

VirTect

Detection of viruses from RNA-Seq on human samples
Python
45
star
10

Phen2Gene

Phenotype driven gene prioritization for HPO
Python
43
star
11

CancerVar

Clinical interpretation of somatic mutations in cancer
Python
42
star
12

LIQA

Long-read Isoform Quantification and Analysis
Python
39
star
13

NanoMod

NanoMod: a computational tool to detect DNA modifications using Nanopore long-read sequencing data
Python
35
star
14

EHR-Phenolyzer

A pipeline that takes clinical notes from EHRs and generate phenotype terms, then generate ranked gene list
Python
35
star
15

RepeatHMM

a hidden Markov model to infer simple repeats from genome sequences
Python
34
star
16

DeepMod2

DNA 5mC methylation detection from Dorado or Guppy basecalled Oxford Nanopore reads
Jupyter Notebook
32
star
17

SeqMule

Automated human exome/genome variants detection from FASTQ files
C++
22
star
18

LongGF

A computational algorithm and software tool for fast and accurate detection of gene fusion by long-read transcriptome sequencing
C
22
star
19

Workshop_Annotation

Course materials for "Variants Annotate and Phenotype Analysis"
22
star
20

LinkedSV

C
21
star
21

GenGen

A set of software tools to facilitate GWAS analysis
C
20
star
22

DeepRepeat

An accurate repeat detection from Nanopore data using deep learning and image techniques
C
19
star
23

HadoopCNV

HadoopCNV is a MapReduce-based copy number variation caller for genome sequencing data
Java
18
star
24

lncScore

A python package for the identification of lncRNA from the assembled novel transcripts
Python
17
star
25

PhenoSV

PhenoSV: Interpretable phenotype-aware model for the prioritization of genes affected by structural variants.
Python
16
star
26

PhenoGPT

Jupyter Notebook
16
star
27

GDP

Group lasso based Deep Neural Network for Cancer Prognosis
Python
15
star
28

NanoRepeat

NanoRepeat: fast and accurate analysis of Short Tandem Repeats (STRs) from Oxford Nanopore sequencing data
Python
15
star
29

icages

iCAGES (integrated CAncer GEnome Score) is an effective tool for prioritizing cancer driver genes for a patient
Perl
14
star
30

PennCNV2

A software package for detection of copy number alterations from tumor samples
C++
12
star
31

LongReadSum

C++
11
star
32

mutformer

A transformer model to predict pathogenic mutations
Jupyter Notebook
11
star
33

SVGen

Simulator for structural variants in various types of next-generation sequencing data
Python
8
star
34

AmpBinner

A barcode demultiplexer for Oxford Nanopore long-read amplicon sequencing data
Python
8
star
35

DELongSeq

Python
7
star
36

iMEGES

integrated Mental-disorder Genome Score
Python
7
star
37

UROBORUS

UROBORUS: circular RNA identification from RNA-Seq data
Perl
6
star
38

epilepsy-autism-multiplex-network

Jupyter Notebook
5
star
39

PhenCards

Development of phencards.org web server for one stop shop of phenotype information
HTML
5
star
40

PennCNV-Seq

Adaption of the original PennCNV algorithm for whole-genome sequencing data
Perl
5
star
41

uniline

Assembly-based analysis
Perl
4
star
42

kgev-neo4j

Jupyter Notebook
4
star
43

QuantitativeGenomics2022

3
star
44

w4CSeq

web server of 4C-Seq data analysis pipeline
R
3
star
45

NanoBinner

Python
3
star
46

Phen2Gene2

Phen2Gene improved and optimized for pipelines
Python
3
star
47

ContextSV

An alignment-based, generalized structural variant caller for long-read sequencing/mapping data
C++
2
star
48

ASD_terminology

Jupyter Notebook
2
star
49

Project_Belka

Jupyter Notebook
2
star
50

QuantitativeGenomics2023

Materials for Quantitative Genomics 2023 workshop
2
star
51

SGAN

SGAN (Oncogenicity score prediction using Semi-supervised Generative Adversarial Networks)
Jupyter Notebook
1
star
52

QuantitativeGenomics2021

1
star
53

lncScore-Java

Prediction of lncRNA from RNA-Seq data
Java
1
star
54

HTT-SNP-Phasing

1
star
55

icages-server

the iCAGES server running on Ruby on Rails
JavaScript
1
star
56

GestaltMML

Jupyter Notebook
1
star
57

SCOTCH

Single-Cell Omics for Transcriptome CHaracterization (SCOTCH): isoform-level characterization of gene expression through long-read single-cell RNA sequencing
Python
1
star