NCBI-Hackathons (Archive) (@NCBI-Hackathons)

Top repositories

1

EDirectCookbook

159
star
2

TheHumanPangenome

A Strategy for Building and Using a Human Reference Pangenome
Jupyter Notebook
70
star
3

Community_Software_Tools_for_NGS

61
star
4

NovoGraph

NovoGraph: building whole genome graphs from long-read-based de novo assemblies
Perl
45
star
5

rnaseqview

RNA-seq Viewer Team at the NCBI-assisted Boston Genomics Hackathon
JavaScript
36
star
6

ConsensusML

Machine Learning to Detect Cancer Biomarkers from RNAseq Data
HTML
33
star
7

VirusDiscoveryProject

Software, architecture, and data index design for the 2018/2019 Virus Discovery Project
Jupyter Notebook
31
star
8

drugdisco

A high throughput automated drug discovery pipeline.
JavaScript
29
star
9

Pharmacogenomics_Prediction_Pipeline_P3

R
28
star
10

NCBIComputationalCookbook

Jupyter notebooks to more effectively leverage computational resources at NCBI.
Jupyter Notebook
28
star
11

ViruSpy

A pipeline for viral identification from metagenomic samples
Shell
26
star
12

SPeW

Automatic Packaging and Distribution of Bioinformatics Pipelines
Python
26
star
13

MR_BACOn

Mendelian Randomization with Biomarker Associations for Causality with Outcomes
R
23
star
14

Machine_Learning_Immunogenicity

This is a repo for the Machine Learning Immunogenicity Team in the August 2016 NCBI Hackathon
Jupyter Notebook
23
star
15

LabPype

Framework for Creating Pipeline Software
Python
21
star
16

MetagenomicAntibioticResistance

NastyBugs: a simple method for extracting antimicrobial resistance information from metagenomes
Shell
20
star
17

CTEligible

Use machine learning to find patterns of similar eligibility protocol criteria for clinical trials
PowerShell
18
star
18

RNA-Seq-in-the-Cloud

An Easy to Use Analysis System for All Human Public bulk RNAseq Data!
Jupyter Notebook
18
star
19

Design-of-ICD-9-to-10-conversion-function-for-the-R-package-icd

Develop a function to be incorporated into the R package 'icd' that will convert International Classification of Diseases codes from Ninth to Tenth revisions
R
18
star
20

RNA_mapping

Python
17
star
21

Network_SNPs

A framework for network analysis and display of SNPs
Python
17
star
22

SRA_Tinder

Find hot data sets in your area (of research)!
Jupyter Notebook
16
star
23

ncbi-cloud-tutorials

Tutorial content for NCBI cloud data and computing
Jupyter Notebook
16
star
24

Master_gff3_parser

Convert sequence IDs between ucsc/refseq/genbank
Python
16
star
25

GoodDoc

A Template for Clear and Simple Documentation of Bioinformatics Code
15
star
26

seqacademy

Self-guided educational workshop for ChIP-Seq and RNA-Seq
HTML
14
star
27

ATACFlow

An ATAC-seq pipeline wrapped in NextFlow that can be run by Jupyter
Jupyter Notebook
14
star
28

svcompare

HTML
14
star
29

Structural_Variant_Comparison

SV
Python
14
star
30

Semantic-search-log-analysis-pipeline

Classify web visitor queries so you can chart, and respond to, trends in information seeking
JavaScript
14
star
31

PubRunner

Framework for running text mining tools on latest publications. Main page at:
JavaScript
14
star
32

deVoReaNN

A virtual reality environment for physically assembling deep learning models to solve data science problems.
C#
12
star
33

seqr

Java
12
star
34

HLAClustRView

R package specialized in HLA typing clustering and visualization based on specific similarity metrics
R
12
star
35

phenotypeXpression

Subclassification of disease states based on the intersection of literature and expression
Python
12
star
36

hackathon_v001_metagenomics

Metagenomics Pipeline Repository for January, 2015 NCBI/ADDS Hackathon at NIH
Shell
11
star
37

FlowBio

A fast, easy way to present complex bioinformatics pipelines to biologists
Shell
11
star
38

GeneExpressionAging

Gene expression viewer template
HTML
11
star
39

The-Broad-Institute-Single-Cell-RNA-Seq-Data-Set

Visualize cancer genomes with FAIR single-cell RNA-seq data
Python
11
star
40

Biological-structure-segmentation-in-microscopy-images-using-deep-learning

Jupyter Notebook
10
star
41

TCGA_dbGaP

Python
10
star
42

Virus_Detection_SRA

Perl
10
star
43

PrecisionMedicineToolkit

Search public databases for given genotypic information
Python
10
star
44

Bringing-the-Power-of-Synthetic-Data-Generation-to-the-Masses

We aim to make it easier for biomedical researchers to access and customize synthetic sequence data for the purpose of sharing and testing analysis methods as well as training and collaboration
Jupyter Notebook
10
star
45

SimpleGeneExpression

Programs to quantify expression of transcripts from public datasets
R
9
star
46

Metabolomics-Data-Portal

R shiny application for the visualization and analysis of untargeted metabolomics datasets.
R
9
star
47

deSRA

An automated protocol to extract variation or expression from public NGS datasets
JavaScript
9
star
48

chervil

A detection algorithm for expression features that correspond to previous viral infection
Shell
9
star
49

Structural_Variants_CSHL

Perl
8
star
50

Kipoi-GWAS

Jupyter Notebook
8
star
51

VirusFriends

VirusFriends: discover viral sequences in the NCBI Sequence Read Archive!
Python
8
star
52

Got_Plasmid

Retreive and visualize plasmid sequences from SRA and Next Generation Sequencing data.
R
8
star
53

Mutation_burden

Building a pipeline to assess effects of mutation burden
R
8
star
54

Epigenomics_CWL

SCREW: A Reproducible Workflow for Single-Cell Epigenomics
R
8
star
55

PhenVar

Python
8
star
56

ClusterDuck

Disease Clustering from Literature Based on Minimal Training Data
Python
7
star
57

AssesSV

HTML
7
star
58

Code_in_PubMed_Abstracts

Python
7
star
59

GeneHummus

An Automated Pipeline to Classify Gene Families based on Protein Domain Organization using Auxin Response Factors in Legumes as an Example
R
7
star
60

Clustering-autism-phenotypes-by-automated-text-analysis

A versatile tool to classify diseases using cluster analysis of published phenotypic data
Python
7
star
61

Hidden-Figures

A pipeline for inferring gender for acknowledged individuals in scientific literature on a massive scale
Jupyter Notebook
7
star
62

Cancer_Epitopes_CSHL

A pipeline to approximate the immunogenicity of peptides resulting from cancer mutations based on structure and other factors.
HTML
7
star
63

HASSL_Homogeneous_Analysis_of_SRA_rnaSequencing_Libraries

Python
7
star
64

PubCode

An app platform for CLI Apps that engage NCBI/NLM Data and Services
HTML
7
star
65

Virulence_Factor_Characterization

Virulence Factor Characterization in Metagenomes
Jupyter Notebook
7
star
66

OnlineAdapterDatabase

Linking publicly deposited data to sequencing adapters.
Python
7
star
67

CapNetProtStruct

Capsule Networks for improving protein secondary structure prediction accuracy
Python
7
star
68

NCBI_Jupyter

A variety of NCBI Computational Tools Distributed as Jupyter Notebooks
Jupyter Notebook
6
star
69

PSST

Polygenic SNP Search Tool
Python
6
star
70

EndoVir

Discovery of Novel Endogenous Viruses
Python
6
star
71

UPWARD

UPWARD: Uniting People Working Against Rare Diseases
PHP
6
star
72

clint

Linking clinical questions with fMRI research literature
Jupyter Notebook
6
star
73

Tumor_sim

Simulation of Tumor Genomes -- Initiated at the 2017 NYGC-NCBI Hackathon
Python
6
star
74

hackathon_v001_rnaseq

RNAseq Pipeline Repository for January, 2015 NCBI/ADDS Hackathon at NIH
Python
6
star
75

Run_an_NCBI-style_hackathon

Collaborative Computational Development, or, How to Run an NCBI-Style Hackathon
6
star
76

SRA2R

SRA2R, a package to import SRA data directly into R
HTML
6
star
77

TCRecePy

A python tool that uses Machine Learning to identify cancer targeting T-Cells
Python
6
star
78

Awesome_enhancer_promoter_dbs

A semi-curated list of enhancer and promoter databases. If you know of others, please put them in issues!
6
star
79

CakeCell

Segmenting cells (and other objects!) in microscopy images via neural networks.
Python
6
star
80

ContainerInception

Gerber: Generalized Easy Reproducible Bioinformatics Environment wRapper
Python
5
star
81

Metadata_categorization

A crowdsourcing/expert curation platform for metadata categorization.
Web Ontology Language
5
star
82

DiseaseCluster

Disease prediction based on transcriptomics clustering
HTML
5
star
83

PuRSSE

Pubmed Research Search String Extraction (PuRSSE)
Jupyter Notebook
5
star
84

Complex_Phenogeno

Mapping complex genotypes to phenotypic subclusters
Python
5
star
85

TriFECTA

The goal of this project is use natural language processing to extract exclusion and inclusion criteria from free form text fields to match patients with clinical trials.
Shell
5
star
86

RetroSpotter

A computational pipeline to find Human Endogenous Retroviruses in RNA Seq Data
Jupyter Notebook
5
star
87

LNCPEP

A machine learning approach to detect micropeptides from noncoding RNAs
Python
5
star
88

Visualizing_MeSH_Term_Interaction_Over_Time

A tool to visually browse co-occurrence of MeSH terms in PubMeb
JavaScript
5
star
89

HistoloMaps

The fastest gigascale image annotation system in the world!
JavaScript
5
star
90

ScrubSV

A QC pipeline for SVs calls based on coverage and SNP calls
R
5
star
91

Ultrafast_Mapping_CSHL

Python
5
star
92

TraIN

HTML
5
star
93

Graph_Extraction

JavaScript
5
star
94

BarcSeek

BarcSeek: A Flexible Barcode Partitioning Tool for Demultiplexing Genomic Sequencing Data
Python
5
star
95

NCBI_August_Hackathon_Push_Button_Genomics_Solution

Python
5
star
96

HAQmap

Push button solution for setting up an NCBI-style Hackathon
HTML
5
star
97

MutPredMerge

Consolidation of tools in the MutPred Suite to work with VCF files
Python
5
star
98

PyClonal

Jupyter Notebooks to analyze T-cell Receptor Sequencing
Jupyter Notebook
5
star
99

Viral-VDAP

Viral VDAP: a viral alignment, variant discovery, and annotation pipeline launched at the NCBI-Hackathon 2019
Python
5
star
100

ContamFilter

Implements NCBI Contamination Screen Publicly in CWL
Python
5
star