• This repository has been archived on 30/Jan/2023
  • Stars
    star
    271
  • Rank 151,717 (Top 3 %)
  • Language SCSS
  • Created about 7 years ago
  • Updated almost 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Intro to ChIPseq using HPC

NOTE: The materials in this repository are no longer actively maintained. More recent content can be found at: https://hbctraining.github.io/Intro-to-ChIPseq-flipped/

OLD - Introduction to ChIP-seq using high performance computing

Audience Computational Skills Prerequisites Duration
Biologists Beginner/Intermediate None 3-day workshop (~19.5 hours of trainer-led time)

Description

This repository has teaching materials for a 3-day Introduction to ChIP-sequencing data analysis workshop. This workshop focuses on teaching basic computational skills to enable the effective use of an high-performance computing environment to implement a ChIP-seq data analysis workflow. It includes an introduction to shell (bash) and shell scripting. In addition to running the ChIP-seq workflow from FASTQ files to peak calls and nearest gene annotations, the workshop covers best practice guidlelines for ChIP-seq experimental design and data organization/management and quality control.

These materials were developed for a trainer-led workshop, but are also amenable to self-guided learning.

Learning Objectives

  1. Understand the necessity for, and use of, the command line interface (bash) and HPC for analyzing high-throughput sequencing data.
  2. Understand best practices for designing a ChIP-seq experiment and analysis the resulting data.

Lessons

Click here for links to lessons and the suggested schedule

Dataset

Installation Requirements

Download the most recent versions of R and RStudio for your laptop:

NOTE:Β When installing the following packages, if you are asked to select (a/s/n) or (y/n), please select β€œa” or "y" as applicable.

(1) Install the below packages on your laptop from CRAN. You DO NOT have to go to the CRAN webpage; you can use the following function to install them:

install.packages("BiocManager")
install.packages("tidyverse")

Note that these package names are case sensitive!

(2) Install the below packages from Bioconductor. Load BiocManager, then run BiocManager's install() function 7 times for the 7 packages:

library(BiocManager)
install("insert_first_package_name_in_quotations")
install("insert_second_package_name_in_quotations")
& so on ...

Note that these package names are case sensitive!

ChIPQC
ChIPseeker
DiffBind
clusterProfiler
AnnotationDbi
TxDb.Hsapiens.UCSC.hg19.knownGene
EnsDb.Hsapiens.v75
org.Hs.eg.db

NOTE: The library used for the annotations associated with genes (here we are using TxDb.Hsapiens.UCSC.hg19.knownGene and EnsDb.Hsapiens.v75) will change based on organism (e.g. if studying mouse, would need to install and load TxDb.Mmusculus.UCSC.mm10.knownGene). The list of different organism packages are given here.

(3) Finally, please check that all the packages were installed successfully by loading them one at a time using the library() function.

library(tidyverse)
library(ChIPQC)
library(ChIPseeker)
library(DiffBind)
library(clusterProfiler)
library(AnnotationDbi)
library(TxDb.Hsapiens.UCSC.hg19.knownGene)
library(EnsDb.Hsapiens.v75)

(4) Once all packages have been loaded, run sessionInfo().

sessionInfo()

These materials have been developed by members of the teaching team at the Harvard Chan Bioinformatics Core (HBC). These are open access materials distributed under the terms of the Creative Commons Attribution license (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

More Repositories

1

scRNA-seq

SCSS
511
star
2

scRNA-seq_online

R
457
star
3

In-depth-NGS-Data-Analysis-Course

HTML
430
star
4

DGE_workshop

HTML
358
star
5

Training-modules

Materials for short, half-day workshops
Jupyter Notebook
285
star
6

Intro-to-rnaseq-hpc-O2

This repository has teaching materials for a 2 and 3-day Introduction to RNA-sequencing data analysis workshop using the O2 Cluster
HTML
162
star
7

DGE_workshop_salmon_online

HTML
159
star
8

Intro-to-R

SCSS
90
star
9

Intro-to-rnaseq-hpc-salmon-flipped

Introduction to bulk RNA-seq
HTML
84
star
10

Intro-to-R-flipped

R
64
star
11

DGE_workshop_salmon

HTML
64
star
12

Intro-to-ChIPseq-flipped

HTML
64
star
13

Intro-to-R-with-DGE

R
57
star
14

main

SCSS
51
star
15

Intro-to-rnaseq-hpc-salmon

HTML
46
star
16

Intro-to-shell-flipped

HTML
34
star
17

Accessing_public_genomic_data

Tutorials on accessing public reference and genomic data
HTML
28
star
18

variant_analysis

HTML
22
star
19

publication_perfect

Six hour hands-on to creating publication-quality plots in R
SCSS
21
star
20

bioinformatics_online

Resource list and preparation instructions for online learning and teaching for Bioinformatics
SCSS
19
star
21

Intro-to-Shell

Introduction to the Unix shell
HTML
19
star
22

rnaseq_overview

Short course describing the considerations for a successful RNA-seq experiment
HTML
16
star
23

rnaseq-cb321

June 5, 2019
HTML
9
star
24

Intro-to-Unix-archived

Materials for a 2-day introduction to the bash language, the Linux OS, and high-performance computing.
Shell
9
star
25

Intro-to-rnaseq-hpc-gt

Introduction to RNA-seq using HPC
Shell
9
star
26

Intro-to-rnaseq-hpc-orchestra

This repository has teaching materials for a 2-day Introduction to RNA-sequencing data analysis workshop using the Orchestra Cluster.
Shell
9
star
27

versioning_data_scripts

GitKraken lesson (forked from HBS-RCS)
R
8
star
28

Peak_analysis_workshop

An introduction to various methods/approaches for the analysis of peaks generated from ChIP-seq / CUT&RUN / ATAC-seq
SCSS
8
star
29

reproducibility-tools

R
7
star
30

Intro-to-rnaseq-fasrc-salmon-flipped

HTML
7
star
31

GCC-BOSC-2018

Lessons for "Setting up for Success when planning an RNA-seq experiment" session at GCC-BOSC 2018
HTML
5
star
32

Intro-to-shell-fasrc-flipped

Shell training materials for the FAS-RC cluster
Shell
4
star
33

RNA-seq-CB321qc_2022

Teaching RNA-seq experimental design and analysis + data storage and sharing for genomics data
SCSS
3
star
34

Rmarkdown_analysis_reports

SCSS
3
star
35

Intro-to-R-online-Catalyst

Contains materials for an online version of the Introduction to R
CSS
3
star
36

EpiR

Introduction to R for Epi summer program
SCSS
2
star
37

datafest2021_Rmarkdown

2
star
38

version-control-gitkraken

R
1
star
39

Galaxy_RNA-Seq

1
star