• Stars
    star
    231
  • Rank 173,434 (Top 4 %)
  • Language
    Jupyter Notebook
  • License
    Apache License 2.0
  • Created over 5 years ago
  • Updated 3 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

GCP for Bioinformatics Researchers

Google Cloud Platform (GCP) for Bioinformatics

This repository shows how to use Google Cloud Platform (GCP) public cloud services to scale sets of bioinformatics data analysis tasks. This Repo uses cloud best practices for GCP. All examples use genomic sample (input) data, tools and pipelines. Use cases included here as examples are called by any and all of the following terms:

  • genomic-scale data workflows or pipelines
  • bioinformatics primary, secondary or tertiary analysis
  • distributed cloud-based batch jobs

This content is intended for researchers - in particular, this guide is for those who are NEW to working with GCP. You have a number of options on how to use the materials provided in this course. A summary is shown below left.

This Repo includes content you can read, watch or run:

  • πŸ“— READ - one page of this Repo (MD page)
  • πŸ“Ί WATCH - linked YouTube screencasts
  • πŸ“™ RUN - Jupyter Notebook example
  • :octocat: TRY - linked GitHub Repos
  • πŸ“˜ EXPAND - linked (external) resources
  • πŸ” SCAN - search a list in this Repo

πŸ“Ί Click below to WATCH 'Lynn's Welcome Video' (4 min) on YouTube

Welcome to GCP for Bioinformatics


Why would I choose to use a public cloud vendor for bioinformatics?

⭐️ SAVE MONEY run (and pay for) scalable analysis jobs only when you need to run them
⭐️ SAVE TIME use vendor-managed infrastructure & best-practice patterns for fast repeatable research
πŸ“— READ the FAQ for GCP bioinformatics for this Repo
πŸ“• READ Nature article: "Cloud computing for genomic data analysis and collaboration"
πŸ“— READ the top 4 most common use cases for using the public cloud for bioinformatics researchers

Bioinformatics wanting more advanced GCP content?

If you would like to learn more advanced concepts (including script examples and patterns) about working with Google Cloud Platform, see my Repo gcp-essentials --> link


New to Bioinformatics?

If you are NEW to bioinformatics and have a computational background...

  • :octocat: REVIEW my bioinformatics concepts tools and terms
    • Designed for experienced cloud practioners who are NEW to Bioinformatics
    • The 'student notes repo' is named Team Teri - link to 'who is Teri?'
    • This Repo includes links to explanations of bioinformatics concepts, tools and platforms - link

Contibutions

We love contributions! See this short style guide when making pull requests to this repo.


More Repositories

1

learning-cloud

Courses, sample code, articles & screencasts - AWS, Azure, & GCP
Jupyter Notebook
453
star
2

learn-snowflakedb

Resources to work with SnowflakeDB
287
star
3

gcp-essentials

Sample code and notes for my GCP courses on LinkedIn Learning
Jupyter Notebook
235
star
4

learning-hadoop-and-spark

Companion to Learning Hadoop and Learning Spark courses on Linked In Learning
HTML
181
star
5

Hello-AWS-Data-Services

AWS Data/MLServices sample code & notes for my LinkedIn Learning courses
Jupyter Notebook
177
star
6

learning-quantum

Study resources for learning quantum computing
Jupyter Notebook
141
star
7

aws-for-bioinformatics

AWS for Bioinformatics Researchers
Jupyter Notebook
114
star
8

lynnlangit

Lynn Langit profile
Julia
65
star
9

great-github-profiles

Companion Repo to LinkedIn Learning course 'Great GitHub Profiles'
HTML
60
star
10

TeamTeri

Bioinformatics on GCP, AWS or Azure
Shell
52
star
11

aws-cost-control

Companion Repository to Linked In Learning Course "AWS Cost Control"
46
star
12

gcp-ml

Google Cloud Platform Machine Learning Samples
Jupyter Notebook
40
star
13

Spark-Scala-EKS

Spark Scala docker container sample for AWS testing - EKS & S3
HCL
23
star
14

learning-data-mesh

Repo with resources for learning Data Mesh
15
star
15

serverless-architecture

Companion to my Linked In Learning 'Serverless Architecture' course
14
star
16

RedisLabsDemo

demo of using RedisLabs RedisCloud as a user caching store for a node.js app with SQL Azure
C#
13
star
17

learning-ethical-ai

Resources to learn how to implement ethical AI
Python
12
star
18

AdvancedPythonForBio

Work from the book 'Advanced Python for Biologists'
Jupyter Notebook
9
star
19

learning-alibaba-cloud

Companion Repo for LinkedIn Learning Course
TSQL
9
star
20

julia-linear-algebra

study notes and sample code for "Learning Linear Algebra with Julia"
Jupyter Notebook
8
star
21

AWS-Redshift-Matillion-Workshop

Scripts, Instructions and Materials for AWS Redshift and Matillion ETL workshop
Shell
8
star
22

Java-Refactoring-Workbook

Practing Using Excercises from 'Refactoring Workbook'
Java
7
star
23

sample-data

Small datasets and files in many formats, used for testing cloud SQL, NoSQL or Machine Learning Services
PowerShell
6
star
24

learning-codespaces

Index of content to learn to use GitHub Codespaces
4
star
25

learning-nosql

Companion repository to Linked In Learning course 'Cloud NoSQL for SQL Pros'
4
star
26

learning-github

Demo Repo for Learning GitHub
3
star
27

DnBBusinessVerificationAPISample

Sample code for YouTube demo of Dunn And Bradstreet Business Verification API in the Windows Azure Marketplace
C#
3
star
28

AWSDataWarehouse

Demo of AWS Redshift and partners
Shell
3
star
29

consulting

Lynn Langit
CSS
2
star
30

architects-who-code

Architects Who Code
Python
2
star
31

hello-cloud-run

Demo of easy button for CloudRun
Dockerfile
2
star
32

github-slideshow

A robot powered training repository πŸ€–
Ruby
2
star
33

learn-copilot-workspace

Demo Repo for Copilot Workspace
Java
2
star
34

Intro-to-Google-Cloud-Java-Code-Demos

Intro to Google Cloud for Developers YouTube screencast series - code demos
CSS
1
star
35

FizzBuzz-ML

sample of Fizz Buzz via machine learning model
Python
1
star
36

GCP-Big-Data-Setup

dev environment setup script
Shell
1
star
37

appengine-try-python-flask

Sample for GAE using Python
Python
1
star
38

blastn

Demo of blastn tool for bioinformatics
Jupyter Notebook
1
star
39

ballerina-testing

unit tests for Ballerina Langauge
Ballerina
1
star
40

docker-for-biologists

Resources for using docker for biologists
Dockerfile
1
star