• This repository has been archived on 28/Oct/2022
  • Stars
    star
    214
  • Rank 184,678 (Top 4 %)
  • Language
    Python
  • License
    Apache License 2.0
  • Created over 10 years ago
  • Updated about 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Models and APIs for Genomic data. RETIRED 2018-01-24
Image

Schemas for the Data Working Group

Build Status Docs PyPi Release

The Global Alliance for Genomics and Health is an international coalition, formed to enable the sharing of genomic and clinical data.

REPOSITORY RETIREMENT NOTICE

The Genomics API was intended to act as a suite of integrated APIs each targeting a different aspect of exchanging genomic information between data providers and consumers. The Genomics API, together with the Reference Server and Compatibility test suite, was retired on January 24, 2018 and several of the sub-APIs are now being pursued under the auspices of new GA4GH Work Streams. We would like to thank those of you who have worked on this and look forward to ongoing contributions in the GA4GH Work Streams. You may still fork this repository if you wish to pursue developments. You may read the meeting minutes of the GA4GH Engineering Committee to learn more about the decision to retire the API. For additional questions or to get involved with ongoing technical work at GA4GH, please see the website at Global Alliance for Genomics and Health

Previous Readme Information continues below

Each area of genomics and health has a dedicated team working to define those standards.

Reads Task Team

The Reads Task Team is focused on standards for accessing genomic read data -- collections of primary data collected from sequencing machines.

The team will deliver:

  1. Data model. An abstract, mathematically complete and precise model of the data that is manipulated by the API. See the Proto directory for our in-progress work on defining v0.5 of the data model.
  2. API Specification. A human-readable document introducing and defining the API, accompanied by a formal specification. See the documentation page for the published v0.1 API.
  3. Reference Implementation. Open source working code demonstrating the API, ideally which can underpin real world working implementations.

Reference Variation Task Team

The Reference Variation Task Team is focused on standards for storing and accessing reference genome and variant data -- the results of analysis of primary data collected from sequencing machines.

Variant Annotation Task Team

The Variant Annotation Task Team is focused on developing standards for reporting variant annotation including results formats, ontologies and vocabularies for different classes of annotation so reporting is consistent and in a manner that facilitates benchmarking and evaluation.

File Formats Task Team

One small but essential part of this effort is the definition, standardisation, and improvement of basic file formats for sequence and variation data, and for associated infrastructure such as index formats.

These format specifications can be found in the samtools/hts-specs repository.

Metadata Task Team

The Metadata Task Team (MTT) concerns itself with data structures, attributes and values used to describe everything but the sequence. This includes metadata for individuals, samples, analyses, instrumentation a well as ontology representations for metadata. Naturally, the group interacts heavily with members of most other task teams and working groups.

The relevant place for finding out about accepted metadata formats is the *metadata-integration branch*

How to build

See INSTALL.rst for instructions on how to build the schemas and their documentation.

How to contribute changes

See the CONTRIBUTING.rst document.

License

See the LICENSE

More Repositories

1

benchmarking-tools

Repository for the GA4GH Benchmarking Team work developing standardized benchmarking methods for germline small variant calls
HTML
188
star
2

ga4gh-server

Reference implementation of the APIs defined in ga4gh-schemas. RETIRED 2018-01-24
Python
96
star
3

workflow-execution-service-schemas

The WES API is a standard way to run and manage portable workflows.
82
star
4

task-execution-schemas

81
star
5

vrs

Extensible specification for representing and uniquely identifying biological sequence variation
Jupyter Notebook
80
star
6

data-repository-service-schemas

A repository for the schemas used for the Data Repository Service.
60
star
7

vrs-python

GA4GH Variation Representation Python Implementation
Python
51
star
8

mme-apis

Documentation for the MatchmakerExchange APIs
HTML
34
star
9

tool-registry-service-schemas

APIs for discovering genomics tools, their metadata and their containers
Shell
30
star
10

data-security

This is the repository for the GA4GH Data Security Work Stream, containing AAI and DSIP.
Python
22
star
11

va-spec

An information model for representing variant annotations.
Jupyter Notebook
16
star
12

htsget

Python client for GA4GH htsget protocol
Python
15
star
13

wiki

a place for GA4GH-related wiki pages
15
star
14

refget

GA4GH Refget specifications docs
HTML
14
star
15

quality-control-wgs

Home for the GA4GH Quality Control of Whole Genome Sequencing metrics and reference implementations
Jupyter Notebook
12
star
16

pedigree

Repository for the family history/pedigree project
11
star
17

fasp-scripts

Jupyter Notebook
11
star
18

htsget-refserver

Reference server implementation for the GA4GH HTSget API standard.
Go
11
star
19

task-execution-server

Go
11
star
20

compliance

Compliance test suite for the APIs defined in ga4gh-schemas. RETIRED 2018-01-24
Java
10
star
21

metadata-team

9
star
22

cloud-interop-testing

Interoperable execution of workflows using GA4GH APIs
Python
9
star
23

TASC

TASC aids the harmonisation of aspects of GA4GH's various products that would otherwise prevent different products from being used together conveniently.
8
star
24

gh-openapi-docs

Package for building OpenAPI schema docs and UI for gh-pages
JavaScript
7
star
25

large-scale-genomics-wiki

Homepage for the Large Scale Genomics Work Stream
7
star
26

refget-compliance-suite

Repository for the Google Summer of Code for the Reference Retrieval API
Python
7
star
27

ga4gh-drs-client

Retrieve omics data and metadata from DRS-compliant web services
Python
6
star
28

ga4gh-client

Client implementation for accessing GA4GH APIs
Python
6
star
29

human-pangenome-project

Human Pangenome Project
5
star
30

Get-Started-with-GA4GH-APIs

ISMB 2022 tutorial on FASP and Starter Kit
Jupyter Notebook
5
star
31

ga4gh-registry

Implementation registry service for cataloguing implementations of GA4GH standards
Java
5
star
32

pedigree_family_history_terminology

Java
5
star
33

ga4gh-starter-kit-wes

Starter Kit server implementation of the GA4GH Workflow Execution Service (WES) specification
Java
5
star
34

gastore

Sample code to inform a discussion around content-addressable storage
Scala
5
star
35

tool-registry-validator

Used to validate APIs that implement https://github.com/ga4gh/tool-registry-schemas
Python
4
star
36

ga4gh-starter-kit-drs

Open, extensible server implementation of the GA4GH Data Repository Service (DRS) specification
Java
4
star
37

dwg-website

GA4GH Data Working Group website hosted at ga4gh.org
HTML
4
star
38

future-of-vcf

Repository for the GA4GH Future of VCF team.
3
star
39

g2p-team

GitHub Repo for the Genotype to Phenotype Task Team
3
star
40

ga4gh-common

common modules for GA4GH schemas, client and server, collected into one package
Python
3
star
41

ADA-M

3
star
42

tool-registry-reference-implementation

Clojure
3
star
43

swg-ssig

GA4GH Software Security Implementation Guide
2
star
44

fasp-clients

a place for client code from the FASP effort
Python
2
star
45

duri

2
star
46

hackathon2016

Repository to track DWG hackathon 2016 projects
2
star
47

ga4gh-consent-policy

2
star
48

ga4gh-testbed-ui

Web UI for the GA4GH Testbed Reporting service
JavaScript
2
star
49

hgvs-lib

C-based library for manipulating HGVS-formatted variants
C
2
star
50

approval-tracker

GitHub repository for tracking the progress of specification approval
2
star
51

refget-cloud

AWS serverless implementation of the refget service
Python
2
star
52

ga4gh-starter-kit-data-connect

Starter Kit server implementation of the GA4GH Data Connect specification
Java
2
star
53

refget-loader

Schedule the processing of ENA sequences and upload to S3 public dataset
Python
2
star
54

gks-common

Common classes and schemas used by all GKS specifications (ie. VR, VA, etc..)
Makefile
2
star
55

data-object-service-schemas

Intermediate repo to manage redirects to DRS repository (original home for DOS)
HTML
1
star
56

workflow-execution-server

1
star
57

pedigree-tools

JavaScript
1
star
58

Strategic-Refresh

1
star
59

build

A hub for working on the GA4GH Reference Server, Schemas, and Compliance projects.
Shell
1
star
60

ga4gh-crypt4gh

1
star
61

htsget-compliance

Compliance testing for htsget deployments
Python
1
star
62

vrs-hackathons

Project tracking for GA4GH Variation Representation Specification hackathons
Jupyter Notebook
1
star
63

experiments-metadata

1
star
64

ga4gh-testbed-api

GA4GH-wide testbed infrastructure for hosting testbed reports
Java
1
star
65

compliance-tests-ga4gh-tes

1
star
66

compliance-tests-ga4gh-wes

1
star
67

cloud-best-practices

Best practices (spec development, project/issue management, testing, CI, etc.) across all Cloud Work Stream standards
1
star
68

cloud-interop-ui

Manage Cloud interoperability test plugins and schedules through web interface
JavaScript
1
star
69

ga4gh-starter-kit-common

Common utils library for GA4GH Starter Kit web services
Java
1
star
70

product-process

1
star
71

tool-registry-server

1
star
72

cloud-conformance-testing

Test suites to validate implementations of GA4GH Cloud APIs
1
star
73

ga4gh-testbed-lib

Python library for creating GA4GH testbed reports according to a harmonized, cross-workstream schema
Python
1
star
74

cohort-rep-hackathon

Project tracking for GA4GH Computable Cohort Representation Hackathons
Jupyter Notebook
1
star
75

ga4gh-starter-kit-passport-ui

Central UI server connecting to other Passport-related microservices (ory hydra, ory kratos)
TypeScript
1
star
76

gks-starter-repo

A starting point to copy and use for teams wishing to quickly define their own schema extensions and profiles to the VA, Cat-VRS or VRS schemas.
Python
1
star
77

gks-validator

1
star
78

gk-pilot

A pilot implementation of GKS Work Stream specifications.
Makefile
1
star