Skip to content

jday1/airr-standards

 
 

Repository files navigation

https://github.com/airr-community/airr-standards/actions/workflows/schema-test.yaml/badge.svg?branch=master https://github.com/airr-community/airr-standards/actions/workflows/py-unittest.yaml/badge.svg?branch=master https://github.com/airr-community/airr-standards/actions/workflows/r-check.yaml/badge.svg?branch=master

AIRR Community Data Standards

Introduction

One of the core initiatives of the Adaptive Immune Receptor Repertoire (AIRR) Community [Breden_2017] is to develop and maintain standards regarding the content, format and exchange of data and metadata from AIRR sequencing (AIRR-seq) experiments, thereby contributing to the global endeavor to make scientific data FAIR (findable, accessible, interoperable, reusable). The current Standards developed from three initially separate initatives (Minimal Information, Data Representation, API), which converged over time. The further development of the Standards is currently carried out by the AIRR Community's Standards Working Group (WG), which was the result of the fusion of the "Data Representation", "Minimal Standards" and parts of the Common Repository WGs.

MiAIRR Minimal Information Standard

The Minimal information about an Adaptive Immune Receptor Repertoire (MiAIRR) data standard defines around 60 metadata fields which should be provided to facilitate evaluation and re-use a the data set. These fields are grouped according to the various stage of the typical workflow (e.g., study, subject, sample, experimental processing, data processing, data annotation). MiAIRR was originally published in 2017 [Rubelt_2017] and has since been intergrated into the AIRR Data Schema, which has now become the authorative reference for MiAIRR. For further information see the MiAIRR section of the AIRR Standards documentation.

AIRR Standardized Data Representation

The standardarized data representation was first published in 2018 [Vander_Heiden_2018], the current version of the schema can be found in this repository as OpenAPI v2 and OpenAPI v3 definitions. The two versions are functionally identical, but users should be aware that support for OpenAPI v2 will be discontinued in the intermediate future.

AIRR Data Commons API

The AIRR Data Commons (ADC) API [Christley_2020] provides programmatic access to query and download AIRR-seq data from ADC repositories.

Implementations

The AIRR Standards are supported by the following services:

Releases and Issues

Official releases can be found in the release section of the repository. There releases are also permanently archived on Zenodo, where they can be referenced to via the archive's DOI [Zenodo_1185414].

Contributions

The AIRR Community is an open initiative and we welcome all contributions. Please review our development process before contributing changes.

References

[Breden_2017]Breden F et al. Reproducibility and Reuse of Adaptive Immune Receptor Repertoire Data. Front Immunol 8:1418 (2017) [PMID:29163494] [DOI:10.3389/fimmu.2017.01418]
[Christley_2020]Christley S et al. The ADC API: A Web API for the Programmatic Query of the AIRR Data Commons. Front Big Data (2020) [PMID:33693395] [DOI:10.3389/fdata.2020.00022]
[Lees_2020]Lees W et al. OGRDB: a reference database of inferred immune receptor genes. Nucleic Acids Res 48:D964 (2020) [PMID:31566225] [DOI:10.1093/nar/gkz822]
[Rubelt_2017]Rubelt F et al. AIRR Community Recommendations for Sharing Immune Repertoire Sequencing Data. Nat Immunol 18:1274 (2017) [PMID:29144493] [DOI:10.1038/ni.3873]
[Vander_Heiden_2018]Vander Heiden JA et al. AIRR Community Standardized Representations for Annotated Immune Repertoires. Front Immunol 9:2206 (2018) [PMID:30323809] [DOI:10.3389/fimmu.2018.02206]
[Zenodo_1185414]Release archive of the AIRR Standards repository. (2018-2022) [DOI:10.5281/zenodo.1185414]

Copyright & License

Copyright 2015 - 2022 by the AIRR Community and contributors. This work is licensed under a Creative Commons Attribution 4.0 International License.

Releases

No releases published

Packages

No packages published

Languages

  • Python 76.8%
  • R 22.7%
  • Dockerfile 0.5%