Skip to content

Latest commit

 

History

History
 
 

datasets

GO Dataset Metadata

Metadata on gene association datasets available from the GOC, organized by contributor. This drives the new GO association file publishing pipeline:

Soon the contents of this page will be driven by this job

See GO Infrastructure Plan USC 2016 for details

See also geneontology#207

Pipeline configuration: https://github.com/geneontology/go-site/tree/master/pipeline

Editing the metadata

Anyone can suggest edits to any of the yaml files in this repo (in general the files will be edited either by GO Central, or by a representative from each producer). These edits are made by Pull Requests, which are evaluated by a member of GO Central, and merged if appropriate and the travis tests pass.

Schema

See ../datasets.schema.yaml

Any edit is checked against this schema

Jenkins Jobs

NEO

This drives the noctua entity ontology which drives selection in noctua

Future

Ultimately, the metadata here will drive pages such as: http://geneontology.org/page/download-annotations

And will also drive loading of Golr and the GO graphstore

See geneontology#207 for more details.

For current plans on GO data architecture see: https://docs.google.com/presentation/d/1TmkHm_gq085_P7jWQCFdo2Y8NTsHGMCSNKAyXCZAsho/edit#slide=id.p4