These repositories represent accumulated tools and knowledge that enable labs to get up and running with cloud computing, mostly using cancer genome analysis as the use case. Some of the contents include:
- How to get your lab signed up for Google Cloud access through WUSTL IT
- A lecture and slides on Getting started on cloud with details specific to WUSTL.
- End-to-end genomic workflows for variant calling, rnaseq analyses, epigenomics, and more.
- Guidance for running these pipelines (or others) on Google Cloud, including complete walkthrough examples of using it to run an immunogenomics pipeline from data you've already put on the cloud or from data stored on the local compute1 cluster
- Guides to running workflows on the local compute cluster, using GMS (example) for workflow orchestration or Cromwell
- Annotation files pre-loaded on the cloud that are needed for many of these pipelines, as well as detailed instructions for creating your own
- Links to useful resources for analysis, cloud computing, or running workflows on other providers such as Terra or DNAnexus