Getting GWAS Data Ready for tranSMART

This paragraph describes the steps needed to prepare GWAS data for loading into tranSMART. All scripts and templates available in this repository assume the use of tranSMART-batch as the loading tool.

An overview of the GWAS preprocessing process is shown below:

Details on individual parts of the process are found in the following paragraphs.

Lifting GWAS data to target dbSNP version

Preprocessing lifted GWAS data

Once the GWAS data has been lifted to a target dbSNP version (defined by the tranSMART adminstrators), the next step is to map and curate the data. The figure below illustrates the process:

The R script responsible for the preprocessing and mapping of the lifted GWAS data is called PREPROCESS_GWAS.R. This script requires a text file (like shown in the figure below) with information on how the GWAS data should be mapped to the tranSMART data model for GWAS data, and the actual lifted GWAS data.

Extract Metadata on GWAS Data

This step is usually the most laborous of the entire process of loading GWAS data into tranSMART as it requires reading and extracting essential information on how the GWAS data was generated. The primary source of information is most likely the peer-reviewed journal papers accompanying the GWAS data.

Generate GWAS Metadata

Before loading GWAS data, it is necessary to generate some meaningful metadata to support the GWAS data. This metadata will be visible in the tranSMART interface alongside with the data. In order to ease the process of providing meaningful metadata, a GWAS metadata template has been created (can be found here: https://github.com/Lundbeck-Biometrics/tranSMART-GWAS/tree/master/GWAS_METADATA_TEMPLATE). The column "Field Value" is the only column that needs to be updated with data on the study.

Loading GWAS Data into tranSMART

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
GWAS_METADATA_TEMPLATE		GWAS_METADATA_TEMPLATE
PREPROCESSING		PREPROCESSING
GWAS_mapping.jpg		GWAS_mapping.jpg
GWAS_metadata_generation.jpg		GWAS_metadata_generation.jpg
GWAS_preprocessing_data.jpg		GWAS_preprocessing_data.jpg
GWAS_preprocessing_overview.jpg		GWAS_preprocessing_overview.jpg
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Getting GWAS Data Ready for tranSMART

Lifting GWAS data to target dbSNP version

Preprocessing lifted GWAS data

Extract Metadata on GWAS Data

Generate GWAS Metadata

Loading GWAS Data into tranSMART

Making a gwas.params file

tranSMART Batch Commands

About

Releases

Packages

Languages

Lundbeck-Biometrics/tranSMART-GWAS

Folders and files

Latest commit

History

Repository files navigation

Getting GWAS Data Ready for tranSMART

Lifting GWAS data to target dbSNP version

Preprocessing lifted GWAS data

Extract Metadata on GWAS Data

Generate GWAS Metadata

Loading GWAS Data into tranSMART

Making a gwas.params file

tranSMART Batch Commands

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages