BetaVAEMultImpute

As missing values are frequently present in genomic data, practical methods to handle missing data are necessary for downstream analyses that require complete datasets. In this work, we describe the use of a deep learning framework based on the variational autoencoder (VAE) to impute missing values using multiple imputation in transcriptomic data.

Gene expression data is version 2 of the adjusted pan-cancer gene expression data obtained from Synapse: https://www.synapse.org/#!Synapse:syn4976369.2. Examples of preprocessing the raw data and creating missing value simulations can be found in ./preprocess.

Build your environments

conda env create --file conda_env/vae_imp_tf2.yaml
conda env create --file conda_env/lasso.yaml

Set up input files

You must have nextflow installed prior to use.

Set parameters in nextflow.config and example_config_VAE.json file

Imputation

This pipeline is written in nextflow to allow parallel computing across all imputation strategies.

Run pipeline via nextflow

nextflow run main.nf

View results in the output directory specified in the nextflow.config file.

Name		Name	Last commit message	Last commit date
Latest commit History 403 Commits
bin		bin
conda_env		conda_env
conf		conf
cross_validation		cross_validation
data		data
experiments		experiments
figure_scripts		figure_scripts
modules		modules
preprocess		preprocess
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
VAE_config.json		VAE_config.json
appendix.pdf		appendix.pdf
betaVAE.py		betaVAE.py
impute_missing.py		impute_missing.py
main.nf		main.nf
nextflow.config		nextflow.config
requirements.txt		requirements.txt
train_VAE.py		train_VAE.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BetaVAEMultImpute

About

Releases

Packages

Contributors 2

Languages

License

roskamsh/BetaVAEMultImpute

Folders and files

Latest commit

History

Repository files navigation

BetaVAEMultImpute

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages