dsSynthetic
is a DataSHIELD server side package for generating synthetic data.
Please also look at the corresponding client side package dsSyntheticClient
https://github.com/tombisho/dsSyntheticClient
or
https://github.com/neelsoumya/dsSyntheticClient
-
Install R Studio and the development environment as described below:
-
Install the virtual machines as described below:
-
Install dsBase and dsSynthetic on Opal server in the Virtual Machine (type tombisho/dsSynthetic and main in the textboxes)
See the link below on how to install a package in Opal
https://opaldoc.obiba.org/en/latest/web-user-guide/administration/datashield.html#add-package
-
Follow the bookdown
https://tombisho.github.io/synthetic_bookdown/
which uses the Opal demo server
https://opal-sandbox.mrc-epid.cam.ac.uk/
See bookdown below for executable code and a tutorial
https://tombisho.github.io/synthetic_bookdown/
which uses the Opal demo server which has all server side packages installed
https://opal-sandbox.mrc-epid.cam.ac.uk/
Thanks to the DataSHIELD team for providing the plaform on which these functions are based.
Thanks to OBiBa and Epigeny for the Opal data warehouse which we use to run DataSHIELD
Tom R.P. Bishop and Soumya Banerjee
If you like or use this work, please cite the following manuscript
Banerjee S, Bishop TRP. dsSynthetic: Synthetic data generation for the DataSHIELD federated analysis system. BMC Res. Notes. 2022;15 (1) :230
https://bmcresnotes.biomedcentral.com/articles/10.1186/s13104-022-06111-2
or
dsSynthetic: Synthetic data generation for the DataSHIELD federated analysis system