Skip to content
Travis Haussler edited this page Jul 8, 2024 · 26 revisions

Welcome to the OrchidTM Synthetic Hospital Discharge Data Project Wiki!

For a quick start, see the README and Dev Guide.

Mission Statement

Our project leverages SyntheaTM, an open-source tool developed by the MITRE Corporation, to create synthetic hospital discharge data. SyntheaTM uses research-based models to generate rich medical histories for synthetic patients. We extract the hospital visits and create datasets that match the format of administrative data available to healthcare organizations. This synthetic data allows students and researchers to explore patient records without privacy concerns and develop analyses for hospitals to run on their own real data. Our goal is to make it easier for hospitals, public health officials, and researchers to collaborate and gain insights from administrative hospital data, while keeping patient information private.

Studies

For each study we generate data to support we generally need to generate and compile the data in a custom format with specific goals for the data. Here are the details for each study, including what the data goals are, and how to run the study: How to use Studies

Program information

Basic run:

python3 run_synthea.py
Note - this uses default output format of California PDD
Example output using this: Example Outputs