From da059065684e09697c3ab897a97705c59e8c2022 Mon Sep 17 00:00:00 2001 From: verena <9377970+vpchung@users.noreply.github.com> Date: Mon, 6 Nov 2023 23:41:27 +0000 Subject: [PATCH 1/5] fix headlines; increase char limit to 120 --- .../src/main/resources/db/challenges.csv | 348 +++++++++--------- 1 file changed, 174 insertions(+), 174 deletions(-) diff --git a/apps/openchallenges/challenge-service/src/main/resources/db/challenges.csv b/apps/openchallenges/challenge-service/src/main/resources/db/challenges.csv index 3cea75ade0..0daec5e80b 100644 --- a/apps/openchallenges/challenge-service/src/main/resources/db/challenges.csv +++ b/apps/openchallenges/challenge-service/src/main/resources/db/challenges.csv @@ -1,60 +1,60 @@ "id","slug","name","headline","description","avatar_url","website_url","status","difficulty","platform","doi","start_date","end_date","created_at","updated_at" -"1","network-topology-and-parameter-inference","Network Topology and Parameter Inference","Optimize methods to estimate biology model parameters for Network Topology a...","Participants are asked to develop and/or apply optimization methods, including the selection of the most informative experiments, to accurately estimate parameters and predict outcomes of perturbations in Systems Biology models.","","https://www.synapse.org/#!Synapse:syn2821735","completed","intermediate","1","","2012-06-01","2012-10-01","2023-11-01 22:18:09","2023-11-02 16:29:14" -"2","breast-cancer-prognosis","Breast Cancer Prognosis","Predict breast cancer survival from clinical and genomic data for Breast Can...","The goal of the breast cancer prognosis Challenge is to assess the accuracy of computational models designed to predict breast cancer survival, based on clinical information about the patient's tumor as well as genome-wide molecular profiling data including gene expression and copy number profiles.","","https://www.synapse.org/#!Synapse:syn2813426","completed","intermediate","1","","2012-07-12","2012-10-15","2023-11-01 22:18:13","2023-11-01 23:44:04" -"3","phil-bowen-als-prediction-prize4life","Phil Bowen ALS Prediction Prize4Life","Seeking treatment to halt ALS's fatal loss of motor function for Phil Bowen ...","Amyotrophic Lateral Sclerosis (ALS)-also known as Lou Gehrig's disease (in the US) or Motor Neurone disease (outside the US)-is a fatal neurological disease causing death of the nerve cells in the brain and spinal cord which control voluntary muscle movements. This leaves patients struggling with a progressive loss of motor function while leaving cognitive functions intact. Symptoms usually do not manifest until the age of 50 but can start earlier. At any given time, approximately five out of every 100,000 people worldwide suffer from ALS, though there would be a higher prevalence if the disease did not progress so rapidly, leading to the death of the patient. There are no known risk factors for developing ALS other than having a family member who has a hereditary form of the disease, which accounts for about 5-10% of ALS patients. There is also no known cure for ALS. The only FDA-approved drug for the disease is Riluzole, which has been shown to prolong the life span of someone w...","","https://www.synapse.org/#!Synapse:syn2826267","completed","intermediate","1","","2012-06-01","2012-10-01","2023-11-01 22:09:02","2023-11-01 20:40:42" -"4","drug-sensitivity-and-drug-synergy-prediction","Drug Sensitivity and Drug Synergy Prediction","Revolutionizing Cancer Therapeutics: Predicting Drug Sensitivity in Human Ce...","Development of new cancer therapeutics currently requires a long and protracted process of experimentation and testing. Human cancer cell lines represent a good model to help identify associations between molecular subtypes, pathways, and drug response. In recent years there have been several efforts to generate genomic profiles of collections of cell lines and to determine their response to panels of candidate therapeutic compounds. These data provide the basis for the development of in silico models of sensitivity based either on the unperturbed genetic potential of a cancer cell, or by using perturbation data to incorporate knowledge of actual cell response. Making predictions from either of these data profiles will be beneficial in identifying single and combinatorial chemotherapeutic response in patients. To that end, the present challenge seeks computational methods, derived from the molecular profiling of cell lines both in a static state and in response to perturbation of ...","","https://www.synapse.org/#!Synapse:syn2785778","completed","intermediate","1","","2012-06-01","2012-10-01","2023-11-01 22:08:36","2023-11-02 18:25:20" -"5","niehs-ncats-unc-toxicogenetics","NIEHS-NCATS-UNC Toxicogenetics","Predicting cytotoxicity from genomic and chemical data for NIEHS-NCATS-UNC T...","This challenge is designed to build predictive models of cytotoxicity as mediated by exposure to environmental toxicants and drugs. To approach this question, we will provide a dataset containing cytotoxicity estimates as measured in lymphoblastoid cell lines derived from 884 individuals following in vitro exposure to 156 chemical compounds. In subchallenge 1, participants will be asked to model interindividual variability in cytotoxicity based on genomic profiles in order to predict cytotoxicity in unknown individuals. In subchallenge 2, participants will be asked to predict population-level parameters of cytotoxicity across chemicals based on structural attributes of compounds in order to predict median cytotoxicity and mean variance in toxicity for unknown compounds.","","https://www.synapse.org/#!Synapse:syn1761567","completed","intermediate","1","","2013-06-10","2013-09-15","2023-11-01 22:08:45","2023-11-01 22:06:01" -"6","whole-cell-parameter-estimation","Whole-Cell Parameter Estimation","Seeking innovative parameter estimation methods for large models for Whole-C...","The goal of this challenge is to explore and compare innovative approaches to parameter estimation of large, heterogeneous computational models. Participants are encouraged to develop and/or apply optimization methods, including the selection of the most informative experiments. The organizers encourage participants to form teams to collaboratively solve the challenge.","","https://www.synapse.org/#!Synapse:syn1876068","completed","intermediate","1","","2013-06-10","2013-09-23","2023-06-23 00:00:00","2023-11-01 22:06:23" -"7","hpn-dream-breast-cancer-network-inference","HPN-DREAM Breast Cancer Network Inference","Inferring causal signaling networks in breast cancer for HPN-DREAM Breast Ca...","The overall goal of the Heritage-DREAM breast cancer network inference challenge is to quickly and effectively advance our ability to infer causal signaling networks and predict protein phosphorylation dynamics in cancer. We provide extensive training data from experiments on four breast cancer cell lines stimulated with various ligands. The data comprise protein abundance time-courses under inhibitor perturbations.","","https://www.synapse.org/#!Synapse:syn1720047","completed","intermediate","1","","2013-06-10","2013-09-16","2023-06-23 00:00:00","2023-11-01 22:06:31" -"8","rheumatoid-arthritis-responder","Rheumatoid Arthritis Responder","Unlocking Anti-TNF Response Predictors: A Crowdsourced Breakthrough in RA Th...","The goal of this project is to use a crowd-based competition framework to develop a validated molecular predictor of anti-TNF response in RA. There is an increasing need for predictors of response to therapy in inflammatory disease driven by the observation that most clinically defined diseases show variable response and the growing availability of alternative therapies. Anti-TNF drugs in Rheumatoid Arthritis represent a prototypical example of this opportunity. A number of studies have tried, over the past decade, to develop a robust predictor of response. We believe the time is right to try a different approach to developing such a biomarker with a crowd-sourced collaborative competition. This is based on DREAM and Sage Bionetworks' experience with running competitions and the availability of new unpublished large-scale data relating to RA treatment response.THIS CHALLENGE RAN FROM FEBRUARY TO OCTOBER 2014 AND IS NOW CLOSED.","","https://www.synapse.org/#!Synapse:syn1734172","completed","intermediate","1","","2014-02-10","2014-06-04","2023-06-23 00:00:00","2023-11-01 22:06:54" -"9","icgc-tcga-dream-mutation-calling","ICGC-TCGA DREAM Mutation Calling","Crowdsourcing Challenge Seeks to Improve Cancer Mutation Detection for ICGC-...","The ICGC-TCGA DREAM Genomic Mutation Calling Challenge (herein, The Challenge) is an international effort to improve standard methods for identifying cancer-associated mutations and rearrangements in whole-genome sequencing (WGS) data. Leaders of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA) cancer genomics projects are joining with Sage Bionetworks and IBM-DREAM to initiate this innovative open crowd-sourced Challenge [1-3].","","https://www.synapse.org/#!Synapse:syn312572","completed","intermediate","1","","2013-12-14","2016-04-22","2023-06-23 00:00:00","2023-10-14 05:38:15" -"10","acute-myeloid-leukemia-outcome-prediction","Acute Myeloid Leukemia Outcome Prediction","Uncover drivers of AML using clinical and proteomic data for Acute Myeloid L...","The AML Outcome Prediction Challenge provides a unique opportunity to access and interpret a rich dataset for AML patients that includes clinical covariates, select gene mutation status and proteomic data. Capitalizing on a unique AML reverse phase protein array (RPPA) dataset obtained at M.D. Anderson Cancer Center that captures 271 measurements for each patient, participants of the DREAM 9 Challenge will help uncover what drives AML. Outcomes of this Challenge have the potential to be used immediately to tailor therapies for newly diagnosed leukemia patients and to accelerate the development of new drugs for leukemia.","","https://www.synapse.org/#!Synapse:syn2455683","completed","intermediate","1","","2014-06-02","2014-09-15","2023-06-23 00:00:00","2023-10-14 05:38:16" -"11","broad-dream-gene-essentiality-prediction","Broad-DREAM Gene Essentiality Prediction","Crowdsourcing Models to Predict Cancer Cell Gene Dependencies for Broad-DREA...","The goal of this project is to use a crowd-based competition to develop predictive models that can infer gene dependency scores in cancer cells (genes that are essential to cancer cell viability when suppressed) using features of those cell lines. An additional goal is to find a small set of biomarkers (gene expression, copy number, and mutation features) that can best predict a single gene or set of genes.","","https://www.synapse.org/#!Synapse:syn2384331","completed","intermediate","1","","2014-06-02","2014-09-29","2023-06-23 00:00:00","2023-10-14 05:38:16" -"12","alzheimers-disease-big-data","Alzheimer's Disease Big Data","Seeking Accurate Predictive Biomarkers for Alzheimer's Diagnosis for Alzheim...","The goal of the Alzheimer's Disease Big Data DREAM Challenge #1 (AD#1) was to apply an open science approach to rapidly identify accurate predictive AD biomarkers that can be used by the scientific, industrial and regulatory communities to improve AD diagnosis and treatment. AD#1 will be the first in a series of AD Data Challenges to leverage genetics and brain imaging in combination with cognitive assessments, biomarkers and demographic information from cohorts ranging from cognitively normal to mild cognitively impaired to individuals with AD.","","https://www.synapse.org/#!Synapse:syn2290704","completed","intermediate","1","","2014-06-02","2014-10-17","2023-06-23 00:00:00","2023-10-14 05:38:17" -"13","olfaction-prediction","Olfaction Prediction","Predicting smell from molecule features for Olfaction Prediction","The goal of the DREAM Olfaction Prediction Challenge is to find models that can predict how a molecule smells from its physical and chemical features. A model that allows us to predict a smell from a molecule will provide fundamental insights into how odor chemicals are transformed into a smell percept in the brain. Further, being able to predict how a chemical smells will greatly accelerate the design of new molecules to be used as fragrances. Currently, fragrance chemists synthesize many molecules to obtain a new ingredient, but most of these will not have the desired qualities.","","https://www.synapse.org/#!Synapse:syn2811262","completed","intermediate","1","","2015-01-15","2015-05-01","2023-11-01 22:11:08","2023-10-14 05:38:17" -"14","prostate-cancer","Prostate Cancer","Predict survival of docetaxel treatment in mCRPC patients for Prostate Cancer","This challenge will attempt to improve the prediction of survival and toxicity of docetaxel treatment in patients with metastatic castration-resistant prostate cancer (mCRPC). The primary benefit of this Challenge will be to establish new quantitative benchmarks for prognostic modeling in mCRPC, with a potential impact for clinical decision making and ultimately understanding the mechanism of disease progression. Participating teams will be asked to submit predictive models based on clinical variables from the comparator arms of four phase III clinical trials with over 2,000 mCRPC patients treated with first-line docetaxel. The comparator arm of a clinical trial represents the patients that receive a treatment that is considered to be effective. This arm of the clinical trial is used to evaluate the effectiveness of the new therapy being tested.","","https://www.synapse.org/#!Synapse:syn2813558","completed","intermediate","1","","2015-03-16","2015-07-27","2023-06-23 00:00:00","2023-10-14 05:38:18" -"15","als-stratification-prize4life","ALS Stratification Prize4Life","Advancing ALS Treatment: Predicting Disease Progression and Survival with Data.","As illustrated by the overview figure below, (a) Challenge Data includes data from ALS clinical trials and ALS registries. ALS clinical trials consist of patients from clinical trials available open access on the PRO-ACT database and patients from 6 clinical trials not yet added into the database. Data from ALS registries was collected from patients in national ALS registries. (b) Data is divided into three subsets-training data provided to solvers in full, leaderboard, and validation data that is available only to the organizers and is reserved for the scoring of the challenge. (c) The goal of this challenge is then to predict the Clinical Targets, i.e. the disease progression as ALSFRS slope as well as survival. (d) For Building the Models, participants create two algorithms-one that selects features and one that predicts outcomes. To perform predictions, data from a given patient is fed into the selector . The selector selects 6 features and a cluster/model ID (3), e.g. from a...","","https://www.synapse.org/#!Synapse:syn2873386","completed","intermediate","1","","2015-06-22","2015-10-04","2023-06-23 00:00:00","2023-10-14 05:38:19" -"16","astrazeneca-sanger-drug-combination-prediction","AstraZeneca-Sanger Drug Combination Prediction","Predict effective drug combinations using genomic data for AstraZeneca-Sange...","To accelerate the understanding of drug synergy, AstraZeneca has partnered with the European Bioinformatic Institute, the Sanger Institute, Sage Bionetworks, and the distributed DREAM community to launch the AstraZeneca-Sanger Drug Combination Prediction DREAM Challenge. This Challenge is designed to explore fundamental traits that underlie effective combination treatments and synergistic drug behavior using baseline genomic data, i.e. data collected pretreatment. As the basis of the Challenge, AstraZeneca is releasing ~11.5k experimentally tested drug combinations measuring cell viability over 118 drugs and 85 cancer cell lines (primarily colon, lung, and breast), and monotherapy drug response data for each drug and cell line. Moreover, in coordination with the Genomics of Drug Sensitivity in Cancer and COSMIC teams at the Sanger Institute, genomic data including gene expression, mutations (whole exome), copy-number alterations, and methylation data will be released into the publ...","","https://www.synapse.org/#!Synapse:syn4231880","completed","intermediate","1","","2015-09-03","2016-03-14","2023-06-23 00:00:00","2023-10-14 05:38:19" -"17","smc-dna-meta","SMC-DNA Meta","Seeking Most Accurate Somatic Mutation Detection Pipeline for SMC-DNA Meta","The goal of this Challenge is to identify the most accurate meta-pipeline for somatic mutation detection, and establish the state-of-the-art. The algorithms in this Challenge must use as input mutations predicted by one or more variant callers and output mutation calls associated with cancer. An additional goal is to highlight the complementarity of the calling algorithms and help understand their individual advantages/deficiencies.","","https://www.synapse.org/#!Synapse:syn4588939","completed","intermediate","1","","2015-08-17","2016-04-10","2023-06-23 00:00:00","2023-10-14 05:38:20" -"18","smc-het","SMC-Het","Crowdsourcing Challenge to Improve Tumor Subclonal Reconstruction for SMC-Het","The ICGC-TCGA DREAM Somatic Mutation Calling-Tumour Heterogeneity Challenge (SMC-Het) is an international effort to improve standard methods for subclonal reconstruction-to quantify and genotype each individual cell population present within a tumor. Leaders of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA) cancer genomics projects are joining with Sage Bionetworks and IBM-DREAM to initiate this innovative open crowd-sourced Challenge [1-3].","","https://www.synapse.org/#!Synapse:syn2813581","completed","intermediate","1","","2015-11-16","2016-06-30","2023-11-01 22:21:29","2023-10-14 05:38:21" -"19","respiratory-viral","Respiratory Viral","Unraveling Viral Susceptibility: Early Predictors of Respiratory Infection a...","Respiratory viruses are highly infectious and cause acute illness in millions of people every year. However, there is wide variation in the physiologic response to exposure at the individual level. Some people that are exposed to virus are able to completely avoid infection. Others contract virus but are able to fight it off without exhibiting any symptoms of illness such as coughing, sneezing, sore throat or fever. It is not well understood what characteristics may protect individuals from respiratory viral infection. These individual responses are likely influenced by multiple processes including both the basal state of the human host upon exposure and the dynamics of host immune response in the early hours immediately following exposure. Many of these processes play out in the peripheral blood through activation and recruitment of circulating immune cells. Global gene expression patterns measured in peripheral blood at the time of symptom onset-several days after viral exposure...","","https://www.synapse.org/#!Synapse:syn5647810","completed","intermediate","1","","2016-05-16","2016-09-28","2023-06-23 00:00:00","2023-10-14 05:38:21" -"20","disease-module-identification","Disease Module Identification","Crowdsourcing challenge to find disease modules in genomic networks for Dise...","The Disease Module Identification DREAM Challenge is an open community effort to systematically assess module identification methods on a panel of state-of-the-art genomic networks and leverage the “wisdom of crowds” to discover novel modules and pathways underlying complex diseases.","","https://www.synapse.org/#!Synapse:syn6156761","completed","intermediate","1","https://doi.org/10.1038/s41592-019-0509-5","2016-06-24","2016-10-01","2023-11-01 22:21:32","2023-10-16 21:17:48" -"21","encode","ENCODE","Predict transcription factor binding sites from limited data for ENCODE","Transcription factors (TFs) are regulatory proteins that bind specific DNA sequence patterns (motifs) in the genome and affect transcription rates of target genes. Binding sites of TFs differ across cell types and experimental conditions. Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is an experimental method that is commonly used to obtain the genome-wide binding profile of a TF of interest in a specific cell type/condition. However, profiling the binding landscape of every TF in every cell type/condition is infeasible due to constraints on cost, material and effort. Hence, accurate computational prediction of in vivo TF binding sites is critical to complement experimental results.","","https://www.synapse.org/#!Synapse:syn6131484","completed","intermediate","1","","2016-07-07","2017-01-11","2023-11-01 22:21:32","2023-10-14 05:38:26" +"1","network-topology-and-parameter-inference","Network Topology and Parameter Inference","Optimize methods to estimate biology model parameters","Participants are asked to develop and/or apply optimization methods, including the selection of the most informative experiments, to accurately estimate parameters and predict outcomes of perturbations in Systems Biology models.","","https://www.synapse.org/#!Synapse:syn2821735","completed","intermediate","1","","2012-06-01","2012-10-01","2023-11-01 22:18:09","2023-11-02 16:29:14" +"2","breast-cancer-prognosis","Breast Cancer Prognosis","Predict breast cancer survival from clinical and genomic data for Breast Cancer Prognosis","The goal of the breast cancer prognosis Challenge is to assess the accuracy of computational models designed to predict breast cancer survival, based on clinical information about the patient's tumor as well as genome-wide molecular profiling data including gene expression and copy number profiles.","","https://www.synapse.org/#!Synapse:syn2813426","completed","intermediate","1","","2012-07-12","2012-10-15","2023-11-01 22:18:13","2023-11-01 23:44:04" +"3","phil-bowen-als-prediction-prize4life","Phil Bowen ALS Prediction Prize4Life","Seeking treatment to halt ALS's fatal loss of motor function","Amyotrophic Lateral Sclerosis (ALS)-also known as Lou Gehrig's disease (in the US) or Motor Neurone disease (outside the US)-is a fatal neurological disease causing death of the nerve cells in the brain and spinal cord which control voluntary muscle movements. This leaves patients struggling with a progressive loss of motor function while leaving cognitive functions intact. Symptoms usually do not manifest until the age of 50 but can start earlier. At any given time, approximately five out of every 100,000 people worldwide suffer from ALS, though there would be a higher prevalence if the disease did not progress so rapidly, leading to the death of the patient. There are no known risk factors for developing ALS other than having a family member who has a hereditary form of the disease, which accounts for about 5-10% of ALS patients. There is also no known cure for ALS. The only FDA-approved drug for the disease is Riluzole, which has been shown to prolong the life span of someone w...","","https://www.synapse.org/#!Synapse:syn2826267","completed","intermediate","1","","2012-06-01","2012-10-01","2023-11-01 22:09:02","2023-11-01 20:40:42" +"4","drug-sensitivity-and-drug-synergy-prediction","Drug Sensitivity and Drug Synergy Prediction","Revolutionizing Cancer Therapeutics: Predicting Drug Sensitivity in Human Cell Lines","Development of new cancer therapeutics currently requires a long and protracted process of experimentation and testing. Human cancer cell lines represent a good model to help identify associations between molecular subtypes, pathways, and drug response. In recent years there have been several efforts to generate genomic profiles of collections of cell lines and to determine their response to panels of candidate therapeutic compounds. These data provide the basis for the development of in silico models of sensitivity based either on the unperturbed genetic potential of a cancer cell, or by using perturbation data to incorporate knowledge of actual cell response. Making predictions from either of these data profiles will be beneficial in identifying single and combinatorial chemotherapeutic response in patients. To that end, the present challenge seeks computational methods, derived from the molecular profiling of cell lines both in a static state and in response to perturbation of ...","","https://www.synapse.org/#!Synapse:syn2785778","completed","intermediate","1","","2012-06-01","2012-10-01","2023-11-01 22:08:36","2023-11-02 18:25:20" +"5","niehs-ncats-unc-toxicogenetics","NIEHS-NCATS-UNC Toxicogenetics","Predicting cytotoxicity from genomic and chemical data","This challenge is designed to build predictive models of cytotoxicity as mediated by exposure to environmental toxicants and drugs. To approach this question, we will provide a dataset containing cytotoxicity estimates as measured in lymphoblastoid cell lines derived from 884 individuals following in vitro exposure to 156 chemical compounds. In subchallenge 1, participants will be asked to model interindividual variability in cytotoxicity based on genomic profiles in order to predict cytotoxicity in unknown individuals. In subchallenge 2, participants will be asked to predict population-level parameters of cytotoxicity across chemicals based on structural attributes of compounds in order to predict median cytotoxicity and mean variance in toxicity for unknown compounds.","","https://www.synapse.org/#!Synapse:syn1761567","completed","intermediate","1","","2013-06-10","2013-09-15","2023-11-01 22:08:45","2023-11-01 22:06:01" +"6","whole-cell-parameter-estimation","Whole-Cell Parameter Estimation","Seeking innovative parameter estimation methods for large models","The goal of this challenge is to explore and compare innovative approaches to parameter estimation of large, heterogeneous computational models. Participants are encouraged to develop and/or apply optimization methods, including the selection of the most informative experiments. The organizers encourage participants to form teams to collaboratively solve the challenge.","","https://www.synapse.org/#!Synapse:syn1876068","completed","intermediate","1","","2013-06-10","2013-09-23","2023-06-23 00:00:00","2023-11-01 22:06:23" +"7","hpn-dream-breast-cancer-network-inference","HPN-DREAM Breast Cancer Network Inference","Inferring causal signaling networks in breast cancer","The overall goal of the Heritage-DREAM breast cancer network inference challenge is to quickly and effectively advance our ability to infer causal signaling networks and predict protein phosphorylation dynamics in cancer. We provide extensive training data from experiments on four breast cancer cell lines stimulated with various ligands. The data comprise protein abundance time-courses under inhibitor perturbations.","","https://www.synapse.org/#!Synapse:syn1720047","completed","intermediate","1","","2013-06-10","2013-09-16","2023-06-23 00:00:00","2023-11-01 22:06:31" +"8","rheumatoid-arthritis-responder","Rheumatoid Arthritis Responder","Unlocking Anti-TNF Response Predictors: A Crowdsourced Breakthrough in RA Therapy.","The goal of this project is to use a crowd-based competition framework to develop a validated molecular predictor of anti-TNF response in RA. There is an increasing need for predictors of response to therapy in inflammatory disease driven by the observation that most clinically defined diseases show variable response and the growing availability of alternative therapies. Anti-TNF drugs in Rheumatoid Arthritis represent a prototypical example of this opportunity. A number of studies have tried, over the past decade, to develop a robust predictor of response. We believe the time is right to try a different approach to developing such a biomarker with a crowd-sourced collaborative competition. This is based on DREAM and Sage Bionetworks' experience with running competitions and the availability of new unpublished large-scale data relating to RA treatment response.THIS CHALLENGE RAN FROM FEBRUARY TO OCTOBER 2014 AND IS NOW CLOSED.","","https://www.synapse.org/#!Synapse:syn1734172","completed","intermediate","1","","2014-02-10","2014-06-04","2023-06-23 00:00:00","2023-11-01 22:06:54" +"9","icgc-tcga-dream-mutation-calling","ICGC-TCGA DREAM Mutation Calling","Crowdsourcing challenge to improve cancer mutation detection","The ICGC-TCGA DREAM Genomic Mutation Calling Challenge (herein, The Challenge) is an international effort to improve standard methods for identifying cancer-associated mutations and rearrangements in whole-genome sequencing (WGS) data. Leaders of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA) cancer genomics projects are joining with Sage Bionetworks and IBM-DREAM to initiate this innovative open crowd-sourced Challenge [1-3].","","https://www.synapse.org/#!Synapse:syn312572","completed","intermediate","1","","2013-12-14","2016-04-22","2023-06-23 00:00:00","2023-10-14 05:38:15" +"10","acute-myeloid-leukemia-outcome-prediction","Acute Myeloid Leukemia Outcome Prediction","Uncover drivers of AML using clinical and proteomic data","The AML Outcome Prediction Challenge provides a unique opportunity to access and interpret a rich dataset for AML patients that includes clinical covariates, select gene mutation status and proteomic data. Capitalizing on a unique AML reverse phase protein array (RPPA) dataset obtained at M.D. Anderson Cancer Center that captures 271 measurements for each patient, participants of the DREAM 9 Challenge will help uncover what drives AML. Outcomes of this Challenge have the potential to be used immediately to tailor therapies for newly diagnosed leukemia patients and to accelerate the development of new drugs for leukemia.","","https://www.synapse.org/#!Synapse:syn2455683","completed","intermediate","1","","2014-06-02","2014-09-15","2023-06-23 00:00:00","2023-10-14 05:38:16" +"11","broad-dream-gene-essentiality-prediction","Broad-DREAM Gene Essentiality Prediction","Crowdsourcing models to predict cancer cell gene dependencies","The goal of this project is to use a crowd-based competition to develop predictive models that can infer gene dependency scores in cancer cells (genes that are essential to cancer cell viability when suppressed) using features of those cell lines. An additional goal is to find a small set of biomarkers (gene expression, copy number, and mutation features) that can best predict a single gene or set of genes.","","https://www.synapse.org/#!Synapse:syn2384331","completed","intermediate","1","","2014-06-02","2014-09-29","2023-06-23 00:00:00","2023-10-14 05:38:16" +"12","alzheimers-disease-big-data","Alzheimer's Disease Big Data","Seeking accurate predictive biomarkers","The goal of the Alzheimer's Disease Big Data DREAM Challenge #1 (AD#1) was to apply an open science approach to rapidly identify accurate predictive AD biomarkers that can be used by the scientific, industrial and regulatory communities to improve AD diagnosis and treatment. AD#1 will be the first in a series of AD Data Challenges to leverage genetics and brain imaging in combination with cognitive assessments, biomarkers and demographic information from cohorts ranging from cognitively normal to mild cognitively impaired to individuals with AD.","","https://www.synapse.org/#!Synapse:syn2290704","completed","intermediate","1","","2014-06-02","2014-10-17","2023-06-23 00:00:00","2023-10-14 05:38:17" +"13","olfaction-prediction","Olfaction Prediction","Predicting smell from molecule features","The goal of the DREAM Olfaction Prediction Challenge is to find models that can predict how a molecule smells from its physical and chemical features. A model that allows us to predict a smell from a molecule will provide fundamental insights into how odor chemicals are transformed into a smell percept in the brain. Further, being able to predict how a chemical smells will greatly accelerate the design of new molecules to be used as fragrances. Currently, fragrance chemists synthesize many molecules to obtain a new ingredient, but most of these will not have the desired qualities.","","https://www.synapse.org/#!Synapse:syn2811262","completed","intermediate","1","","2015-01-15","2015-05-01","2023-11-01 22:11:08","2023-10-14 05:38:17" +"14","prostate-cancer","Prostate Cancer","Predict survival of docetaxel treatment in mCRPC patients","This challenge will attempt to improve the prediction of survival and toxicity of docetaxel treatment in patients with metastatic castration-resistant prostate cancer (mCRPC). The primary benefit of this Challenge will be to establish new quantitative benchmarks for prognostic modeling in mCRPC, with a potential impact for clinical decision making and ultimately understanding the mechanism of disease progression. Participating teams will be asked to submit predictive models based on clinical variables from the comparator arms of four phase III clinical trials with over 2,000 mCRPC patients treated with first-line docetaxel. The comparator arm of a clinical trial represents the patients that receive a treatment that is considered to be effective. This arm of the clinical trial is used to evaluate the effectiveness of the new therapy being tested.","","https://www.synapse.org/#!Synapse:syn2813558","completed","intermediate","1","","2015-03-16","2015-07-27","2023-06-23 00:00:00","2023-10-14 05:38:18" +"15","als-stratification-prize4life","ALS Stratification Prize4Life","Advancing ALS Treatment: Predicting Disease Progression and Survival with Data","As illustrated by the overview figure below, (a) Challenge Data includes data from ALS clinical trials and ALS registries. ALS clinical trials consist of patients from clinical trials available open access on the PRO-ACT database and patients from 6 clinical trials not yet added into the database. Data from ALS registries was collected from patients in national ALS registries. (b) Data is divided into three subsets-training data provided to solvers in full, leaderboard, and validation data that is available only to the organizers and is reserved for the scoring of the challenge. (c) The goal of this challenge is then to predict the Clinical Targets, i.e. the disease progression as ALSFRS slope as well as survival. (d) For Building the Models, participants create two algorithms-one that selects features and one that predicts outcomes. To perform predictions, data from a given patient is fed into the selector . The selector selects 6 features and a cluster/model ID (3), e.g. from a...","","https://www.synapse.org/#!Synapse:syn2873386","completed","intermediate","1","","2015-06-22","2015-10-04","2023-06-23 00:00:00","2023-10-14 05:38:19" +"16","astrazeneca-sanger-drug-combination-prediction","AstraZeneca-Sanger Drug Combination Prediction","Predict effective drug combinations using genomic data","To accelerate the understanding of drug synergy, AstraZeneca has partnered with the European Bioinformatic Institute, the Sanger Institute, Sage Bionetworks, and the distributed DREAM community to launch the AstraZeneca-Sanger Drug Combination Prediction DREAM Challenge. This Challenge is designed to explore fundamental traits that underlie effective combination treatments and synergistic drug behavior using baseline genomic data, i.e. data collected pretreatment. As the basis of the Challenge, AstraZeneca is releasing ~11.5k experimentally tested drug combinations measuring cell viability over 118 drugs and 85 cancer cell lines (primarily colon, lung, and breast), and monotherapy drug response data for each drug and cell line. Moreover, in coordination with the Genomics of Drug Sensitivity in Cancer and COSMIC teams at the Sanger Institute, genomic data including gene expression, mutations (whole exome), copy-number alterations, and methylation data will be released into the publ...","","https://www.synapse.org/#!Synapse:syn4231880","completed","intermediate","1","","2015-09-03","2016-03-14","2023-06-23 00:00:00","2023-10-14 05:38:19" +"17","smc-dna-meta","SMC-DNA Meta","Seeking most accurate somatic mutation detection pipeline","The goal of this Challenge is to identify the most accurate meta-pipeline for somatic mutation detection, and establish the state-of-the-art. The algorithms in this Challenge must use as input mutations predicted by one or more variant callers and output mutation calls associated with cancer. An additional goal is to highlight the complementarity of the calling algorithms and help understand their individual advantages/deficiencies.","","https://www.synapse.org/#!Synapse:syn4588939","completed","intermediate","1","","2015-08-17","2016-04-10","2023-06-23 00:00:00","2023-10-14 05:38:20" +"18","smc-het","SMC-Het","Crowdsourcing challenge to improve tumor subclonal reconstruction","The ICGC-TCGA DREAM Somatic Mutation Calling-Tumour Heterogeneity Challenge (SMC-Het) is an international effort to improve standard methods for subclonal reconstruction-to quantify and genotype each individual cell population present within a tumor. Leaders of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA) cancer genomics projects are joining with Sage Bionetworks and IBM-DREAM to initiate this innovative open crowd-sourced Challenge [1-3].","","https://www.synapse.org/#!Synapse:syn2813581","completed","intermediate","1","","2015-11-16","2016-06-30","2023-11-01 22:21:29","2023-10-14 05:38:21" +"19","respiratory-viral","Respiratory Viral","Unraveling Viral Susceptibility: Early Predictors of Respiratory Infection and Contagiousness","Respiratory viruses are highly infectious and cause acute illness in millions of people every year. However, there is wide variation in the physiologic response to exposure at the individual level. Some people that are exposed to virus are able to completely avoid infection. Others contract virus but are able to fight it off without exhibiting any symptoms of illness such as coughing, sneezing, sore throat or fever. It is not well understood what characteristics may protect individuals from respiratory viral infection. These individual responses are likely influenced by multiple processes including both the basal state of the human host upon exposure and the dynamics of host immune response in the early hours immediately following exposure. Many of these processes play out in the peripheral blood through activation and recruitment of circulating immune cells. Global gene expression patterns measured in peripheral blood at the time of symptom onset-several days after viral exposure...","","https://www.synapse.org/#!Synapse:syn5647810","completed","intermediate","1","","2016-05-16","2016-09-28","2023-06-23 00:00:00","2023-10-14 05:38:21" +"20","disease-module-identification","Disease Module Identification","Crowdsourcing challenge to find disease modules in genomic networks","The Disease Module Identification DREAM Challenge is an open community effort to systematically assess module identification methods on a panel of state-of-the-art genomic networks and leverage the “wisdom of crowds” to discover novel modules and pathways underlying complex diseases.","","https://www.synapse.org/#!Synapse:syn6156761","completed","intermediate","1","https://doi.org/10.1038/s41592-019-0509-5","2016-06-24","2016-10-01","2023-11-01 22:21:32","2023-10-16 21:17:48" +"21","encode","ENCODE","Predict transcription factor binding sites from limited data","Transcription factors (TFs) are regulatory proteins that bind specific DNA sequence patterns (motifs) in the genome and affect transcription rates of target genes. Binding sites of TFs differ across cell types and experimental conditions. Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is an experimental method that is commonly used to obtain the genome-wide binding profile of a TF of interest in a specific cell type/condition. However, profiling the binding landscape of every TF in every cell type/condition is infeasible due to constraints on cost, material and effort. Hence, accurate computational prediction of in vivo TF binding sites is critical to complement experimental results.","","https://www.synapse.org/#!Synapse:syn6131484","completed","intermediate","1","","2016-07-07","2017-01-11","2023-11-01 22:21:32","2023-10-14 05:38:26" "22","idea","Idea","Fostering Collaborative Solutions in Health: The DREAM Idea Challenge","The DREAM Idea Challenge is designed to collaboratively shape and enable the solution of a question fundamental to improving human health. In the process, all proposals and their evaluation will be made publicly available for the explicit purpose of connecting modelers and experimentalists who want to address the same question. This Wall of Models will enable new collaborations, and help turn every good modeling idea into a success story. It will further serve as a basis for new DREAM challenges.","","https://www.synapse.org/#!Synapse:syn5659209","completed","advanced","1","","2016-06-15","2017-04-30","2023-06-23 00:00:00","2023-10-14 05:38:26" -"23","smc-rna","SMC-RNA","Crowdsourcing Challenge Seeks to Improve Cancer Mutation Detection from RNA ...","The ICGC-TCGA DREAM Somatic Mutation Calling-RNA Challenge (SMC-RNA) is an international effort to improve standard methods for identifying cancer-associated rearrangements in RNA sequencing (RNA-seq) data. Leaders of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA) cancer genomics projects are joining with Sage Bionetworks and IBM-DREAM to initiate this innovative open crowd-sourced Challenge [1-3].","","https://www.synapse.org/#!Synapse:syn2813589","completed","intermediate","1","","2016-06-29","2017-05-02","2023-06-23 00:00:00","2023-10-14 05:38:29" -"24","digital-mammography-dream-challenge","Digital Mammography DREAM Challenge","Improve mammography prediction to detect breast cancer early for Digital Mam...","The Digital Mammography DREAM Challenge will attempt to improve the predictive accuracy of digital mammography for the early detection of breast cancer. The primary benefit of this Challenge will be to establish new quantitative tools-machine learning, deep learning or other-that can help decrease the recall rate of screening mammography, with a potential impact on shifting the balance of routine breast cancer screening towards more benefit and less harm. Participating teams will be asked to submit predictive models based on over 640,000 de-identified digital mammography images from over 86000 subjects, with corresponding clinical variables.","","https://www.synapse.org/#!Synapse:syn4224222","completed","advanced","1","https://doi.org/10.1001/jamanetworkopen.2020.0265","2016-11-18","2017-05-16","2023-06-23 00:00:00","2023-10-14 05:38:29" -"25","multiple-myeloma","Multiple Myeloma","Develop precise risk model for myeloma patients for Multiple Myeloma","Multiple myeloma (MM) is a cancer of the plasma cells in the bone marrow, with about 25,000 newly diagnosed patients per year in the United States alone. The disease's clinical course depends on a complex interplay of clinical traits and molecular characteristics of the plasma cells.1 Since risk-adapted therapy is becoming standard of care, there is an urgent need for a precise risk stratification model to assist in therapeutic decision-making and research. While progress has been made, there remains a significant opportunity to improve patient stratification to optimize treatment and to develop new therapies for high-risk patients. A DREAM Challenge represents a chance not only to integrate available data and analytical approaches to tackle this important problem, but also provides the ability to benchmark potential methods to identify those with the greatest potential to yield patient care benefits in the future.","","https://www.synapse.org/#!Synapse:syn6187098","completed","intermediate","1","","2017-06-30","2017-11-08","2023-06-23 00:00:00","2023-10-14 05:38:31" -"26","ga4gh-dream-workflow-execution","GA4GH-DREAM Workflow Execution","Develop technologies to enable distributed genomic data analysis for GA4GH-D...","The highly distributed and disparate nature of genomic and clinical data generated around the world presents an enormous challenge for those scientists who wish to integrate and analyze these data. The sheer volume of data often exceeds the capacity for storage at any one site and prohibits the efficient transfer between sites. To address this challenge, researchers must bring their computation to the data. Numerous groups are now developing technologies and best practice methodologies for running portable and reproducible genomic analysis pipelines as well as tools and APIs for discovering genomic analysis resources. Software development, deployment, and sharing efforts in these groups commonly rely on the use of modular workflow pipelines and virtualization based on Docker containers and related tools.","","https://www.synapse.org/#!Synapse:syn8507133","completed","intermediate","1","","2017-07-21","2017-12-31","2023-06-23 00:00:00","2023-10-14 05:38:31" -"27","parkinsons-disease-digital-biomarker","Parkinson's Disease Digital Biomarker","Benchmarking methods to develop Parkinson's digital signatures from sensor d...","The Parkinson's Disease Digital Biomarker DREAM Challenge is a first of it's kind challenge, designed to benchmark methods for the processing of sensor data for development of digital signatures reflective of Parkinson's Disease. Participants will be provided with raw sensor (accelerometer, gyroscope, and magnetometer) time series data recorded during the performance of pre-specified motor tasks, and will be asked to extract data features which are predictive of PD pathology. In contrast to traditional DREAM challenges, this one will focus on feature extraction rather than predictive modeling, and submissions will be evaluated based on their ability to predict disease phenotype using an array of standard machine learning algorithms.","","https://www.synapse.org/#!Synapse:syn8717496","completed","intermediate","1","","2017-07-06","2017-11-10","2023-06-23 00:00:00","2023-10-14 05:38:32" -"28","nci-cptac-proteogenomics","NCI-CPTAC Proteogenomics","Develop tools to extract insights from cancer proteomics data for NCI-CPTAC ...","Cancer is driven by aberrations in the genome [1,2], and these alterations manifest themselves largely in the changes in the structure and abundance of proteins, the main functional gene products. Hence, characterization and analyses of alterations in the proteome has the promise to shed light into cancer development and may improve development of both biomarkers and therapeutics. Measuring the proteome is very challenging, but recent rapid technology developments in mass spectrometry are enabling deep proteomics analysis [3]. Multiple initiatives have been launched to take advantage of this development to characterize the proteome of tumours, such as the Clinical Proteomic Tumor Analysis Consortium (CPTAC). These efforts hold the promise to revolutionize cancer research, but this will only be possible if the community develops computational tools powerful enough to extract the most information from the proteome, and to understand the association between genome, transcriptome and ...","","https://www.synapse.org/#!Synapse:syn8228304","completed","intermediate","1","","2017-06-26","2017-11-20","2023-11-01 22:21:37","2023-10-14 05:38:33" -"29","multi-targeting-drug","Multi-Targeting Drug","Seeking Generalizable Methods to Predict Multi-Target Compound Binding for M...","The objective of this challenge is to incentivize development of methods for predicting compounds that bind to multiple targets. In particular, methods that are generalizable to multiple prediction problems are sought. To achieve this, participants will be asked to predict 2 separate compounds, each having specific targets to which they should bind, and a list of anti-targets to avoid. Participants should use the same methods to produce answers for questions 1 and 2.","","https://www.synapse.org/#!Synapse:syn8404040","completed","intermediate","1","","2017-10-05","2018-02-26","2023-06-23 00:00:00","2023-10-14 05:38:33" -"30","single-cell-transcriptomics","Single Cell Transcriptomics","Reconstructing Cell Locations in Drosophila Embryo from Transcripts for Sing...","In this Challenge on Single-Cell Transcriptomics, participants will reconstruct the location of single cells in the Drosophila embryo using single-cell transcriptomic data. Data will be made available in late August and participating challenge teams can work on the data and submit their results previous to the DREAM Conference. The best performers will be announced at the DREAM conference on Dec 8.","","https://www.synapse.org/#!Synapse:syn15665609","completed","intermediate","1","","2018-09-04","2018-11-21","2023-06-23 00:00:00","2023-10-14 05:38:34" -"31","idg-drug-kinase-binding","IDG Drug-Kinase Binding","Challenge seeks machine learning for drug-kinase binding prediction for IDG ...","This IDG-DREAM Drug-Kinase Binding Prediction Challenge seeks to evaluate the power of statistical and machine learning models as a systematic and cost-effective means for catalyzing compound-target interaction mapping efforts by prioritizing most potent interactions for further experimental evaluation. The Challenge will focus on kinase inhibitors, due to their clinical importance [2], and will be implemented in a screening-based, pre-competitive drug discovery project in collaboration with theIlluminating the Druggable Genome (IDG) Kinase-focused Data and Resource Generation Center, consortium, with the aim to establish kinome-wide target profiles of small-molecule agents, with the goal of extending the druggability of the human kinome space.","","https://www.synapse.org/#!Synapse:syn15667962","completed","intermediate","1","","2018-10-01","2019-04-18","2023-06-23 00:00:00","2023-10-14 05:38:35" +"23","smc-rna","SMC-RNA","Crowdsourcing challenge to improve cancer mutation detection from RNA Data","The ICGC-TCGA DREAM Somatic Mutation Calling-RNA Challenge (SMC-RNA) is an international effort to improve standard methods for identifying cancer-associated rearrangements in RNA sequencing (RNA-seq) data. Leaders of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA) cancer genomics projects are joining with Sage Bionetworks and IBM-DREAM to initiate this innovative open crowd-sourced Challenge [1-3].","","https://www.synapse.org/#!Synapse:syn2813589","completed","intermediate","1","","2016-06-29","2017-05-02","2023-06-23 00:00:00","2023-10-14 05:38:29" +"24","digital-mammography-dream-challenge","Digital Mammography DREAM Challenge","Improve mammography prediction to detect breast cancer early","The Digital Mammography DREAM Challenge will attempt to improve the predictive accuracy of digital mammography for the early detection of breast cancer. The primary benefit of this Challenge will be to establish new quantitative tools-machine learning, deep learning or other-that can help decrease the recall rate of screening mammography, with a potential impact on shifting the balance of routine breast cancer screening towards more benefit and less harm. Participating teams will be asked to submit predictive models based on over 640,000 de-identified digital mammography images from over 86000 subjects, with corresponding clinical variables.","","https://www.synapse.org/#!Synapse:syn4224222","completed","advanced","1","https://doi.org/10.1001/jamanetworkopen.2020.0265","2016-11-18","2017-05-16","2023-06-23 00:00:00","2023-10-14 05:38:29" +"25","multiple-myeloma","Multiple Myeloma","Develop precise risk model for myeloma patients","Multiple myeloma (MM) is a cancer of the plasma cells in the bone marrow, with about 25,000 newly diagnosed patients per year in the United States alone. The disease's clinical course depends on a complex interplay of clinical traits and molecular characteristics of the plasma cells.1 Since risk-adapted therapy is becoming standard of care, there is an urgent need for a precise risk stratification model to assist in therapeutic decision-making and research. While progress has been made, there remains a significant opportunity to improve patient stratification to optimize treatment and to develop new therapies for high-risk patients. A DREAM Challenge represents a chance not only to integrate available data and analytical approaches to tackle this important problem, but also provides the ability to benchmark potential methods to identify those with the greatest potential to yield patient care benefits in the future.","","https://www.synapse.org/#!Synapse:syn6187098","completed","intermediate","1","","2017-06-30","2017-11-08","2023-06-23 00:00:00","2023-10-14 05:38:31" +"26","ga4gh-dream-workflow-execution","GA4GH-DREAM Workflow Execution","Develop technologies to enable distributed genomic data analysis","The highly distributed and disparate nature of genomic and clinical data generated around the world presents an enormous challenge for those scientists who wish to integrate and analyze these data. The sheer volume of data often exceeds the capacity for storage at any one site and prohibits the efficient transfer between sites. To address this challenge, researchers must bring their computation to the data. Numerous groups are now developing technologies and best practice methodologies for running portable and reproducible genomic analysis pipelines as well as tools and APIs for discovering genomic analysis resources. Software development, deployment, and sharing efforts in these groups commonly rely on the use of modular workflow pipelines and virtualization based on Docker containers and related tools.","","https://www.synapse.org/#!Synapse:syn8507133","completed","intermediate","1","","2017-07-21","2017-12-31","2023-06-23 00:00:00","2023-10-14 05:38:31" +"27","parkinsons-disease-digital-biomarker","Parkinson's Disease Digital Biomarker","Benchmarking methods to develop Parkinson's digital signatures from sensor data for Parkinson's Disease","The Parkinson's Disease Digital Biomarker DREAM Challenge is a first of it's kind challenge, designed to benchmark methods for the processing of sensor data for development of digital signatures reflective of Parkinson's Disease. Participants will be provided with raw sensor (accelerometer, gyroscope, and magnetometer) time series data recorded during the performance of pre-specified motor tasks, and will be asked to extract data features which are predictive of PD pathology. In contrast to traditional DREAM challenges, this one will focus on feature extraction rather than predictive modeling, and submissions will be evaluated based on their ability to predict disease phenotype using an array of standard machine learning algorithms.","","https://www.synapse.org/#!Synapse:syn8717496","completed","intermediate","1","","2017-07-06","2017-11-10","2023-06-23 00:00:00","2023-10-14 05:38:32" +"28","nci-cptac-proteogenomics","NCI-CPTAC Proteogenomics","Develop tools to extract insights from cancer proteomics data","Cancer is driven by aberrations in the genome [1,2], and these alterations manifest themselves largely in the changes in the structure and abundance of proteins, the main functional gene products. Hence, characterization and analyses of alterations in the proteome has the promise to shed light into cancer development and may improve development of both biomarkers and therapeutics. Measuring the proteome is very challenging, but recent rapid technology developments in mass spectrometry are enabling deep proteomics analysis [3]. Multiple initiatives have been launched to take advantage of this development to characterize the proteome of tumours, such as the Clinical Proteomic Tumor Analysis Consortium (CPTAC). These efforts hold the promise to revolutionize cancer research, but this will only be possible if the community develops computational tools powerful enough to extract the most information from the proteome, and to understand the association between genome, transcriptome and ...","","https://www.synapse.org/#!Synapse:syn8228304","completed","intermediate","1","","2017-06-26","2017-11-20","2023-11-01 22:21:37","2023-10-14 05:38:33" +"29","multi-targeting-drug","Multi-Targeting Drug","Seeking generalizable methods to predict multi-target compound binding","The objective of this challenge is to incentivize development of methods for predicting compounds that bind to multiple targets. In particular, methods that are generalizable to multiple prediction problems are sought. To achieve this, participants will be asked to predict 2 separate compounds, each having specific targets to which they should bind, and a list of anti-targets to avoid. Participants should use the same methods to produce answers for questions 1 and 2.","","https://www.synapse.org/#!Synapse:syn8404040","completed","intermediate","1","","2017-10-05","2018-02-26","2023-06-23 00:00:00","2023-10-14 05:38:33" +"30","single-cell-transcriptomics","Single Cell Transcriptomics","Reconstructing cell locations in Drosophila embryo from transcripts","In this Challenge on Single-Cell Transcriptomics, participants will reconstruct the location of single cells in the Drosophila embryo using single-cell transcriptomic data. Data will be made available in late August and participating challenge teams can work on the data and submit their results previous to the DREAM Conference. The best performers will be announced at the DREAM conference on Dec 8.","","https://www.synapse.org/#!Synapse:syn15665609","completed","intermediate","1","","2018-09-04","2018-11-21","2023-06-23 00:00:00","2023-10-14 05:38:34" +"31","idg-drug-kinase-binding","IDG Drug-Kinase Binding","Challenge seeks machine learning for drug-kinase binding prediction for IDG Drug-Kinase Binding","This IDG-DREAM Drug-Kinase Binding Prediction Challenge seeks to evaluate the power of statistical and machine learning models as a systematic and cost-effective means for catalyzing compound-target interaction mapping efforts by prioritizing most potent interactions for further experimental evaluation. The Challenge will focus on kinase inhibitors, due to their clinical importance [2], and will be implemented in a screening-based, pre-competitive drug discovery project in collaboration with theIlluminating the Druggable Genome (IDG) Kinase-focused Data and Resource Generation Center, consortium, with the aim to establish kinome-wide target profiles of small-molecule agents, with the goal of extending the druggability of the human kinome space.","","https://www.synapse.org/#!Synapse:syn15667962","completed","intermediate","1","","2018-10-01","2019-04-18","2023-06-23 00:00:00","2023-10-14 05:38:35" "32","malaria","Malaria","Predict malaria drug resistance from parasite gene expression for Malaria","The Malaria DREAM Challenge is open to anyone interested in contributing to the development of computational models that address important problems in advancing the fight against malaria. The overall goal of the first Malaria DREAM Challenge is to predict Artemisinin (Art) drug resistance level of a test set of malaria parasites using their in vitro transcription data and a training set consisting of published in vivo and unpublished in vitrotranscriptomes. The in vivodataset consists of ~1000 transcription samples from various geographic locations covering a wide range of life cycles and resistance levels, with other accompanying data such as patient age, geographic location, Art combination therapy used, etc [Mok et al (2015) Science]. The in vitro transcription dataset consists of 55 isolates, with transcription collected at two timepoints (6 and 24 hours post-invasion), in the absence or presence of an Art perturbation, for two biological replicates using a custom microarray a...","","https://www.synapse.org/#!Synapse:syn16924919","completed","intermediate","1","","2019-04-30","2019-08-15","2023-06-23 00:00:00","2023-10-14 05:38:35" -"33","preterm-birth-prediction-transcriptomics","Preterm Birth Prediction - Transcriptomics","Developing Accurate, Inexpensive Molecular Clock to Determine Gestational Ag...","A basic need in pregnancy care is to establish gestational age, and inaccurate estimates may lead to unnecessary interventions and sub-optimal patient management. Current approaches to establish gestational age rely on patient's recollection of her last menstrual period and/or ultrasound, with the latter being not only costly but also less accurate if not performed during the first trimester of pregnancy. Therefore development of an inexpensive and accurate molecular clock of pregnancy would be of benefit to patients and health care systems. Participants in sub-challenge 1 (Prediction of gestational age) will be given whole blood gene topic_3170 collected from pregnant women to develop prediction models for the gestational age at blood draw. Another challenge in obstetrics, in both low and high-income countries, is identification and treatment of women at risk of developing the ‘great obstetrical syndromes‘. Of these, preterm birth (PTB), defined as giving birth prior to completio...","","https://www.synapse.org/#!Synapse:syn18380862","completed","good_for_beginners","1","","2019-05-04","2019-12-05","2023-06-23 00:00:00","2023-10-14 05:38:36" -"34","single-cell-signaling-in-breast-cancer","Single-Cell Signaling in Breast Cancer","Exploring heterogeneous signaling in single cancer cells for Single-Cell Sig...","Signaling underlines nearly every cellular event. Individual cells, even if genetically identical, respond to perturbation in different ways. This underscores the relevance of cellular heterogeneity, in particular in how cells respond to drugs. This is of high relevance since the fact that a subset of cells do not respond (or only weakly) to drugs can render this drug an ineffective treatment. In spite of its relevance to many diseases, comprehensive studies on the heterogeneous signaling in single cells are still lacking. We have generated the, to our knowledge, currently largest single cell signaling dataset on a panel of 67 well-characterized breast cancer cell lines by mass cytometry (3'015 conditions, ~80 mio single cells, 38 markers; Bandura et al. 2009; Bendall et al., 2011; Bodenmiller et al., 2012; Lun et al., 2017; Lun et al., 2019). These cell lines are, among others, also characterized at the genomic, transcriptomic, and proteomic level (Marcotte et al., 2016). We ask ...","","https://www.synapse.org/#!Synapse:syn20366914","completed","intermediate","1","","2018-08-20","2019-11-15","2023-06-23 00:00:00","2023-10-14 05:38:37" -"35","ehr-dream-challenge-patient-mortality-prediction","EHR DREAM Challenge: Patient Mortality Prediction","New tools to reconstruct cell lineages from CRISPR mutations for EHR DREAM C...","The recent advent of new CRISPR-based molecular tools allows the reconstruction of cell lineages based on the phylogenetical analysis of DNA mutations induced by CRISPR during development and promises to solve the lineage of complex model organisms at single-cell resolution (see image from McKenna et al Science 2016). To date, however, no lineage reconstruction algorithms have been rigorously examined for their performance/robustness across diverse molecular tools, datasets, and number of cells/size of lineage trees. It also remains unclear whether new Machine-Learning algorithms that go beyond the classical ones developed for reconstructing phylogenetic trees, could consistently reconstruct cell lineages to a high degree of accuracy. The challenge-a partnership between The Allen Institute and DREAM-will comprise 3 subchallenges that consist of reconstructing cell lineage trees of different sizes and nature. In subchallenge 1, participants will be given experimental molecular data...","","https://www.synapse.org/#!Synapse:syn18405991","completed","intermediate","1","https://doi.org/10.1093/jamia/ocad159","2019-09-09","2020-01-23","2023-06-23 00:00:00","2023-11-02 18:25:23" -"36","allen-institute-cell-lineage-reconstruction","Allen Institute Cell Lineage Reconstruction","New tools enable reconstructing complex cell lineages at single-cell resolut...","The recent advent of new CRISPR-based molecular tools allows the reconstruction of cell lineages based on the phylogenetical analysis of DNA mutations induced by CRISPR during development and promises to solve the lineage of complex model organisms at single-cell resolution. To date, however, no lineage reconstruction algorithms have been rigorously examined for their performance/robustness across diverse molecular tools, datasets, and number of cells/size of lineage trees. It also remains unclear whether new Machine-Learning algorithms that go beyond the classical ones developed for reconstructing phylogenetic trees, could consistently reconstruct cell lineages to a high degree of accuracy. The challenge-a partnership between The Allen Institute and DREAM-will comprise 3 subchallenges that consist of reconstructing cell lineage trees of different sizes and nature. In subchallenge 1, participants will be given experimental molecular data to reconstruct in vitro cell lineages of l...","","https://www.synapse.org/#!Synapse:syn20692755","completed","intermediate","1","","2019-10-15","2020-02-06","2023-06-23 00:00:00","2023-11-02 18:25:24" -"37","tumor-deconvolution","Tumor Deconvolution","Assess computational methods to deconvolve bulk tumor data into immune compo...","The extent of stromal and immune cell infiltration within solid tumors has prognostic and predictive significance. Unfortunately, expression profiling of tumors has, until very recently, largely been undertaken using bulk techniques (e.g., microarray and RNA-seq). Unlike single-cell methods (e.g., single-cell RNA-seq, FACS, mass cytometry, or immunohistochemistry), bulk approaches average expression across all cells (cancer, stromal, and immune) within the sample and, hence, do not directly quantitate tumor infiltration. This information can be recovered by computational tumor deconvolution methods, which would thus allow interrogation of immune subpopulations across the large collection of public bulk topic_3170sets. The goal of this Challenge is to evaluate the ability of computational methods to deconvolve bulk topic_3170, reflecting a mixture of cell types, into individual immune components. Methods will be assessed based on in vitro and in silico admixtures specifically gener...","","https://www.synapse.org/#!Synapse:syn15589870","completed","intermediate","1","","2019-06-26","2020-04-30","2023-06-23 00:00:00","2023-10-14 05:38:39" -"38","ctd2-pancancer-drug-activity","CTD2 Pancancer Drug Activity","Benchmark algorithms predicting drug targets from gene data for CTD2 Pancanc...","Over the last two years, the Columbia CTD2 Center developed PANACEA (Pancancer Analysis of Chemical Entity Activity), a comprehensive repertoire of dose response curves and molecular profiles representative of cellular responses to drug perturbations. PANACEA covers a broad spectrum of cellular contexts representative of poor outcome malignancies, including rare ones such as GIST sarcoma and gastroenteropancreatic neuroendocrine tumors (GEP-NETs). PANACEA is uniquely suited to support DREAM Challenges related to the elucidation of drug mechanism of action (MOA), drug sensitivity, and drug synergy. The goal of the CTD2 Pancancer Drug Activity DREAM Challenge is to foster the development and benchmarking of algorithms to predict targets of chemotherapeutic compounds from post-treatment transcriptional data.","","https://www.synapse.org/#!Synapse:syn20968331","completed","good_for_beginners","1","","2019-12-02","2020-02-13","2023-06-23 00:00:00","2023-10-20 23:11:10" -"39","ctd2-beataml","CTD2 BeatAML","Seeking New Drug Targets for Precision AML Treatment for CTD2 BeatAML","In the era of precision medicine, AML patients have few therapeutic options, with “7 + 3” induction chemotherapy having been the standard for decades (Bertoli et al. 2017). While several agents targeting the myeloid marker CD33 or alterations in FLT3 or IDH2 have demonstrated efficacy in patients (Wei and Tiong 2017), responses are uncertain in some populations (Castaigne et al. 2012) and relapse remains prevalent (Stone et al. 2017). These drugs highlight both the promise of targeted therapies in AML and the urgent need for additional treatment options that are tailored to more refined patient subpopulations in order to achieve durable responses. The BeatAML initiative was launched as a comprehensive study of the relationship between molecular alterations and ex-vivo drug sensitivity in patients with AML. One of the primary goals of this multi-center study was to develop a discovery cohort that could yield new drug target hypotheses and predictive biomarkers of therapeutic respon...","","https://www.synapse.org/#!Synapse:syn20940518","completed","good_for_beginners","1","","2019-12-19","2020-04-28","2023-06-23 00:00:00","2023-10-14 05:38:42" -"40","metadata-automation","Metadata Automation","Semi-Automating Metadata Annotation for Enhanced Data Sharing in Cancer Research","The Cancer Research Data Commons (CRDC) will collate data across diverse groups of cancer researchers, each collecting biomedical data in different formats. This means the data must be retrospectively harmonized and transformed to enable this data to be submitted. In addition, to be findable by the broader scientific community, coherent information (metadata) is necessary about the data fields and values. Coherent metadata annotation of the data fields and their values can enable computational data transformation, query, and analysis. Creation of this type of descriptive metadata can require biomedical expertise to determine the best annotations and thus is a time-consuming and manual task which is both an obstacle and a bottleneck in data sharing and submissions. Goal-Using structured biomedical data files, challenge participants will develop tools to semi-automate annotation of metadata fields and values, using available research data annotations (e.g. caDSR CDEs) as well as es...","","https://www.synapse.org/#!Synapse:syn18065891","completed","intermediate","1","","2020-01-14","2020-06-02","2023-06-23 00:00:00","2023-10-14 05:38:42" -"41","automated-scoring-of-radiographic-joint-damage","Automated Scoring of Radiographic Joint Damage","Develop automated method to quantify rheumatoid arthritis joint damage for A...","The purpose of the RA2-DREAM Challenge is to develop an automated method to quickly and accurately quantify the degree of joint damage associated with rheumatoid arthritis (RA). Based on radiographs of the hands and feet, a novel, automated scoring method could be applied broadly for patient care and research. We challenge participants to develop algorithms to automatically assess joint space narrowing and erosions using a large set of existing radiographs with damage scores generated by visual assessment of images by trained readers using standard protocols. The end result will be a generalizable, publicly available, automated method to generate accurate, reproducible and unbiased RA damage scores to replace the current tedious, expensive, and non-scalable method of scoring by human visual inspection.","","https://www.synapse.org/#!Synapse:syn20545111","completed","intermediate","1","","2019-11-04","2020-05-21","2023-06-23 00:00:00","2023-10-18 00:38:55" -"42","beat-pd","BEAT-PD","Develop mobile sensors to remotely monitor Parkinson's disease for BEAT-PD","Recent advances in mobile health have demonstrated great potential to leverage sensor-based technologies for quantitative, remote monitoring of health and disease-particularly for diseases affecting motor function such as Parkinson's disease. Such approaches have been rolled out using research-grade wearable sensors and, increasingly, through the use of smartphones and consumer wearables, such as smart watches and fitness trackers. These devices not only provide the ability to measure much more detailed disease phenotypes but also provide the ability to follow patients longitudinally with much higher frequency than is possible through clinical exams. However, the conversion of sensor-based data streams into digital biomarkers is complex and no methodological standards have yet evolved to guide this process. Parkinson's disease (PD) is a neurodegenerative disease that primarily affects the motor system but also exhibits other symptoms. Typical motor symptoms of the disease include...","","https://www.synapse.org/#!Synapse:syn20825169","completed","intermediate","1","","2020-01-13","2020-05-13","2023-06-23 00:00:00","2023-10-14 05:38:45" -"43","ctd2-pancancer-chemosensitivity","CTD2 Pancancer Chemosensitivity","Predict drug sensitivity from cell line gene expression for CTD2 Pancancer C...","Over the last two years, the Columbia CTD2 Center developed PANACEA (Pancancer Analysis of Chemical Entity Activity), a comprehensive repertoire of dose response curves and molecular profiles representative of cellular responses to drug perturbations. PANACEA covers a broad spectrum of cellular contexts representative of poor outcome malignancies, including rare ones such as GIST sarcoma and gastroenteropancreatic neuroendocrine tumors (GEP-NETs). PANACEA is uniquely suited to support DREAM Challenges related to the elucidation of drug mechanism of action (MOA), drug sensitivity, and drug synergy. The goal of this Challenge is to foster development and benchmarking of algorithms to predict the sensitivity, as measured by the area under the dose-response curve, of a cell line to a compound based on the baseline transcriptional profiles of the cell line. The drug perturbational RNAseq profiles of 11 cell lines for 30 chosen compounds will be provided to challenge participants, with...","","https://www.synapse.org/#!Synapse:syn21763589","completed","good_for_beginners","1","","2020-04-28","2020-07-27","2023-06-23 00:00:00","2023-10-14 05:38:45" -"44","ehr-dream-challenge-covid-19","EHR DREAM Challenge: COVID-19","Develop tools to predict COVID-19 risk without sharing data for EHR DREAM Ch...","The rapid rise of COVID-19 has challenged healthcare globally. The underlying risks and outcomes of infection are still incompletely characterized even as the world surpasses 4 million infections. Due to the importance and emergent need for better understanding of the condition and the development of patient specific clinical risk scores and early warning tools, we have developed a platform to support testing analytic and machine learning hypotheses on clinical data without data sharing as a platform to rapidly discover and implement approaches for care. We have previously applied this approach in the successful EHR DREAM Challenge focusing on Patient Mortality Prediction with UW Medicine. We have the goal of incorporating machine learning and predictive algorithms into clinical care and COVID-19 is an important and highly urgent challenge. In our first iteration, we will facilitate understanding risk factors that lead to a positive test utilizing electronic health recorded dat...","","https://www.synapse.org/#!Synapse:syn21849255","completed","intermediate","1","https://doi.org/10.1001/jamanetworkopen.2021.24946","2020-04-30","2021-07-01","2023-06-23 00:00:00","2023-11-01 14:57:29" +"33","preterm-birth-prediction-transcriptomics","Preterm Birth Prediction - Transcriptomics","Developing accurate, inexpensive molecular clock to determine gestational age for preterm birth prediction","A basic need in pregnancy care is to establish gestational age, and inaccurate estimates may lead to unnecessary interventions and sub-optimal patient management. Current approaches to establish gestational age rely on patient's recollection of her last menstrual period and/or ultrasound, with the latter being not only costly but also less accurate if not performed during the first trimester of pregnancy. Therefore development of an inexpensive and accurate molecular clock of pregnancy would be of benefit to patients and health care systems. Participants in sub-challenge 1 (Prediction of gestational age) will be given whole blood gene topic_3170 collected from pregnant women to develop prediction models for the gestational age at blood draw. Another challenge in obstetrics, in both low and high-income countries, is identification and treatment of women at risk of developing the ‘great obstetrical syndromes‘. Of these, preterm birth (PTB), defined as giving birth prior to completio...","","https://www.synapse.org/#!Synapse:syn18380862","completed","good_for_beginners","1","","2019-05-04","2019-12-05","2023-06-23 00:00:00","2023-10-14 05:38:36" +"34","single-cell-signaling-in-breast-cancer","Single-Cell Signaling in Breast Cancer","Exploring heterogeneous signaling in single cancer cells","Signaling underlines nearly every cellular event. Individual cells, even if genetically identical, respond to perturbation in different ways. This underscores the relevance of cellular heterogeneity, in particular in how cells respond to drugs. This is of high relevance since the fact that a subset of cells do not respond (or only weakly) to drugs can render this drug an ineffective treatment. In spite of its relevance to many diseases, comprehensive studies on the heterogeneous signaling in single cells are still lacking. We have generated the, to our knowledge, currently largest single cell signaling dataset on a panel of 67 well-characterized breast cancer cell lines by mass cytometry (3'015 conditions, ~80 mio single cells, 38 markers; Bandura et al. 2009; Bendall et al., 2011; Bodenmiller et al., 2012; Lun et al., 2017; Lun et al., 2019). These cell lines are, among others, also characterized at the genomic, transcriptomic, and proteomic level (Marcotte et al., 2016). We ask ...","","https://www.synapse.org/#!Synapse:syn20366914","completed","intermediate","1","","2018-08-20","2019-11-15","2023-06-23 00:00:00","2023-10-14 05:38:37" +"35","ehr-dream-challenge-patient-mortality-prediction","EHR DREAM Challenge: Patient Mortality Prediction","New tools to reconstruct cell lineages from CRISPR mutations","The recent advent of new CRISPR-based molecular tools allows the reconstruction of cell lineages based on the phylogenetical analysis of DNA mutations induced by CRISPR during development and promises to solve the lineage of complex model organisms at single-cell resolution (see image from McKenna et al Science 2016). To date, however, no lineage reconstruction algorithms have been rigorously examined for their performance/robustness across diverse molecular tools, datasets, and number of cells/size of lineage trees. It also remains unclear whether new Machine-Learning algorithms that go beyond the classical ones developed for reconstructing phylogenetic trees, could consistently reconstruct cell lineages to a high degree of accuracy. The challenge-a partnership between The Allen Institute and DREAM-will comprise 3 subchallenges that consist of reconstructing cell lineage trees of different sizes and nature. In subchallenge 1, participants will be given experimental molecular data...","","https://www.synapse.org/#!Synapse:syn18405991","completed","intermediate","1","https://doi.org/10.1093/jamia/ocad159","2019-09-09","2020-01-23","2023-06-23 00:00:00","2023-11-02 18:25:23" +"36","allen-institute-cell-lineage-reconstruction","Allen Institute Cell Lineage Reconstruction","New tools enable reconstructing complex cell lineages at single-cell resolution","The recent advent of new CRISPR-based molecular tools allows the reconstruction of cell lineages based on the phylogenetical analysis of DNA mutations induced by CRISPR during development and promises to solve the lineage of complex model organisms at single-cell resolution. To date, however, no lineage reconstruction algorithms have been rigorously examined for their performance/robustness across diverse molecular tools, datasets, and number of cells/size of lineage trees. It also remains unclear whether new Machine-Learning algorithms that go beyond the classical ones developed for reconstructing phylogenetic trees, could consistently reconstruct cell lineages to a high degree of accuracy. The challenge-a partnership between The Allen Institute and DREAM-will comprise 3 subchallenges that consist of reconstructing cell lineage trees of different sizes and nature. In subchallenge 1, participants will be given experimental molecular data to reconstruct in vitro cell lineages of l...","","https://www.synapse.org/#!Synapse:syn20692755","completed","intermediate","1","","2019-10-15","2020-02-06","2023-06-23 00:00:00","2023-11-02 18:25:24" +"37","tumor-deconvolution","Tumor Deconvolution","Assess computational methods to deconvolve bulk tumor data into immune components","The extent of stromal and immune cell infiltration within solid tumors has prognostic and predictive significance. Unfortunately, expression profiling of tumors has, until very recently, largely been undertaken using bulk techniques (e.g., microarray and RNA-seq). Unlike single-cell methods (e.g., single-cell RNA-seq, FACS, mass cytometry, or immunohistochemistry), bulk approaches average expression across all cells (cancer, stromal, and immune) within the sample and, hence, do not directly quantitate tumor infiltration. This information can be recovered by computational tumor deconvolution methods, which would thus allow interrogation of immune subpopulations across the large collection of public bulk topic_3170sets. The goal of this Challenge is to evaluate the ability of computational methods to deconvolve bulk topic_3170, reflecting a mixture of cell types, into individual immune components. Methods will be assessed based on in vitro and in silico admixtures specifically gener...","","https://www.synapse.org/#!Synapse:syn15589870","completed","intermediate","1","","2019-06-26","2020-04-30","2023-06-23 00:00:00","2023-10-14 05:38:39" +"38","ctd2-pancancer-drug-activity","CTD2 Pancancer Drug Activity","Benchmark algorithms predicting drug targets from gene data","Over the last two years, the Columbia CTD2 Center developed PANACEA (Pancancer Analysis of Chemical Entity Activity), a comprehensive repertoire of dose response curves and molecular profiles representative of cellular responses to drug perturbations. PANACEA covers a broad spectrum of cellular contexts representative of poor outcome malignancies, including rare ones such as GIST sarcoma and gastroenteropancreatic neuroendocrine tumors (GEP-NETs). PANACEA is uniquely suited to support DREAM Challenges related to the elucidation of drug mechanism of action (MOA), drug sensitivity, and drug synergy. The goal of the CTD2 Pancancer Drug Activity DREAM Challenge is to foster the development and benchmarking of algorithms to predict targets of chemotherapeutic compounds from post-treatment transcriptional data.","","https://www.synapse.org/#!Synapse:syn20968331","completed","good_for_beginners","1","","2019-12-02","2020-02-13","2023-06-23 00:00:00","2023-10-20 23:11:10" +"39","ctd2-beataml","CTD2 BeatAML","Seeking new drug targets for precision AML treatment","In the era of precision medicine, AML patients have few therapeutic options, with “7 + 3” induction chemotherapy having been the standard for decades (Bertoli et al. 2017). While several agents targeting the myeloid marker CD33 or alterations in FLT3 or IDH2 have demonstrated efficacy in patients (Wei and Tiong 2017), responses are uncertain in some populations (Castaigne et al. 2012) and relapse remains prevalent (Stone et al. 2017). These drugs highlight both the promise of targeted therapies in AML and the urgent need for additional treatment options that are tailored to more refined patient subpopulations in order to achieve durable responses. The BeatAML initiative was launched as a comprehensive study of the relationship between molecular alterations and ex-vivo drug sensitivity in patients with AML. One of the primary goals of this multi-center study was to develop a discovery cohort that could yield new drug target hypotheses and predictive biomarkers of therapeutic respon...","","https://www.synapse.org/#!Synapse:syn20940518","completed","good_for_beginners","1","","2019-12-19","2020-04-28","2023-06-23 00:00:00","2023-10-14 05:38:42" +"40","metadata-automation","Metadata Automation","Semi-automating metadata annotation for enhanced data sharing in cancer research","The Cancer Research Data Commons (CRDC) will collate data across diverse groups of cancer researchers, each collecting biomedical data in different formats. This means the data must be retrospectively harmonized and transformed to enable this data to be submitted. In addition, to be findable by the broader scientific community, coherent information (metadata) is necessary about the data fields and values. Coherent metadata annotation of the data fields and their values can enable computational data transformation, query, and analysis. Creation of this type of descriptive metadata can require biomedical expertise to determine the best annotations and thus is a time-consuming and manual task which is both an obstacle and a bottleneck in data sharing and submissions. Goal-Using structured biomedical data files, challenge participants will develop tools to semi-automate annotation of metadata fields and values, using available research data annotations (e.g. caDSR CDEs) as well as es...","","https://www.synapse.org/#!Synapse:syn18065891","completed","intermediate","1","","2020-01-14","2020-06-02","2023-06-23 00:00:00","2023-10-14 05:38:42" +"41","automated-scoring-of-radiographic-joint-damage","Automated Scoring of Radiographic Joint Damage","Develop automated method to quantify rheumatoid arthritis joint damage","The purpose of the RA2-DREAM Challenge is to develop an automated method to quickly and accurately quantify the degree of joint damage associated with rheumatoid arthritis (RA). Based on radiographs of the hands and feet, a novel, automated scoring method could be applied broadly for patient care and research. We challenge participants to develop algorithms to automatically assess joint space narrowing and erosions using a large set of existing radiographs with damage scores generated by visual assessment of images by trained readers using standard protocols. The end result will be a generalizable, publicly available, automated method to generate accurate, reproducible and unbiased RA damage scores to replace the current tedious, expensive, and non-scalable method of scoring by human visual inspection.","","https://www.synapse.org/#!Synapse:syn20545111","completed","intermediate","1","","2019-11-04","2020-05-21","2023-06-23 00:00:00","2023-10-18 00:38:55" +"42","beat-pd","BEAT-PD","Develop mobile sensors to remotely monitor Parkinson's disease","Recent advances in mobile health have demonstrated great potential to leverage sensor-based technologies for quantitative, remote monitoring of health and disease-particularly for diseases affecting motor function such as Parkinson's disease. Such approaches have been rolled out using research-grade wearable sensors and, increasingly, through the use of smartphones and consumer wearables, such as smart watches and fitness trackers. These devices not only provide the ability to measure much more detailed disease phenotypes but also provide the ability to follow patients longitudinally with much higher frequency than is possible through clinical exams. However, the conversion of sensor-based data streams into digital biomarkers is complex and no methodological standards have yet evolved to guide this process. Parkinson's disease (PD) is a neurodegenerative disease that primarily affects the motor system but also exhibits other symptoms. Typical motor symptoms of the disease include...","","https://www.synapse.org/#!Synapse:syn20825169","completed","intermediate","1","","2020-01-13","2020-05-13","2023-06-23 00:00:00","2023-10-14 05:38:45" +"43","ctd2-pancancer-chemosensitivity","CTD2 Pancancer Chemosensitivity","Predict drug sensitivity from cell line gene expression","Over the last two years, the Columbia CTD2 Center developed PANACEA (Pancancer Analysis of Chemical Entity Activity), a comprehensive repertoire of dose response curves and molecular profiles representative of cellular responses to drug perturbations. PANACEA covers a broad spectrum of cellular contexts representative of poor outcome malignancies, including rare ones such as GIST sarcoma and gastroenteropancreatic neuroendocrine tumors (GEP-NETs). PANACEA is uniquely suited to support DREAM Challenges related to the elucidation of drug mechanism of action (MOA), drug sensitivity, and drug synergy. The goal of this Challenge is to foster development and benchmarking of algorithms to predict the sensitivity, as measured by the area under the dose-response curve, of a cell line to a compound based on the baseline transcriptional profiles of the cell line. The drug perturbational RNAseq profiles of 11 cell lines for 30 chosen compounds will be provided to challenge participants, with...","","https://www.synapse.org/#!Synapse:syn21763589","completed","good_for_beginners","1","","2020-04-28","2020-07-27","2023-06-23 00:00:00","2023-10-14 05:38:45" +"44","ehr-dream-challenge-covid-19","EHR DREAM Challenge: COVID-19","Develop tools to predict COVID-19 risk without sharing data","The rapid rise of COVID-19 has challenged healthcare globally. The underlying risks and outcomes of infection are still incompletely characterized even as the world surpasses 4 million infections. Due to the importance and emergent need for better understanding of the condition and the development of patient specific clinical risk scores and early warning tools, we have developed a platform to support testing analytic and machine learning hypotheses on clinical data without data sharing as a platform to rapidly discover and implement approaches for care. We have previously applied this approach in the successful EHR DREAM Challenge focusing on Patient Mortality Prediction with UW Medicine. We have the goal of incorporating machine learning and predictive algorithms into clinical care and COVID-19 is an important and highly urgent challenge. In our first iteration, we will facilitate understanding risk factors that lead to a positive test utilizing electronic health recorded dat...","","https://www.synapse.org/#!Synapse:syn21849255","completed","intermediate","1","https://doi.org/10.1001/jamanetworkopen.2021.24946","2020-04-30","2021-07-01","2023-06-23 00:00:00","2023-11-01 14:57:29" "45","anti-pd1-response-prediction","Anti-PD1 Response Prediction","Predicting lung cancer response to immuno-oncology therapy","While durable responses and prolonged survival have been demonstrated in some lung cancer patients treated with immuno-oncology (I-O) anti-PD-1 therapy, there remains a need to improve the ability to predict which patients are more likely to receive benefit from treatment with I-O. The goal of this challenge is to leverage clinical and biomarker data to develop predictive models of response to I-O therapy in lung cancer and ultimately gain insights that may facilitate potential novel monotherapies or combinations with I-O.","","https://www.synapse.org/#!Synapse:syn18404605","completed","intermediate","1","","2020-11-17","2021-02-25","2023-06-23 00:00:00","2023-11-02 18:25:16" "46","brats-2021-challenge","BraTS 2021 Challenge","Developing ML methods to analyze brain tumor MRI scans","Glioblastoma, and diffuse astrocytic glioma with molecular features of glioblastoma (WHO Grade 4 astrocytoma), are the most common and aggressive malignant primary tumor of the central nervous system in adults, with extreme intrinsic heterogeneity in appearance, shape, and histology. Glioblastoma patients have very poor prognosis, and the current standard of care treatment comprises surgery, followed by radiotherapy and chemotherapy. The International Brain Tumor Segmentation (BraTS) Challenges —which have been running since 2012— assess state-of-the-art machine learning (ML) methods used for brain tumor image analysis in mpMRI scans.","","https://www.synapse.org/#!Synapse:syn25829067","completed","advanced","1","","2021-07-07","2021-10-15","2023-06-23 00:00:00","2023-10-14 05:38:48" "47","cancer-data-registry-nlp","Cancer Data Registry NLP","Predicting lung cancer response to immuno-oncology therapy","A critical bottleneck in translational and clinical research is access to large volumes of high-quality clinical data. While structured data exist in medical EHR systems, a large portion of patient information including patient status, treatments, and outcomes is contained in unstructured text fields. Research in Natural Language Processing (NLP) aims to unlock this hidden and often inaccessible information. However, numerous challenges exist in developing and evaluating NLP methods, much of it centered on having “gold-standard” metrics for evaluation, and access to data that may contain personal health information (PHI). This DREAM Challenge will focus on the development and evaluation of of NLP algorithms that can improve clinical trial matching and recruitment.","","https://www.synapse.org/#!Synapse:syn18361217","upcoming","intermediate","1","","\N","\N","2023-06-23 00:00:00","2023-10-14 05:38:49" "48","barda-community-challenge-pediatric-covid-19-data-challenge","BARDA Community Challenge - Pediatric COVID-19 Data Challenge","Models to predict severe COVID-19 in children sought","While most children with COVID-19 are asymptomatic or have mild symptoms, healthcare providers have difficulty determining which among their pediatric patients will progress to moderate or severe COVID-19 early in the progression. Some of these patients develop multisystem inflammatory syndrome in children (MIS-C), a life-threatening inflammation of organs and tissues. Methods to distinguish children at risk for severe COVID-19 complications, including conditions such as MIS-C, are needed for earlier interventions to improve pediatric patient outcomes. Multiple HHS divisions are coming together for a data challenge competition that will leverage de-identified electronic health record data to develop, train and validate computational models that can predict severe COVID-19 complications in children, equipping healthcare providers with the information and tools they need to identify pediatric patients at risk.","","https://www.synapse.org/#!Synapse:syn25875374/wiki/611225","completed","intermediate","1","","2021-08-19","2021-12-17","2023-06-23 00:00:00","2023-10-14 05:38:50" -"49","brats-continuous-evaluation","BraTS Continuous Evaluation","Seeking Innovations To Improve Brain Tumor Diagnosis And Treatment","Brain tumors are among the deadliest types of cancer. Specifically, glioblastoma, and diffuse astrocytic glioma with molecular features of glioblastoma (WHO Grade 4 astrocytoma), are the most common and aggressive malignant primary tumor of the central nervous system in adults, with extreme intrinsic heterogeneity in appearance, shape, and histology, with a median survival of approximately 15 months. Brain tumors in general are challenging to diagnose, hard to treat and inherently resistant to conventional therapy because of the challenges in delivering drugs to the brain, as well as the inherent high heterogeneity of these tumors in their radiographic, morphologic, and molecular landscapes. Years of extensive research to improve diagnosis, characterization, and treatment have decreased mortality rates in the U.S by 7% over the past 30 years. Although modest, these research innovations have not translated to improvements in survival for adults and children in low-and middle-income...","","https://www.synapse.org/brats_ce","completed","advanced","1","","2022-01-01","\N","2023-06-23 00:00:00","2023-10-14 05:38:51" +"49","brats-continuous-evaluation","BraTS Continuous Evaluation","Seeking innovations to improve brain tumor diagnosis and treatment","Brain tumors are among the deadliest types of cancer. Specifically, glioblastoma, and diffuse astrocytic glioma with molecular features of glioblastoma (WHO Grade 4 astrocytoma), are the most common and aggressive malignant primary tumor of the central nervous system in adults, with extreme intrinsic heterogeneity in appearance, shape, and histology, with a median survival of approximately 15 months. Brain tumors in general are challenging to diagnose, hard to treat and inherently resistant to conventional therapy because of the challenges in delivering drugs to the brain, as well as the inherent high heterogeneity of these tumors in their radiographic, morphologic, and molecular landscapes. Years of extensive research to improve diagnosis, characterization, and treatment have decreased mortality rates in the U.S by 7% over the past 30 years. Although modest, these research innovations have not translated to improvements in survival for adults and children in low-and middle-income...","","https://www.synapse.org/brats_ce","completed","advanced","1","","2022-01-01","\N","2023-06-23 00:00:00","2023-10-14 05:38:51" "50","fets-2022","FeTS 2022","Federated Learning Challenge 2022: Advancing Brain Tumor Segmentation Algorithms","FeTS 2022 focuses on benchmarking methods for federated learning (FL), and particularly i) weight aggregation methods for federated training, and ii) algorithmic generalizability on out-of-sample data based on federated evaluation. In line with its last instance (FeTS 2021-the 1st FL challenge ever organized), FeTS 2022 targets the task of brain tumor segmentation and builds upon i) the centralized dataset of >8,000 clinically-acquired multi-institutional MRI scans (from the RSNA-ASNR-MICCAI BraTS 2021 challenge) with their real-world partitioning, and ii) the collaborative network of remote independent institutions included in a real-world federation. Participants are welcome to compete in either of the two challenge tasks- Task 1 (“Federated Training”) seeks effective weight aggregation methods for the creation of a consensus model given a pre-defined segmentation algorithm for training, while also (optionally) accounting for network outages. Task 2 (“Federated Evaluation”) see...","","https://www.synapse.org/#!Synapse:syn28546456/wiki/617093","completed","advanced","1","","2022-04-08","2022-08-15","2023-06-23 00:00:00","2023-10-18 00:36:14" "51","random-promotor","Random Promotor","Deciphering Gene Regulation: Training Models to Predict Gene Expression Patterns","Decoding how gene expression is regulated is critical to understanding disease. Regulatory DNA is decoded by the cell in a process termed “cis-regulatory logic”, where proteins called Transcription Factors (TFs) bind to specific DNA sequences within the genome and work together to produce as output a level of gene expression for downstream adjacent genes. This process is exceedingly complex to model as a large number of parameters is needed to fully describe the process (see Rationale, de Boer et al. 2020; Zeitingler J. 2020). Understanding the cis-regulatory logic of the human genome is an important goal and would provide insight into the origins of many diseases. However, learning models from human data is challenging due to limitations in the diversity of sequences present within the human genome (e.g. extensive repetitive DNA), the vast number of cell types that differ in how they interpret regulatory DNA, limited reporter assay data, and substantial technical biases present ...","","https://www.synapse.org/#!Synapse:syn28469146/wiki/617075","completed","intermediate","1","","2022-05-02","2022-08-07","2023-06-23 00:00:00","2023-10-14 05:38:53" -"52","preterm-birth-prediction-microbiome","Preterm Birth Prediction - Microbiome","Seeking Innovations To Improve Brain Tumor Diagnosis And Treatment","Globally, about 11% of infants every year are born preterm, defined as birth prior to 37 weeks of gestation, totaling nearly 15 million births.(5) In addition to the emotional and financial toll on families, preterm births have higher rates of neonatal death, nearly 1 million deaths each year, and long-term health consequences for some children. Infants born preterm are at risk for a variety of adverse outcomes, such as respiratory illnesses, cerebral palsy, infections, and blindness, with infants born very preterm (i.e., before 32 weeks) at increased risk of these conditions.(6) The ability to accurately predict which women are at a higher risk for preterm birth would help healthcare providers to treat in a timely manner those at higher risk of delivering preterm. Currently available treatments for pregnant women at risk of preterm delivery include corticosteroids for fetal maturation and magnesium sulfate provided prior to 32 weeks to prevent cerebral palsy.(7) There are several...","","https://www.synapse.org/#!Synapse:syn26133770/wiki/612541","completed","advanced","1","","2022-07-19","2022-09-16","2023-06-23 00:00:00","2023-10-14 05:38:54" -"53","finrisk","FINRISK - Heart Failure and Microbiome","FINRISK - Heart Failure and Microbiome: (No headline provided)","Cardiovascular diseases are the leading cause of death both in men and women worldwide. Heart failure (HF) is the most common form of heart disease, characterised by the heart's inability to pump a sufficient supply of blood to meet the needs of the body. The lifetime risk of developing HF is roughly 20%, yet, it remains difficult to diagnose due to its and a lack of agreement of diagnostic criteria. As the diagnosis of HF is dependent on ascertainment of clinical histories and appropriate screening of symptomatic individuals, identifying those at risk of HF is essential. This DREAM challenge focuses on the prediction of HF using a combination of gut microbiome and clinical variables. This challenge is designed to predict incident risk for heart failure in a large human population study of Finnish adults, FINRISK 2002 (Borodulin et al., 2018). The FINRISK study has been conducted in Finland to investigate the risk factors for cardiovascular disease every 5 years since 1972. A rand...","","https://www.synapse.org/#!Synapse:syn27130803/wiki/616705","completed","advanced","1","","2022-09-20","2023-01-30","2023-06-23 00:00:00","2023-10-16 21:19:55" +"52","preterm-birth-prediction-microbiome","Preterm Birth Prediction - Microbiome","Seeking innovations to improve brain tumor diagnosis and treatment","Globally, about 11% of infants every year are born preterm, defined as birth prior to 37 weeks of gestation, totaling nearly 15 million births.(5) In addition to the emotional and financial toll on families, preterm births have higher rates of neonatal death, nearly 1 million deaths each year, and long-term health consequences for some children. Infants born preterm are at risk for a variety of adverse outcomes, such as respiratory illnesses, cerebral palsy, infections, and blindness, with infants born very preterm (i.e., before 32 weeks) at increased risk of these conditions.(6) The ability to accurately predict which women are at a higher risk for preterm birth would help healthcare providers to treat in a timely manner those at higher risk of delivering preterm. Currently available treatments for pregnant women at risk of preterm delivery include corticosteroids for fetal maturation and magnesium sulfate provided prior to 32 weeks to prevent cerebral palsy.(7) There are several...","","https://www.synapse.org/#!Synapse:syn26133770/wiki/612541","completed","advanced","1","","2022-07-19","2022-09-16","2023-06-23 00:00:00","2023-10-14 05:38:54" +"53","finrisk","FINRISK - Heart Failure and Microbiome","Predict incident risk for heart failure in a large human population study of Finnish adults","Cardiovascular diseases are the leading cause of death both in men and women worldwide. Heart failure (HF) is the most common form of heart disease, characterised by the heart's inability to pump a sufficient supply of blood to meet the needs of the body. The lifetime risk of developing HF is roughly 20%, yet, it remains difficult to diagnose due to its and a lack of agreement of diagnostic criteria. As the diagnosis of HF is dependent on ascertainment of clinical histories and appropriate screening of symptomatic individuals, identifying those at risk of HF is essential. This DREAM challenge focuses on the prediction of HF using a combination of gut microbiome and clinical variables. This challenge is designed to predict incident risk for heart failure in a large human population study of Finnish adults, FINRISK 2002 (Borodulin et al., 2018). The FINRISK study has been conducted in Finland to investigate the risk factors for cardiovascular disease every 5 years since 1972. A rand...","","https://www.synapse.org/#!Synapse:syn27130803/wiki/616705","completed","advanced","1","","2022-09-20","2023-01-30","2023-06-23 00:00:00","2023-10-16 21:19:55" "54","scrna-seq-and-scatac-seq-data-analysis","scRNA-seq and scATAC-seq Data Analysis","Assess computational methods for scRNA-seq and scATAC-seq analysis","Understanding transcriptional regulation at individual cell resolution is fundamental to understanding complex biological systems such as tissues and organs. Emerging high-throughput sequencing technologies now allow for transcript quantification and chromatin accessibility at the single cell level. These technologies present unique challenges due to inherent data sparsity. Proper signal correction is key to accurate gene expression quantification via scRNA-seq, which propagates into downstream analyses such as differential gene expression analysis and cell-type identification. In the even more sparse scATAC-seq data, the correct identification of informative features is key to assessing cell heterogeneity at the chromatin level. The aims of this challenge will be two-fold- 1) To evaluate computational methods for signal correction and peak identification in scRNA-seq and scATAC-seq, respectively; 2) To assess the impact of these methods on downstream analysis","","https://www.synapse.org/#!Synapse:syn26720920/wiki/615338","completed","advanced","1","","2022-11-29","2023-02-08","2023-06-23 00:00:00","2023-10-14 05:38:56" -"55","cough-diagnostic-algorithm-for-tuberculosis","COugh Diagnostic Algorithm for Tuberculosis","Assess computational methods for scRNA-seq and scATAC-seq analysis","Tuberculosis (TB), a communicable disease caused by Mycobacterium tuberculosis, is a major cause of ill health and one of the leading causes of death worldwide. Until the COVID-19 pandemic, TB was the leading cause of death from a single infectious agent, ranking even above HIV/AIDS. In 2020, an estimated 9.9 million people fell ill with TB and 1.3 million died of TB worldwide. However, approximately 40% of people with TB were not diagnosed or reported to public health authorities because of challenges in accessing health facilities or failure to be tested or treated when they do. The development of low-cost, non-invasive digital screening tools may improve some of the gaps in diagnosis. As cough is a common symptom of TB, it has the potential to be used as a biomarker for diagnosis of disease. Several previous studies have demonstrated the potential for cough sounds to be used to screen for TB[1-3], though these were typically done in small samples or limited settings. Further de...","","https://www.synapse.org/#!Synapse:syn31472953/wiki/617828","completed","advanced","1","","2022-10-16","2023-02-13","2023-06-23 00:00:00","2023-10-14 05:38:57" -"56","nih-long-covid-computational-challenge","NIH Long COVID Computational Challenge","Understanding Prevalence and Outcomes of Post-COVID Syndrome","The overall prevalence of post-acute sequelae of SARS-CoV-2 (PASC) is currently unknown, but there is growing evidence that more than half of COVID-19 survivors experience at least one symptom of PASC/Long COVID at six months after recovery of the acute illness. Reports also reflect an underlying heterogeneity of symptoms, multi-organ involvement, and persistence of PASC/Long COVID in some patients. Research is ongoing to understand prevalence, duration, and clinical outcomes of PASC/Long COVID. Symptoms of fatigue, cognitive impairment, shortness of breath, and cardiac damage, among others, have been observed in patients who had only mild initial disease. The breadth and complexity of data created in today's health care encounters require advanced analytics to extract meaning from longitudinal data on symptoms, laboratory results, images, functional tests, genomics, mobile health/wearable devices, written notes, electronic health records (EHR), and other relevant data types. Adva...","","https://www.synapse.org/#!Synapse:syn33576900/wiki/618451","completed","intermediate","1","","2022-08-25","2022-12-15","2023-06-23 00:00:00","2023-10-18 00:39:03" +"55","cough-diagnostic-algorithm-for-tuberculosis","COugh Diagnostic Algorithm for Tuberculosis","Predict TB status using features extracted from audio of elicited coughs","Tuberculosis (TB), a communicable disease caused by Mycobacterium tuberculosis, is a major cause of ill health and one of the leading causes of death worldwide. Until the COVID-19 pandemic, TB was the leading cause of death from a single infectious agent, ranking even above HIV/AIDS. In 2020, an estimated 9.9 million people fell ill with TB and 1.3 million died of TB worldwide. However, approximately 40% of people with TB were not diagnosed or reported to public health authorities because of challenges in accessing health facilities or failure to be tested or treated when they do. The development of low-cost, non-invasive digital screening tools may improve some of the gaps in diagnosis. As cough is a common symptom of TB, it has the potential to be used as a biomarker for diagnosis of disease. Several previous studies have demonstrated the potential for cough sounds to be used to screen for TB[1-3], though these were typically done in small samples or limited settings. Further de...","","https://www.synapse.org/#!Synapse:syn31472953/wiki/617828","completed","advanced","1","","2022-10-16","2023-02-13","2023-06-23 00:00:00","2023-10-14 05:38:57" +"56","nih-long-covid-computational-challenge","NIH Long COVID Computational Challenge","Understanding prevalence and outcomes of post-COVID syndrome","The overall prevalence of post-acute sequelae of SARS-CoV-2 (PASC) is currently unknown, but there is growing evidence that more than half of COVID-19 survivors experience at least one symptom of PASC/Long COVID at six months after recovery of the acute illness. Reports also reflect an underlying heterogeneity of symptoms, multi-organ involvement, and persistence of PASC/Long COVID in some patients. Research is ongoing to understand prevalence, duration, and clinical outcomes of PASC/Long COVID. Symptoms of fatigue, cognitive impairment, shortness of breath, and cardiac damage, among others, have been observed in patients who had only mild initial disease. The breadth and complexity of data created in today's health care encounters require advanced analytics to extract meaning from longitudinal data on symptoms, laboratory results, images, functional tests, genomics, mobile health/wearable devices, written notes, electronic health records (EHR), and other relevant data types. Adva...","","https://www.synapse.org/#!Synapse:syn33576900/wiki/618451","completed","intermediate","1","","2022-08-25","2022-12-15","2023-06-23 00:00:00","2023-10-18 00:39:03" "57","bridge2ai","Bridge2AI","What makes a good color palette?","What makes a good color palette?","","","upcoming","good_for_beginners","1","","\N","\N","2023-06-23 00:00:00","2023-10-14 05:38:58" "58","rare-x-open-data-science","RARE-X Open Data Science","Unlocking rare disease mysteries through open science collaboration","The Xcelerate RARE-A Rare Disease Open Science Data Challenge is bringing together researchers and data scientists in a collaborative and competitive environment to make the best use of patient-provided data to solve big unknowns in healthcare. The Challenge will launch to researchers in late May 2023, focused on rare pediatric neurodevelopmental diseases.","","https://www.synapse.org/#!Synapse:syn51198355/wiki/621435","completed","intermediate","1","","2023-05-17","2023-08-16","2023-06-23 00:00:00","2023-10-14 05:38:59" "59","cagi5-regulation-saturation","CAGI5: Regulation saturation","Predicting effects of variants in disease-linked enhancers and promoters","17,500 single nucleotide variants (SNVs) in 5 human disease associated enhancers (including IRF4, IRF6, MYC, SORT1) and 9 promoters (including TERT, LDLR, F9, HBG1) were assessed in a saturation mutagenesis massively parallel reporter assay. Promoters were cloned into a plasmid upstream of a tagged reporter construct, and reporter expression was measured relative to the plasmid DNA to determine the impact of promoter variants. Enhancers were placed upstream of a minimal promoter and assayed similarly. The challenge is to predict the functional effects of these variants in the regulatory regions as measured from the reporter expression.","","https://genomeinterpretation.org/cagi5-regulation-saturation.html","completed","intermediate","14","","2018-01-04","2018-05-03","2023-06-23 00:00:00","2023-11-01 23:42:37" @@ -65,42 +65,42 @@ "64","cagi5-annotate-all-missense","CAGI5: Annotate all nonsynonymous variants","Annotate all nonsynonymous variants","dbNSFP describes 810,848,49 possible protein-altering variants in the human genome. The challenge is to predict the functional effect of every such variant. For the vast majority of these missense variants, the functional impact is not currently known, but experimental and clinical evidence are accruing rapidly. Rather than drawing upon a single discrete dataset as typical with CAGI, predictions will be assessed by comparing with experimental or clinical annotations made available after the prediction submission date, on an ongoing basis. if predictors assent, predictions will also incorporated into dbNSFP.","","https://genomeinterpretation.org/cagi5-annotate-all-missense.html","completed","intermediate","14","","2017-11-30","2018-05-09","2023-06-23 00:00:00","2023-10-14 05:39:04" "65","cagi5-gaa","CAGI5: GAA","Predict enzyme activity of GAA mutants in Pompe disease","Acid alpha-glucosidase (GAA) is a lysosomal alpha-glucosidase. Some mutations in GAA cause a rare disorder, Pompe disease, (Glycogen Storage Disease II). Rare GAA missense variants found in a human population sample have been assayed for enzymatic activity in transfected cell lysates. The assessment of this challenge will include evaluations that recognize novelty of approach. The challenge is to predict the fractional enzyme activity of each mutant protein compared to the wild-type enzyme.","","https://genomeinterpretation.org/cagi5-gaa.html","completed","intermediate","14","","2017-11-09","2018-04-25","2023-06-23 00:00:00","2023-10-14 05:39:04" "66","cagi5-chek2","CAGI5: CHEK2","Estimate CHEK2 gene variant probabilities in Latino breast cancer cases","Variants in the CHEK2 gene are associated with breast cancer. This challenge includes CHEK2 gene variants from approximately 1200 Latino breast cancer cases and 1200 ethnically matched controls. This challenge is to estimate the probability of each gene variant occurring in an individual from the cancer affected cohort.","","https://genomeinterpretation.org/cagi5-chek2.html","completed","intermediate","14","","2017-12-20","2018-04-24","2023-06-23 00:00:00","2023-10-14 05:39:07" -"67","cagi5-enigma","CAGI5: ENIGMA","Predicting cancer risk from BRCA1/2 gene variants","Breast cancer is the most prevalent cancer among women worldwide. The association between germline mutations in the BRCA1 and BRCA2 genes and the development of cancer has been well established. The most common high-risk mutations associated with breast cancer are those in the autosomal dominant breast cancer genes 1 and 2 (BRCA1 and BRCA2). Mutations in these genes are found in 1-3% of breast cancer cases. The challenge is to predict which variants are associated with increased risk for breast cancer.","","https://genomeinterpretation.org/cagi5-enigma.html","completed","intermediate","14","","2017-12-20","2018-05-01","2023-06-23 00:00:00","2023-10-14 05:39:08" -"68","cagi5-mapsy","CAGI5: MaPSy","Predicting the Impact of Genetic Variants on Splicing Mechanisms","The Massively Parallel Splicing Assay (MaPSy) approach was used to screen 797 reported exonic disease mutations using a mini-gene system, assaying both in vivo via transfection in tissue culture, and in vitro via incubation in cell nuclear extract. The challenge is to predict the degree to which a given variant causes changes in splicing.","","https://genomeinterpretation.org/cagi5-mapsy.html","completed","intermediate","14","","2017-11-29","2018-05-07","2023-06-23 00:00:00","2023-10-14 05:39:08" +"67","cagi5-enigma","CAGI5: ENIGMA","Predict cancer risk from BRCA1/2 gene variants","Breast cancer is the most prevalent cancer among women worldwide. The association between germline mutations in the BRCA1 and BRCA2 genes and the development of cancer has been well established. The most common high-risk mutations associated with breast cancer are those in the autosomal dominant breast cancer genes 1 and 2 (BRCA1 and BRCA2). Mutations in these genes are found in 1-3% of breast cancer cases. The challenge is to predict which variants are associated with increased risk for breast cancer.","","https://genomeinterpretation.org/cagi5-enigma.html","completed","intermediate","14","","2017-12-20","2018-05-01","2023-06-23 00:00:00","2023-10-14 05:39:08" +"68","cagi5-mapsy","CAGI5: MaPSy","Predict the Impact of Genetic Variants on Splicing Mechanisms","The Massively Parallel Splicing Assay (MaPSy) approach was used to screen 797 reported exonic disease mutations using a mini-gene system, assaying both in vivo via transfection in tissue culture, and in vitro via incubation in cell nuclear extract. The challenge is to predict the degree to which a given variant causes changes in splicing.","","https://genomeinterpretation.org/cagi5-mapsy.html","completed","intermediate","14","","2017-11-29","2018-05-07","2023-06-23 00:00:00","2023-10-14 05:39:08" "69","cagi5-vex-seq","CAGI5: Vex-seq","Predict splicing changes from variants in globin gene","A barcoding approach called Variant exon sequencing (Vex-seq) was applied to assess effect of 2,059 natural single nucleotide variants and short indels on splicing of a globin mini-gene construct transfected into HepG2 cells. This is reported as ΔΨ (delta PSI, or Percent Spliced In), between the variant Ψand the reference Ψ. The challenge is to predict ΔΨ for each variant.","","https://genomeinterpretation.org/cagi5-vex-seq.html","completed","intermediate","14","","2017-12-14","2018-05-02","2023-06-23 00:00:00","2023-10-16 17:51:58" "70","cagi5-sickkids5","CAGI5: SickKids clinical genomes","Predict genetic disorders from 30 child genomes and phenotypes.","This challenge involves 30 children with suspected genetic disorders who were referred for clinical genome sequencing. Predictors are given the 30 genome sequences, and are also provided with the phenotypic descriptions as shared with the diagnostic laboratory. The challenge is to predict what class of disease is associated with each genome, and which genome corresponds to which clinical description. Predictors may additionally identify the diagnostic variant(s) underlying the predictions, and identify predictive secondary variants conferring high risk of other diseases whose phenotypes are not reported in the clinical descriptions.","","https://genomeinterpretation.org/cagi5-sickkids5.html","completed","intermediate","14","","2017-12-22","2018-04-26","2023-06-23 00:00:00","2023-10-14 05:39:10" "71","cagi5-intellectual-disability","CAGI5: ID Panel","Predict phenotypes and variants from gene panel sequences","The challenge presented here is to use computational methods to predict a patient's clinical phenotype and the causal variant(s) based on analysis of their gene panel sequence data. Sequence data for 74 genes associated with intellectual disability (ID) and/or Autism spectrum disorders (ASD) from a cohort of 150 patients with a range of neurodevelopmental presentations (ID, autism, epilepsy, etc..) have been made available for this challenge. For each patient, predictors must report the causative variants and which of seven phenotypes are present.","","https://genomeinterpretation.org/cagi5-intellectual-disability.html","completed","intermediate","14","","2017-12-22","2018-04-30","2023-06-23 00:00:00","2023-10-18 15:28:06" -"72","cagi5-clotting-disease","CAGI5: Clotting disease exomes","Predicting venous thromboembolism risk in African Americans","African Americans have a higher incidence of developing venous thromboembolisms (VTE), which includes deep vein thrombosis (DVT) and pulmonary embolism (PE), than people of European ancestry. Participants are provided with exome data and clinical covariates for a cohort of African Americans who have been prescribed Warfarin either because they had experienced a VTE event or had been diagnosed with atrial fibrillation (which predisposes to clotting). The challenge is to distinguish between these conditions. At present, in contrast to European ancestry, there are no genetic methods for anticipating which African Americans are most at risk of a venous thromboembolism, and the results of this challenge may contribute to the development of such tools.","","https://genomeinterpretation.org/cagi5-clotting-disease.html","completed","intermediate","14","","2017-11-23","2018-04-28","2023-06-23 00:00:00","2023-10-18 15:30:55" +"72","cagi5-clotting-disease","CAGI5: Clotting disease exomes","Predict venous thromboembolism risk in African Americans","African Americans have a higher incidence of developing venous thromboembolisms (VTE), which includes deep vein thrombosis (DVT) and pulmonary embolism (PE), than people of European ancestry. Participants are provided with exome data and clinical covariates for a cohort of African Americans who have been prescribed Warfarin either because they had experienced a VTE event or had been diagnosed with atrial fibrillation (which predisposes to clotting). The challenge is to distinguish between these conditions. At present, in contrast to European ancestry, there are no genetic methods for anticipating which African Americans are most at risk of a venous thromboembolism, and the results of this challenge may contribute to the development of such tools.","","https://genomeinterpretation.org/cagi5-clotting-disease.html","completed","intermediate","14","","2017-11-23","2018-04-28","2023-06-23 00:00:00","2023-10-18 15:30:55" "73","cagi6-sickkids","CAGI6: SickKids clinical genomes and transcriptomes","Identify genes causing rare diseases using transcriptomics","This challenge involves data from 79 children who were referred to The Hospital for Sick Children's (SickKids) Genome Clinic for genome sequencing because of suspected but undiagnosed genetic disorders. Research subjects are consented for sharing of their sequence data and phenotype information with researchers working to understand the molecular causes of rare disease. When a candidate disease variant believed to be related to the phenotype is identified, the variant is adjudicated and confirmed in a clinical setting. In this challenge, transcriptomic and phenotype data from a subset of the “solved” (diagnosed) and “unsolved” SickKids patients will be provided, along with corresponding genomic sequence data. The challenge is to use a transcriptome-driven approach to identify the gene(s) and molecular mechanisms underlying the phenotypic descriptions in each case. For the unsolved cases, prioritized variants from the participating teams will be examined to see if additional diagno...","","https://genomeinterpretation.org/cagi6-sickkids.html","completed","intermediate","1","","2021-08-04","2021-12-31","2023-06-23 00:00:00","2023-11-02 18:02:23" -"74","cagi6-cam","CAGI6: CaM","Predicting the Impact of Point Mutations on Calmodulin Stability","Calmodulin (CaM) is a ubiquitous calcium (Ca2+) sensor protein interacting with more than 200 molecular partners, thereby regulating a variety of biological processes. Missense point mutations in the genes encoding CaM have been associated with ventricular tachycardia and sudden cardiac death. A library encompassing up to 17 point mutations was assessed by far-UV circular dichroism (CD) by measuring melting temperature (Tm) and percentage of unfolding (%unfold) upon thermal denaturation at pH and salt concentration that mimic the physiological conditions. The challenge is to predict: the Tm and %unfold values for isolated CaM variants under Ca2+-saturating conditions (Ca2+-CaM) and in the Ca2+-free (apo) state; whether the point mutation stabilizes or destabilizes the protein (based on Tm and %unfold).","","https://genomeinterpretation.org/cagi6-cam.html","completed","intermediate","1","","2021-06-08","2021-10-11","2023-06-23 00:00:00","2023-10-18 15:32:37" -"75","cami-ii","CAMI II","Assembling and Classifying Microbial Genomes in Complex Samples","CAMI II offers several challenges-an assembly, a genome binning, a taxonomic binning and a taxonomic profiling challenge, on several multi-sample data sets from different environments, including long and short read data. This includes a marine data set and a high-strain diversity data set, with a third data set to follow later. A pathogen detection challenge on a clinical sample is also provided.","","https://www.microbiome-cosi.org/cami/cami/cami2","completed","intermediate","3","","2019-01-14","2021-01-31","2023-06-23 00:00:00","2023-10-17 23:15:00" -"76","camda18-metasub-forensics","CAMDA18-MetaSUB Forensics","Building a metagenomic map of mass-transit systems globally","The MetaSUB International Consortium is building a longitudinal metagenomic map of mass-transit systems and other public spaces across the globe. The consortium maintains a strategic partnership with CAMDA and this year provides data from global City Sampling Days for the first-ever multi-city forensic analyses.","","http://camda2018.bioinf.jku.at/doku.php/contest_dataset#metasub_forensics_challenge","completed","intermediate","14","","\N","\N","2023-06-23 00:00:00","2023-11-01 20:37:34" -"77","camda18-cmap-drug-safety","CAMDA18-CMap Drug Safety","Predicting drug toxicity using cell-based gene expression data","Attrition in drug discovery and development due to safety / toxicity issues remains a significant concern, and there are strong efforts to identify and mitigate risk as early as possible. Drug-induced liver injury (DILI) is one of the primary problems in drug development and regulatory clearance due to the poor performance of existing preclinical models. There is a pressing need to evaluate alternative methods for predicting DILI, with great hopes being placed in modern approaches from statistics and machine learning applied to genome scale profiling data. A critical question thus is if we can better integrate, understand, and exploit information from cell-based screens like the Broad Institute Connectivity Map (CMap, Science 313, Nature Reviews Cancer 7). This CAMDA challenge focuses on understanding or predicting drug induced liver injury in humans from cell-based screens, specifically the CMap gene expression responses of two different cancer cell lines (MCF7 and PC3) to 276 d...","","http://camda2018.bioinf.jku.at/doku.php/contest_dataset#cmap_drug_safety_challenge","completed","intermediate","14","","\N","\N","2023-06-23 00:00:00","2023-11-01 20:37:35" +"74","cagi6-cam","CAGI6: CaM","Predict the impact of point mutations on calmodulin stability","Calmodulin (CaM) is a ubiquitous calcium (Ca2+) sensor protein interacting with more than 200 molecular partners, thereby regulating a variety of biological processes. Missense point mutations in the genes encoding CaM have been associated with ventricular tachycardia and sudden cardiac death. A library encompassing up to 17 point mutations was assessed by far-UV circular dichroism (CD) by measuring melting temperature (Tm) and percentage of unfolding (%unfold) upon thermal denaturation at pH and salt concentration that mimic the physiological conditions. The challenge is to predict: the Tm and %unfold values for isolated CaM variants under Ca2+-saturating conditions (Ca2+-CaM) and in the Ca2+-free (apo) state; whether the point mutation stabilizes or destabilizes the protein (based on Tm and %unfold).","","https://genomeinterpretation.org/cagi6-cam.html","completed","intermediate","1","","2021-06-08","2021-10-11","2023-06-23 00:00:00","2023-10-18 15:32:37" +"75","cami-ii","CAMI II","Assemble and classify microbial genomes in complex samples","CAMI II offers several challenges-an assembly, a genome binning, a taxonomic binning and a taxonomic profiling challenge, on several multi-sample data sets from different environments, including long and short read data. This includes a marine data set and a high-strain diversity data set, with a third data set to follow later. A pathogen detection challenge on a clinical sample is also provided.","","https://www.microbiome-cosi.org/cami/cami/cami2","completed","intermediate","3","","2019-01-14","2021-01-31","2023-06-23 00:00:00","2023-10-17 23:15:00" +"76","camda18-metasub-forensics","CAMDA18-MetaSUB Forensics","Build a metagenomic map of mass-transit systems globally","The MetaSUB International Consortium is building a longitudinal metagenomic map of mass-transit systems and other public spaces across the globe. The consortium maintains a strategic partnership with CAMDA and this year provides data from global City Sampling Days for the first-ever multi-city forensic analyses.","","http://camda2018.bioinf.jku.at/doku.php/contest_dataset#metasub_forensics_challenge","completed","intermediate","14","","\N","\N","2023-06-23 00:00:00","2023-11-01 20:37:34" +"77","camda18-cmap-drug-safety","CAMDA18-CMap Drug Safety","Predict drug toxicity using cell-based gene expression data","Attrition in drug discovery and development due to safety / toxicity issues remains a significant concern, and there are strong efforts to identify and mitigate risk as early as possible. Drug-induced liver injury (DILI) is one of the primary problems in drug development and regulatory clearance due to the poor performance of existing preclinical models. There is a pressing need to evaluate alternative methods for predicting DILI, with great hopes being placed in modern approaches from statistics and machine learning applied to genome scale profiling data. A critical question thus is if we can better integrate, understand, and exploit information from cell-based screens like the Broad Institute Connectivity Map (CMap, Science 313, Nature Reviews Cancer 7). This CAMDA challenge focuses on understanding or predicting drug induced liver injury in humans from cell-based screens, specifically the CMap gene expression responses of two different cancer cell lines (MCF7 and PC3) to 276 d...","","http://camda2018.bioinf.jku.at/doku.php/contest_dataset#cmap_drug_safety_challenge","completed","intermediate","14","","\N","\N","2023-06-23 00:00:00","2023-11-01 20:37:35" "78","camda18-cancer-data-integration","CAMDA18-Cancer Data Integration","Unify data integration approaches for breast cancer and neuroblastoma","Examine the power of data integration in a real-world clinical settings. Many approaches work well on some data-sets yet not on others. We here challenge you to demonstrate a unified single approach to data-integration that matches or outperforms the current state of the art on two different diseases, breast cancer and neuroblastoma. Breast cancer affects about 3 million women every year (McGuire et al, Cancers 7), and this number is growing fast, especially in developed countries. Can you improve on the large Metabric study (Curtis et al., Nature 486, and Dream Challenge, Margolin et al, Sci Transl Med 5)? The cohort is biologically heterogeneous with all five distinct PAM50 breast cancer subtypes represented. Matched profiles for microarray and copy number data as well as clinical information (survival times, multiple prognostic markers, therapy data) are available for about 2,000 patients. Neuroblastoma is the most common extracranial solid tumor in children. The base study com...","","http://camda2018.bioinf.jku.at/doku.php/contest_dataset#cancer_data_integration_challenge","completed","intermediate","14","","\N","\N","2023-06-23 00:00:00","2023-11-01 20:37:36" -"79","cafa-4","CAFA 4","Assessing algorithms for predicting protein function","The goal of the Critical Assessment of Functional Annotation(CAFA) challenge is to evaluate automated protein function prediction algorithms in the task of predicting Gene Ontology and Human Phenotype Ontology terms for a given set of protein sequences. For the GO-based predictions, the evaluation will be carried out for the Molecular Function Ontology, Biological Process Ontology and Cellular Component Ontology. Participants develop protein function prediction algorithms using training protein sequence data and submit their predictions on target protein sequence data.","","https://www.biofunctionprediction.org/cafa/","completed","intermediate","1","","2019-10-21","2020-02-12","2023-06-23 00:00:00","2023-10-14 05:39:20" +"79","cafa-4","CAFA 4","Assess algorithms for predicting protein function","The goal of the Critical Assessment of Functional Annotation(CAFA) challenge is to evaluate automated protein function prediction algorithms in the task of predicting Gene Ontology and Human Phenotype Ontology terms for a given set of protein sequences. For the GO-based predictions, the evaluation will be carried out for the Molecular Function Ontology, Biological Process Ontology and Cellular Component Ontology. Participants develop protein function prediction algorithms using training protein sequence data and submit their predictions on target protein sequence data.","","https://www.biofunctionprediction.org/cafa/","completed","intermediate","1","","2019-10-21","2020-02-12","2023-06-23 00:00:00","2023-10-14 05:39:20" "80","casp13","CASP13","CASP assesses protein structure prediction methods","CASP (Critical Assessment of Structure Prediction) is a community wide experiment to determine and advance the state of the art in modeling protein structure from amino acid sequence. Every two years, participants are invited to submit models for a set of proteins for which the experimental structures are not yet public. Independent assessors then compare the models with experiment. Assessments and results are published in a special issue of the journal PROTEINS. In the most recent CASP round, CASP12, nearly 100 groups from around the world submitted more than 50,000 models on 82 modeling targets","","https://predictioncenter.org/casp13/index.cgi","completed","intermediate","14","","2018-04-18","2018-08-20","2023-06-23 00:00:00","2023-10-17 22:52:29" -"81","casp14","CASP14","Assessing progress in protein structure prediction","CASP (Critical Assessment of Structure Prediction) is a community wide experiment to determine and advance the state of the art in modeling protein structure from amino acid sequence. Every two years, participants are invited to submit models for a set of proteins for which the experimental structures are not yet public. Independent assessors then compare the models with experiment. Assessments and results are published in a special issue of the journal PROTEINS. In the most recent CASP round, CASP14, nearly 100 groups from around the world submitted more than 67,000 models on 90 modeling targets.","","https://predictioncenter.org/casp14/index.cgi","completed","intermediate","14","","2020-05-04","2020-09-07","2023-06-23 00:00:00","2023-10-17 22:47:26" -"82","cfsan-pathogen-detection","CFSAN Pathogen Detection","Rapidly Identify Food Sources of Outbreaks","In the U.S. alone, one in six individuals, an estimated 48 million people, fall prey to foodborne illness, resulting in 128,000 hospitalizations and 3,000 deaths per year. Economic burdens are estimated cumulatively at $152 billion dollars annually, including $39 billion due to contamination of fresh and processed produce. One longstanding problem is the ability to rapidly identify the food-source associated with the outbreak being investigated. The faster an outbreak is identified and the increased certainty that a given source (e.g., papayas from Mexico) and patients are linked, the faster the outbreak can be stopped, limiting morbidity and mortality. In the last few years, the application of next-generation sequencing (NGS) technology for whole genome sequencing (WGS) of foodborne pathogens has revolutionized food pathogen outbreak surveillance. WGS of foodborne pathogens enables high-resolution identification of pathogens isolated from food or environmental samples. These pat...","","https://precision.fda.gov/challenges/2","completed","intermediate","6","","2018-02-15","2018-04-26","2023-06-23 00:00:00","2023-10-14 05:39:23" -"83","cdrh-biothreat","CDRH Biothreat","Identifying infectious diseases from clinical samples using sequencing techn...","Many infectious diseases have similar signs and symptoms, making it challenging for healthcare providers to identify the disease-causing agent. Clinical samples are often tested by multiple test methods to help reveal the microbe that is causing the infectious disease. The results of these test methods can help healthcare professionals determine the best treatment for patients. Today, High-Throughput Sequencing (HTS) or Next Generation Sequencing (NGS) technology has the capability, as a single test, to accomplish what might have required several different tests in the past. NGS technology may allow the diagnosis of infections without prior knowledge of disease(s) cause. NGS technology can potentially reveal the presence of all microorganisms in a patient sample. Using infectious disease NGS (ID-NGS) technology, each microbial pathogen may be identified by its unique genomic fingerprint. The vision of ID-NGS technology is to further improve patient care by delivering diagnostics ...","","https://precision.fda.gov/challenges/3","completed","intermediate","6","","2018-08-03","2018-10-18","2023-06-23 00:00:00","2023-10-14 05:39:24" -"84","multi-omics-enabled-sample-mislabeling-correction","Multi-omics Enabled Sample Mislabeling Correction","Multi-omics Enabled Sample Mislabeling Correction: (No headline provided)","In biomedical research, sample mislabeling (accidental swapping of patient samples) or data mislabeling (accidental swapping of patient omics data) has been a long-standing problem that contributes to irreproducible results and invalid conclusions. These problems are particularly prevalent in large scale multi-omics studies, in which multiple different omics experiments are carried out at different time periods and/or in different labs. Human errors could arise during sample transferring, sample tracking, large-scale data generation, and data sharing/management. Thus, there is a pressing need to identify and correct sample and data mislabeling events to ensure the right data for the right patient. Simultaneous use of multiple types of omics platforms to characterize a large set of biological samples, as utilized in The Cancer Genome Atlas (TCGA) and the Clinical Proteomic Tumor Analysis Consortium (CPTAC) projects, has been demonstrated as a powerful approach to understanding the ...","","https://precision.fda.gov/challenges/4","completed","intermediate","6","https://doi.org/10.1038/s41591-018-0180-x","2018-09-24","2018-12-19","2023-06-23 00:00:00","2023-10-14 05:39:25" -"85","biocompute-object-app-a-thon","BioCompute Object App-a-thon","Seeking Standards for Reproducible Bioinformatics Analysis","Like scientific laboratory experiments, bioinformatics analysis results and interpretation are faced with reproducibility challenges due to the variability in multiple computational parameters, including input format, prerequisites, platform dependencies, and more. Even small changes in these computational parameters may have a large impact on the results and carry big implications for their scientific validity. Because there are currently no standardized schemas for reporting computational scientific workflows and parameters together with their results, the ways in which these workflows are communicated is highly variable, incomplete, and difficult or impossible to reproduce. The US Food and Drug Administration (FDA) High Performance Virtual Environment (HIVE) group and George Washington University (GW) have partnered to establish a framework for community-based standards development and harmonization of high-throughput sequencing (HTS) computations and data formats based arou...","","https://precision.fda.gov/challenges/7/","completed","intermediate","6","https://doi.org/10.1101/2020.11.02.365528","2019-05-14","2019-10-18","2023-06-23 00:00:00","2023-10-14 05:39:25" +"81","casp14","CASP14","Assess progress in protein structure prediction","CASP (Critical Assessment of Structure Prediction) is a community wide experiment to determine and advance the state of the art in modeling protein structure from amino acid sequence. Every two years, participants are invited to submit models for a set of proteins for which the experimental structures are not yet public. Independent assessors then compare the models with experiment. Assessments and results are published in a special issue of the journal PROTEINS. In the most recent CASP round, CASP14, nearly 100 groups from around the world submitted more than 67,000 models on 90 modeling targets.","","https://predictioncenter.org/casp14/index.cgi","completed","intermediate","14","","2020-05-04","2020-09-07","2023-06-23 00:00:00","2023-10-17 22:47:26" +"82","cfsan-pathogen-detection","CFSAN Pathogen Detection","Rapidly identify food sources of outbreaks","In the U.S. alone, one in six individuals, an estimated 48 million people, fall prey to foodborne illness, resulting in 128,000 hospitalizations and 3,000 deaths per year. Economic burdens are estimated cumulatively at $152 billion dollars annually, including $39 billion due to contamination of fresh and processed produce. One longstanding problem is the ability to rapidly identify the food-source associated with the outbreak being investigated. The faster an outbreak is identified and the increased certainty that a given source (e.g., papayas from Mexico) and patients are linked, the faster the outbreak can be stopped, limiting morbidity and mortality. In the last few years, the application of next-generation sequencing (NGS) technology for whole genome sequencing (WGS) of foodborne pathogens has revolutionized food pathogen outbreak surveillance. WGS of foodborne pathogens enables high-resolution identification of pathogens isolated from food or environmental samples. These pat...","","https://precision.fda.gov/challenges/2","completed","intermediate","6","","2018-02-15","2018-04-26","2023-06-23 00:00:00","2023-10-14 05:39:23" +"83","cdrh-biothreat","CDRH Biothreat","Identify infectious diseases from clinical samples using sequencing technology.","Many infectious diseases have similar signs and symptoms, making it challenging for healthcare providers to identify the disease-causing agent. Clinical samples are often tested by multiple test methods to help reveal the microbe that is causing the infectious disease. The results of these test methods can help healthcare professionals determine the best treatment for patients. Today, High-Throughput Sequencing (HTS) or Next Generation Sequencing (NGS) technology has the capability, as a single test, to accomplish what might have required several different tests in the past. NGS technology may allow the diagnosis of infections without prior knowledge of disease(s) cause. NGS technology can potentially reveal the presence of all microorganisms in a patient sample. Using infectious disease NGS (ID-NGS) technology, each microbial pathogen may be identified by its unique genomic fingerprint. The vision of ID-NGS technology is to further improve patient care by delivering diagnostics ...","","https://precision.fda.gov/challenges/3","completed","intermediate","6","","2018-08-03","2018-10-18","2023-06-23 00:00:00","2023-10-14 05:39:24" +"84","multi-omics-enabled-sample-mislabeling-correction","Multi-omics Enabled Sample Mislabeling Correction","Identify and correct sample and data mislabeling events to ensure the right data for the right patient","In biomedical research, sample mislabeling (accidental swapping of patient samples) or data mislabeling (accidental swapping of patient omics data) has been a long-standing problem that contributes to irreproducible results and invalid conclusions. These problems are particularly prevalent in large scale multi-omics studies, in which multiple different omics experiments are carried out at different time periods and/or in different labs. Human errors could arise during sample transferring, sample tracking, large-scale data generation, and data sharing/management. Thus, there is a pressing need to identify and correct sample and data mislabeling events to ensure the right data for the right patient. Simultaneous use of multiple types of omics platforms to characterize a large set of biological samples, as utilized in The Cancer Genome Atlas (TCGA) and the Clinical Proteomic Tumor Analysis Consortium (CPTAC) projects, has been demonstrated as a powerful approach to understanding the ...","","https://precision.fda.gov/challenges/4","completed","intermediate","6","https://doi.org/10.1038/s41591-018-0180-x","2018-09-24","2018-12-19","2023-06-23 00:00:00","2023-10-14 05:39:25" +"85","biocompute-object-app-a-thon","BioCompute Object App-a-thon","Seeking standards for reproducible bioinformatics analysis","Like scientific laboratory experiments, bioinformatics analysis results and interpretation are faced with reproducibility challenges due to the variability in multiple computational parameters, including input format, prerequisites, platform dependencies, and more. Even small changes in these computational parameters may have a large impact on the results and carry big implications for their scientific validity. Because there are currently no standardized schemas for reporting computational scientific workflows and parameters together with their results, the ways in which these workflows are communicated is highly variable, incomplete, and difficult or impossible to reproduce. The US Food and Drug Administration (FDA) High Performance Virtual Environment (HIVE) group and George Washington University (GW) have partnered to establish a framework for community-based standards development and harmonization of high-throughput sequencing (HTS) computations and data formats based arou...","","https://precision.fda.gov/challenges/7/","completed","intermediate","6","https://doi.org/10.1101/2020.11.02.365528","2019-05-14","2019-10-18","2023-06-23 00:00:00","2023-10-14 05:39:25" "86","brain-cancer-predictive-modeling-and-biomarker-discovery","Brain Cancer Predictive Modeling and Biomarker Discovery","Seeking novel biomarkers to advance precision medicine for brain tumors","An estimated 86,970 new cases of primary brain and other central nervous system tumors are expected to be diagnosed in the US in 2019. Brain tumors comprise a particularly deadly subset of all cancers due to limited treatment options and the high cost of care. Only a few prognostic and predictive markers have been successfully implemented in the clinic so far for gliomas, the most common malignant brain tumor type. These markers include MGMT promoter methylation in high-grade astrocytomas, co-deletion of 1p/19q in oligodendrogliomas, and mutations in IDH1 or IDH2 genes (Staedtke et al. 2016). There remains significant potential for identifying new clinical biomarkers in gliomas. Clinical investigators at Georgetown University are seeking to advance precision medicine techniques for the prognosis and treatment of brain tumors through the identification of novel multi-omics biomarkers. In support of this goal, precisionFDA and the Georgetown Lombardi Comprehensive Cancer Center and ...","","https://precision.fda.gov/challenges/8/","completed","advanced","6","","2019-11-01","2020-02-14","2023-06-23 00:00:00","2023-10-14 05:39:25" -"87","gaining-new-insights-by-detecting-adverse-event-anomalies","Gaining New Insights by Detecting Adverse Event Anomalies","Seeking Algorithms to Detect Adverse Events in FDA Data","The Food and Drug Administration (FDA) calls on the public to develop computational algorithms for automatic detection of adverse event anomalies using publicly available data.","","https://precision.fda.gov/challenges/9/","completed","intermediate","6","","2020-01-17","2020-05-18","2023-06-23 00:00:00","2023-10-14 05:39:27" +"87","gaining-new-insights-by-detecting-adverse-event-anomalies","Gaining New Insights by Detecting Adverse Event Anomalies","Seeking algorithms to detect adverse events in FDA data","The Food and Drug Administration (FDA) calls on the public to develop computational algorithms for automatic detection of adverse event anomalies using publicly available data.","","https://precision.fda.gov/challenges/9/","completed","intermediate","6","","2020-01-17","2020-05-18","2023-06-23 00:00:00","2023-10-14 05:39:27" "88","calling-variants-in-difficult-to-map-regions","Calling Variants in Difficult-to-Map Regions","Precision Benchmarking: Evaluating Variant Calling in Complex Genomic Regions","This challenge calls on the public to assess variant calling pipeline performance on a common frame of reference, with a focus on benchmarking in difficult-to-map regions, segmental duplications, and the Major Histocompatibility Complex (MHC).","","https://precision.fda.gov/challenges/10/","completed","intermediate","6","https://doi.org/10.1016/j.xgen.2022.100129","2020-05-01","2020-06-15","2023-06-23 00:00:00","2023-10-14 05:39:28" "89","vha-innovation-ecosystem-and-covid-19-risk-factor-modeling","VHA Innovation Ecosystem and COVID-19 Risk Factor Modeling","AI for COVID-19: Predicting Health Outcomes in the Veteran Population","The novel coronavirus disease 2019 (COVID-19) is a respiratory disease caused by a new type of coronavirus, known as “severe acute respiratory syndrome coronavirus 2,” or SARS-CoV-2. On March 11, 2020, the World Health Organization (WHO) declared the outbreak a global pandemic. As of Monday, June 1, the Johns Hopkins University COVID-19 dashboard reports over 6.21 million total confirmed cases worldwide, including over 1.79 million cases in the United States. Although most people have mild to moderate symptoms, the disease can cause severe medical complications leading to death in some people. The Centers for Disease Control and Prevention (CDC) have identified several groups at elevated risk for severe illness, including people 65 years and older, individuals living in nursing homes or long term care facilities, and those with serious underlying medical conditions, such as severe obesity, diabetes, chronic lung disease or moderate to severe asthma, chronic kidney or liver disease...","","https://precision.fda.gov/challenges/11/","completed","intermediate","6","","2020-06-02","2020-07-03","2023-06-23 00:00:00","2023-10-14 05:39:28" -"90","covid-19-precision-immunology-app-a-thon","COVID-19 Precision Immunology App-a-thon","Seeking insights on COVID-19 pathophysiology to enable effective strategies.","The novel coronavirus disease 2019 (COVID-19), a respiratory disease caused by a new type of coronavirus, known as “severe acute respiratory syndrome coronavirus 2” or SARS-CoV-2, was declared a global pandemic by the World Health Organization on March 11, 2020. To date, the Johns Hopkins University COVID-19 dashboard reports over 62 million confirmed cases worldwide, with a wide range of disease severity from asymptomatic to deaths (over 1.46 million). To effectively combat the widespread transmission of COVID-19 infection and save lives especially of those vulnerable individuals, it is imperative to better understand its pathophysiology to enable effective diagnosis, prognosis and treatment strategies using rapidly shared data.","","https://precision.fda.gov/challenges/12/","completed","intermediate","6","","2020-11-30","2021-01-29","2023-06-23 00:00:00","2023-10-14 05:39:29" -"91","smarter-food-safety-low-cost-tech-enabled-traceability","Smarter Food Safety Low Cost Tech-Enabled Traceability","Seeking Affordable Tech Solutions for Food Traceability","The motivation is tapping into new technologies and integrating data streams will help to advance the widespread, consistent implementation of traceability systems across the food industry. However, the affordability of such technologies, particularly for smaller companies, can be a barrier to implementing tech-enabled traceability systems. FDA's New Era of Smarter Food Safety initiative strives to work with stakeholders to explore low-cost or no-cost options so that our approaches are inclusive of and viable for human and animal food operations of all sizes. Democratizing the benefits of digitizing data will allow the entire food system to move more rapidly towards digital traceability systems. The primary goal is to encourage stakeholders, including technology providers, public health advocates, entrepreneurs, and innovators from all disciplines and around the world, to develop traceability hardware, software, or data analytics platforms that are low-cost or no-cost to the en...","","https://precision.fda.gov/challenges/13","completed","intermediate","6","","2021-06-01","2021-07-30","2023-06-23 00:00:00","2023-10-17 23:05:49" -"92","tumor-mutational-burden-tmb-challenge-phase-1","Tumor Mutational Burden (TMB) Challenge Phase 1","Standardizing Tumor Mutational Burden (TMB) Calculation in Cancer Research","Tumor mutational burden (TMB) is generally defined as the number of mutations detected in a patient's tumor sample per megabase of DNA sequenced. However different algorithms use different methods for calculating TMB. Mutations in genes in tumor cells may lead to the creation of neoantigens, which have the potential to activate an immune system response against the tumor, and the likelihood of an immune system response may increase with the number of mutations. Thus, TMB is a biomarker for some immunotherapy drugs, called immune checkpoint inhibitors, such as those that target the PD-1 and PD-L1 pathways (Chan et al., 2019). An outstanding problem is the lack of standardization for TMB calculation and reporting between different assays. To address this problem, the Friends of Cancer Research convened a working group of industry and regulatory stakeholders to develop guidance and tools for TMB harmonization. Results from the first phase of this effort were presented at AACR 2020 (...","","https://precision.fda.gov/challenges/17","completed","advanced","6","","2021-06-21","2021-09-13","2023-06-23 00:00:00","2023-11-02 18:28:46" -"93","kits21","Kidney and Kidney Tumor Segmentation","Contest Seeks Best Kidney Tumor Segmentation System","The 2021 Kidney and Kidney Tumor Segmentation challenge (abbreviated KiTS21) is a competition in which teams compete to develop the best system for automatic semantic segmentation of renal tumors and surrounding anatomy. Kidney cancer is one of the most common malignancies in adults around the world, and its incidence is thought to be increasing [1]. Fortunately, most kidney tumors are discovered early while they're still localized and operable. However, there are important questions concerning management of localized kidney tumors that remain unanswered [2], and metastatic renal cancer remains almost uniformly fatal [3]. Kidney tumors are notorious for their conspicuous appearance in computed tomography (CT) imaging, and this has enabled important work by radiologists and surgeons to study the relationship between tumor size, shape, and appearance and its prospects for treatment [4,5,6]. It's laborious work, however, and it relies on assessments that are often subjective and impr...","","https://kits21.kits-challenge.org/","completed","advanced","5","","2021-08-23","2021-09-17","2023-06-23 00:00:00","2023-10-16 18:30:07" +"90","covid-19-precision-immunology-app-a-thon","COVID-19 Precision Immunology App-a-thon","Seeking insights on COVID-19 pathophysiology to enable effective strategies","The novel coronavirus disease 2019 (COVID-19), a respiratory disease caused by a new type of coronavirus, known as “severe acute respiratory syndrome coronavirus 2” or SARS-CoV-2, was declared a global pandemic by the World Health Organization on March 11, 2020. To date, the Johns Hopkins University COVID-19 dashboard reports over 62 million confirmed cases worldwide, with a wide range of disease severity from asymptomatic to deaths (over 1.46 million). To effectively combat the widespread transmission of COVID-19 infection and save lives especially of those vulnerable individuals, it is imperative to better understand its pathophysiology to enable effective diagnosis, prognosis and treatment strategies using rapidly shared data.","","https://precision.fda.gov/challenges/12/","completed","intermediate","6","","2020-11-30","2021-01-29","2023-06-23 00:00:00","2023-10-14 05:39:29" +"91","smarter-food-safety-low-cost-tech-enabled-traceability","Smarter Food Safety Low Cost Tech-Enabled Traceability","Seeking affordable tech solutions for food traceability","The motivation is tapping into new technologies and integrating data streams will help to advance the widespread, consistent implementation of traceability systems across the food industry. However, the affordability of such technologies, particularly for smaller companies, can be a barrier to implementing tech-enabled traceability systems. FDA's New Era of Smarter Food Safety initiative strives to work with stakeholders to explore low-cost or no-cost options so that our approaches are inclusive of and viable for human and animal food operations of all sizes. Democratizing the benefits of digitizing data will allow the entire food system to move more rapidly towards digital traceability systems. The primary goal is to encourage stakeholders, including technology providers, public health advocates, entrepreneurs, and innovators from all disciplines and around the world, to develop traceability hardware, software, or data analytics platforms that are low-cost or no-cost to the en...","","https://precision.fda.gov/challenges/13","completed","intermediate","6","","2021-06-01","2021-07-30","2023-06-23 00:00:00","2023-10-17 23:05:49" +"92","tumor-mutational-burden-tmb-challenge-phase-1","Tumor Mutational Burden (TMB) Challenge Phase 1","Standardize tumor mutational burden (TMB) calculation in cancer research","Tumor mutational burden (TMB) is generally defined as the number of mutations detected in a patient's tumor sample per megabase of DNA sequenced. However different algorithms use different methods for calculating TMB. Mutations in genes in tumor cells may lead to the creation of neoantigens, which have the potential to activate an immune system response against the tumor, and the likelihood of an immune system response may increase with the number of mutations. Thus, TMB is a biomarker for some immunotherapy drugs, called immune checkpoint inhibitors, such as those that target the PD-1 and PD-L1 pathways (Chan et al., 2019). An outstanding problem is the lack of standardization for TMB calculation and reporting between different assays. To address this problem, the Friends of Cancer Research convened a working group of industry and regulatory stakeholders to develop guidance and tools for TMB harmonization. Results from the first phase of this effort were presented at AACR 2020 (...","","https://precision.fda.gov/challenges/17","completed","advanced","6","","2021-06-21","2021-09-13","2023-06-23 00:00:00","2023-11-02 18:28:46" +"93","kits21","Kidney and Kidney Tumor Segmentation","Contest seeks best kidney tumor segmentation system","The 2021 Kidney and Kidney Tumor Segmentation challenge (abbreviated KiTS21) is a competition in which teams compete to develop the best system for automatic semantic segmentation of renal tumors and surrounding anatomy. Kidney cancer is one of the most common malignancies in adults around the world, and its incidence is thought to be increasing [1]. Fortunately, most kidney tumors are discovered early while they're still localized and operable. However, there are important questions concerning management of localized kidney tumors that remain unanswered [2], and metastatic renal cancer remains almost uniformly fatal [3]. Kidney tumors are notorious for their conspicuous appearance in computed tomography (CT) imaging, and this has enabled important work by radiologists and surgeons to study the relationship between tumor size, shape, and appearance and its prospects for treatment [4,5,6]. It's laborious work, however, and it relies on assessments that are often subjective and impr...","","https://kits21.kits-challenge.org/","completed","advanced","5","","2021-08-23","2021-09-17","2023-06-23 00:00:00","2023-10-16 18:30:07" "94","realnoisemri","Real Noise MRI","Developing fast MRI techniques without fully sampled data","In recent years, there is a growing focus on the application of fast magnetic resonance imaging (MRI) based on prior knowledge. In the 1980s and 2000s the community used either purely mathematical models such as the partial Fourier transform or solutions derived through advanced engineering such as parallel imaging to speed up MRI acquisition. Since the mid-2000's, compressed sensing and artificial intelligence have been employed to speed up MRI acquisition. These newer methods rely on under sampling the data acquired in Fourier (aka k-) space and then interpolating or augmenting k-space data based on training data content. One of the underlying problems for the development of fast imaging techniques, that just as in e.g. [1], it is common to use a fully sampled image as ground truth and then under sample it in k-space in order to simulate under sampled data. The problem with this approach is that in cases were the under sampled data is corrupted, through e.g. motion, this under s...","","https://realnoisemri.grand-challenge.org/","completed","intermediate","5","","2021-09-21","2021-12-06","2023-06-23 00:00:00","2023-10-14 05:39:33" "95","deep-generative-model-challenge-for-da-in-surgery","Deep Generative Model Challenge for DA in Surgery","Challenge aims to adapt algorithms from simulation to mitral valve surgery","Mitral regurgitation (MR) is the second most frequent indication for valve surgery in Europe and may occur for organic or functional causes [1]. Mitral valve repair, although considerably more difficult, is prefered over mitral valve replacement, since the native tissue of the valve is preserved. It is a complex on-pump heart surgery, often conducted only by a handful of surgeons in high-volume centers. Minimally invasive procedures, which are performed with endoscopic video recordings, became more and more popular in recent years. However, data availability and data privacy concerns are still an issue for the development of automatic scene analysis algorithms. The AdaptOR challenge aims to address these issues by formulating a domain adaptation problem from simulation to surgery. We provide a smaller number of datasets from real surgeries, and a larger number of annotated recordings of training and planning sessions from a physical mitral valve simulator. The goal is to reduce th...","","https://adaptor2021.github.io/","completed","intermediate","1","","2021-04-01","2021-07-16","2023-06-23 00:00:00","2023-10-14 05:39:34" -"96","aimdatathon","AIM Datathon 2020","AIM Datathon 2020: ""Join the AI in Medicine (AIM) Datathon 2020""","Join the AI in Medicine ( AIM ) Datathon 2020","","https://www.kaggle.com/competitions/aimdatathon","completed","intermediate","8","","2020-11-09","2020-11-22","2023-06-23 00:00:00","2023-11-02 18:28:42" -"97","opc-recurrence","Oropharynx Cancer (OPC) Radiomics Challenge :: Local Recurrence Prediction","Determine from CT data whether a tumor will be controlled by definitive radi...","Determine from CT data whether a tumor will be controlled by definitive radiation therapy.","","https://www.kaggle.com/competitions/opc-recurrence","completed","intermediate","8","","2016-07-26","2016-09-12","2023-06-23 00:00:00","2023-10-16 18:10:11" -"98","oropharynx-radiomics-hpv","Oropharynx Cancer (OPC) Radiomics Challenge :: Human Papilloma Virus (HPV) Status Prediction","Human Papilloma Virus (HPV) Status Prediction: ""Predict from CT data the HPV...","Predict from CT data the HPV phenotype of oropharynx tumors; compare to ground-truth results previously obtained by p16 or HPV testing.","","https://www.kaggle.com/competitions/oropharynx-radiomics-hpv","completed","intermediate","8","","2016-07-26","2016-09-12","2023-06-23 00:00:00","2023-10-16 18:10:15" -"99","data-science-bowl-2017","Data Science Bowl 2017","""Can you improve lung cancer detection?""","Can you improve lung cancer detection?","","https://www.kaggle.com/competitions/data-science-bowl-2017","completed","intermediate","8","","2017-01-12","2017-04-12","2023-06-23 00:00:00","2023-10-14 05:39:38" -"100","predict-impact-of-air-quality-on-death-rates","Predict impact of air quality on mortality rates","""Predict CVD and cancer caused mortality rates in England using air quality ...","Predict CVD and cancer caused mortality rates in England using air quality data available from Copernicus Atmosphere Monitoring Service","","https://www.kaggle.com/competitions/predict-impact-of-air-quality-on-death-rates","completed","intermediate","8","","2017-02-13","2017-05-05","2023-06-23 00:00:00","2023-10-14 05:39:38" -"101","intel-mobileodt-cervical-cancer-screening","Intel & MobileODT Cervical Cancer Screening","""Which cancer treatment will be most effective?""","Which cancer treatment will be most effective?","","https://www.kaggle.com/competitions/intel-mobileodt-cervical-cancer-screening","completed","intermediate","8","","2017-03-15","2017-06-21","2023-06-23 00:00:00","2023-10-14 05:39:39" -"102","msk-redefining-cancer-treatment","Personalized Medicine-Redefining Cancer Treatment","Predict the effect of Genetic Variants to enable Personalized Medicine","Predict the effect of Genetic Variants to enable Personalized Medicine","","https://www.kaggle.com/competitions/msk-redefining-cancer-treatment","completed","intermediate","8","","2017-06-26","2017-10-02","2023-06-23 00:00:00","2023-11-02 18:32:51" +"96","aimdatathon","AIM Datathon 2020","Join the AI in Medicine (AIM) Datathon 2020","Join the AI in Medicine ( AIM ) Datathon 2020","","https://www.kaggle.com/competitions/aimdatathon","completed","intermediate","8","","2020-11-09","2020-11-22","2023-06-23 00:00:00","2023-11-02 18:28:42" +"97","opc-recurrence","Oropharynx Cancer (OPC) Radiomics Challenge :: Local Recurrence Prediction","Determine from CT data whether a tumor will be controlled by definitive radiation therapy","Determine from CT data whether a tumor will be controlled by definitive radiation therapy.","","https://www.kaggle.com/competitions/opc-recurrence","completed","intermediate","8","","2016-07-26","2016-09-12","2023-06-23 00:00:00","2023-10-16 18:10:11" +"98","oropharynx-radiomics-hpv","Oropharynx Cancer (OPC) Radiomics Challenge :: Human Papilloma Virus (HPV) Status Prediction","Predict from CT data the HPV phenotype of oropharynx tumors; compare to ground truth data","Predict from CT data the HPV phenotype of oropharynx tumors; compare to ground-truth results previously obtained by p16 or HPV testing.","","https://www.kaggle.com/competitions/oropharynx-radiomics-hpv","completed","intermediate","8","","2016-07-26","2016-09-12","2023-06-23 00:00:00","2023-10-16 18:10:15" +"99","data-science-bowl-2017","Data Science Bowl 2017","Can you improve lung cancer detection?","Can you improve lung cancer detection?","","https://www.kaggle.com/competitions/data-science-bowl-2017","completed","intermediate","8","","2017-01-12","2017-04-12","2023-06-23 00:00:00","2023-10-14 05:39:38" +"100","predict-impact-of-air-quality-on-death-rates","Predict impact of air quality on mortality rates","Predict CVD and cancer caused mortality rates in England using air quality data","Predict CVD and cancer caused mortality rates in England using air quality data available from Copernicus Atmosphere Monitoring Service","","https://www.kaggle.com/competitions/predict-impact-of-air-quality-on-death-rates","completed","intermediate","8","","2017-02-13","2017-05-05","2023-06-23 00:00:00","2023-10-14 05:39:38" +"101","intel-mobileodt-cervical-cancer-screening","Intel & MobileODT Cervical Cancer Screening","Which cancer treatment will be most effective?","Which cancer treatment will be most effective?","","https://www.kaggle.com/competitions/intel-mobileodt-cervical-cancer-screening","completed","intermediate","8","","2017-03-15","2017-06-21","2023-06-23 00:00:00","2023-10-14 05:39:39" +"102","msk-redefining-cancer-treatment","Personalized Medicine-Redefining Cancer Treatment","Predict the effect of genetic variants to enable personalized medicine","Predict the effect of Genetic Variants to enable Personalized Medicine","","https://www.kaggle.com/competitions/msk-redefining-cancer-treatment","completed","intermediate","8","","2017-06-26","2017-10-02","2023-06-23 00:00:00","2023-11-02 18:32:51" "103","mubravo","Predicting Cancer Diagnosis","Bravo's machine learning competition!","Bravo's machine learning competition!","","https://www.kaggle.com/competitions/mubravo","completed","intermediate","8","","2018-07-31","2018-08-13","2023-06-23 00:00:00","2023-10-14 05:39:41" "104","histopathologic-cancer-detection","Histopathologic Cancer Detection","Identify metastatic tissue in histopathologic scans of lymph node sections","Identify metastatic tissue in histopathologic scans of lymph node sections","","https://www.kaggle.com/competitions/histopathologic-cancer-detection","completed","intermediate","8","","2018-11-16","2019-03-30","2023-06-23 00:00:00","2023-10-14 05:39:41" "105","tjml1920-decision-trees","TJML 2019-20 Breast Cancer Detection Competition","Use a decision tree to identify malignant breast cancer tumors","Use a decision tree to identify malignant breast cancer tumors","","https://www.kaggle.com/competitions/tjml1920-decision-trees","completed","intermediate","8","","2019-09-22","2019-10-16","2023-06-23 00:00:00","2023-10-14 05:39:42" @@ -131,9 +131,9 @@ "130","pro-lig-aff-task1-pearsonr","Structure-free protein-ligand affinity prediction - Task 1 Ranking","Developing new AI models for drug discovery (Task-1 ranking)","Developing new AI models for drug discovery (Task-1 ranking)","","https://www.kaggle.com/competitions/pro-lig-aff-task1-pearsonr","completed","intermediate","8","","2021-12-08","2021-12-31","2023-06-23 00:00:00","2023-10-16 18:13:26" "131","pro-lig-aff-task2-pearsonr","Structure-free protein-ligand affinity prediction - Task 2 Ranking","Developing new AI models for drug discovery (Task-2 ranking)","Developing new AI models for drug discovery (Task-2 ranking)","","https://www.kaggle.com/competitions/pro-lig-aff-task2-pearsonr","completed","intermediate","8","","2021-12-08","2021-12-31","2023-06-23 00:00:00","2023-11-02 18:41:41" "132","pro-lig-aff-task3-spearmanr","Structure-free protein-ligand affinity prediction - Task 3 Ranking","Developing new AI models for drug discovery (Task-3 ranking)","Developing new AI models for drug discovery (Task-3 ranking)","","https://www.kaggle.com/competitions/pro-lig-aff-task3-spearmanr","completed","intermediate","8","","2021-12-08","2021-12-31","2023-06-23 00:00:00","2023-10-16 18:13:32" -"133","hhp","Heritage Health Prize","Identify patients who will be admitted to a hospital within the next year us...","Identify patients who will be admitted to a hospital within the next year using historical claims data. (Enter by 06-59-59 UTC Oct 4 2012)","","https://www.kaggle.com/competitions/hhp","completed","intermediate","8","","2011-04-04","2013-04-04","2023-06-23 00:00:00","2023-10-14 05:40:00" -"134","pf2012","Practice Fusion Analyze This! 2012 - Prediction Challenge","Delve into Electronic Health Records: Propose Innovative Predictive Modeling...","Start digging into electronic health records and submit your ideas for the most promising, impactful or interesting predictive modeling competitions","","https://www.kaggle.com/competitions/pf2012","completed","intermediate","8","","2012-06-07","2012-06-30","2023-06-23 00:00:00","2023-10-16 18:14:24" -"135","pf2012-at","Practice Fusion Analyze This! 2012 - Open Challenge","Delve into Electronic Health Records: Propose Innovative Predictive Modeling...","Start digging into electronic health records and submit your creative, insightful, and visually striking analyses.","","https://www.kaggle.com/competitions/pf2012-at","completed","intermediate","8","","2012-06-07","2012-09-10","2023-06-23 00:00:00","2023-10-16 18:14:26" +"133","hhp","Heritage Health Prize","Identify patients who will be admitted to a hospital within the next year using claims data","Identify patients who will be admitted to a hospital within the next year using historical claims data. (Enter by 06-59-59 UTC Oct 4 2012)","","https://www.kaggle.com/competitions/hhp","completed","intermediate","8","","2011-04-04","2013-04-04","2023-06-23 00:00:00","2023-10-14 05:40:00" +"134","pf2012","Practice Fusion Analyze This! 2012 - Prediction Challenge","Delve into Electronic Health Records: Propose Innovative Predictive Modeling Challenges","Start digging into electronic health records and submit your ideas for the most promising, impactful or interesting predictive modeling competitions","","https://www.kaggle.com/competitions/pf2012","completed","intermediate","8","","2012-06-07","2012-06-30","2023-06-23 00:00:00","2023-10-16 18:14:24" +"135","pf2012-at","Practice Fusion Analyze This! 2012 - Open Challenge","Delve into Electronic Health Records: Propose Innovative Predictive Modeling Challenges","Start digging into electronic health records and submit your creative, insightful, and visually striking analyses.","","https://www.kaggle.com/competitions/pf2012-at","completed","intermediate","8","","2012-06-07","2012-09-10","2023-06-23 00:00:00","2023-10-16 18:14:26" "136","seizure-detection","UPenn and Mayo Clinic's Seizure Detection Challenge","Detect seizures in intracranial EEG recordings","Detect seizures in intracranial EEG recordings","","https://www.kaggle.com/competitions/seizure-detection","completed","intermediate","8","","2014-05-19","2014-08-19","2023-06-23 00:00:00","2023-10-14 05:40:02" "137","seizure-prediction","American Epilepsy Society Seizure Prediction Challenge","Predict seizures in intracranial EEG recordings","Predict seizures in intracranial EEG recordings","","https://www.kaggle.com/competitions/seizure-prediction","completed","intermediate","8","","2014-08-25","2014-11-17","2023-06-23 00:00:00","2023-10-14 05:40:03" "138","deephealth-1","Deep Health - alcohol","Find Correlations and patterns with Medical data","Find Correlations and patterns with Medical data","","https://www.kaggle.com/competitions/deephealth-1","completed","intermediate","8","","2017-02-13","2017-02-19","2023-06-23 00:00:00","2023-10-16 18:14:48" @@ -146,79 +146,79 @@ "145","biocreative-vii-text-mining-drug-and-chemical-protein-interactions-drugprot","BioCreative VII: Text mining drug and chemical-protein interactions (DrugProt)","Develop systems to extract drug-gene relations from text","With the rapid accumulation of biomedical literature, it is getting increasingly challenging to exploit efficiently drug-related information described in the scientific literature. One of the most relevant aspects of drugs and chemical compounds are their relationships with certain biomedical entities, in particular genes and proteins. The aim of the DrugProt track (similar to the previous CHEMPROT task of BioCreative VI) is to promote the development and evaluation of systems that are able to automatically detect in relations between chemical compounds/drug and genes/proteins. There are a range of different types of drug-gene/protein interactions, and their systematic extraction and characterization is essential to analyze, predict and explore key biomedical properties underlying high impact biomedical applications. These application scenarios include use cases related to drug discovery, drug repurposing, drug design, metabolic engineering, modeling drug response, pharmacogenet...","","https://biocreative.bioinformatics.udel.edu/tasks/biocreative-vii/track-1/","completed","intermediate","14","","2021-06-15","2021-09-22","2023-06-23 00:00:00","2023-11-01 20:37:37" "146","extended-literature-ai-for-drug-induced-liver-injury","Extended Literature AI for Drug Induced Liver Injury","Develop ML tools to analyze drug texts for liver injury data","Unexpected Drug-Induced Liver Injury (DILI) still is one of the main killers of promising novel drug candidates. It is a clinically significant disease that can lead to severe outcomes such as acute liver failure and even death. It remains one of the primary liabilities in drug development and regulatory clearance due to the limited performance of mandated preclinical models even today. The free text of scientific publications is still the main medium carrying DILI results from clinical practice or experimental studies. The textual data still has to be analysed manually. This process, however, is tedious and prone to human mistakes or omissions, as results are very rarely available in a standardized form or organized form. There is thus great hope that modern techniques from machine learning or natural language processing could provide powerful tools to better process and derive the underlying knowledge within free form texts. The pressing need to faster process potential drug can...","","http://camda2022.bioinf.jku.at/contest_dataset#extended_literature_ai_for_drug_induced_liver_injury","completed","intermediate","14","","\N","2022-05-20","2023-06-23 00:00:00","2023-11-01 20:37:38" "147","anti-microbial-resistance-forensics","Anti-Microbial Resistance Forensics","Classifying Bacteriophages to Understand Microbial Evolution","Bacteriophages, being the re-occuring mystery in the history of science are believed to be they key for understanding of microbial evolution and the transfer of AMR genes. Recent studies show that there is a significant correlation between occurence of Phages and AMR genes, indicating that they are indeed taking part in the spread of them. While taking part in AMR dissemination the phages are also considered as the potential alternative to antibiotics. In such contradictory world there is a huge potential as well as urgent need for precise classification, description and analysis of capabilities. Due to pandemic of SARS-CoV-2, advance in phylogenetic algorithms and k-mer based methods have been extremely rapid and those improvements are witing to be adapted to different branches of life sciences.","","http://camda2022.bioinf.jku.at/contest_dataset#anti-microbial_resistance_forensics","completed","intermediate","14","","\N","2022-05-20","2023-06-23 00:00:00","2023-10-14 05:40:14" -"148","disease-maps-to-modelling-covid-19","Disease Maps to Modelling COVID-19","Disease Maps COVID-19 Challenge: Enhancing Drug Repurposing with Omic Data","The Disease Maps to modeling COVID-19 Challenge provides highly detailed expert-curated molecular mechanistic maps for COVID-19. Combine them with available omic data to expand the current biological knowledge on COVID-19 mechanism of infection and downstream consequences. The main topic for this year's challenge is drug repurposing with the possibility of Real World Data based validation of the most promising candidates suggested.","","http://camda2022.bioinf.jku.at/contest_dataset#disease_maps_to_modelling_covid-19","completed","intermediate","14","","\N","2022-05-20","2023-06-23 00:00:00","2023-10-14 05:40:15" -"149","crowdsourced-evaluation-of-inchi-based-tautomer-identification","Crowdsourced Evaluation of InChI-based Tautomer Identification","Crowdsourced Evaluation of InChI-Based Tautomer Identification Challenge","This challenge focuses on the International Chemical Identifier (InChI), which was developed and is maintained under the auspices of the International Union of Pure and Applied Chemistry (IUPAC) and the InChI Trust. The InChI Trust, the IUPAC Working Group on Tautomers, and the U.S. Food and Drug Administration (FDA) call on the scientific community dealing with chemical repositories/data sets and analytics of compounds to test the recently modified InChI algorithm, which was designed for advanced recognition of tautomers. Participants will evaluate this algorithm against real chemical samples in this Crowdsourced Evaluation of InChI-based Tautomer Identification.","","https://precision.fda.gov/challenges/29","completed","intermediate","6","","2022-11-01","2023-03-01","2023-06-23 00:00:00","2023-10-14 05:40:15" -"150","nctr-indel-calling-from-oncopanel-sequencing-challenge-phase-2","NCTR Indel Calling from Oncopanel Sequencing Challenge Phase 2","NCTR Indel Calling from Oncopanel Sequencing Data Challenge","The high value of clinically actionable information obtained by oncopanel sequencing makes it a crucial tool for precision oncology[1,2]. With the surge in availability of oncopanels, it is critical to ensure that they have been thoroughly tested and are properly used. FDA has initiated the Sequencing Quality Control phase II (SEQC2) project[3] to develop standard analysis protocols and quality control metrics for fit-for-purpose use of Next Generation Sequencing (NGS) data including oncopanel sequencing to inform regulatory science research and precision medicine. The Oncopanel Sequencing Working Group of FDA-led SEQC2 has developed a reference sample[4] suitable for benchmarking oncopanels and comprehensively assessed the analytical performance of several oncopanels[1,2]. The genomic deoxyribonucleic acid (gDNA) reference sample was derived from ten Universal Human Reference RNA (UHRR, Agilent Technologies, Inc) cell-lines and made publicly available by Agilent. Substantial gen...","","https://precision.fda.gov/challenges/22","completed","intermediate","6","","2022-07-11","2022-07-26","2023-06-23 00:00:00","2023-10-17 23:18:17" -"151","nctr-indel-calling-from-oncopanel-sequencing-data-challenge-phase-1","NCTR Indel Calling from Oncopanel Sequencing Data Challenge Phase 1","NCTR Indel Calling from Oncopanel Sequencing Data Challenge","The high value of clinically actionable information obtained by oncopanel sequencing makes it a crucial tool for precision oncology[1,2]. With the surge in availability of oncopanels, it is critical to ensure that they have been thoroughly tested and are properly used. FDA has initiated the Sequencing Quality Control phase II (SEQC2) project[3] to develop standard analysis protocols and quality control metrics for fit-for-purpose use of Next Generation Sequencing (NGS) data including oncopanel sequencing to inform regulatory science research and precision medicine. The Oncopanel Sequencing Working Group of FDA-led SEQC2 has developed a reference sample[4] suitable for benchmarking oncopanels and comprehensively assessed the analytical performance of several oncopanels[1,2]. The genomic deoxyribonucleic acid (gDNA) reference sample was derived from ten Universal Human Reference RNA (UHRR, Agilent Technologies, Inc) cell-lines and made publicly available by Agilent. Substantial gen...","","https://precision.fda.gov/challenges/21","completed","intermediate","6","","2022-05-02","2022-07-08","2023-06-23 00:00:00","2023-10-17 23:18:21" -"152","vha-innovation-ecosystem-and-precisionfda-covid-19-risk-factor-modeling-challenge-phase-2","VHA Innovation Ecosystem and precisionFDA COVID-19 Risk Factor Modeling Challenge Phase 2","The focus of Phase 2 was to validate the top performing models on two additi...","The novel coronavirus disease 2019 (COVID-19) is a respiratory disease caused by a new type of coronavirus, known as “severe acute respiratory syndrome coronavirus 2,” or SARS-CoV-2. On March 11, 2020, the World Health Organization (WHO) declared the outbreak a global pandemic. As of January 22nd, 2022, the Johns Hopkins University COVID-19 dashboard reports over 338 million total confirmed cases worldwide. Although most people have mild to moderate symptoms, the disease can cause severe medical complications leading to death in some people. The Centers for Disease Control and Prevention (CDC) have identified several risk factors for severe COVID-19 illness, including people 65 years and older, individuals living in nursing homes or long-term care facilities, and those with serious underlying medical conditions. The Veteran population has a higher prevalence of several of the known risk factors for severe COVID-19 illness, such as advanced age, heart disease, and diabetes. Identif...","","https://precision.fda.gov/challenges/20","completed","intermediate","6","","2021-04-14","2022-01-28","2023-06-23 00:00:00","2023-10-14 05:40:19" -"153","tumor-mutational-burden-tmb-challenge-phase-2","Tumor Mutational Burden (TMB) Challenge Phase 2","The goal of the Friends of Cancer Research and precisionFDA Tumor Mutational...","Tumor mutational burden (TMB) is generally defined as the number of mutations detected in a patient's tumor sample per megabase of DNA sequenced. However different algorithms use different methods for calculating TMB. Mutations in genes in tumor cells may lead to the creation of neoantigens, which have the potential to activate an immune system response against the tumor, and the likelihood of an immune system response may increase with the number of mutations. Thus, TMB is a biomarker for some immunotherapy drugs, called immune checkpoint inhibitors, such as those that target the PD-1 and PD-L1 pathways (Chan et al., 2019). An outstanding problem is the lack of standardization for TMB calculation and reporting between different assays. To address this problem, the Friends of Cancer Research convened a working group of industry and regulatory stakeholders to develop guidance and tools for TMB harmonization. Results from the first phase of this effort were presented at AACR 2020 (s...","","https://precision.fda.gov/challenges/18","completed","intermediate","6","","2021-07-19","2021-09-12","2023-06-23 00:00:00","2023-10-14 05:40:20" +"148","disease-maps-to-modelling-covid-19","Disease Maps to Modelling COVID-19","Use the COVID-19 disease map to suggest drugs candidate for repurposing, that could be tested using the RWD dataset","The Disease Maps to modeling COVID-19 Challenge provides highly detailed expert-curated molecular mechanistic maps for COVID-19. Combine them with available omic data to expand the current biological knowledge on COVID-19 mechanism of infection and downstream consequences. The main topic for this year's challenge is drug repurposing with the possibility of Real World Data based validation of the most promising candidates suggested.","","http://camda2022.bioinf.jku.at/contest_dataset#disease_maps_to_modelling_covid-19","completed","intermediate","14","","\N","2022-05-20","2023-06-23 00:00:00","2023-10-14 05:40:15" +"149","crowdsourced-evaluation-of-inchi-based-tautomer-identification","Crowdsourced Evaluation of InChI-based Tautomer Identification","Calling on scientists from industry, government, and academia to test a modified InChI algorithm","This challenge focuses on the International Chemical Identifier (InChI), which was developed and is maintained under the auspices of the International Union of Pure and Applied Chemistry (IUPAC) and the InChI Trust. The InChI Trust, the IUPAC Working Group on Tautomers, and the U.S. Food and Drug Administration (FDA) call on the scientific community dealing with chemical repositories/data sets and analytics of compounds to test the recently modified InChI algorithm, which was designed for advanced recognition of tautomers. Participants will evaluate this algorithm against real chemical samples in this Crowdsourced Evaluation of InChI-based Tautomer Identification.","","https://precision.fda.gov/challenges/29","completed","intermediate","6","","2022-11-01","2023-03-01","2023-06-23 00:00:00","2023-10-14 05:40:15" +"150","nctr-indel-calling-from-oncopanel-sequencing-challenge-phase-2","NCTR Indel Calling from Oncopanel Sequencing Challenge Phase 2","Validate their pipelines with the frozen-in parameters on a third oncopanel sequencing dataset (Oncopanel X)","The high value of clinically actionable information obtained by oncopanel sequencing makes it a crucial tool for precision oncology[1,2]. With the surge in availability of oncopanels, it is critical to ensure that they have been thoroughly tested and are properly used. FDA has initiated the Sequencing Quality Control phase II (SEQC2) project[3] to develop standard analysis protocols and quality control metrics for fit-for-purpose use of Next Generation Sequencing (NGS) data including oncopanel sequencing to inform regulatory science research and precision medicine. The Oncopanel Sequencing Working Group of FDA-led SEQC2 has developed a reference sample[4] suitable for benchmarking oncopanels and comprehensively assessed the analytical performance of several oncopanels[1,2]. The genomic deoxyribonucleic acid (gDNA) reference sample was derived from ten Universal Human Reference RNA (UHRR, Agilent Technologies, Inc) cell-lines and made publicly available by Agilent. Substantial gen...","","https://precision.fda.gov/challenges/22","completed","intermediate","6","","2022-07-11","2022-07-26","2023-06-23 00:00:00","2023-10-17 23:18:17" +"151","nctr-indel-calling-from-oncopanel-sequencing-data-challenge-phase-1","NCTR Indel Calling from Oncopanel Sequencing Data Challenge Phase 1","Develop, validate, and benchmark indel calling pipelines to identify indels in oncopanel sequencing datasets","The high value of clinically actionable information obtained by oncopanel sequencing makes it a crucial tool for precision oncology[1,2]. With the surge in availability of oncopanels, it is critical to ensure that they have been thoroughly tested and are properly used. FDA has initiated the Sequencing Quality Control phase II (SEQC2) project[3] to develop standard analysis protocols and quality control metrics for fit-for-purpose use of Next Generation Sequencing (NGS) data including oncopanel sequencing to inform regulatory science research and precision medicine. The Oncopanel Sequencing Working Group of FDA-led SEQC2 has developed a reference sample[4] suitable for benchmarking oncopanels and comprehensively assessed the analytical performance of several oncopanels[1,2]. The genomic deoxyribonucleic acid (gDNA) reference sample was derived from ten Universal Human Reference RNA (UHRR, Agilent Technologies, Inc) cell-lines and made publicly available by Agilent. Substantial gen...","","https://precision.fda.gov/challenges/21","completed","intermediate","6","","2022-05-02","2022-07-08","2023-06-23 00:00:00","2023-10-17 23:18:21" +"152","vha-innovation-ecosystem-and-precisionfda-covid-19-risk-factor-modeling-challenge-phase-2","VHA Innovation Ecosystem and precisionFDA COVID-19 Risk Factor Modeling Challenge Phase 2","The focus of Phase 2 was to validate the top performing models on two additional VA sites' data.","The novel coronavirus disease 2019 (COVID-19) is a respiratory disease caused by a new type of coronavirus, known as “severe acute respiratory syndrome coronavirus 2,” or SARS-CoV-2. On March 11, 2020, the World Health Organization (WHO) declared the outbreak a global pandemic. As of January 22nd, 2022, the Johns Hopkins University COVID-19 dashboard reports over 338 million total confirmed cases worldwide. Although most people have mild to moderate symptoms, the disease can cause severe medical complications leading to death in some people. The Centers for Disease Control and Prevention (CDC) have identified several risk factors for severe COVID-19 illness, including people 65 years and older, individuals living in nursing homes or long-term care facilities, and those with serious underlying medical conditions. The Veteran population has a higher prevalence of several of the known risk factors for severe COVID-19 illness, such as advanced age, heart disease, and diabetes. Identif...","","https://precision.fda.gov/challenges/20","completed","intermediate","6","","2021-04-14","2022-01-28","2023-06-23 00:00:00","2023-10-14 05:40:19" +"153","tumor-mutational-burden-tmb-challenge-phase-2","Tumor Mutational Burden (TMB) Challenge Phase 2","Phase 2 of the challenge which focuses on evaluating various computational pipelines for TMB estimation","Tumor mutational burden (TMB) is generally defined as the number of mutations detected in a patient's tumor sample per megabase of DNA sequenced. However different algorithms use different methods for calculating TMB. Mutations in genes in tumor cells may lead to the creation of neoantigens, which have the potential to activate an immune system response against the tumor, and the likelihood of an immune system response may increase with the number of mutations. Thus, TMB is a biomarker for some immunotherapy drugs, called immune checkpoint inhibitors, such as those that target the PD-1 and PD-L1 pathways (Chan et al., 2019). An outstanding problem is the lack of standardization for TMB calculation and reporting between different assays. To address this problem, the Friends of Cancer Research convened a working group of industry and regulatory stakeholders to develop guidance and tools for TMB harmonization. Results from the first phase of this effort were presented at AACR 2020 (s...","","https://precision.fda.gov/challenges/18","completed","intermediate","6","","2021-07-19","2021-09-12","2023-06-23 00:00:00","2023-10-14 05:40:20" "154","predicting-gene-expression-using-millions-of-random-promoter-sequences","Predicting Gene Expression Using Millions of Random Promoter Sequences","Decoding gene expression regulation to understand disease.","Decoding how gene expression is regulated is critical to understanding disease. Regulatory DNA is decoded by the cell in a process termed “cis-regulatory logic”, where proteins called Transcription Factors (TFs) bind to specific DNA sequences within the genome and work together to produce as output a level of gene expression for downstream adjacent genes. This process is exceedingly complex to model as a large number of parameters is needed to fully describe the process (see Rationale, de Boer et al. 2020; Zeitingler J. 2020). Understanding the cis-regulatory logic of the human genome is an important goal and would provide insight into the origins of many diseases. However, learning models from human data is challenging due to limitations in the diversity of sequences present within the human genome (e.g. extensive repetitive DNA), the vast number of cell types that differ in how they interpret regulatory DNA, limited reporter assay data, and substantial technical biases present i...","","https://www.synapse.org/#!Synapse:syn28469146/wiki/617075","completed","intermediate","1","","2022-06-15","2022-08-07","2023-06-23 00:00:00","2023-10-14 05:40:21" "155","brats-2023","BraTS 2023","Benchmarking brain tumor segmentation with expanded dataset.","The International Brain Tumor Segmentation (BraTS) challenge. BraTS, since 2012, has focused on the generation of a benchmarking environment and dataset for the delineation of adult brain gliomas. The focus of this year’s challenge remains the generation of a common benchmarking environment, but its dataset is substantially expanded to ~4,500 cases towards addressing additional i) populations (e.g., sub-Saharan Africa patients), ii) tumors (e.g., meningioma), iii) clinical concerns (e.g., missing data), and iv) technical considerations (e.g., augmentations). Specifically, the focus of BraTS 2023 is to identify the current state-of-the-art algorithms for addressing (Task 1) the same adult glioma population as in the RSNA-ANSR-MICCAI BraTS challenge, as well as (Task 2) the underserved sub-Saharan African brain glioma patient population, (Task 3) intracranial meningioma, (Task 4) brain metastasis, (Task 5) pediatric brain tumor patients, (Task 6) global & local missing data, (Task 7...","","https://www.synapse.org/brats","completed","advanced","1","","2023-06-01","2023-08-25","2023-06-23 00:00:00","2023-10-26 23:20:21" -"156","cagi7","CAGI7","The seventh round of CAGI.","There have been six editions of CAGI experiments, held between 2010 and 2022. The seventh round of CAGI is planned to take place over the Summer of 2024.","","https://genomeinterpretation.org/challenges.html","upcoming","intermediate","1","","\N","\N","2023-08-04 21:47:38","2023-10-14 05:40:32" -"157","casp15","CASP15","Establish the state-of-art in modeling proteins and protein complexes.","CASP14 (2020) saw an enormous jump in the accuracy of single protein and domain models such that many are competitive with experiment. That advance is largely the result of the successful application of deep learning methods, particularly by the AlphaFold and, since that CASP, RosettaFold. As a consequence, computed protein structures are becoming much more widely used in a broadening range of applications. CASP has responded to this new landscape with a revised set of modeling categories. Some old categories have been dropped (refinement, contact prediction, and aspects of model accuracy estimation) and new ones have been added (RNA structures, protein ligand complexes, protein ensembles, and accuracy estimation for protein complexes). We are also strengthening our interactions with our partners CAPRI and CAMEO. We hope that these changes will maximize the insight that CASP15 provides, particularly in new applications of deep learning.","","https://predictioncenter.org/casp15/index.cgi","completed","intermediate","14","","2022-04-18","\N","2023-08-04 21:52:12","2023-09-28 23:09:59" -"158","synthrad2023","SynthRAD2023","Synthesizing computed tomography for radiotherapy.","This challenge aims to provide the first platform offering public data evaluation metrics to compare the latest developments in sCT generation methods. The accepted challenge design approved by MICCAI can be found at https://doi.org/10.5281/zenodo.7746019. A type 2 challenge will be run, where the participant needs to submit their algorithm packaged in a docker both for validation and test.","","https://synthrad2023.grand-challenge.org/","completed","intermediate","5","","2023-04-01","2023-08-22","2023-08-04 21:54:31","2023-10-26 23:20:24" -"159","synthetic-data-for-instrument-segmentation-in-surgery-syn-iss","Synthetic Data for Instrument Segmentation in Surgery (Syn-ISS)","Challenging Machine Learning in Surgical Instrument Segmentation with Synthe...","A common limitation noted by the surgical data science community is the size of datasets and the resources needed to generate training data at scale for building reliable and high-performing machine learning models. Beyond unsupervised and self-supervised approaches another solution within the broader machine learning community has been a growing volume of literature in the use of synthetic data (simulation) for training algorithms than can be applied to real world data. Synthetic data has multiple benefits like free groundtruth at large scale, possibility to collect larger sample of rare events, include anatomical variations, etc. A first step towards proving the validity of using synthetic data for real world applications is to demonstrate the feasibility within the simulation world itself. Our proposed challenge is to train machine learning methods for instrument segmentation using synthetic datasets and test their performance on synthetic datasets. That is, the challenge parti...","","https://www.synapse.org/#!Synapse:syn50908388/wiki/620516","completed","intermediate","1","","2023-07-19","2023-09-07","2023-08-04 23:49:44","2023-10-26 23:20:28" -"160","pitvis","PitVis","Surgical workflow and instrument recognition in endonasal surgery.","The pituitary gland, found just off the base of the brain, is commonly known as “the master gland”, performing essential functions required for sustaining human life. Clinically relevant tumours that have grown on the pituitary gland have an estimated prevalence of 1 in 1000 of the population, and if left untreated can be life-limiting. The “gold standard” treatment is endoscopic pituitary surgery, where the tumour is directly removed by entering through a nostril. This surgery is particularly challenging due to the small working space which limits both vision and instrument manoeuvrability and thus can lead to poor surgical technique causing adverse outcomes for the patient. Computer-assisted intervention can help overcome these challenges by providing guidance for senior surgeons and operative staff during surgery, and for junior surgeons during training.","","https://www.synapse.org/#!Synapse:syn51232283/wiki/","completed","intermediate","1","","2023-06-29","2023-09-10","2023-08-04 23:58:01","2023-10-26 23:20:30" -"161","mvseg2023","MVSEG2023","Automatically segment mitral valve leaflets from single frame 3D trans-esoph...","Mitral valve (MV) disease is a common pathologic problem occurring in approximately 2 % of the general population but climbing to 10 % in those over the age of 75. The preferred intervention for mitral regurgitation is valve repair, due to superior patient outcomes compared to those following valve replacement. Mitral valve interventions are technically challenging due to the functional and anatomical complexity of mitral pathologies. Repair must be tailored to the patient-specific anatomy and pathology, which requires considerable expert training and experience. Automatic segmentation of the mitral valve leaflets from 3D transesophageal echocardiography (TEE) may play an important role in treatment planning, as well as physical and computational modelling of patient-specific valve pathologies and potential repair approaches. This may have important implications in the drive towards personalized care and has the potential to impact clinical outcomes for those undergoing mitral val...","","https://www.synapse.org/#!Synapse:syn51186045/wiki/621356","completed","intermediate","1","","2023-05-29","2023-08-07","2023-08-05 0-04-36","2023-09-28 23:12:19" -"162","crossmoda23","crossMoDA23","This challenge proposes is the third edition of the first medical imaging be...","Domain Adaptation (DA) has recently raised strong interest in the medical imaging community. By encouraging algorithms to be robust to unseen situations or different input data domains, Domain Adaptation improves the applicability of machine learning approaches to various clinical settings. While a large variety of DA techniques has been proposed, most of these techniques have been validated either on private datasets or on small publicly available datasets. Moreover, these datasets mostly address single-class problems. To tackle these limitations, the crossMoDA challenge introduced the first large and multi-class dataset for unsupervised cross-modality Domain Adaptation. From an application perspective, crossMoDA focuses on MRI segmentation for Vestibular Schwannoma. Compared to the previous crossMoDA instance, which made use of multi-institutional data acquired in controlled conditions for radiosurgery planning and focused on a 2 class segmentation task (tumour and cochlea), the...","","https://www.synapse.org/#!Synapse:syn51236108/wiki/621615","completed","intermediate","1","","2023-04-15","2023-07-10","2023-08-05 0-13-23","2023-10-12 18:10:18" -"163","icr-identify-age-related-conditions","ICR - Identifying Age-Related Conditions","Use Machine Learning to detect conditions with measurements of anonymous cha...","The goal of this competition is to predict if a person has any of three medical conditions. You are being asked to predict if the person has one or more of any of the three medical conditions (Class 1), or none of the three medical conditions (Class 0). You will create a model trained on measurements of health characteristics. To determine if someone has these medical conditions requires a long and intrusive process to collect information from patients. With predictive models, we can shorten this process and keep patient details private by collecting key characteristics relative to the conditions, then encoding these characteristics.","","https://www.kaggle.com/competitions/icr-identify-age-related-conditions","completed","intermediate","8","","2023-05-11","2023-08-10","2023-08-05 0-32-01","2023-10-12 18:15:08" -"164","cafa-5-protein-function-prediction","CAFA 5: Protein Function Prediction","Predict the biological function of a protein.","The goal of this competition is to predict the function of a set of proteins. You will develop a model trained on the amino-acid sequences of the proteins and on other data. Your work will help ​​researchers better understand the function of proteins, which is important for discovering how cells, tissues, and organs work. This may also aid in the development of new drugs and therapies for various diseases.","","https://www.kaggle.com/competitions/cafa-5-protein-function-prediction","completed","intermediate","8","","2023-04-18","2023-08-21","2023-08-05 5-18-40","2023-10-19 00:13:14" -"165","rsna-2023-abdominal-trauma-detection","RSNA 2023 Abdominal Trauma Detection","Detect and classify traumatic abdominal injuries.","Traumatic injury is the most common cause of death in the first four decades of life and a major public health problem around the world. There are estimated to be more than 5 million annual deaths worldwide from traumatic injury. Prompt and accurate diagnosis of traumatic injuries is crucial for initiating appropriate and timely interventions, which can significantly improve patient outcomes and survival rates. Computed tomography (CT) has become an indispensable tool in evaluating patients with suspected abdominal injuries due to its ability to provide detailed cross-sectional images of the abdomen. Interpreting CT scans for abdominal trauma, however, can be a complex and time-consuming task, especially when multiple injuries or areas of subtle active bleeding are present. This challenge seeks to harness the power of artificial intelligence and machine learning to assist medical professionals in rapidly and precisely detecting injuries and grading their severity. The development...","","https://www.kaggle.com/competitions/rsna-2023-abdominal-trauma-detection","completed","intermediate","8","","2023-07-26","2023-10-13","2023-08-05 5-24-09","2023-09-28 23:14:12" -"166","hubmap-hacking-the-human-vasculature","HuBMAP: Hacking the Human Vasculature","Segment instances of microvascular structures from healthy human kidney tiss...","The goal of this competition is to segment instances of microvascular structures, including capillaries, arterioles, and venules. You'll create a model trained on 2D PAS-stained histology images from healthy human kidney tissue slides. Your help in automating the segmentation of microvasculature structures will improve researchers' understanding of how the blood vessels are arranged in human tissues.","","https://www.kaggle.com/competitions/hubmap-hacking-the-human-vasculature","completed","intermediate","8","","2023-05-22","2023-07-31","2023-08-05 5-31-12","2023-10-12 18:15:00" -"167","amp-parkinsons-disease-progression-prediction","AMP(R)-Parkinson's Disease Progression Prediction","Use protein and peptide data measurements from Parkinson's Disease patients ...","The goal of this competition is to predict MDS-UPDR scores, which measure progression in patients with Parkinson's disease. The Movement Disorder Society-Sponsored Revision of the Unified Parkinson's Disease Rating Scale (MDS-UPDRS) is a comprehensive assessment of both motor and non-motor symptoms associated with Parkinson's. You will develop a model trained on data of protein and peptide levels over time in subjects with Parkinson’s disease versus normal age-matched control subjects. Your work could help provide important breakthrough information about which molecules change as Parkinson’s disease progresses.","","https://www.kaggle.com/competitions/amp-parkinsons-disease-progression-prediction","completed","intermediate","8","","2023-02-16","2023-05-18","2023-08-05 5-37-12","2023-10-10 19:52:34" -"168","open-problems-multimodal","Open Problems -Multimodal Single-Cell Integration","Predict how DNA, RNA & protein measurements co-vary in single cells.","The goal of this competition is to predict how DNA, RNA, and protein measurements co-vary in single cells as bone marrow stem cells develop into more mature blood cells. You will develop a model trained on a subset of 300,000-cell time course dataset of CD34+ hematopoietic stem and progenitor cells (HSPC) from four human donors at five time points generated for this competition by Cellarity, a cell-centric drug creation company. In the test set, taken from an unseen later time point in the dataset, competitors will be provided with one modality and be tasked with predicting a paired modality measured in the same cell. The added challenge of this competition is that the test data will be from a later time point than any time point in the training data. Your work will help accelerate innovation in methods of mapping genetic information across layers of cellular state. If we can predict one modality from another, we may expand our understanding of the rules governing these complex re...","","https://www.kaggle.com/competitions/open-problems-multimodal","completed","intermediate","8","","2022-08-15","2022-11-15","2023-08-05 5-43-25","2023-10-10 19:52:41" -"169","multi-atlas-labeling-beyond-the-cranial-vault","Multi-Atlas Labeling Beyond the Cranial Vault","Innovative Multi-Atlas Labeling for Soft Tissue Segmentation on Clinical CT","Multi-atlas labeling has proven to be an effective paradigm for creating segmentation algorithms from training data. These approaches have been extraordinarily successful for brain and cranial structures (e.g., our prior MICCAI workshops-MLSF’11, MAL’12, SATA’13). After the original challenges closed, the data continue to drive scientific innovation; 144 groups have registered for the 2012 challenge (brain only) and 115 groups for the 2013 challenge (brain/heart/canine leg). However, innovation in application outside of the head and to soft tissues has been more limited. This workshop will provide a snapshot of the current progress in the field through extended discussions and provide researchers an opportunity to characterize their methods on a newly created and released standardized dataset of abdominal anatomy on clinically acquired CT. The datasets will be freely available both during and after the challenge. We have two separate new challenges-abdomen and cervix on routinely ...","","https://www.synapse.org/#!Synapse:syn3193805/wiki/89480","active","intermediate","1","","2015-04-15","\N","2023-08-07 20:21:22","2023-10-10 19:52:39" -"170","hubmap-organ-segmentation","HuBMAP + HPA: Hacking the Human Body","Segment multi-organ functional tissue units.","In this competition, you’ll identify and segment functional tissue units (FTUs) across five human organs. You'll build your model using a dataset of tissue section images, with the best submissions segmenting FTUs as accurately as possible. If successful, you'll help accelerate the world’s understanding of the relationships between cell and tissue organization. With a better idea of the relationship of cells, researchers will have more insight into the function of cells that impact human health. Further, the Human Reference Atlas constructed by HuBMAP will be freely available for use by researchers and pharmaceutical companies alike, potentially improving and prolonging human life.","","https://www.kaggle.com/competitions/hubmap-organ-segmentation","completed","intermediate","8","","2022-06-22","2022-09-22","2023-08-08 16:30:22","2023-11-02 18:44:27" -"171","hubmap-kidney-segmentation","HuBMAP: Hacking the Kidney","Identify glomeruli in human kidney tissue images.","This competition, “Hacking the Kidney, starts by mapping the human kidney at single cell resolution. Your challenge is to detect functional tissue units (FTUs) across different tissue preparation pipelines. An FTU is defined as a “three-dimensional block of cells centered around a capillary, such that each cell in this block is within diffusion distance from any other cell in the same block” ([de Bono, 2013](https://www.ncbi.nlm.nih.gov/pubmed/24103658)). The goal of this competition is the implementation of a successful and robust glomeruli FTU detector. You will also have the opportunity to present your findings to a panel of judges for additional consideration. Successful submissions will construct the tools, resources, and cell atlases needed to determine how the relationships between cells can affect the health of an individual. Advancements in HuBMAP will accelerate the world’s understanding of the relationships between cell and tissue organization and function and human health.","","https://www.kaggle.com/competitions/hubmap-kidney-segmentation","completed","intermediate","8","","2020-11-16","2021-05-10","2023-08-08 17:31:46","2023-10-12 18:14:16" -"172","ventilator-pressure-prediction","Google Brain: Ventilator Pressure Prediction","Simulate a ventilator connected to a sedated patient's lung.","In this competition, you’ll simulate a ventilator connected to a sedated patient's lung. The best submissions will take lung attributes compliance and resistance into account. If successful, you'll help overcome the cost barrier of developing new methods for controlling mechanical ventilators. This will pave the way for algorithms that adapt to patients and reduce the burden on clinicians during these novel times and beyond. As a result, ventilator treatments may become more widely available to help patients breathe.","","https://www.kaggle.com/competitions/ventilator-pressure-prediction","completed","intermediate","8","","2021-09-22","2021-11-03","2023-08-08 17:53:33","2023-11-02 18:44:22" -"173","stanford-covid-vaccine","OpenVaccine - COVID-19 mRNA Vaccine Degradation Prediction","Urgent need to bring the COVID-19 vaccine to mass production.","In this competition, we are looking to leverage the data science expertise of the Kaggle community to develop models and design rules for RNA degradation. Your model will predict likely degradation rates at each base of an RNA molecule, trained on a subset of an Eterna dataset comprising over 3000 RNA molecules (which span a panoply of sequences and structures) and their degradation rates at each position. We will then score your models on a second generation of RNA sequences that have just been devised by Eterna players for COVID-19 mRNA vaccines. These final test sequences are currently being synthesized and experimentally characterized at Stanford University in parallel to your modeling efforts--Nature will score your models!","","https://www.kaggle.com/competitions/stanford-covid-vaccine","completed","intermediate","8","","2020-09-10","2020-10-06","2023-08-08 18:06:17","2023-10-12 18:14:27" -"174","openvaccine","OpenVaccine","A research initiative aimed at developing innovative design principles for R...","mRNA vaccines are a relatively new technology that have come into the limelight with the onset of COVID-19. They were the first COVID-19 vaccines to start clinical trials (initially formulated in a matter of days) and the first to be approved and distributed. mRNA vaccines have the potential to transform immunization, being significantly faster to formulate and produce, cheaper, and more effective-including against mutant strains. However, there is one key bottleneck to their widespread viability and our ability to immunize the entire world-poor refrigerator stability in prefilled syringes. The OpenVaccine challenge aims to allow a worldwide community of game players to create an enhanced vaccine to be injected into millions of people. The challenge-design an mRNA that codes for the same amino acid sequence of the spike protein, but is 2x-10x+ more stable. Through a number of academic partnerships and the launch of a Kaggle machine learning challenge to create best-in-class algori...","","https://eternagame.org/challenges/10845741","completed","intermediate","13","https://doi.org/10.1038/s41467-022-28776-w","\N","2021-12-12","2023-08-08 18:22:49","2023-09-28 23:17:02" -"175","opentb","OpenTB","We aim to gain fundamental insights into the ribosome's RNA sequence-folding.","OpenTB used a recently reported gene signature for active tuberculosis based on three RNAs in the blood. This signature could form the basis for a fast, color-based test for TB, similar to an over-the-counter pregnancy test. What was needed was a sensor that could detect the concentrations of three RNAs, carry out the needed calculation, and report the result by binding another molecule. Over four rounds, players designed RNA sensors that can do the math on these 3 genes. Through experimental feedback, they honed their skills and techniques, which resulted in the creation of multiple designs that have been shown to be successful. These findings are being prepared to be published, and future work will be done to develop diagnostic devices integrating these designs","","https://eternagame.org/challenges/10845742","completed","intermediate","13","","2016-05-04","2018-04-15","2023-08-08 18:43:09","2023-09-28 23:17:09" -"176","opencrispr","OpenCRISPR","Can you improve the algorithm that classifies drugs based on their biologica...","CRISPR gene editing is a RNA-based method that can target essentially any gene in a living organism for genetic changes. Since its first demonstration, CRISPR has been revolutionizing biology and promises to change how we tackle numerous human diseases from malaria to cancer. Stanford's Center for Personal Dynamic Regulomes and UC Berkeley's Innovative Genomics Institute have challenged Eterna players to solve a remaining hurdle in making this technology safe for use. Scientists want the power to turn on and off CRISPR on demand with small molecules. This is almost a perfect match to the small-molecule switches that the Eterna community has worked on. In fact, the MS2 RNA hairpin often used in Eterna is routinely used to recruit new functionality to CRISPR complexes through other molecules tethered to the MS2 protein. The puzzles began with OpenCRISPR Controls, looking for solutions to lock in or lock out the MS2 RNA hairpin within a special loop in the CRISPR RNA. We hope the res...","","https://eternagame.org/challenges/10845743","completed","intermediate","13","https://doi.org/10.1021/acssynbio.9b00142","2017-08-26","\N","2023-08-08 18:43:14","2023-10-10 19:57:07" -"177","openknot","OpenKnot","CellSignal - Disentangling biological signal from experimental noise in cell...","RNA pseudoknots have significant biological importance in various processes. They participate in gene regulation by influencing translation initiation or termination in mRNA molecules. Pseudoknots also play a role in programmed ribosomal frameshifting, leading to the production of different protein products from a single mRNA. RNA viruses, including SARS-CoV-2 and Dengue virus, utilize pseudoknots to regulate their replication and control the synthesis of viral proteins. Additionally, certain RNA molecules with pseudoknot structures exhibit enzymatic activity, acting as ribozymes and catalyzing biochemical reactions. These functions highlight the crucial role of RNA pseudoknots in gene expression, proteomic diversity, viral replication, and enzymatic processes. Several unanswered scientific questions surround RNA pseudoknots. One key area of inquiry is understanding the folding pathways of pseudoknots and how they form from linear RNA sequences. Elucidating the structural dynamics...","","https://eternagame.org/challenges/11843006","active","intermediate","13","","2022-06-17","\N","2023-08-08 18:43:22","2023-10-10 19:52:53" -"178","openaso","OpenASO","Event detection from wearable sensor data.","The DNA genome is the blueprint for building and operating cells, but this information must be decoded into RNA molecules to be useful. Transcription is the process of decoding DNA genomic information into RNA, resulting in RNA transcripts. Genes are specific sequences of DNA that contain information to produce a specific RNA transcript. The fate of most mRNA molecules in the cell is to be translated by ribosomes into protein molecules. However, mRNA splicing is a crucial step that occurs between the formation of an RNA transcript and protein translation. This step is essential because genes contain non-protein coding introns and protein-coding exons. Splicing removes introns and joins exons to produce a mature mRNA molecule that can be decoded into the correct protein molecule. When the splicing process is corrupted due to genetic mutations, the resulting RNA can become toxic, leading to the synthesis of non-functional proteins or no protein at all, causing various human diseases...","","https://eternagame.org/challenges/11546273","active","intermediate","13","","2023-02-20","\N","2023-08-08 18:43:25","2023-10-10 19:52:57" -"179","openribosome","OpenRibosome","AI competition seeks cancer diagnosis and treatment solutions.","Our modern world has many challenges-challenges like climate change, increasing waste production, and human health. Imagine-we could replace petrochemistry with biology, single-use plastics with selectively degradable polymers, broad chemotherapeutics with targeted medicines for fighting specific cancer cells, and complex health equipment with point-of-care diagnostics. These innovations and many more can empower us to confront the challenges affecting humanity, our world, and beyond. But how do we actually create these smart materials and medicines? Is it possible to do so by repurposing one of Nature's molecular machines? We think we can. The answer? Customized ribosomes. In Nature, ribosomes are the catalysts for protein assembly. And proteins are more or less similar, chemically, to the smart materials and medicines we want to synthesize. If we could modify ribosomes to build polymers with diverse components-beyond the canonical amino acids us","","https://eternagame.org/challenges/11043833","active","intermediate","13","https://doi.org/10.1038/s41467-023-35827-3","2019-01-31","\N","2023-08-08 18:43:27","2023-10-10 19:53:01" -"180","lish-moa","Mechanisms of Action (MoA) Prediction","Segmenting Cerebral Arteries from 3D Angiography Images.","Can you improve the algorithm that classifies drugs based on their biological activity?","","https://www.kaggle.com/competitions/lish-moa","completed","intermediate","8","","2020-09-03","2020-11-30","2023-08-08 19:09:31","2023-09-28 23:18:04" -"181","recursion-cellular-image-classification","Recursion Cellular Image Classification","Challenge compares Circle of Willis classification methods.","This competition will have you disentangling experimental noise from real biological signals. Your entry will classify images of cells under one of 1,108 different genetic perturbations. You can help eliminate the noise introduced by technical execution and environmental variation between experiments. If successful, you could dramatically improve the industry’s ability to model cellular images according to their relevant biology. In turn, applying AI could greatly decrease the cost of treatments, and ensure these treatments get to patients faster.","","https://www.kaggle.com/competitions/recursion-cellular-image-classification","completed","intermediate","8","","2019-06-27","2019-09-26","2023-08-08 19:38:42","2023-10-10 19:53:05" -"182","tlvmc-parkinsons-freezing-gait-prediction","Parkinson's Freezing of Gait Prediction","The US Food and Drug Administration (FDA) calls on stakeholders, including t...","The goal of this competition is to detect freezing of gait (FOG), a debilitating symptom that afflicts many people with Parkinson’s disease. You will develop a machine learning model trained on data collected from a wearable 3D lower back sensor. Your work will help researchers better understand when and why FOG episodes occur. This will improve the ability of medical professionals to optimally evaluate, monitor, and ultimately, prevent FOG events.","","https://www.kaggle.com/competitions/tlvmc-parkinsons-freezing-gait-prediction","completed","intermediate","8","","2023-03-09","2023-06-08","2023-08-08 19:47:54","2023-10-10 19:53:08" -"183","chaimeleon","CHAIMELEON Open Challenges","The Veterans Health Administration Innovation Ecosystem, the Digital Health ...","The CHAIMELEON Open Challenges is a competition designed to train and refine AI models to answer clinical questions about five types of cancer-prostate, lung, breast, colon, and rectal. Participants are challenged to collaborate and develop innovative AI-powered solutions that can significantly impact cancer diagnosis, management, and treatment. They will be evaluated considering a balance between the performance of their AI algorithms to predict different clinical endpoints such as disease staging, treatment response or progression free survival and their trustworthiness. The challenges are open to the whole scientific and tech community interested in AI. They are a unique opportunity to showcase how AI can be used to advance medical research and improve patient outcomes within the CHAIMELEON project.","","https://chaimeleon.grand-challenge.org/","active","intermediate","5","","2023-11-03","2023-12-31","2023-08-09 17:13:09","2023-11-02 15:33:27" -"184","topcow23","Topology-Aware Anatomical Segmentation of the Circle of Willis for CTA and MRA","Predicting High Risk Breast Cancer - a Nightingale OS & AHLI data challenge.","The aim of the challenge is to extract the CoW angio-architecture from 3D angiographic imaging by segmentation of the vessel components. There are two sub-tasks-binary segmentation of CoW vessels, and multi-class CoW anatomical segmentation. We release a new dataset of joint-modalities, CTA and MRA of the same patient cohort, both with annotations of the anatomy of CoW. Our challenge has two tracks for the same segmentation task, namely CTA track and MRA track. We made use of the clinical information from both modalities during our annotation. And participants can pick whichever modality they want, both CTA and MRA, and choose to tackle the task for either modality.","","https://topcow23.grand-challenge.org/","completed","intermediate","5","","2023-08-20","2023-09-25","2023-08-09 17:16:22","2023-09-28 23:24:41" -"185","circle-of-willis-intracranial-artery-classification-and-quantification-challenge-2023","Circle of Willis Intracranial Artery Classification and Quantification Challenge 2023","Predicting High Risk Breast Cancer - a Nightingale OS & AHLI data challenge.","The purpose of this challenge is to compare automatic methods for classification of the circle of Willis (CoW) configuration and quantification of the CoW major artery diameters and bifurcation angles.","","https://crown.isi.uu.nl/","completed","intermediate","14","","2023-05-01","2023-08-15","2023-08-09 22:13:24","2023-09-28 23:24:54" -"186","making-sense-of-electronic-health-record-ehr-race-and-ethnicity-data","Making Sense of Electronic Health Record (EHR) Race and Ethnicity Data","Predicting the connectivity and properties of in-silico networks.","The urgency of the coronavirus disease 2019 (COVID-19) pandemic has heightened interest in the use of real-world data (RWD) to obtain timely information about patients and populations and has focused attention on EHRs. The pandemic has also heightened awareness of long-standing racial and ethnic health disparities along a continuum from underlying social determinants of health, exposure to risk, access to insurance and care, quality of care, and responses to treatments. This highlighted the potential that EHRs can be used to describe and contribute to our understanding of racial and ethnic health disparities and their solutions. The OMB Revisions to the Standards for the Classification of Federal Data on Race and Ethnicity provides minimum standards for maintaining, collecting, and presenting data on race and ethnicity for all Federal reporting purposes, and defines the two separate constructs of race and ethnicity.","","https://precision.fda.gov/challenges/30","completed","intermediate","6","","2023-05-31","2023-06-23","2023-08-10 18:28:06","2023-10-10 19:53:12" -"187","the-veterans-cardiac-health-and-ai-model-predictions-v-champs","The Veterans Cardiac Health and AI Model Predictions (V-CHAMPS)","The goal of the in silico challenges is the reverse engineering of gene netw...","To better understand the risk and protective factors in the Veteran population, the VHA IE and its collaborating partners are calling upon the public to develop AI/ML models to predict cardiovascular health outcomes, including readmission and mortality, using synthetically generated Veteran health records. The Challenge consists of two Phases-Phase 1 is focused on synthetic data. In this Phase of the Challenge, AI/ML models will be developed by Challenge participants and trained and tested on the synthetic data sets provided to them, with a view towards predicting outcome variables for Veterans who have been diagnosed with chronic heart failure (please note that in Phase 1, the data is synthetic Veteran health records). Phase 2 will focus on validating and further exploring the limits of the AI/ML models. During this Phase, high-performing AI/ML models from Phase 1 will be brought into the VA system and validated on the real-world Veterans health data within the VHA. These models...","","https://precision.fda.gov/challenges/31","completed","intermediate","6","","2023-05-25","2023-08-02","2023-08-10 21:41:10","2023-09-28 23:25:45" -"188","predicting-high-risk-breast-cancer-phase-1","Predicting High Risk Breast Cancer - Phase 1","The goal of the in silico network challenge is to reverse engineer gene regu...","Every year, 40 million women get a mammogram; some go on to have an invasive biopsy to better examine a concerning area. Underneath these routine tests lies a deep—and disturbing—mystery. Since the 1990s, we have found far more ‘cancers’, which has in turn prompted vastly more surgical procedures and chemotherapy. But death rates from metastatic breast cancer have hardly changed. When a pathologist looks at a biopsy slide, she is looking for known signs of cancer-tubules, cells with atypical looking nuclei, evidence of rapid cell division. These features, first identified in 1928, still underlie critical decisions today-which women must receive urgent treatment with surgery and chemotherapy? And which can be prescribed “watchful waiting”, sparing them invasive procedures for cancers that would not harm them? There is already evidence that algorithms can predict which cancers will metastasize and harm patients on the basis of the biopsy image. Fascinatingly, these algorithms also h...","","https://app.nightingalescience.org/contests/3jmp2y128nxd","completed","intermediate","15","","2022-06-01","2023-01-12","2023-08-22 17:07:00","2023-10-12 17:55:10" -"189","predicting-high-risk-breast-cancer-phase-2","Predicting High Risk Breast Cancer - Phase 2","The goal of this Network Inference Challenge is to reverse engineer gene reg...","Every year, 40 million women get a mammogram; some go on to have an invasive biopsy to better examine a concerning area. Underneath these routine tests lies a deep—and disturbing—mystery. Since the 1990s, we have found far more ‘cancers’, which has in turn prompted vastly more surgical procedures and chemotherapy. But death rates from metastatic breast cancer have hardly changed. When a pathologist looks at a biopsy slide, she is looking for known signs of cancer-tubules, cells with atypical looking nuclei, evidence of rapid cell division. These features, first identified in 1928, still underlie critical decisions today-which women must receive urgent treatment with surgery and chemotherapy? And which can be prescribed “watchful waiting”, sparing them invasive procedures for cancers that would not harm them? There is already evidence that algorithms can predict which cancers will metastasize and harm patients on the basis of the biopsy image. Fascinatingly, these algorithms also...","","https://app.nightingalescience.org/contests/vd8g98zv9w0p","completed","intermediate","15","","2023-02-03","2023-05-13","2023-08-22 17:07:01","2023-10-12 17:55:08" -"190","dream-2-in-silico-network-inference","DREAM 2 - In Silico Network Inference","Identify dates in clinical notes.","Three in-silico networks were created and endowed with a dynamics that simulate biological interactions. The challenge consists of predicting the connectivity and some of the properties of one or more of these three networks.","","https://www.synapse.org/#!Synapse:syn2825394/wiki/71150","completed","intermediate","1","","2007-03-25","\N","2023-08-24 18:54:05","2023-10-12 17:55:03" -"191","dream-3-in-silico-network-challenge","DREAM 3 - In Silico Network Challenge","Identify person names in clinical notes.","The goal of the in silico challenges is the reverse engineering of gene networks from steady state and time series data. Participants are challenged to predict the directed unsigned network topology from the given in silico generated gene topic_3170sets.","","https://www.synapse.org/#!Synapse:syn2853594/wiki/71567","completed","intermediate","1","https://doi.org/10.1089/cmb.2008.09TT","2008-06-09","\N","2023-08-25 16:43:41","2023-10-12 17:55:02" -"192","dream-4-in-silico-network-challenge","DREAM 4 - In Silico Network Challenge","Identify location information in clinical notes.","The goal of the in silico network challenge is to reverse engineer gene regulation networks from simulated steady-state and time-series data. Participants are challenged to infer the network structure from the given in silico gene topic_3170sets. Optionally, participants may also predict the response of the networks to a set of novel perturbations that were not included in the provided datasets.","","https://www.synapse.org/#!Synapse:syn3049712/wiki/74628","completed","intermediate","1","https://doi.org/10.1073/pnas.0913357107","2009-06-09","\N","2023-08-25 16:43:42","2023-10-12 17:55:00" -"193","dream-5-network-inference-challenge","DREAM 5 - Network Inference Challenge","Identify contact information in clinical notes.","The goal of this Network Inference Challenge is to reverse engineer gene regulatory networks from gene topic_3170sets. Participants are given four microarray compendia and are challenged to infer the structure of the underlying transcriptional regulatory networks. Three of the four compendia were obtained from microorganisms, some of which are pathogens of clinical relevance. The fourth compendium is based on an in-silico (i.e., simulated) network. Each compendium consists of hundreds of microarray experiments, which include a wide range of genetic, drug, and environmental perturbations (or in the in-silico network case, simulations thereof). Network predictions will be evaluated on a subset of known interactions for each organism, or on the known network for the in-silico case.","","https://www.synapse.org/#!Synapse:syn2787209/wiki/70349","completed","intermediate","1","https://doi.org/10.1038/nmeth.2016","2010-06-09","2010-10-31","2023-08-25 16:43:43","2023-10-12 17:54:57" -"194","nlp-sandbox-date-annotation","NLP Sandbox Date Annotation","Identify identifiers in clinical notes.","An NLP Sandbox Date Annotator takes as input a clinical note and outputs a list of predicted date annotations found in the clinical note.","","https://www.synapse.org/#!Synapse:syn22277123/wiki/609134","completed","intermediate","1","https://doi.org/10.7303/syn22277123","2021-06-04","2023-09-01","2023-08-25 16:45:22","2023-09-28 23:59:02" -"195","nlp-sandbox-person-name-annotation","NLP Sandbox Person Name Annotation","Predict BCL6 transcriptomic targets from biological data.","An NLP Sandbox Person Name Annotator takes as input a clinical note and outputs a list of predicted person name annotations found in the clinical note.","","https://www.synapse.org/#!Synapse:syn22277123/wiki/609134","completed","intermediate","1","https://doi.org/10.7303/syn22277123","2021-06-04","2023-09-01","2023-09-08 16:44:20","2023-09-28 23:59:20" -"196","nlp-sandbox-location-annotation","NLP Sandbox Location Annotation","Predict a protein-protein interaction network of 47 proteins.","An NLP Sandbox Location Annotator takes as input a clinical note and outputs a list of predicted location annotations found in the clinical note.","","https://www.synapse.org/#!Synapse:syn22277123/wiki/609134","completed","intermediate","1","https://doi.org/10.7303/syn22277123","2021-06-04","2023-09-01","2023-09-08 16:44:21","2023-09-28 23:59:21" -"197","nlp-sandbox-contact-annotation","NLP Sandbox Contact Annotation","Reconstruct genome-scale networks from microarray data.","An NLP Sandbox contact annotator takes as input a clinical note and outputs a list of predicted contact annotations found in the clinical note.","","https://www.synapse.org/#!Synapse:syn22277123/wiki/609134","completed","intermediate","1","https://doi.org/10.7303/syn22277123","2021-06-04","2023-09-01","2023-09-08 16:44:22","2023-09-28 23:59:21" -"198","nlp-sandbox-id-annotation","NLP Sandbox ID Annotation","Inferring five-gene networks from synthetic data.","An NLP Sandbox ID annotator takes as input a clinical note and outputs a list of predicted ID annotations found in the clinical note.","","https://www.synapse.org/#!Synapse:syn22277123/wiki/609134","completed","intermediate","1","https://doi.org/10.7303/syn22277123","2021-06-04","2023-09-01","2023-09-08 16:44:22","2023-09-28 23:59:22" -"199","dream-2-bcl6-transcriptomic-target-prediction","DREAM 2 - BCL6 Transcriptomic Target Prediction","Inferring signaling cascade dynamics from flow cytometry data.","A number of potential transcriptional targets of BCL6, a gene that encodes for a transcription factor active in B cells, have been identified with ChIP-on-chip data and functionally validated by perturbing the BCL6 pathway with CD40 and anti-IgM, and by over-expressing exogenous BCL6 in Ramos cell. We subselected a number of targets found in this way (the gold standard positive set), and added a number decoys (genes that have no evidence of being BCL6 targets, named the gold standard negative set), compiling a list of 200 genes in total. Given this list of 200 genes, the challenge consists of identifying which ones are the true targets and which ones are the decoys, using an independent panel of gene topic_3170.","","https://www.synapse.org/#!Synapse:syn3034857/wiki/","completed","intermediate","1","https://doi.org/10.1073/pnas.0437996100","2007-04-19","\N","2023-09-12 21:26:22","2023-10-12 17:53:55" -"200","dream-2-protein-protein-interaction-network-inference","DREAM 2 - Protein-Protein Interaction Network Inference","Predicting gene expression from gene datasets.","For many pairs of bait and prey genes, yeast protein-protein interactions were tested in an unbiased fashion using a high saturation, high-stringency variant of the yeast two-hybrid (Y2H) method. A high confidence subset of gene pairs that were found to interact in at least three repetitions of the experiment but that hadn’t been reported in the literature was extracted. There were 47 yeast genes involved in these pairs. Including self interactions, there are a total of 47*48/2 possible pairs of genes that can be formed with these 47 genes. As mentioned above some of these gene pairs were seen to consistently interact in at least three repetitions of the Y2H experiments-these gene pairs form the gold standard positive set. A second set among these gene pairs were seen never to interact in repeated experiments and were not reported as interacting in the literature; we call this the gold standard negative set. Finally in a third set of gene pairs, which we shall call the undecided s...","","https://www.synapse.org/#!Synapse:syn2825374/wiki/","completed","intermediate","1","https://doi.org/10.1126/science.1158684","2007-05-24","\N","2023-09-12 21:26:28","2023-10-12 17:54:00" -"201","dream-2-genome-scale-network-inference","DREAM 2 - Genome-Scale Network Inference","Cell-type specific high-throughput experimental data.","A panel of single-channel microarrays was collected for a particular microorganism, including some already published and some in-print data. The data was appropriately normalized (to the logarithmic scale). The challenge consists of reconstructing a genome-scale transcriptional network for this organism. The accuracy of network inference will be judged using chromatin precipitation and otherwise experimentally verified Transcription Factor (TF)-target interactions.","","https://www.synapse.org/#!Synapse:syn3034894/wiki/74418","completed","intermediate","1","https://doi.org/10.1371/journal.pbio.0050008","2007-06-05","2007-10-31","2023-09-12 21:26:34","2023-10-12 17:54:03" -"202","dream-2-synthetic-five-gene-network-inference","DREAM 2 - Synthetic Five-Gene Network Inference","Predict missing protein concentrations from a large corpus of measurements.","A synthetic-biology network consisting of 5 interacting genes was created and transfected to an in-vivo model organism. The challenge consists of predicting the connectivity of the five-gene network from in-vivo measurements.","","https://www.synapse.org/#!Synapse:syn3034869/wiki/74411","completed","intermediate","1","https://doi.org/10.1016/j.cell.2009.01.055","2007-06-20","2007-10-31","2023-09-12 21:26:56","2023-10-12 17:54:05" -"203","dream-3-signaling-cascade-identification","DREAM 3 - Signaling Cascade Identification","Predict binding specificity of peptide-antibody interactions.","The concentration of four intracellular proteins or phospho-proteins (X1, X2, X3 and X4) participating in a signaling cascade were measured in about 104 cells by antibody staining and flow cytometry. The idea of this challenge is to explore what key aspects of the dynamics and topology of interactions of a signaling cascade can be inferred from incomplete flow cytometry data.","","https://www.synapse.org/#!Synapse:syn3033068/wiki/74362","completed","intermediate","1","","2008-06-01","2008-10-31","2023-09-12 21:27:04","2023-10-12 17:54:08" -"204","dream-3-gene-expression-prediction","DREAM 3 - Gene Expression Prediction","Predict binding intensities for transcription factors from motifs.","Gene expression time course data is provided for four different strains of yeast (S. Cerevisiae), after perturbation of the cells. The challenge is to predict the rank order of induction/repression of a small subset of genes (the prediction targets) in one of the four strains, given complete data for three of the strains, and data for all genes except the prediction targets in the other strain. You are also allowed to use any information that is in the public domain and are expected to be forthcoming about what information was used.","","https://www.synapse.org/#!Synapse:syn3033083/wiki/74369","completed","intermediate","1","","2008-06-01","2008-10-31","2023-09-12 21:27:12","2023-10-12 17:54:10" -"205","dream-4-predictive-signaling-network-modelling","DREAM 4 - Predictive Signaling Network Modelling","Predict the binding specificity of peptide-antibody interactions.","This challenge explores the extent to which our current knowledge of signaling pathways, collected from a variety of cell types, agrees with cell-type specific high-throughput experimental data. Specifically, we ask the challenge participants to create a cell-type specific model of signal transduction using the measured activity levels of signaling proteins in HepG2 cell lines. The model, which can leverage prior information encoded in a generic signaling pathway provided in the challenge, should be biologically interpretable as a network, and capable of predicting the outcome of new experiments.","","https://www.synapse.org/#!Synapse:syn2825304/wiki/71129","completed","intermediate","1","","2009-03-09","\N","2023-09-12 21:27:14","2023-10-12 17:54:30" -"206","dream-3-signaling-response-prediction","DREAM 3 - Signaling Response Prediction","Predict gene expression levels from promoter sequences in eukaryotes.","Approximately 10,000 intracellular measurements (fluorescence signals proportional to the concentrations of phosphorylated proteins) and extracellular measurements (concentrations of cytokines released in response to cell stimulation) were acquired in human normal hepatocytes and the hepatocellular carcinoma cell line HepG2 cells. The datasets consist of measurements of 17 phospho-proteins (at 0 min, 30 min, and 3 hrs) and 20 cytokines (at 0 min, 3 hrs, and 24 hrs) in two cell types (normal and cancer) after perturbations to the pathway induced by the combinatorial treatment of 7 stimuli and 7 selective inhibitors.","","https://www.synapse.org/#!Synapse:syn2825325/wiki/","completed","intermediate","1","https://doi.org/10.1126%2Fscisignal.2002212","2009-03-09","\N","2023-09-12 21:27:20","2023-10-12 17:54:33" -"207","dream-4-peptide-recognition-domain-prd-specificity-prediction","DREAM 4 - Peptide Recognition Domain (PRD) Specificity Prediction","Predict disease phenotypes and infer gene networks from systems genetics data.","Many important protein-protein interactions are mediated by peptide recognition domains (PRD), which bind short linear sequence motifs in other proteins. For example, SH3 domains typically recognize proline-rich motifs, PDZ domains recognize hydrophobic C-terminal tails, and kinases recognize short sequence regions around a phosphorylatable residue (Pawson, 2003). Given the sequence of the domains, the challenge consists of predicting a position weight matrix (PWM) that describes the specificity profile of each of the given domains to their target peptides. Any publicly accessible peptide specificity information available for the domain may be used.","","https://www.synapse.org/#!Synapse:syn2925957/wiki/72976","completed","intermediate","1","","2009-06-01","2009-10-31","2023-09-12 21:27:35","2023-10-12 17:54:35" -"208","dream-5-transcription-factor-dna-motif-recognition-challenge","DREAM 5 - Transcription-Factor, DNA-Motif Recognition Challenge","Challenge to estimate model parameters.","Transcription factors (TFs) control the expression of genes through sequence-specific interactions with genomic DNA. Different TFs bind preferentially to different sequences, with the majority recognizing short (6-12 base), degenerate ‘motifs’. Modeling the sequence specificities of TFs is a central problem in understanding the function and evolution of the genome, because many types of genomic analyses involve scanning for potential TF binding sites. Models of TF binding specificity are also important for understanding the function and evolution of the TFs themselves. The challenge consists of predicting the signal intensities for the remaining TFs.","","https://www.synapse.org/#!Synapse:syn2887863/wiki/72185","completed","intermediate","1","https://doi.org/10.1038/nbt.2486","2011-06-01","2011-09-30","2023-09-12 21:27:41","2023-10-12 17:54:36" -"209","dream-5-epitope-antibody-recognition-ear-challenge","DREAM 5 - Epitope-Antibody Recognition (EAR) Challenge","The goal of this challenge is to diagnose Acute Myeloid Leukemia from patien...","Humoral immune responses are mediated through antibodies. About 1010 to 1012 different antigen binding sites called paratopes are generated by genomic recombination. These antibodies are capable to bind to a variety of structures ranging from small molecules to protein complexes, including any posttranslational modification thereof. When studying protein-antibody interactions, two types of epitopes (the region paratopes interact with) are to be distinguished from each other-i) conformational and ii) linear epitopes. All potential linear epitopes of a protein can be represented by short peptides derived from the primary amino acid sequence. These peptides can be synthesized and arrayed on solid supports, e.g. glass slides (see Lorenz et al., 2009 [1]). By incubating these peptide arrays with antibody mixtures such as human serum or plasma, peptides can be determined that interact with antibodies in a specific fashion.","","https://www.synapse.org/#!Synapse:syn2820433/wiki/71017","completed","intermediate","1","","2010-06-09","\N","2023-09-12 21:27:44","2023-10-12 17:54:39" -"210","dream-gene-expression-prediction-challenge","DREAM Gene Expression Prediction Challenge","Assess accuracy of mRNA-seq alternative splicing reconstruction.","The level by which genes are transcribed is determined in large part by the DNA sequence upstream to the gene, known as the promoter region. Although widely studied, we are still far from a quantitative and predictive understanding of how transcriptional regulation is encoded in gene promoters. One obstacle in the field is obtaining accurate measurements of transcription derived by different promoters. To address this, an experimental system was designed to measure the transcription derived by different promoters, all of which are inserted into the same genomic location upstream to a reporter gene -a yellow florescence protein gene (YFP). The challenge consists of the prediction of the promoter activity given a promoter sequence and a specific experimental condition. To study a set of promoters that share many elements of the regulatory program, and thus are suitable for computational learning, the data pertains to promoters of most of the ribosomal protein genes (RP) of yeast (S....","","https://www.synapse.org/#!Synapse:syn2820426/wiki/71010","completed","intermediate","1","","2010-07-09","\N","2023-09-12 21:28:00","2023-10-19 23:32:10" -"211","dream-5-systems-genetics-challenge","DREAM 5 - Systems Genetics Challenge","A machine learning contest for gene network inference from single-cell pertu...","The central goal of systems biology is to gain a predictive, system-level understanding of biological networks. This can be done, for example, by inferring causal networks from observations on a perturbed biological system. An ideal experimental design for causal inference is randomized, multifactorial perturbation. The recognition that the genetic variation in a segregating population represents randomized, multifactorial perturbations (Jansen and Nap (2001), Jansen (2003)) gave rise to Systems Genetics (SG), where a segregating or genetically randomized population is genotyped for many DNA variants, and profiled for phenotypes of interest (e.g. disease phenotypes), gene expression, and potentially other ‘omics’ variables (protein expression, metabolomics, DNA methylation, etc.; Figure 1. Figure 1 was taken from Jansen and Nap (2001)). In this challenge we explore the use of Systems Genetics data for elucidating causal network models among genes, i.e. Gene Networks (DREAM5 SYSGEN...","","https://www.synapse.org/#!Synapse:syn2820440/wiki/","completed","intermediate","1","","2010-07-09","\N","2023-09-12 21:28:10","2023-10-12 17:54:42" -"212","dream-6-estimation-of-model-parameters-challenge","DREAM 6 - Estimation of Model Parameters Challenge","The challenge related to computational geometry and topology for ICLR 2022.","Given the complete model structures (including expressions for the kinetic rate laws) for three gene regulatory networks, participants are asked to develop and/or apply optimization methods, including the selection of the most informative experiments, to accurately estimate parameters and predict outcomes of perturbations in Systems Biology models.","","https://www.synapse.org/#!Synapse:syn2841366/wiki/71372","completed","intermediate","1","","2011-06-01","2011-10-31","2023-09-12 21:28:12","2023-10-12 17:54:45" -"213","dream-6-flowcap2-molecular-classification-of-acute-myeloid-leukemia-challenge","DREAM 6 - FlowCAP2 Molecular Classification of Acute Myeloid Leukemia Challenge","Automating Identification of Cell Populations in Flow Cytometry Data","Flow cytometry (FCM) has been widely used by immunologists and cancer biologists for more than 30 years as a biomedical research tool to distinguish different cell types in mixed populations, based on the expression of cellular markers. It has also become a widely used diagnostic tool for clinicians to identify abnormal cell populations associated with disease. In the last decade, advances in instrumentation and reagent technologies have enabled simultaneous single-cell measurement of tens of surface and intracellular markers, as well as tens of signaling molecules, positioning FCM to play an even bigger role in medicine and systems biology [1,2]. However, the rapid expansion of FCM applications has outpaced the functionality of traditional analysis tools used to interpret FCM data such that scientists are faced with the daunting prospect of manually identifying interesting cell populations in 20 dimensional data from a collection of millions of cells. For these reasons a reliable...","","https://www.synapse.org/#!Synapse:syn2887788/wiki/72178","completed","intermediate","1","https://doi.org/10.1038/nmeth.2365","2011-06-01","2011-09-30","2023-09-12 21:28:19","2023-10-12 17:54:47" +"156","cagi7","CAGI7","The seventh round of CAGI","There have been six editions of CAGI experiments, held between 2010 and 2022. The seventh round of CAGI is planned to take place over the Summer of 2024.","","https://genomeinterpretation.org/challenges.html","upcoming","intermediate","1","","\N","\N","2023-08-04 21:47:38","2023-10-14 05:40:32" +"157","casp15","CASP15","Establish the state-of-art in modeling proteins and protein complexes","CASP14 (2020) saw an enormous jump in the accuracy of single protein and domain models such that many are competitive with experiment. That advance is largely the result of the successful application of deep learning methods, particularly by the AlphaFold and, since that CASP, RosettaFold. As a consequence, computed protein structures are becoming much more widely used in a broadening range of applications. CASP has responded to this new landscape with a revised set of modeling categories. Some old categories have been dropped (refinement, contact prediction, and aspects of model accuracy estimation) and new ones have been added (RNA structures, protein ligand complexes, protein ensembles, and accuracy estimation for protein complexes). We are also strengthening our interactions with our partners CAPRI and CAMEO. We hope that these changes will maximize the insight that CASP15 provides, particularly in new applications of deep learning.","","https://predictioncenter.org/casp15/index.cgi","completed","intermediate","14","","2022-04-18","\N","2023-08-04 21:52:12","2023-09-28 23:09:59" +"158","synthrad2023","SynthRAD2023","Synthesizing computed tomography for radiotherapy","This challenge aims to provide the first platform offering public data evaluation metrics to compare the latest developments in sCT generation methods. The accepted challenge design approved by MICCAI can be found at https://doi.org/10.5281/zenodo.7746019. A type 2 challenge will be run, where the participant needs to submit their algorithm packaged in a docker both for validation and test.","","https://synthrad2023.grand-challenge.org/","completed","intermediate","5","","2023-04-01","2023-08-22","2023-08-04 21:54:31","2023-10-26 23:20:24" +"159","synthetic-data-for-instrument-segmentation-in-surgery-syn-iss","Synthetic Data for Instrument Segmentation in Surgery (Syn-ISS)","Challenging machine learning in surgical instrument segmentation with synthetic data","A common limitation noted by the surgical data science community is the size of datasets and the resources needed to generate training data at scale for building reliable and high-performing machine learning models. Beyond unsupervised and self-supervised approaches another solution within the broader machine learning community has been a growing volume of literature in the use of synthetic data (simulation) for training algorithms than can be applied to real world data. Synthetic data has multiple benefits like free groundtruth at large scale, possibility to collect larger sample of rare events, include anatomical variations, etc. A first step towards proving the validity of using synthetic data for real world applications is to demonstrate the feasibility within the simulation world itself. Our proposed challenge is to train machine learning methods for instrument segmentation using synthetic datasets and test their performance on synthetic datasets. That is, the challenge parti...","","https://www.synapse.org/#!Synapse:syn50908388/wiki/620516","completed","intermediate","1","","2023-07-19","2023-09-07","2023-08-04 23:49:44","2023-10-26 23:20:28" +"160","pitvis","PitVis","Surgical workflow and instrument recognition in endonasal surgery","The pituitary gland, found just off the base of the brain, is commonly known as “the master gland”, performing essential functions required for sustaining human life. Clinically relevant tumours that have grown on the pituitary gland have an estimated prevalence of 1 in 1000 of the population, and if left untreated can be life-limiting. The “gold standard” treatment is endoscopic pituitary surgery, where the tumour is directly removed by entering through a nostril. This surgery is particularly challenging due to the small working space which limits both vision and instrument manoeuvrability and thus can lead to poor surgical technique causing adverse outcomes for the patient. Computer-assisted intervention can help overcome these challenges by providing guidance for senior surgeons and operative staff during surgery, and for junior surgeons during training.","","https://www.synapse.org/#!Synapse:syn51232283/wiki/","completed","intermediate","1","","2023-06-29","2023-09-10","2023-08-04 23:58:01","2023-10-26 23:20:30" +"161","mvseg2023","MVSEG2023","Automatically segment mitral valve leaflets from single frame 3D trans-esophageal echocardiography","Mitral valve (MV) disease is a common pathologic problem occurring in approximately 2 % of the general population but climbing to 10 % in those over the age of 75. The preferred intervention for mitral regurgitation is valve repair, due to superior patient outcomes compared to those following valve replacement. Mitral valve interventions are technically challenging due to the functional and anatomical complexity of mitral pathologies. Repair must be tailored to the patient-specific anatomy and pathology, which requires considerable expert training and experience. Automatic segmentation of the mitral valve leaflets from 3D transesophageal echocardiography (TEE) may play an important role in treatment planning, as well as physical and computational modelling of patient-specific valve pathologies and potential repair approaches. This may have important implications in the drive towards personalized care and has the potential to impact clinical outcomes for those undergoing mitral val...","","https://www.synapse.org/#!Synapse:syn51186045/wiki/621356","completed","intermediate","1","","2023-05-29","2023-08-07","2023-08-05 0-04-36","2023-09-28 23:12:19" +"162","crossmoda23","crossMoDA23","Third edition of the first medical imaging benchmark for unsupervised domain adaptation (DA) approaches","Domain Adaptation (DA) has recently raised strong interest in the medical imaging community. By encouraging algorithms to be robust to unseen situations or different input data domains, Domain Adaptation improves the applicability of machine learning approaches to various clinical settings. While a large variety of DA techniques has been proposed, most of these techniques have been validated either on private datasets or on small publicly available datasets. Moreover, these datasets mostly address single-class problems. To tackle these limitations, the crossMoDA challenge introduced the first large and multi-class dataset for unsupervised cross-modality Domain Adaptation. From an application perspective, crossMoDA focuses on MRI segmentation for Vestibular Schwannoma. Compared to the previous crossMoDA instance, which made use of multi-institutional data acquired in controlled conditions for radiosurgery planning and focused on a 2 class segmentation task (tumour and cochlea), the...","","https://www.synapse.org/#!Synapse:syn51236108/wiki/621615","completed","intermediate","1","","2023-04-15","2023-07-10","2023-08-05 0-13-23","2023-10-12 18:10:18" +"163","icr-identify-age-related-conditions","ICR - Identifying Age-Related Conditions","Use Machine Learning to detect conditions with measurements of anonymous characteristics of a subject","The goal of this competition is to predict if a person has any of three medical conditions. You are being asked to predict if the person has one or more of any of the three medical conditions (Class 1), or none of the three medical conditions (Class 0). You will create a model trained on measurements of health characteristics. To determine if someone has these medical conditions requires a long and intrusive process to collect information from patients. With predictive models, we can shorten this process and keep patient details private by collecting key characteristics relative to the conditions, then encoding these characteristics.","","https://www.kaggle.com/competitions/icr-identify-age-related-conditions","completed","intermediate","8","","2023-05-11","2023-08-10","2023-08-05 0-32-01","2023-10-12 18:15:08" +"164","cafa-5-protein-function-prediction","CAFA 5: Protein Function Prediction","Predict the biological function of a protein","The goal of this competition is to predict the function of a set of proteins. You will develop a model trained on the amino-acid sequences of the proteins and on other data. Your work will help ​​researchers better understand the function of proteins, which is important for discovering how cells, tissues, and organs work. This may also aid in the development of new drugs and therapies for various diseases.","","https://www.kaggle.com/competitions/cafa-5-protein-function-prediction","completed","intermediate","8","","2023-04-18","2023-08-21","2023-08-05 5-18-40","2023-10-19 00:13:14" +"165","rsna-2023-abdominal-trauma-detection","RSNA 2023 Abdominal Trauma Detection","Detect and classify traumatic abdominal injuries","Traumatic injury is the most common cause of death in the first four decades of life and a major public health problem around the world. There are estimated to be more than 5 million annual deaths worldwide from traumatic injury. Prompt and accurate diagnosis of traumatic injuries is crucial for initiating appropriate and timely interventions, which can significantly improve patient outcomes and survival rates. Computed tomography (CT) has become an indispensable tool in evaluating patients with suspected abdominal injuries due to its ability to provide detailed cross-sectional images of the abdomen. Interpreting CT scans for abdominal trauma, however, can be a complex and time-consuming task, especially when multiple injuries or areas of subtle active bleeding are present. This challenge seeks to harness the power of artificial intelligence and machine learning to assist medical professionals in rapidly and precisely detecting injuries and grading their severity. The development...","","https://www.kaggle.com/competitions/rsna-2023-abdominal-trauma-detection","completed","intermediate","8","","2023-07-26","2023-10-13","2023-08-05 5-24-09","2023-09-28 23:14:12" +"166","hubmap-hacking-the-human-vasculature","HuBMAP: Hacking the Human Vasculature","Segment instances of microvascular structures from healthy human kidney tissue images","The goal of this competition is to segment instances of microvascular structures, including capillaries, arterioles, and venules. You'll create a model trained on 2D PAS-stained histology images from healthy human kidney tissue slides. Your help in automating the segmentation of microvasculature structures will improve researchers' understanding of how the blood vessels are arranged in human tissues.","","https://www.kaggle.com/competitions/hubmap-hacking-the-human-vasculature","completed","intermediate","8","","2023-05-22","2023-07-31","2023-08-05 5-31-12","2023-10-12 18:15:00" +"167","amp-parkinsons-disease-progression-prediction","AMP(R)-Parkinson's Disease Progression Prediction","Use data from Parkinson's Disease patients to predict clinical and molecular progression of the disease","The goal of this competition is to predict MDS-UPDR scores, which measure progression in patients with Parkinson's disease. The Movement Disorder Society-Sponsored Revision of the Unified Parkinson's Disease Rating Scale (MDS-UPDRS) is a comprehensive assessment of both motor and non-motor symptoms associated with Parkinson's. You will develop a model trained on data of protein and peptide levels over time in subjects with Parkinson’s disease versus normal age-matched control subjects. Your work could help provide important breakthrough information about which molecules change as Parkinson’s disease progresses.","","https://www.kaggle.com/competitions/amp-parkinsons-disease-progression-prediction","completed","intermediate","8","","2023-02-16","2023-05-18","2023-08-05 5-37-12","2023-10-10 19:52:34" +"168","open-problems-multimodal","Open Problems -Multimodal Single-Cell Integration","Predict how DNA, RNA & protein measurements co-vary in single cells","The goal of this competition is to predict how DNA, RNA, and protein measurements co-vary in single cells as bone marrow stem cells develop into more mature blood cells. You will develop a model trained on a subset of 300,000-cell time course dataset of CD34+ hematopoietic stem and progenitor cells (HSPC) from four human donors at five time points generated for this competition by Cellarity, a cell-centric drug creation company. In the test set, taken from an unseen later time point in the dataset, competitors will be provided with one modality and be tasked with predicting a paired modality measured in the same cell. The added challenge of this competition is that the test data will be from a later time point than any time point in the training data. Your work will help accelerate innovation in methods of mapping genetic information across layers of cellular state. If we can predict one modality from another, we may expand our understanding of the rules governing these complex re...","","https://www.kaggle.com/competitions/open-problems-multimodal","completed","intermediate","8","","2022-08-15","2022-11-15","2023-08-05 5-43-25","2023-10-10 19:52:41" +"169","multi-atlas-labeling-beyond-the-cranial-vault","Multi-Atlas Labeling Beyond the Cranial Vault","Innovative multi-atlas labeling for soft tissue segmentation on clinical CT","Multi-atlas labeling has proven to be an effective paradigm for creating segmentation algorithms from training data. These approaches have been extraordinarily successful for brain and cranial structures (e.g., our prior MICCAI workshops-MLSF’11, MAL’12, SATA’13). After the original challenges closed, the data continue to drive scientific innovation; 144 groups have registered for the 2012 challenge (brain only) and 115 groups for the 2013 challenge (brain/heart/canine leg). However, innovation in application outside of the head and to soft tissues has been more limited. This workshop will provide a snapshot of the current progress in the field through extended discussions and provide researchers an opportunity to characterize their methods on a newly created and released standardized dataset of abdominal anatomy on clinically acquired CT. The datasets will be freely available both during and after the challenge. We have two separate new challenges-abdomen and cervix on routinely ...","","https://www.synapse.org/#!Synapse:syn3193805/wiki/89480","active","intermediate","1","","2015-04-15","\N","2023-08-07 20:21:22","2023-10-10 19:52:39" +"170","hubmap-organ-segmentation","HuBMAP + HPA: Hacking the Human Body","Segment multi-organ functional tissue units","In this competition, you’ll identify and segment functional tissue units (FTUs) across five human organs. You'll build your model using a dataset of tissue section images, with the best submissions segmenting FTUs as accurately as possible. If successful, you'll help accelerate the world’s understanding of the relationships between cell and tissue organization. With a better idea of the relationship of cells, researchers will have more insight into the function of cells that impact human health. Further, the Human Reference Atlas constructed by HuBMAP will be freely available for use by researchers and pharmaceutical companies alike, potentially improving and prolonging human life.","","https://www.kaggle.com/competitions/hubmap-organ-segmentation","completed","intermediate","8","","2022-06-22","2022-09-22","2023-08-08 16:30:22","2023-11-02 18:44:27" +"171","hubmap-kidney-segmentation","HuBMAP: Hacking the Kidney","Identify glomeruli in human kidney tissue images","This competition, “Hacking the Kidney, starts by mapping the human kidney at single cell resolution. Your challenge is to detect functional tissue units (FTUs) across different tissue preparation pipelines. An FTU is defined as a “three-dimensional block of cells centered around a capillary, such that each cell in this block is within diffusion distance from any other cell in the same block” ([de Bono, 2013](https://www.ncbi.nlm.nih.gov/pubmed/24103658)). The goal of this competition is the implementation of a successful and robust glomeruli FTU detector. You will also have the opportunity to present your findings to a panel of judges for additional consideration. Successful submissions will construct the tools, resources, and cell atlases needed to determine how the relationships between cells can affect the health of an individual. Advancements in HuBMAP will accelerate the world’s understanding of the relationships between cell and tissue organization and function and human health.","","https://www.kaggle.com/competitions/hubmap-kidney-segmentation","completed","intermediate","8","","2020-11-16","2021-05-10","2023-08-08 17:31:46","2023-10-12 18:14:16" +"172","ventilator-pressure-prediction","Google Brain: Ventilator Pressure Prediction","Simulate a ventilator connected to a sedated patient's lung","In this competition, you’ll simulate a ventilator connected to a sedated patient's lung. The best submissions will take lung attributes compliance and resistance into account. If successful, you'll help overcome the cost barrier of developing new methods for controlling mechanical ventilators. This will pave the way for algorithms that adapt to patients and reduce the burden on clinicians during these novel times and beyond. As a result, ventilator treatments may become more widely available to help patients breathe.","","https://www.kaggle.com/competitions/ventilator-pressure-prediction","completed","intermediate","8","","2021-09-22","2021-11-03","2023-08-08 17:53:33","2023-11-02 18:44:22" +"173","stanford-covid-vaccine","OpenVaccine - COVID-19 mRNA Vaccine Degradation Prediction","Urgent need to bring the COVID-19 vaccine to mass production","In this competition, we are looking to leverage the data science expertise of the Kaggle community to develop models and design rules for RNA degradation. Your model will predict likely degradation rates at each base of an RNA molecule, trained on a subset of an Eterna dataset comprising over 3000 RNA molecules (which span a panoply of sequences and structures) and their degradation rates at each position. We will then score your models on a second generation of RNA sequences that have just been devised by Eterna players for COVID-19 mRNA vaccines. These final test sequences are currently being synthesized and experimentally characterized at Stanford University in parallel to your modeling efforts--Nature will score your models!","","https://www.kaggle.com/competitions/stanford-covid-vaccine","completed","intermediate","8","","2020-09-10","2020-10-06","2023-08-08 18:06:17","2023-10-12 18:14:27" +"174","openvaccine","OpenVaccine","To develop mRNA vaccines stable enough to be deployed to everyone in the world, and not just a privileged few","mRNA vaccines are a relatively new technology that have come into the limelight with the onset of COVID-19. They were the first COVID-19 vaccines to start clinical trials (initially formulated in a matter of days) and the first to be approved and distributed. mRNA vaccines have the potential to transform immunization, being significantly faster to formulate and produce, cheaper, and more effective-including against mutant strains. However, there is one key bottleneck to their widespread viability and our ability to immunize the entire world-poor refrigerator stability in prefilled syringes. The OpenVaccine challenge aims to allow a worldwide community of game players to create an enhanced vaccine to be injected into millions of people. The challenge-design an mRNA that codes for the same amino acid sequence of the spike protein, but is 2x-10x+ more stable. Through a number of academic partnerships and the launch of a Kaggle machine learning challenge to create best-in-class algori...","","https://eternagame.org/challenges/10845741","completed","intermediate","13","https://doi.org/10.1038/s41467-022-28776-w","\N","2021-12-12","2023-08-08 18:22:49","2023-09-28 23:17:02" +"175","opentb","OpenTB","What if we could use RNA to detect a gene sequence found to be present only in people with active TB?","OpenTB used a recently reported gene signature for active tuberculosis based on three RNAs in the blood. This signature could form the basis for a fast, color-based test for TB, similar to an over-the-counter pregnancy test. What was needed was a sensor that could detect the concentrations of three RNAs, carry out the needed calculation, and report the result by binding another molecule. Over four rounds, players designed RNA sensors that can do the math on these 3 genes. Through experimental feedback, they honed their skills and techniques, which resulted in the creation of multiple designs that have been shown to be successful. These findings are being prepared to be published, and future work will be done to develop diagnostic devices integrating these designs","","https://eternagame.org/challenges/10845742","completed","intermediate","13","","2016-05-04","2018-04-15","2023-08-08 18:43:09","2023-09-28 23:17:09" +"176","opencrispr","OpenCRISPR","A project to discover design patterns for guide RNAs to make gene editing more precisely controllable.","CRISPR gene editing is a RNA-based method that can target essentially any gene in a living organism for genetic changes. Since its first demonstration, CRISPR has been revolutionizing biology and promises to change how we tackle numerous human diseases from malaria to cancer. Stanford's Center for Personal Dynamic Regulomes and UC Berkeley's Innovative Genomics Institute have challenged Eterna players to solve a remaining hurdle in making this technology safe for use. Scientists want the power to turn on and off CRISPR on demand with small molecules. This is almost a perfect match to the small-molecule switches that the Eterna community has worked on. In fact, the MS2 RNA hairpin often used in Eterna is routinely used to recruit new functionality to CRISPR complexes through other molecules tethered to the MS2 protein. The puzzles began with OpenCRISPR Controls, looking for solutions to lock in or lock out the MS2 RNA hairpin within a special loop in the CRISPR RNA. We hope the res...","","https://eternagame.org/challenges/10845743","completed","intermediate","13","https://doi.org/10.1021/acssynbio.9b00142","2017-08-26","\N","2023-08-08 18:43:14","2023-10-10 19:57:07" +"177","openknot","OpenKnot","Build a diverse library of RNAs that form pseudoknot structures when tested experimentally","RNA pseudoknots have significant biological importance in various processes. They participate in gene regulation by influencing translation initiation or termination in mRNA molecules. Pseudoknots also play a role in programmed ribosomal frameshifting, leading to the production of different protein products from a single mRNA. RNA viruses, including SARS-CoV-2 and Dengue virus, utilize pseudoknots to regulate their replication and control the synthesis of viral proteins. Additionally, certain RNA molecules with pseudoknot structures exhibit enzymatic activity, acting as ribozymes and catalyzing biochemical reactions. These functions highlight the crucial role of RNA pseudoknots in gene expression, proteomic diversity, viral replication, and enzymatic processes. Several unanswered scientific questions surround RNA pseudoknots. One key area of inquiry is understanding the folding pathways of pseudoknots and how they form from linear RNA sequences. Elucidating the structural dynamics...","","https://eternagame.org/challenges/11843006","active","intermediate","13","","2022-06-17","\N","2023-08-08 18:43:22","2023-10-10 19:52:53" +"178","openaso","OpenASO","Design principles for RNA-based therapeutics that target and neutralize the harmful effects of toxic RNAs","The DNA genome is the blueprint for building and operating cells, but this information must be decoded into RNA molecules to be useful. Transcription is the process of decoding DNA genomic information into RNA, resulting in RNA transcripts. Genes are specific sequences of DNA that contain information to produce a specific RNA transcript. The fate of most mRNA molecules in the cell is to be translated by ribosomes into protein molecules. However, mRNA splicing is a crucial step that occurs between the formation of an RNA transcript and protein translation. This step is essential because genes contain non-protein coding introns and protein-coding exons. Splicing removes introns and joins exons to produce a mature mRNA molecule that can be decoded into the correct protein molecule. When the splicing process is corrupted due to genetic mutations, the resulting RNA can become toxic, leading to the synthesis of non-functional proteins or no protein at all, causing various human diseases...","","https://eternagame.org/challenges/11546273","active","intermediate","13","","2023-02-20","\N","2023-08-08 18:43:25","2023-10-10 19:52:57" +"179","openribosome","OpenRibosome","Learn and change the ribosome's RNAs so that the ribosome can make new materials and medicines","Our modern world has many challenges-challenges like climate change, increasing waste production, and human health. Imagine-we could replace petrochemistry with biology, single-use plastics with selectively degradable polymers, broad chemotherapeutics with targeted medicines for fighting specific cancer cells, and complex health equipment with point-of-care diagnostics. These innovations and many more can empower us to confront the challenges affecting humanity, our world, and beyond. But how do we actually create these smart materials and medicines? Is it possible to do so by repurposing one of Nature's molecular machines? We think we can. The answer? Customized ribosomes. In Nature, ribosomes are the catalysts for protein assembly. And proteins are more or less similar, chemically, to the smart materials and medicines we want to synthesize. If we could modify ribosomes to build polymers with diverse components-beyond the canonical amino acids us","","https://eternagame.org/challenges/11043833","active","intermediate","13","https://doi.org/10.1038/s41467-023-35827-3","2019-01-31","\N","2023-08-08 18:43:27","2023-10-10 19:53:01" +"180","lish-moa","Mechanisms of Action (MoA) Prediction","Can you improve the algorithm that classifies drugs based on their biological activity?","Can you improve the algorithm that classifies drugs based on their biological activity?","","https://www.kaggle.com/competitions/lish-moa","completed","intermediate","8","","2020-09-03","2020-11-30","2023-08-08 19:09:31","2023-09-28 23:18:04" +"181","recursion-cellular-image-classification","Recursion Cellular Image Classification","CellSignal-Disentangling biological signal from experimental noise in cellular images","This competition will have you disentangling experimental noise from real biological signals. Your entry will classify images of cells under one of 1,108 different genetic perturbations. You can help eliminate the noise introduced by technical execution and environmental variation between experiments. If successful, you could dramatically improve the industry’s ability to model cellular images according to their relevant biology. In turn, applying AI could greatly decrease the cost of treatments, and ensure these treatments get to patients faster.","","https://www.kaggle.com/competitions/recursion-cellular-image-classification","completed","intermediate","8","","2019-06-27","2019-09-26","2023-08-08 19:38:42","2023-10-10 19:53:05" +"182","tlvmc-parkinsons-freezing-gait-prediction","Parkinson's Freezing of Gait Prediction","Event detection from wearable sensor data","The goal of this competition is to detect freezing of gait (FOG), a debilitating symptom that afflicts many people with Parkinson’s disease. You will develop a machine learning model trained on data collected from a wearable 3D lower back sensor. Your work will help researchers better understand when and why FOG episodes occur. This will improve the ability of medical professionals to optimally evaluate, monitor, and ultimately, prevent FOG events.","","https://www.kaggle.com/competitions/tlvmc-parkinsons-freezing-gait-prediction","completed","intermediate","8","","2023-03-09","2023-06-08","2023-08-08 19:47:54","2023-10-10 19:53:08" +"183","chaimeleon","CHAIMELEON Open Challenges","Train and refine AI models to answer clinical questions about five types of cancer","The CHAIMELEON Open Challenges is a competition designed to train and refine AI models to answer clinical questions about five types of cancer-prostate, lung, breast, colon, and rectal. Participants are challenged to collaborate and develop innovative AI-powered solutions that can significantly impact cancer diagnosis, management, and treatment. They will be evaluated considering a balance between the performance of their AI algorithms to predict different clinical endpoints such as disease staging, treatment response or progression free survival and their trustworthiness. The challenges are open to the whole scientific and tech community interested in AI. They are a unique opportunity to showcase how AI can be used to advance medical research and improve patient outcomes within the CHAIMELEON project.","","https://chaimeleon.grand-challenge.org/","active","intermediate","5","","2023-11-03","2023-12-31","2023-08-09 17:13:09","2023-11-02 15:33:27" +"184","topcow23","Topology-Aware Anatomical Segmentation of the Circle of Willis for CTA and MRA","Extract the CoW angio-architecture from 3D angiographic imaging by segmentation of the vessel components","The aim of the challenge is to extract the CoW angio-architecture from 3D angiographic imaging by segmentation of the vessel components. There are two sub-tasks-binary segmentation of CoW vessels, and multi-class CoW anatomical segmentation. We release a new dataset of joint-modalities, CTA and MRA of the same patient cohort, both with annotations of the anatomy of CoW. Our challenge has two tracks for the same segmentation task, namely CTA track and MRA track. We made use of the clinical information from both modalities during our annotation. And participants can pick whichever modality they want, both CTA and MRA, and choose to tackle the task for either modality.","","https://topcow23.grand-challenge.org/","completed","intermediate","5","","2023-08-20","2023-09-25","2023-08-09 17:16:22","2023-09-28 23:24:41" +"185","circle-of-willis-intracranial-artery-classification-and-quantification-challenge-2023","Circle of Willis Intracranial Artery Classification and Quantification Challenge 2023","Classify the circle of Willis (CoW) configuration and quantification","The purpose of this challenge is to compare automatic methods for classification of the circle of Willis (CoW) configuration and quantification of the CoW major artery diameters and bifurcation angles.","","https://crown.isi.uu.nl/","completed","intermediate","14","","2023-05-01","2023-08-15","2023-08-09 22:13:24","2023-09-28 23:24:54" +"186","making-sense-of-electronic-health-record-ehr-race-and-ethnicity-data","Making Sense of Electronic Health Record (EHR) Race and Ethnicity Data","FDA calls on stakeholders to make sense of electronic health record race and ethnicity data","The urgency of the coronavirus disease 2019 (COVID-19) pandemic has heightened interest in the use of real-world data (RWD) to obtain timely information about patients and populations and has focused attention on EHRs. The pandemic has also heightened awareness of long-standing racial and ethnic health disparities along a continuum from underlying social determinants of health, exposure to risk, access to insurance and care, quality of care, and responses to treatments. This highlighted the potential that EHRs can be used to describe and contribute to our understanding of racial and ethnic health disparities and their solutions. The OMB Revisions to the Standards for the Classification of Federal Data on Race and Ethnicity provides minimum standards for maintaining, collecting, and presenting data on race and ethnicity for all Federal reporting purposes, and defines the two separate constructs of race and ethnicity.","","https://precision.fda.gov/challenges/30","completed","intermediate","6","","2023-05-31","2023-06-23","2023-08-10 18:28:06","2023-10-10 19:53:12" +"187","the-veterans-cardiac-health-and-ai-model-predictions-v-champs","The Veterans Cardiac Health and AI Model Predictions (V-CHAMPS)","Develop AI/ML models to predict cardiovascular health related outcomes in Veterans","To better understand the risk and protective factors in the Veteran population, the VHA IE and its collaborating partners are calling upon the public to develop AI/ML models to predict cardiovascular health outcomes, including readmission and mortality, using synthetically generated Veteran health records. The Challenge consists of two Phases-Phase 1 is focused on synthetic data. In this Phase of the Challenge, AI/ML models will be developed by Challenge participants and trained and tested on the synthetic data sets provided to them, with a view towards predicting outcome variables for Veterans who have been diagnosed with chronic heart failure (please note that in Phase 1, the data is synthetic Veteran health records). Phase 2 will focus on validating and further exploring the limits of the AI/ML models. During this Phase, high-performing AI/ML models from Phase 1 will be brought into the VA system and validated on the real-world Veterans health data within the VHA. These models...","","https://precision.fda.gov/challenges/31","completed","intermediate","6","","2023-05-25","2023-08-02","2023-08-10 21:41:10","2023-09-28 23:25:45" +"188","predicting-high-risk-breast-cancer-phase-1","Predicting High Risk Breast Cancer - Phase 1","Predicting High Risk Breast Cancer-a Nightingale OS & AHLI data challenge","Every year, 40 million women get a mammogram; some go on to have an invasive biopsy to better examine a concerning area. Underneath these routine tests lies a deep—and disturbing—mystery. Since the 1990s, we have found far more ‘cancers’, which has in turn prompted vastly more surgical procedures and chemotherapy. But death rates from metastatic breast cancer have hardly changed. When a pathologist looks at a biopsy slide, she is looking for known signs of cancer-tubules, cells with atypical looking nuclei, evidence of rapid cell division. These features, first identified in 1928, still underlie critical decisions today-which women must receive urgent treatment with surgery and chemotherapy? And which can be prescribed “watchful waiting”, sparing them invasive procedures for cancers that would not harm them? There is already evidence that algorithms can predict which cancers will metastasize and harm patients on the basis of the biopsy image. Fascinatingly, these algorithms also h...","","https://app.nightingalescience.org/contests/3jmp2y128nxd","completed","intermediate","15","","2022-06-01","2023-01-12","2023-08-22 17:07:00","2023-10-12 17:55:10" +"189","predicting-high-risk-breast-cancer-phase-2","Predicting High Risk Breast Cancer - Phase 2","Predicting High Risk Breast Cancer-a Nightingale OS & AHLI data challenge","Every year, 40 million women get a mammogram; some go on to have an invasive biopsy to better examine a concerning area. Underneath these routine tests lies a deep—and disturbing—mystery. Since the 1990s, we have found far more ‘cancers’, which has in turn prompted vastly more surgical procedures and chemotherapy. But death rates from metastatic breast cancer have hardly changed. When a pathologist looks at a biopsy slide, she is looking for known signs of cancer-tubules, cells with atypical looking nuclei, evidence of rapid cell division. These features, first identified in 1928, still underlie critical decisions today-which women must receive urgent treatment with surgery and chemotherapy? And which can be prescribed “watchful waiting”, sparing them invasive procedures for cancers that would not harm them? There is already evidence that algorithms can predict which cancers will metastasize and harm patients on the basis of the biopsy image. Fascinatingly, these algorithms also...","","https://app.nightingalescience.org/contests/vd8g98zv9w0p","completed","intermediate","15","","2023-02-03","2023-05-13","2023-08-22 17:07:01","2023-10-12 17:55:08" +"190","dream-2-in-silico-network-inference","DREAM 2 - In Silico Network Inference","Predict the connectivity and properties of in-silico networks","Three in-silico networks were created and endowed with a dynamics that simulate biological interactions. The challenge consists of predicting the connectivity and some of the properties of one or more of these three networks.","","https://www.synapse.org/#!Synapse:syn2825394/wiki/71150","completed","intermediate","1","","2007-03-25","\N","2023-08-24 18:54:05","2023-10-12 17:55:03" +"191","dream-3-in-silico-network-challenge","DREAM 3 - In Silico Network Challenge","The goal of the in silico challenges is the reverse engineering of gene networks from biological data","The goal of the in silico challenges is the reverse engineering of gene networks from steady state and time series data. Participants are challenged to predict the directed unsigned network topology from the given in silico generated gene topic_3170sets.","","https://www.synapse.org/#!Synapse:syn2853594/wiki/71567","completed","intermediate","1","https://doi.org/10.1089/cmb.2008.09TT","2008-06-09","\N","2023-08-25 16:43:41","2023-10-12 17:55:02" +"192","dream-4-in-silico-network-challenge","DREAM 4 - In Silico Network Challenge","The goal of the in silico network challenge is to reverse engineer gene regulatory networks","The goal of the in silico network challenge is to reverse engineer gene regulation networks from simulated steady-state and time-series data. Participants are challenged to infer the network structure from the given in silico gene topic_3170sets. Optionally, participants may also predict the response of the networks to a set of novel perturbations that were not included in the provided datasets.","","https://www.synapse.org/#!Synapse:syn3049712/wiki/74628","completed","intermediate","1","https://doi.org/10.1073/pnas.0913357107","2009-06-09","\N","2023-08-25 16:43:42","2023-10-12 17:55:00" +"193","dream-5-network-inference-challenge","DREAM 5 - Network Inference Challenge","The goal of this Network Inference Challenge is to reverse engineer gene regulatory networks","The goal of this Network Inference Challenge is to reverse engineer gene regulatory networks from gene topic_3170sets. Participants are given four microarray compendia and are challenged to infer the structure of the underlying transcriptional regulatory networks. Three of the four compendia were obtained from microorganisms, some of which are pathogens of clinical relevance. The fourth compendium is based on an in-silico (i.e., simulated) network. Each compendium consists of hundreds of microarray experiments, which include a wide range of genetic, drug, and environmental perturbations (or in the in-silico network case, simulations thereof). Network predictions will be evaluated on a subset of known interactions for each organism, or on the known network for the in-silico case.","","https://www.synapse.org/#!Synapse:syn2787209/wiki/70349","completed","intermediate","1","https://doi.org/10.1038/nmeth.2016","2010-06-09","2010-10-31","2023-08-25 16:43:43","2023-10-12 17:54:57" +"194","nlp-sandbox-date-annotation","NLP Sandbox Date Annotation","Identify dates in clinical notes","An NLP Sandbox Date Annotator takes as input a clinical note and outputs a list of predicted date annotations found in the clinical note.","","https://www.synapse.org/#!Synapse:syn22277123/wiki/609134","completed","intermediate","1","https://doi.org/10.7303/syn22277123","2021-06-04","2023-09-01","2023-08-25 16:45:22","2023-09-28 23:59:02" +"195","nlp-sandbox-person-name-annotation","NLP Sandbox Person Name Annotation","Identify person names in clinical notes","An NLP Sandbox Person Name Annotator takes as input a clinical note and outputs a list of predicted person name annotations found in the clinical note.","","https://www.synapse.org/#!Synapse:syn22277123/wiki/609134","completed","intermediate","1","https://doi.org/10.7303/syn22277123","2021-06-04","2023-09-01","2023-09-08 16:44:20","2023-09-28 23:59:20" +"196","nlp-sandbox-location-annotation","NLP Sandbox Location Annotation","Identify location information in clinical notes.","An NLP Sandbox Location Annotator takes as input a clinical note and outputs a list of predicted location annotations found in the clinical note.","","https://www.synapse.org/#!Synapse:syn22277123/wiki/609134","completed","intermediate","1","https://doi.org/10.7303/syn22277123","2021-06-04","2023-09-01","2023-09-08 16:44:21","2023-09-28 23:59:21" +"197","nlp-sandbox-contact-annotation","NLP Sandbox Contact Annotation","Identify contact information in clinical notes.","An NLP Sandbox contact annotator takes as input a clinical note and outputs a list of predicted contact annotations found in the clinical note.","","https://www.synapse.org/#!Synapse:syn22277123/wiki/609134","completed","intermediate","1","https://doi.org/10.7303/syn22277123","2021-06-04","2023-09-01","2023-09-08 16:44:22","2023-09-28 23:59:21" +"198","nlp-sandbox-id-annotation","NLP Sandbox ID Annotation","Identify identifiers in clinical notes.","An NLP Sandbox ID annotator takes as input a clinical note and outputs a list of predicted ID annotations found in the clinical note.","","https://www.synapse.org/#!Synapse:syn22277123/wiki/609134","completed","intermediate","1","https://doi.org/10.7303/syn22277123","2021-06-04","2023-09-01","2023-09-08 16:44:22","2023-09-28 23:59:22" +"199","dream-2-bcl6-transcriptomic-target-prediction","DREAM 2 - BCL6 Transcriptomic Target Prediction","Predict BCL6 transcriptomic targets from biological data.","A number of potential transcriptional targets of BCL6, a gene that encodes for a transcription factor active in B cells, have been identified with ChIP-on-chip data and functionally validated by perturbing the BCL6 pathway with CD40 and anti-IgM, and by over-expressing exogenous BCL6 in Ramos cell. We subselected a number of targets found in this way (the gold standard positive set), and added a number decoys (genes that have no evidence of being BCL6 targets, named the gold standard negative set), compiling a list of 200 genes in total. Given this list of 200 genes, the challenge consists of identifying which ones are the true targets and which ones are the decoys, using an independent panel of gene topic_3170.","","https://www.synapse.org/#!Synapse:syn3034857/wiki/","completed","intermediate","1","https://doi.org/10.1073/pnas.0437996100","2007-04-19","\N","2023-09-12 21:26:22","2023-10-12 17:53:55" +"200","dream-2-protein-protein-interaction-network-inference","DREAM 2 - Protein-Protein Interaction Network Inference","Predict a protein-protein interaction network of 47 proteins.","For many pairs of bait and prey genes, yeast protein-protein interactions were tested in an unbiased fashion using a high saturation, high-stringency variant of the yeast two-hybrid (Y2H) method. A high confidence subset of gene pairs that were found to interact in at least three repetitions of the experiment but that hadn’t been reported in the literature was extracted. There were 47 yeast genes involved in these pairs. Including self interactions, there are a total of 47*48/2 possible pairs of genes that can be formed with these 47 genes. As mentioned above some of these gene pairs were seen to consistently interact in at least three repetitions of the Y2H experiments-these gene pairs form the gold standard positive set. A second set among these gene pairs were seen never to interact in repeated experiments and were not reported as interacting in the literature; we call this the gold standard negative set. Finally in a third set of gene pairs, which we shall call the undecided s...","","https://www.synapse.org/#!Synapse:syn2825374/wiki/","completed","intermediate","1","https://doi.org/10.1126/science.1158684","2007-05-24","\N","2023-09-12 21:26:28","2023-10-12 17:54:00" +"201","dream-2-genome-scale-network-inference","DREAM 2 - Genome-Scale Network Inference","Reconstruct genome-scale networks from microarray data.","A panel of single-channel microarrays was collected for a particular microorganism, including some already published and some in-print data. The data was appropriately normalized (to the logarithmic scale). The challenge consists of reconstructing a genome-scale transcriptional network for this organism. The accuracy of network inference will be judged using chromatin precipitation and otherwise experimentally verified Transcription Factor (TF)-target interactions.","","https://www.synapse.org/#!Synapse:syn3034894/wiki/74418","completed","intermediate","1","https://doi.org/10.1371/journal.pbio.0050008","2007-06-05","2007-10-31","2023-09-12 21:26:34","2023-10-12 17:54:03" +"202","dream-2-synthetic-five-gene-network-inference","DREAM 2 - Synthetic Five-Gene Network Inference","Inferring five-gene networks from synthetic data.","A synthetic-biology network consisting of 5 interacting genes was created and transfected to an in-vivo model organism. The challenge consists of predicting the connectivity of the five-gene network from in-vivo measurements.","","https://www.synapse.org/#!Synapse:syn3034869/wiki/74411","completed","intermediate","1","https://doi.org/10.1016/j.cell.2009.01.055","2007-06-20","2007-10-31","2023-09-12 21:26:56","2023-10-12 17:54:05" +"203","dream-3-signaling-cascade-identification","DREAM 3 - Signaling Cascade Identification","Inferring signaling cascade dynamics from flow cytometry data.","The concentration of four intracellular proteins or phospho-proteins (X1, X2, X3 and X4) participating in a signaling cascade were measured in about 104 cells by antibody staining and flow cytometry. The idea of this challenge is to explore what key aspects of the dynamics and topology of interactions of a signaling cascade can be inferred from incomplete flow cytometry data.","","https://www.synapse.org/#!Synapse:syn3033068/wiki/74362","completed","intermediate","1","","2008-06-01","2008-10-31","2023-09-12 21:27:04","2023-10-12 17:54:08" +"204","dream-3-gene-expression-prediction","DREAM 3 - Gene Expression Prediction","Predicting gene expression from gene datasets.","Gene expression time course data is provided for four different strains of yeast (S. Cerevisiae), after perturbation of the cells. The challenge is to predict the rank order of induction/repression of a small subset of genes (the prediction targets) in one of the four strains, given complete data for three of the strains, and data for all genes except the prediction targets in the other strain. You are also allowed to use any information that is in the public domain and are expected to be forthcoming about what information was used.","","https://www.synapse.org/#!Synapse:syn3033083/wiki/74369","completed","intermediate","1","","2008-06-01","2008-10-31","2023-09-12 21:27:12","2023-10-12 17:54:10" +"205","dream-4-predictive-signaling-network-modelling","DREAM 4 - Predictive Signaling Network Modelling","Cell-type specific high-throughput experimental data.","This challenge explores the extent to which our current knowledge of signaling pathways, collected from a variety of cell types, agrees with cell-type specific high-throughput experimental data. Specifically, we ask the challenge participants to create a cell-type specific model of signal transduction using the measured activity levels of signaling proteins in HepG2 cell lines. The model, which can leverage prior information encoded in a generic signaling pathway provided in the challenge, should be biologically interpretable as a network, and capable of predicting the outcome of new experiments.","","https://www.synapse.org/#!Synapse:syn2825304/wiki/71129","completed","intermediate","1","","2009-03-09","\N","2023-09-12 21:27:14","2023-10-12 17:54:30" +"206","dream-3-signaling-response-prediction","DREAM 3 - Signaling Response Prediction","Predict missing protein concentrations from a large corpus of measurements.","Approximately 10,000 intracellular measurements (fluorescence signals proportional to the concentrations of phosphorylated proteins) and extracellular measurements (concentrations of cytokines released in response to cell stimulation) were acquired in human normal hepatocytes and the hepatocellular carcinoma cell line HepG2 cells. The datasets consist of measurements of 17 phospho-proteins (at 0 min, 30 min, and 3 hrs) and 20 cytokines (at 0 min, 3 hrs, and 24 hrs) in two cell types (normal and cancer) after perturbations to the pathway induced by the combinatorial treatment of 7 stimuli and 7 selective inhibitors.","","https://www.synapse.org/#!Synapse:syn2825325/wiki/","completed","intermediate","1","https://doi.org/10.1126%2Fscisignal.2002212","2009-03-09","\N","2023-09-12 21:27:20","2023-10-12 17:54:33" +"207","dream-4-peptide-recognition-domain-prd-specificity-prediction","DREAM 4 - Peptide Recognition Domain (PRD) Specificity Prediction","Predict binding specificity of peptide-antibody interactions.","Many important protein-protein interactions are mediated by peptide recognition domains (PRD), which bind short linear sequence motifs in other proteins. For example, SH3 domains typically recognize proline-rich motifs, PDZ domains recognize hydrophobic C-terminal tails, and kinases recognize short sequence regions around a phosphorylatable residue (Pawson, 2003). Given the sequence of the domains, the challenge consists of predicting a position weight matrix (PWM) that describes the specificity profile of each of the given domains to their target peptides. Any publicly accessible peptide specificity information available for the domain may be used.","","https://www.synapse.org/#!Synapse:syn2925957/wiki/72976","completed","intermediate","1","","2009-06-01","2009-10-31","2023-09-12 21:27:35","2023-10-12 17:54:35" +"208","dream-5-transcription-factor-dna-motif-recognition-challenge","DREAM 5 - Transcription-Factor, DNA-Motif Recognition Challenge","Predict binding intensities for transcription factors from motifs.","Transcription factors (TFs) control the expression of genes through sequence-specific interactions with genomic DNA. Different TFs bind preferentially to different sequences, with the majority recognizing short (6-12 base), degenerate ‘motifs’. Modeling the sequence specificities of TFs is a central problem in understanding the function and evolution of the genome, because many types of genomic analyses involve scanning for potential TF binding sites. Models of TF binding specificity are also important for understanding the function and evolution of the TFs themselves. The challenge consists of predicting the signal intensities for the remaining TFs.","","https://www.synapse.org/#!Synapse:syn2887863/wiki/72185","completed","intermediate","1","https://doi.org/10.1038/nbt.2486","2011-06-01","2011-09-30","2023-09-12 21:27:41","2023-10-12 17:54:36" +"209","dream-5-epitope-antibody-recognition-ear-challenge","DREAM 5 - Epitope-Antibody Recognition (EAR) Challenge","Predict the binding specificity of peptide-antibody interactions.","Humoral immune responses are mediated through antibodies. About 1010 to 1012 different antigen binding sites called paratopes are generated by genomic recombination. These antibodies are capable to bind to a variety of structures ranging from small molecules to protein complexes, including any posttranslational modification thereof. When studying protein-antibody interactions, two types of epitopes (the region paratopes interact with) are to be distinguished from each other-i) conformational and ii) linear epitopes. All potential linear epitopes of a protein can be represented by short peptides derived from the primary amino acid sequence. These peptides can be synthesized and arrayed on solid supports, e.g. glass slides (see Lorenz et al., 2009 [1]). By incubating these peptide arrays with antibody mixtures such as human serum or plasma, peptides can be determined that interact with antibodies in a specific fashion.","","https://www.synapse.org/#!Synapse:syn2820433/wiki/71017","completed","intermediate","1","","2010-06-09","\N","2023-09-12 21:27:44","2023-10-12 17:54:39" +"210","dream-gene-expression-prediction-challenge","DREAM Gene Expression Prediction Challenge","Predict gene expression levels from promoter sequences in eukaryotes.","The level by which genes are transcribed is determined in large part by the DNA sequence upstream to the gene, known as the promoter region. Although widely studied, we are still far from a quantitative and predictive understanding of how transcriptional regulation is encoded in gene promoters. One obstacle in the field is obtaining accurate measurements of transcription derived by different promoters. To address this, an experimental system was designed to measure the transcription derived by different promoters, all of which are inserted into the same genomic location upstream to a reporter gene -a yellow florescence protein gene (YFP). The challenge consists of the prediction of the promoter activity given a promoter sequence and a specific experimental condition. To study a set of promoters that share many elements of the regulatory program, and thus are suitable for computational learning, the data pertains to promoters of most of the ribosomal protein genes (RP) of yeast (S....","","https://www.synapse.org/#!Synapse:syn2820426/wiki/71010","completed","intermediate","1","","2010-07-09","\N","2023-09-12 21:28:00","2023-10-19 23:32:10" +"211","dream-5-systems-genetics-challenge","DREAM 5 - Systems Genetics Challenge","Predict disease phenotypes and infer gene networks from systems genetics data.","The central goal of systems biology is to gain a predictive, system-level understanding of biological networks. This can be done, for example, by inferring causal networks from observations on a perturbed biological system. An ideal experimental design for causal inference is randomized, multifactorial perturbation. The recognition that the genetic variation in a segregating population represents randomized, multifactorial perturbations (Jansen and Nap (2001), Jansen (2003)) gave rise to Systems Genetics (SG), where a segregating or genetically randomized population is genotyped for many DNA variants, and profiled for phenotypes of interest (e.g. disease phenotypes), gene expression, and potentially other ‘omics’ variables (protein expression, metabolomics, DNA methylation, etc.; Figure 1. Figure 1 was taken from Jansen and Nap (2001)). In this challenge we explore the use of Systems Genetics data for elucidating causal network models among genes, i.e. Gene Networks (DREAM5 SYSGEN...","","https://www.synapse.org/#!Synapse:syn2820440/wiki/","completed","intermediate","1","","2010-07-09","\N","2023-09-12 21:28:10","2023-10-12 17:54:42" +"212","dream-6-estimation-of-model-parameters-challenge","DREAM 6 - Estimation of Model Parameters Challenge","Challenge to estimate model parameters.","Given the complete model structures (including expressions for the kinetic rate laws) for three gene regulatory networks, participants are asked to develop and/or apply optimization methods, including the selection of the most informative experiments, to accurately estimate parameters and predict outcomes of perturbations in Systems Biology models.","","https://www.synapse.org/#!Synapse:syn2841366/wiki/71372","completed","intermediate","1","","2011-06-01","2011-10-31","2023-09-12 21:28:12","2023-10-12 17:54:45" +"213","dream-6-flowcap2-molecular-classification-of-acute-myeloid-leukemia-challenge","DREAM 6 - FlowCAP2 Molecular Classification of Acute Myeloid Leukemia Challenge","The goal of this challenge is to diagnose Acute Myeloid Leukemia from patient data using flow cytometry.","Flow cytometry (FCM) has been widely used by immunologists and cancer biologists for more than 30 years as a biomedical research tool to distinguish different cell types in mixed populations, based on the expression of cellular markers. It has also become a widely used diagnostic tool for clinicians to identify abnormal cell populations associated with disease. In the last decade, advances in instrumentation and reagent technologies have enabled simultaneous single-cell measurement of tens of surface and intracellular markers, as well as tens of signaling molecules, positioning FCM to play an even bigger role in medicine and systems biology [1,2]. However, the rapid expansion of FCM applications has outpaced the functionality of traditional analysis tools used to interpret FCM data such that scientists are faced with the daunting prospect of manually identifying interesting cell populations in 20 dimensional data from a collection of millions of cells. For these reasons a reliable...","","https://www.synapse.org/#!Synapse:syn2887788/wiki/72178","completed","intermediate","1","https://doi.org/10.1038/nmeth.2365","2011-06-01","2011-09-30","2023-09-12 21:28:19","2023-10-12 17:54:47" "214","dream-6-alternative-splicing-challenge","DREAM 6 - Alternative Splicing Challenge","Compare mRNA-seq methods on primate and rhino transcripts","The goal of the mRNA-seq alternative splicing challenge is to assess the accuracy of the reconstruction of alternatively spliced mRNA transcripts from Illumina short-read mRNA-seq. Reconstructed transcripts will be scored against Pacific Biosciences long-read mRNA-seq. The ensuing analysis of the transcriptomes from mandrill and rhinoceros fibroblasts and their derived induced pluripotent stem cells (iPSC), as well as the transcriptome for human Embrionic Stem Cells (hESC) is an opportunity to discover novel biology as well as investigate species-bias of different methods.","","https://www.synapse.org/#!Synapse:syn2817724/wiki/","completed","intermediate","1","","2011-08-09","\N","2023-09-12 21:28:25","2023-10-12 17:54:50" -"215","causalbench-challenge","CausalBench Challenge","Deriving gene-gene networks to improve causal disease insights","Mapping gene-gene interactions in cellular systems is a fundamental step in early-stage drug discovery that helps generate hypotheses on what molecular mechanisms may effectively be targeted by potential future medicines. In the CausalBench Challenge, we invite the machine-learning community to advance the state-of-the-art in deriving gene-gene networks from large-scale real-world perturbational single-cell datasets to improve our ability to glean causal insights into disease-relevant biology.","","https://www.gsk.ai/causalbench-challenge/","completed","intermediate","16","https://doi.org/10.48550/arXiv.2308.15395","2023-03-01","2023-04-21","2023-09-12 21:28:25","2023-10-19 23:32:34" -"216","iclr-computational-geometry-and-topology-challenge-2022","ICLR Computational Geometry & Topology Challenge 2022","Fostering Geometric Learning: Crowdsourced Algorithms for Reproducible Deep ...","The purpose of this challenge is to foster reproducible research in geometric (deep) learning, by crowdsourcing the open-source implementation of learning algorithms on manifolds. Participants are asked to contribute code for a published/unpublished algorithm, following Scikit-Learn/Geomstats' or pytorch's APIs and computational primitives, benchmark it, and demonstrate its use in real-world scenarios.","","https://github.com/geomstats/challenge-iclr-2022","completed","intermediate","14","","\N","2022-04-04","2023-09-13 16:54:06","2023-10-19 23:28:44" +"215","causalbench-challenge","CausalBench Challenge","A machine learning contest for gene network inference from single-cell perturbation data","Mapping gene-gene interactions in cellular systems is a fundamental step in early-stage drug discovery that helps generate hypotheses on what molecular mechanisms may effectively be targeted by potential future medicines. In the CausalBench Challenge, we invite the machine-learning community to advance the state-of-the-art in deriving gene-gene networks from large-scale real-world perturbational single-cell datasets to improve our ability to glean causal insights into disease-relevant biology.","","https://www.gsk.ai/causalbench-challenge/","completed","intermediate","16","https://doi.org/10.48550/arXiv.2308.15395","2023-03-01","2023-04-21","2023-09-12 21:28:25","2023-10-19 23:32:34" +"216","iclr-computational-geometry-and-topology-challenge-2022","ICLR Computational Geometry & Topology Challenge 2022","Advancing computational geometry and topology with Python","The purpose of this challenge is to foster reproducible research in geometric (deep) learning, by crowdsourcing the open-source implementation of learning algorithms on manifolds. Participants are asked to contribute code for a published/unpublished algorithm, following Scikit-Learn/Geomstats' or pytorch's APIs and computational primitives, benchmark it, and demonstrate its use in real-world scenarios.","","https://github.com/geomstats/challenge-iclr-2022","completed","intermediate","14","","\N","2022-04-04","2023-09-13 16:54:06","2023-10-19 23:28:44" "217","iclr-computational-geometry-and-topology-challenge-2021","ICLR Computational Geometry & Topology Challenge 2021","Advancing computational geometry and topology with Python","The purpose of this challenge is to push forward the fields of computational differential geometry and topology, by creating the best data analysis, computational method, or numerical experiment relying on state-of-the-art geometric and topological Python packages.","","https://github.com/geomstats/challenge-iclr-2021","completed","intermediate","14","https://doi.org/10.48550/arXiv.2108.09810","\N","2021-05-02","2023-09-13 17:02:12","2023-10-19 23:28:44" "218","genedisco-challenge","GeneDisco Challenge","Exploring experimental design with active learning for genetics","The GeneDisco challenge is a machine learning community challenge for evaluating batch active learning algorithms for exploring the vast experimental design space in genetic perturbation experiments. Genetic perturbation experiments, using for example CRISPR technologies to perturb the genome, are a vital component of early-stage drug discovery, including target discovery and target validation. The GeneDisco challenge is organized in conjunction with the Machine Learning for Drug Discovery workshop at ICLR-22.","","https://www.gsk.ai/genedisco-challenge/","completed","intermediate","16","https://doi.org/10.48550/arXiv.2110.11875","2022-01-31","2022-03-31","2023-09-13 17:20:30","2023-10-19 23:32:43" "219","hidden-treasures-warm-up","Hidden Treasures: Warm Up","Assess genome sequencing software accuracy with unknown variants","In the context of human genome sequencing, software pipelines typically involve a wide range of processing elements, including aligning sequencing reads to a reference genome and subsequently identifying variants (differences). One way of assessing the performance of such pipelines is by using well-characterized datasets such as Genome in a Bottle’s NA12878. However, because the existing NGS reference datasets are very limited and have been widely used to train/develop software pipelines, benchmarking of pipeline performance would ideally be done on samples with unknown variants. This challenge will provide a unique opportunity for participants to investigate the accuracy of their pipelines by testing the ability to find in silico injected variants in FASTQ files from exome sequencing of reference cell lines. It will be a warm up for the community ahead of a more difficult in silico challenge to come in the fall. This challenge will provide users with a FASTQ file of a NA12878 se...","","https://precision.fda.gov/challenges/1","completed","intermediate","6","","2017-07-17","2017-09-13","2023-09-13 23:31:39","2023-10-12 17:55:23" -"220","data-management-and-graph-extraction-for-large-models-in-the-biomedical-space","Data management and graph extraction for large models in the biomedical space","CMU Libraries & DNAnexus Data Management Hackathon: Advancing Biomedical Kno...","This fall, CMU Libraries is hosting a hackathon in partnership with DNAnexus on the topic of data management and graph extraction for large models in the biomedical space. The hackathon will be held in person at CMU, October 19-21, 2023. The hackathon is a collaborative, rather than competitive, event, with each team working on a dedicated part of the problem. The teams will be focused on the following topics-1) Knowledge graph-based validation for variant (genomic) assertions; 2) Continuous monitoring for RLHF and flexible infrastructure for layering assertions with rollback; 3) Flexible tokenization of complex data types; 4) Assertion tracking in large models; 5) Column headers for data harmonization. The outputs are often published as preprints or on the F1000 hackathon channel. Contact Ben Busby (bbusby@dnanexus.com) with any questions about the hackathon or serving as a team lead.","","https://library.cmu.edu/about/news/2023-08/hackathon-2023","completed","intermediate","14","","2023-10-19","2023-10-21","2023-09-13 23:32:59","2023-09-27 21:08:26" +"220","data-management-and-graph-extraction-for-large-models-in-the-biomedical-space","Data management and graph extraction for large models in the biomedical space","CMU Libraries & DNAnexus Data Management Hackathon: Advancing Biomedical Knowledge Graphs","This fall, CMU Libraries is hosting a hackathon in partnership with DNAnexus on the topic of data management and graph extraction for large models in the biomedical space. The hackathon will be held in person at CMU, October 19-21, 2023. The hackathon is a collaborative, rather than competitive, event, with each team working on a dedicated part of the problem. The teams will be focused on the following topics-1) Knowledge graph-based validation for variant (genomic) assertions; 2) Continuous monitoring for RLHF and flexible infrastructure for layering assertions with rollback; 3) Flexible tokenization of complex data types; 4) Assertion tracking in large models; 5) Column headers for data harmonization. The outputs are often published as preprints or on the F1000 hackathon channel. Contact Ben Busby (bbusby@dnanexus.com) with any questions about the hackathon or serving as a team lead.","","https://library.cmu.edu/about/news/2023-08/hackathon-2023","completed","intermediate","14","","2023-10-19","2023-10-21","2023-09-13 23:32:59","2023-09-27 21:08:26" "221","cagi2-asthma-twins","CAGI2: Asthma discordant monozygotic twins","Identify genetic differences between asthmatic and healthy twins","The dataset includes whole genomes of 8 pairs of discordant monozygotic twins (randomly numbered from 1 to 16) that is, in each pair identical twins one has asthma and one does not. In addition, RNA sequencing data for each individual is provided. One of the twins in each pair suffers from asthma while the other twin is healthy.","","https://genomeinterpretation.org/cagi2-asthma-twins.html","completed","intermediate","14","","\N","2011-10-06","2023-09-28 18:19:48","2023-10-12 18:11:42" "222","cagi4-bipolar","CAGI4: Bipolar disorder","Predicting bipolar disorder from exome data","Bipolar disorder (BD) is a serious mental illness characterized by recurrent episodes of manias and depression, which are syndromes of abnormal mood, thinking and behavior. It affects 1.0-4.5% of the population [1], and it is among the major causes of disability worldwide. This challenge involved the prediction of which of a set of individuals have been diagnosed with bipolar disorder, given exome data. 500 of the 1000 exome samples were provided for training.","","https://genomeinterpretation.org/cagi4-bipolar.html","completed","intermediate","14","","\N","2016-04-04","2023-09-28 18:19:48","2023-09-28 18:25:17" "223","cagi3-brca","CAGI3: BRCA1 & BRCA2","Assess hereditary cancer risk via BRCA gene analysis","In normal cells, the BRCA1 and BRCA2 genes are involved in homologous recombination for double strand break repair and ensure the stability of a cell's genetic material. Mutations in these genes have been linked to development of breast and ovarian cancer. Myriad Genetics created the BRACAnalysis test in order to assess a woman’s risk of developing hereditary breast or ovarian cancer based on detection of mutations in the BRCA1 and BRCA2 genes. This test has become the standard of care in identification of individuals with hereditary breast and ovarian cancer (HBOC) syndrome. It is based on proprietary methods.","","https://genomeinterpretation.org/cagi3-brca.html","completed","intermediate","14","","\N","2013-04-25","2023-09-28 18:19:48","2023-10-19 23:32:48" @@ -248,35 +248,35 @@ "247","cagi2-rad50","CAGI2: RAD50","Assessing RAD50 variants for breast cancer risk","RAD50 is a candidate intermediate-risk breast cancer susceptibility gene. The RAD50 data provided for CAGI challenge include a list of potentially interesting sequence variants observed from sequencing RAD50 gene in about 1,400 breast cancer cases and 1,200 ethnically matched controls. Variants in the list were observed between 1 and 20 times.","","https://genomeinterpretation.org/cagi2-rad50.html","completed","intermediate","14","","\N","2011-10-06","2023-09-28 18:19:48","2023-10-19 23:33:11" "248","cagi2-risksnps","CAGI2: riskSNPs","Exploring molecular mechanisms linking SNPs to disease risk","The goal of this experiment is to explore current understanding of the molecular level mechanisms underlying associations between SNPs and disease risk, incorporating expertise in each of the known mechanism areas, and as far as possible assigning possible mechanisms for each association locus. The correct mechanisms are unknown, so there can be no ranking of accuracy-that is not the point of the experiment. Rather, the goal is to ascertain which mechanisms appear most relevant, how confidently they can be assigned, and what fraction of loci can currently be assigned plausible mechanisms.","","https://genomeinterpretation.org/cagi2-risksnps.html","completed","intermediate","14","","\N","2011-10-06","2023-09-28 18:19:48","2023-10-19 23:33:11" "249","cagi3-risksnps","CAGI3: riskSNPs","Exploring molecular mechanisms linking SNPs to disease risk","The goal of this experiment is to explore current understanding of the molecular level mechanisms underlying associations between SNPs and disease risk, incorporating expertise in each of the known mechanism areas, and as far as possible assigning possible mechanisms for each association locus. The correct mechanisms are unknown, so there can be no ranking of accuracy-that is not the point of the experiment. Rather, the goal is to ascertain which mechanisms appear most relevant, how confidently they can be assigned, and what fraction of loci can currently be assigned plausible mechanisms.","","https://genomeinterpretation.org/cagi3-risksnps.html","completed","intermediate","14","","\N","2013-04-25","2023-09-28 18:19:48","2023-10-19 23:33:13" -"250","cagi2-nav1-5","CAGI2: SCN5A","Predictors are asked to submit predictions on the effect of the mutants on t...","The cardiac action potential (AP) is the sum of a number of distinct ionic currents. It can be divided into five phases (phase 0‐4). From pacemaker cells of the SA node the initial depolarizing wave front will spread throughout the cardiomyocytes via gap junctions. If the depolarization is sufficient voltage‐dependent sodium channels (Nav1.5) are activated and allow Na+ influx. This results in a further depolarization of the membrane which will lead to opening of even more Nav channels. This positive feedback mechanism is seen as the rapid upstroke in the initial phase (phase 0) of the action potential. Nav1.5 is encoded by SCN5A and mutations in this gene have been associated with various diseases such as Atrial fibrillation, Long QT syndrome, Cardiac Conduction Defect, Sick Sinus Disease, and Brugada Syndrome (BrS).","","https://genomeinterpretation.org/cagi2-nav1.5.html","completed","intermediate","14","","\N","2011-10-06","2023-09-28 18:19:48","2023-10-16 18:32:16" -"251","cagi2-mr-1","CAGI2: Shewanella oneidensis strain MR-1","Shewanella oneidensis strain MR-1 (formerly known as S. putrefaciens) is a m...","Predictors are asked to submit predictions on how insertions in the given gene of MR-1 affect the fitness of that gene in a given condition (stressor).","","https://genomeinterpretation.org/cagi2-mr-1.html","completed","intermediate","14","","\N","2011-10-06","2023-09-28 18:19:55","2023-10-16 18:32:21" -"252","cagi3-mr-1","CAGI3: Shewanella oneidensis strain MR-1","Shewanella oneidensis strain MR-1 (formerly known as S. putrefaciens) is a m...","Predictors are asked to submit predictions on how insertions in the given gene of MR-1 affect the fitness of that gene in a given condition (stressor).","","https://genomeinterpretation.org/cagi3-mr-1.html","completed","intermediate","14","","\N","2013-04-25","2023-09-28 18:20:01","2023-10-16 18:18:07" -"253","cagi4-sickkids","CAGI4: SickKids","The challenge presented here is to use computational methods to match each g...","Realizing the promise of precision medicine will require developing methods for interpreting genome sequence data to infer individuals’ phenotypic traits and predispositions to disease. This challenge involves 25 children with suspected genetic disorders who were referred for clinical genome sequencing. Predictors are given their genome sequences and their clinical phenotypic descriptions, as provided to the diagnostic laboratory, and asked to predict which genome corresponds to which clinical description. Additionally, identify the diagnostic variants underlying the predictions. Optionally, identify predictive secondary variants conferring high risk of other diseases whose phenotypes are not reported in the clinical descriptions.","","https://genomeinterpretation.org/cagi4-sickkids.html","completed","intermediate","14","","\N","2016-04-04","2023-09-28 18:19:48","2023-10-06 20:48:13" -"254","cagi4-sumo-ligase","CAGI4: SUMO ligase","Participants are asked to submit predictions of the effect of the variants o...","SUMO ligase identifies target proteins and covalently attaches SUMO to them, thereby modulating the functions of hundreds of proteins including proteins implicated in cancer, neurodegeneration, and other diseases. A large library of missense mutations in human SUMO ligase has been assessed for competitive growth in a high-throughput yeast-based complementation assay. The challenge is to predict the effect of mutations on function, as measured by the change in fractional representation of each mutant SUMO ligase clone, relative to wild-type clones, in a competitive yeast growth assay.","","https://genomeinterpretation.org/cagi4-sumo-ligase.html","completed","intermediate","14","","\N","2016-04-04","2023-09-28 18:19:48","2023-10-19 23:31:57" -"255","cagi3-splicing","CAGI3: TP53 splicing","With the provided data, determine which disease-causing mutations in the TP5...","The function of exonic splicing regulatory elements can be undermined by DNA sequence variation and in some cases can contribute to pathogenesis. Thousands of disease-causing mutations disrupt exonic splicing regulatory elements. These data suggest that >25 percent of missense mutations may impact pre-mRNA splicing rather than mRNA translation. Using minigene constructs derived from a fragment of the TP53 gene, we have experimentally determined if each mutation influences splicing fidelity in HEK293T cells. We hope that CAGI participants will be able to predict the outcome of our experiments. A long-term goal will be the computational prioritization of disease-causing mutations prior to experimental validation. This contribution is expected to have major impacts in understanding the pathogenic basis of disease-causing mutations.","","https://genomeinterpretation.org/cagi3-splicing.html","completed","intermediate","14","","\N","2013-04-25","2023-09-28 18:19:48","2023-10-10 19:48:10" -"256","cagi4-warfarin","CAGI4: Warfarin exomes","With the provided exome data and clinical covariates, predict the therapeuti...","With over 33 million prescriptions in 2011, warfarin is the most commonly used anticoagulant for preventing thromboembolic events. Warfarin has a twenty-fold inter-individual dose variability and a narrow therapeutic index, and it is responsible for a third of adverse drug event hospitalizations in older Americans [2]. Alternatives to warfarin, such as direct thrombin inhibitors and factor Xa inhibitors, are now available. However, these are more expensive, irreversible, and may cause a higher rate of acute coronary events compared to warfarin [3,4]. Thus, warfarin remains a mainstay of anticoagulant therapy, and better methods of dosing warfarin will lead to fewer adverse events due to overcoagulation.","","https://genomeinterpretation.org/cagi4-warfarin.html","completed","intermediate","14","","\N","2016-04-04","2023-09-28 18:19:48","2023-09-28 21:19:03" -"257","cagi6-calmodulin","CAGI6: Calmodulin","Participants were asked to submit predictions for the competitive growth sco...","Calmodulin (CaM) is a ubiquitous calcium (Ca2+) sensor protein interacting with more than 200 molecular partners, thereby regulating a variety of biological processes. Missense point mutations in the genes encoding CaM have been associated with ventricular tachycardia and sudden cardiac death. A library encompassing up to 17 point mutations was assessed by far-UV circular dichroism (CD) by measuring melting temperature (Tm) and percentage of unfolding (%unfold) upon thermal denaturation at pH and salt concentration that mimic the physiological conditions. The challenge is to predict- the Tm and %unfold values for isolated CaM variants under Ca2+-saturating conditions (Ca2+-CaM) and in the Ca2+-free (apo) state; whether the point mutation stabilizes or destabilizes the protein (based on Tm and %unfold).","","https://genomeinterpretation.org/cagi6-cam.html","completed","intermediate","1","","\N","2021-12-31","2023-09-28 18:19:48","2023-10-19 23:33:19" -"258","cagi2-splicing","CAGI2: splicing","Predictors are asked to compare exons from wild type and disease-associated ...","Accurate precursor mRNA (pre-mRNA) splicing is required for the expression of protein coding genes from the human genome. In this process, intervening sequences (introns) are removed from pre-mRNA and coding/regulatory sequences (exons) are ligated together generating a mature mRNA. A large ribonucleoprotein machine called the spliceosome assembles de novo upon every nascent intron and catalyzes the chemical steps of splicing.","","https://genomeinterpretation.org/cagi2-splicing.html","completed","intermediate","14","","\N","2011-10-06","2023-09-28 18:19:48","2023-10-18 15:32:55" -"259","cagi6-lc-arsa","CAGI6: ARSA","Predicting the effect of naturally occurring missense mutations on enzymatic...","Metachromatic Leukodystrophy (MLD) is an autosomal recessive, lysosomal-storage disease caused by mutations in Arylsulfatase A (ARSA) and toxic accumulation of sulfatide substrate. Genome sequencing has revealed hundreds of protein-altering, ARSA missense variants, but the functional effect of most variants remains unknown. ARSA enzyme activity using a high-throughput cellular assay was measured for a large set of variants of known significance and variants of unknown significance. The challenge is to predict the fractional enzyme activity of each mutant protein compared to the wildtype protein.","","https://genomeinterpretation.org/cagi6-lc-arsa.html","completed","intermediate","1","","\N","2022-11-16","2023-09-28 18:20:23","2023-10-12 18:11:51" -"260","predict-hits-for-the-wdr-domain-of-lrrk2","CACHE1: Predict Hits for The WDR Domain of LRRK2","Finding ligands targeting the central cavity of the WD-40 repeat (WDR) domai...","The first CACHE Challenge target is LRRK2, the most commonly mutated gene in familial Parkinson's Disease. Participants are asked to find hits for the WD40 repeat (WDR) domain of LRRK2. Read more under Details below.","","https://cache-challenge.org/challenges/predict-hits-for-the-wdr-domain-of-lrrk2","completed","intermediate","17","","2021-12-01","2022-01-31","2023-09-27 19:01:55","2023-11-01 03:58:21" -"261","finding-ligands-targeting-the-conserved-rna-binding-site-of-sars-cov-2-nsp13","CACHE2: Finding Ligands Targeting The Conserved RNA Binding Site of SARS-CoV-2 NSP13","Finding ligands targeting the conserved RNA binding site of SARS-CoV-2 NSP13...","Predicted compounds will be procured and tested at CACHE using both enzymatic and binding assays","","https://cache-challenge.org/challenges/finding-ligands-targeting-the-conserved-rna-binding-site-of-sars-cov-2-nsp13","completed","intermediate","17","","2022-06-22","2022-09-04","2023-09-27 19:02:43","2023-11-01 03:58:00" -"262","finding-ligands-targeting-the-macrodomain-of-sars-cov-2-nsp3","CACHE3: Finding ligands targeting the macrodomain of SARS-CoV-2 Nsp3","Studying the macrodomain of Severe acute respiratory syndrome coronavirus 2 ...","To predict ligands that bind to the ADPr site of SARS-CoV-2 Nsp3 macrodomain (Mac1).","","https://cache-challenge.org/challenges/finding-ligands-targeting-the-macrodomain-of-sars-cov-2-nsp3","completed","intermediate","17","","2022-11-02","2023-01-01","2023-09-27 19:03:13","2023-10-16 19:01:19" -"263","finding-ligands-targeting-the-tkb-domain-of-cblb","CACHE4: Finding ligands targeting the TKB domain of CBLB","Investigating the TKB domain of CBLB, a protein involved in cancer and immun...","Predict compounds that bind to the closed conformation of the CBLB TKB domain with novel chemical templates and KD below 30 micromolar.","","https://cache-challenge.org/challenges/finding-ligands-targeting-the-tkb-domain-of-cblb","completed","intermediate","17","","2023-03-09","2023-05-09","2023-09-27 19:03:14","2023-10-16 19:01:22" -"264","rare-disease-ai-hackathon","Rare Disease AI Hackathon","Researchers and medical experts are invited to collaborate on our patient ca...","Bring AI and medical experts together to build open source models for rare diseases. Create zero-barrier access to rare disease expertise for patients, researchers and physicians. Use AI to Uncover novel links between rare diseases. Establish validation methods for medical AI models. Jumpstart an open source community for rare disease AI models. Launch models for Beta testing on Hypophosphatasia.ai and EhlersDanlos.ai.","","https://www.rarediseaseaihackathon.org/","active","intermediate","14","","2023-09-30","2024-01-15","2023-09-27 19:10:40","2023-10-24 15:56:45" -"265","cometh-benchmark","COMETH Benchmark","Quantify tumor heterogeneity—how many cell types are present and in which pr...","Successful treatment of cancer is still a challenge and this is partly due to a wide heterogeneity of cancer composition across patient population. Unfortunately, accounting for such heterogeneity is very difficult. Clinical evaluation of tumor heterogeneity often requires the expertise of anatomical pathologists and radiologists.This benchmark is dedicated to the quantification of intra-tumor heterogeneity using appropriate statistical methods on cancer omics data.In particular, it focuses on estimating cell types and proportion in biological samples based on methylation and methylome data sets. The goal is to explore various statistical methods for source separation/deconvolution analysis (Non-negative Matrix Factorization, Surrogate Variable Analysis, Principal component Analysis, Latent Factor Models, ...) using both RNA-seq and methylome data.","","https://www.codabench.org/competitions/218/","completed","intermediate","10","","2020-06-14","2020-12-29","2023-09-28 23:25:52","2023-10-10 19:47:14" -"266","the-miccai-2014-machine-learning-challenge","The MICCAI 2014 Machine Learning Challenge","Predicting Binary and Continuous Phenotypes from Structural Brain MRI Data i...","Machine learning tools have been increasingly applied to structural brain magnetic resonance imaging (MRI) scans, largely for developing models to predict clinical phenotypes at the individual level. Despite significant methodological developments and novel application domains, there has been little effort to conduct benchmark studies with standardized datasets, which researchers can use to validate new tools, and more importantly conduct an objective comparison with state-of-the-art algorithms. The MICCAI 2014 Machine Learning Challenge (MLC) will take a significant step in this direction, where we will employ four separate, carefully compiled, and curated large-scale (each N > 70) structural brain MRI datasets with accompanying clinically relevant phenotypes. Our goal is to provide a snapshot of the current state of the art in the field of neuroimage-based prediction, and attract machine-learning practitioners to the MICCAI community and the field of medical image computing in g...","","https://competitions.codalab.org/competitions/1471","completed","intermediate","9","","2014-04-16","2014-06-14","2023-09-28 23:36:12","2023-10-19 23:31:50" -"267","cagi6-annotate-all-missense","CAGI6: Annotate All Missense","Predictors are asked to predict the functional effect of every coding single...","dbNSFP currently describes 81,782,923 possible protein-altering variants in the human genome. The challenge is to predict the functional effect of every such variant. For the vast majority of these missense and nonsense variants, the functional impact is not currently known, but experimental and clinical evidence is accruing rapidly. Rather than drawing upon a single discrete dataset as typical with CAGI, predictions will be assessed by comparing with experimental or clinical annotations made available after the prediction submission date, on an ongoing basis. If predictors assent, predictions will also be incorporated into dbNSFP.","","https://genomeinterpretation.org/cagi6-annotate-all-missense.html","completed","intermediate","1","","2021-06-01","2021-10-11","2023-06-23 00:00:00","2023-10-12 18:13:42" -"268","cagi6-hmbs","CAGI6: HMBS","Participants are asked to submit predictions of the fitness score for each o...","Hydroxymethylbilane synthase (HMBS), also known as porphobilinogen deaminase (PBGD) or uroporphyrinogen I synthase, is an enzyme involved in heme production. In humans, variants that affect HMBS function result in acute intermittent porphyria (AIP), an autosomal dominant genetic disorder caused by a build-up of porphobilinogen in the cytoplasm. A large library of HMBS missense variants was assessed with respect to their effects on protein function using a high-throughput yeast complementation assay. The challenge is to predict the functional effects of these variants.","","https://genomeinterpretation.org/cagi6-hmbs.html","completed","intermediate","1","","2021-06-08","2021-10-11","2023-06-23 00:00:00","2023-10-12 18:12:05" -"269","cagi6-id-panel","CAGI6: Intellectual Disability Panel","In this challenge, predictors are asked to analyze the sequence data for the...","The objective in this challenge is to predict a patient's clinical phenotype and the causal variant(s) based on their gene panel sequences. Sequence data for 74 genes from a cohort of 500 patients with a range of neurodevelopmental presentations (intellectual disability, autistic spectrum disorder, epilepsy, microcephaly, macrocephaly, hypotonia, ataxia) has been made available for this challenge. Additional data from 150 patients from the same clinical study is available for training and validation.","","https://genomeinterpretation.org/cagi6-id-panel.html","completed","intermediate","1","","2021-06-08","2021-10-11","2023-06-23 00:00:00","2023-10-12 18:12:09" -"270","cagi6-mapk1","CAGI6: MAPK1","For each variant, participants are asked to predict the ΔΔG(H2O) value for t...","MAPK1 (ERK2) is active as serine/threonine kinase in the Ras-Raf-MEK-ERK signal transduction cascade that regulates cell proliferation, transcription, differentiation, and cell cycle progression. MAPK1 is activated by phosphorylation which occurs with strict specificity by MEK1/2 on Thr185 and Tyr187, and may also act as a transcriptional repressor independent of its kinase activity. A library of eleven missense variants selected from the COSMIC database was assessed by near and far-UV circular dichroism and intrinsic fluorescence spectra to determine thermodynamic stability at different concentrations of denaturant. These data were used to calculate a ΔΔGH20 value; i.e., the difference in unfolding free energy ΔGH20 between each variant and the wildtype protein, both in phosphorylated and unphosphorylated forms. The challenge is to predict these two ΔΔGH20 values and the catalytic efficiency (kcat/km)mut/(kcat/km)wt, as determined by a fluorescence assay, of the phosphorylated fo...","","https://genomeinterpretation.org/cagi6-mapk1.html","completed","intermediate","1","","2021-07-08","2021-10-11","2023-06-23 00:00:00","2023-10-12 18:12:13" -"271","cagi6-mapk3","CAGI6: MAPK3","For each variant, participants are asked to predict the ΔΔG(H2O) value for t...","MAPK3 (ERK1) is active as serine/threonine kinase in the Ras-Raf-MEK-ERK signal transduction cascade that regulates cell proliferation, transcription, differentiation, and cell cycle progression. MAPK3 is activated by phosphorylation which occurs with strict specificity by MEK1/2 on Thr202 and Tyr204, and may also act as a transcriptional repressor independent of its kinase activity. A library of twelve missense variants selected from the COSMIC database was assessed by near and far-UV circular dichroism and intrinsic fluorescence spectra to determine thermodynamic stability at different concentrations of denaturant. These data were used to calculate a ΔΔGH20 value; i.e., the difference in unfolding free energy ΔGH20 between each variant and the wildtype protein, both in phosphorylated and unphosphorylated forms. The challenge is to predict these two ΔΔGH20 values and the catalytic efficiency (kcat/km)mut/(kcat/km)wt, as determined by a fluorescence assay, of the phosphorylated fo...","","https://genomeinterpretation.org/cagi6-mapk3.html","completed","intermediate","1","","2021-08-04","2021-10-11","2023-06-23 00:00:00","2023-10-12 18:12:15" -"272","cagi6-mthfr","CAGI6: MTHFR","Participants are asked to submit predictions of the fitness score for each m...","Methylenetetrahydrofolate reductase (MTHFR) catalyzes the production of 5-methyltetrahydrofolate, which is needed for conversion of homocysteine to methionine. Humans with variants affecting MTHFR function present with a wide range of phenotypes, including homocystinuria, homocysteinemia, developmental delay, severe mental retardation, psychiatric disturbances, and late-onset neurodegenerative disorders. A further complication to interpretation of variants in this gene is a common variant, Ala222Val, carried by a large fraction of the human population. A large library of MTHFR missense variants was assessed with respect to their effects on protein function using a high-throughput yeast complementation assay. The challenge is to predict the functional effects of these variants in two different settings- for the wildtype protein, and for the protein with the common Ala222Val variant.","","https://genomeinterpretation.org/cagi6-mthfr.html","completed","intermediate","1","","2021-05-03","2021-06-30","2023-06-23 00:00:00","2023-10-12 18:12:18" -"273","cagi6-prs","CAGI6: Polygenic Risk Scores","Participants will be expected to provide a fully trained prediction model th...","Polygenic risk scores (PRS) have potential clinical utility for risk surveillance, prevention and personalized medicine. Participants will be provided with datasets of four real phenotypes (Type 2 Diabetes, Breast Cancer, Inflammatory Bowel Disease and Coronary Artery Disease) and of thirty simulated phenotypes representing a range of genetic architectures of common polygenic diseases. The challenge is to predict the disease outcomes of individuals in held-out validation cohorts.","","https://genomeinterpretation.org/cagi6-prs.html","completed","intermediate","1","","2021-06-08","2021-10-11","2023-06-23 00:00:00","2023-10-12 18:12:23" -"274","cagi6-rgp","CAGI6: Rare Genomes Project","The prediction challenge involves approximately 30 families. The prediction ...","The Rare Genomes Project (RGP) is a direct-to-participant research study on the utility of genome sequencing for rare disease diagnosis and gene discovery. The study is led by genomics experts and clinicians at the Broad Institute of MIT and Harvard. Research subjects are consented for genomic sequencing and the sharing of their sequence and phenotype information with researchers working to understand the molecular causes of rare disease. When a candidate disease variant believed to be related to the phenotype is identified, the variant is confirmed with Sanger sequencing in a clinical setting and returned to the participant via his or her local physician. In this challenge, whole genome sequence data and phenotype data from a subset of the solved and unsolved RGP families will be provided. Participants in the challenge will try to identify the causative variant(s) in each case. For the unsolved cases, prioritized variants from the participating teams will be examined to see if ad...","","https://genomeinterpretation.org/cagi6-rgp.html","completed","intermediate","1","","2021-06-08","2021-10-11","2023-06-23 00:00:00","2023-10-12 18:12:27" -"275","cagi6-invitae","CAGI6: Sherloc clinical classification","Over 122,000 coding (missense, silent, frameshift, stop gained, in-frame cod...","Invitae is a genetic testing company that publishes their variant interpretations to ClinVar. In this challenge, over 122,000 previously uncharacterized variants are provided, spanning the range of effects seen in the clinic. Following the close of this challenge, Invitae will submit their interpretations for these variants to ClinVar. Predictors are asked to interpret the pathogenicity of these variants, and the clinical utility of predictions will be assessed across multiple categories by Invitae.","","https://genomeinterpretation.org/cagi6-invitae.html","completed","intermediate","1","","2021-07-08","2021-12-01","2023-06-23 00:00:00","2023-10-12 18:12:31" -"276","cagi6-splicing-vus","CAGI6: Splicing VUS","Predict whether the experimentally validated variants of unknown significanc...","Variants causing aberrant splicing have been implicated in a range of common and rare disorders, including retinitis pigmentosa, autism spectrum disorder, amyotrophic lateral sclerosis, and a variety of cancers. However, such variants are frequently overlooked by diagnostic sequencing pipelines, leading to missed diagnoses for patients. Clinically ascertained variants of unknown significance underwent whole-blood based RT-PCR to test for impact on splicing. The challenge is to predict which of the tested variants disrupt splicing.","","https://genomeinterpretation.org/cagi6-splicing-vus.html","completed","intermediate","1","","2021-06-08","2021-10-11","2023-06-23 00:00:00","2023-10-12 18:12:34" -"277","cagi6-stk11","CAGI6: STK11","Participants are asked to submit predictions on the impact of the variants l...","Serine/Threonine Kinase 11 (STK11) is considered a master kinase that functions as a tumor suppressor and nutrient sensor within a heterotrimeric complex with pseudo-kinase STRAD-alpha and structural protein MO25. Germline variants resulting in loss of STK11 define Peutz-Jaghers Syndrome, an autosomal dominant cancer predisposition syndrome marked by gastrointestinal hamartomas and freckling of the oral mucosa. Somatic loss of function variants, both nonsense and missense, occur in 15-30% of non-small cell lung adenocarcinomas, where they correlate clinically with insensitivity to anti-PD1 monoclonal antibody therapy. The challenge is to predict the impact on STK11 function for each missense variant in relation to wildtype STK11.","","https://genomeinterpretation.org/cagi6-stk11.html","completed","intermediate","1","","2021-06-08","2021-09-01","2023-06-23 00:00:00","2023-10-12 18:12:38" -"278","qbi-hackathon","QBI hackathon","A 48-hour event connecting the Bay Area developer community with scientists ...","The QBI hackathon is a 48-hour event connecting the vibrant Bay Area developer community with the scientists from UCSF, UCB and UCSC, during which we work together on the cutting edge biomedical problems. Advances in computer vision, AI, and machine learning have enabled computers to pick out cat videos, recognize people’s faces from photos, play video games and drive cars. More recently, application of deep neural nets to protein structure prediction completely revolutionized the field. We look forward to seeing how far we can push science ahead when we apply these latest algorithms to biomedically relevant light microscopy, electron microscopy, and proteomics data. If you love FFTs, transformers, language models, topological data processing, or simply writing code, this is your chance to apply your skills to make an impact on global healthcare. Beyond the actual event, we hope to establish a better connection between talented developers and scientists in the Bay Area, so that we...","","https://www.eventbrite.com/e/qbi-hackathon-2023-tickets-633794304827?aff=oddtdtcreator","completed","intermediate","14","","2023-11-04","2023-11-05","2023-10-06 21:22:51","2023-10-26 23:23:36" -"279","niddk-central-repository-data-centric-challenge","NIDDK Central Repository Data-Centric Challenge","Enhancing NIDDK datasets for future Artificial Intelligence (AI) application...","The National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) Central Repository (https://repository.niddk.nih.gov/home/) is conducting a Data Centric Challenge aimed at augmenting existing Repository data for future secondary research including data-driven discovery by artificial intelligence (AI) researchers. The NIDDK Central Repository (NIDDK-CR) program strives to increase the utilization and impact of the resources under its guardianship. However, lack of standardization and consistent metadata within and across studies limit the ability of secondary researchers to easily combine datasets from related studies to generate new insights using data science methods. In the fall of 2021, the NIDDK-CR began implementing approaches to augment data quality to improve AI-readiness by making research data FAIR (findable, accessible, interoperable, and reusable) via a small pilot project utilizing Natural Language Processing (NLP) to tag study variables. In 2022, the NIDD...","","https://www.challenge.gov/?challenge=niddk-central-repository-data-centric-challenge","completed","intermediate","14","","2023-09-20","2023-11-03","2023-10-18 16:58:17","2023-10-18 20:52:49" -"280","stanford-ribonanza-rna-folding","Stanford Ribonanza RNA Folding","Pioneering RNA Science: A Path to Programmable Medicine and Scientific Break...","Ribonucleic acid (RNA) is essential for most biological functions. A better understanding of how to manipulate RNA could help usher in an age of programmable medicine, including first cures for pancreatic cancer and Alzheimer’s disease as well as much-needed antibiotics and new biotechnology approaches for climate change. But first, researchers must better understand each RNA molecule's structure, an ideal problem for data science.","","https://www.kaggle.com/competitions/stanford-ribonanza-rna-folding","active","intermediate","8","","2023-08-23","2023-11-24","2023-10-23 20:58:06","2023-11-02 18:01:38" -"281","uls23","Universal Lesion Segmentation '23 Challenge","Revolutionizing Lesion Segmentation: Advancements, Challenges, and a Univers...","Significant advancements have been made in AI-based automatic segmentation models for tumours. Medical challenges focusing on e.g. liver, kidney, or lung tumours have resulted in large performance improvements for segmenting these types of lesions. However, in clinical practice there is a need for versatile and robust models capable of quickly segmenting the many possible lesions types in the thorax-abdomen area. Developing a Universal Lesion Segmentation (ULS) model that can handle this diversity of lesions types requires a well-curated and varied dataset. Whilst there has been previous work on ULS [6-8], most research in this field has made extensive use of a single partially annotated dataset [9], containing only the long- and short-axis diameters on a single axial slice. Furthermore, a test set containing 3D segmentation masks used during evaluation on this dataset by previous publications is not publicly available. For these reasons we are excited to host the ULS23 Challenge...","","https://uls23.grand-challenge.org/","active","intermediate","5","","2023-10-29","2024-03-17","2023-11-02 15:35:22","2023-11-02 18:02:48" +"250","cagi2-nav1-5","CAGI2: SCN5A","Predict on the effect of the mutants on the SCN5A involved in cardiac electrophysiology.","The cardiac action potential (AP) is the sum of a number of distinct ionic currents. It can be divided into five phases (phase 0‐4). From pacemaker cells of the SA node the initial depolarizing wave front will spread throughout the cardiomyocytes via gap junctions. If the depolarization is sufficient voltage‐dependent sodium channels (Nav1.5) are activated and allow Na+ influx. This results in a further depolarization of the membrane which will lead to opening of even more Nav channels. This positive feedback mechanism is seen as the rapid upstroke in the initial phase (phase 0) of the action potential. Nav1.5 is encoded by SCN5A and mutations in this gene have been associated with various diseases such as Atrial fibrillation, Long QT syndrome, Cardiac Conduction Defect, Sick Sinus Disease, and Brugada Syndrome (BrS).","","https://genomeinterpretation.org/cagi2-nav1.5.html","completed","intermediate","14","","\N","2011-10-06","2023-09-28 18:19:48","2023-10-16 18:32:16" +"251","cagi2-mr-1","CAGI2: Shewanella oneidensis strain MR-1","Submit predictions on how insertions in the given gene of MR-1 affect the fitness of that gene in a given condition","Predictors are asked to submit predictions on how insertions in the given gene of MR-1 affect the fitness of that gene in a given condition (stressor).","","https://genomeinterpretation.org/cagi2-mr-1.html","completed","intermediate","14","","\N","2011-10-06","2023-09-28 18:19:55","2023-10-16 18:32:21" +"252","cagi3-mr-1","CAGI3: Shewanella oneidensis strain MR-1","Submit predictions on how insertions in the given gene of MR-1 affect the fitness of that gene in a given condition","Predictors are asked to submit predictions on how insertions in the given gene of MR-1 affect the fitness of that gene in a given condition (stressor).","","https://genomeinterpretation.org/cagi3-mr-1.html","completed","intermediate","14","","\N","2013-04-25","2023-09-28 18:20:01","2023-10-16 18:18:07" +"253","cagi4-sickkids","CAGI4: SickKids","Use computational methods to match each genome sequence to the clinical descriptions and phenotypes of pediatric cases.","Realizing the promise of precision medicine will require developing methods for interpreting genome sequence data to infer individuals’ phenotypic traits and predispositions to disease. This challenge involves 25 children with suspected genetic disorders who were referred for clinical genome sequencing. Predictors are given their genome sequences and their clinical phenotypic descriptions, as provided to the diagnostic laboratory, and asked to predict which genome corresponds to which clinical description. Additionally, identify the diagnostic variants underlying the predictions. Optionally, identify predictive secondary variants conferring high risk of other diseases whose phenotypes are not reported in the clinical descriptions.","","https://genomeinterpretation.org/cagi4-sickkids.html","completed","intermediate","14","","\N","2016-04-04","2023-09-28 18:19:48","2023-10-06 20:48:13" +"254","cagi4-sumo-ligase","CAGI4: SUMO ligase","Submit predictions of the effect of the variants on the Small Ubiquitin-like Modifier (SUMO) ligase protein.","SUMO ligase identifies target proteins and covalently attaches SUMO to them, thereby modulating the functions of hundreds of proteins including proteins implicated in cancer, neurodegeneration, and other diseases. A large library of missense mutations in human SUMO ligase has been assessed for competitive growth in a high-throughput yeast-based complementation assay. The challenge is to predict the effect of mutations on function, as measured by the change in fractional representation of each mutant SUMO ligase clone, relative to wild-type clones, in a competitive yeast growth assay.","","https://genomeinterpretation.org/cagi4-sumo-ligase.html","completed","intermediate","14","","\N","2016-04-04","2023-09-28 18:19:48","2023-10-19 23:31:57" +"255","cagi3-splicing","CAGI3: TP53 splicing","Determine which TP53 mutations lead to aberrant splicing and potentially contribute to cancer.","The function of exonic splicing regulatory elements can be undermined by DNA sequence variation and in some cases can contribute to pathogenesis. Thousands of disease-causing mutations disrupt exonic splicing regulatory elements. These data suggest that >25 percent of missense mutations may impact pre-mRNA splicing rather than mRNA translation. Using minigene constructs derived from a fragment of the TP53 gene, we have experimentally determined if each mutation influences splicing fidelity in HEK293T cells. We hope that CAGI participants will be able to predict the outcome of our experiments. A long-term goal will be the computational prioritization of disease-causing mutations prior to experimental validation. This contribution is expected to have major impacts in understanding the pathogenic basis of disease-causing mutations.","","https://genomeinterpretation.org/cagi3-splicing.html","completed","intermediate","14","","\N","2013-04-25","2023-09-28 18:19:48","2023-10-10 19:48:10" +"256","cagi4-warfarin","CAGI4: Warfarin exomes","Predict the therapeutic doses of warfarin for individual patients to reduce adverse events.","With over 33 million prescriptions in 2011, warfarin is the most commonly used anticoagulant for preventing thromboembolic events. Warfarin has a twenty-fold inter-individual dose variability and a narrow therapeutic index, and it is responsible for a third of adverse drug event hospitalizations in older Americans [2]. Alternatives to warfarin, such as direct thrombin inhibitors and factor Xa inhibitors, are now available. However, these are more expensive, irreversible, and may cause a higher rate of acute coronary events compared to warfarin [3,4]. Thus, warfarin remains a mainstay of anticoagulant therapy, and better methods of dosing warfarin will lead to fewer adverse events due to overcoagulation.","","https://genomeinterpretation.org/cagi4-warfarin.html","completed","intermediate","14","","\N","2016-04-04","2023-09-28 18:19:48","2023-09-28 21:19:03" +"257","cagi6-calmodulin","CAGI6: Calmodulin","Participants were asked to submit predictions for the competitive growth score of different Calmodulin variants.","Calmodulin (CaM) is a ubiquitous calcium (Ca2+) sensor protein interacting with more than 200 molecular partners, thereby regulating a variety of biological processes. Missense point mutations in the genes encoding CaM have been associated with ventricular tachycardia and sudden cardiac death. A library encompassing up to 17 point mutations was assessed by far-UV circular dichroism (CD) by measuring melting temperature (Tm) and percentage of unfolding (%unfold) upon thermal denaturation at pH and salt concentration that mimic the physiological conditions. The challenge is to predict- the Tm and %unfold values for isolated CaM variants under Ca2+-saturating conditions (Ca2+-CaM) and in the Ca2+-free (apo) state; whether the point mutation stabilizes or destabilizes the protein (based on Tm and %unfold).","","https://genomeinterpretation.org/cagi6-cam.html","completed","intermediate","1","","\N","2021-12-31","2023-09-28 18:19:48","2023-10-19 23:33:19" +"258","cagi2-splicing","CAGI2: splicing","Compare exons to understand the mechanisms underlying pre-mRNA splicing errors.","Accurate precursor mRNA (pre-mRNA) splicing is required for the expression of protein coding genes from the human genome. In this process, intervening sequences (introns) are removed from pre-mRNA and coding/regulatory sequences (exons) are ligated together generating a mature mRNA. A large ribonucleoprotein machine called the spliceosome assembles de novo upon every nascent intron and catalyzes the chemical steps of splicing.","","https://genomeinterpretation.org/cagi2-splicing.html","completed","intermediate","14","","\N","2011-10-06","2023-09-28 18:19:48","2023-10-18 15:32:55" +"259","cagi6-lc-arsa","CAGI6: ARSA","Predict the effect of naturally occurring missense mutations on enzymatic activity","Metachromatic Leukodystrophy (MLD) is an autosomal recessive, lysosomal-storage disease caused by mutations in Arylsulfatase A (ARSA) and toxic accumulation of sulfatide substrate. Genome sequencing has revealed hundreds of protein-altering, ARSA missense variants, but the functional effect of most variants remains unknown. ARSA enzyme activity using a high-throughput cellular assay was measured for a large set of variants of known significance and variants of unknown significance. The challenge is to predict the fractional enzyme activity of each mutant protein compared to the wildtype protein.","","https://genomeinterpretation.org/cagi6-lc-arsa.html","completed","intermediate","1","","\N","2022-11-16","2023-09-28 18:20:23","2023-10-12 18:11:51" +"260","predict-hits-for-the-wdr-domain-of-lrrk2","CACHE1: Predict Hits for The WDR Domain of LRRK2","Finding ligands targeting the central cavity of the WDR domain of LRRK2, a protein associated with Parkinson's disease.","The first CACHE Challenge target is LRRK2, the most commonly mutated gene in familial Parkinson's Disease. Participants are asked to find hits for the WD40 repeat (WDR) domain of LRRK2. Read more under Details below.","","https://cache-challenge.org/challenges/predict-hits-for-the-wdr-domain-of-lrrk2","completed","intermediate","17","","2021-12-01","2022-01-31","2023-09-27 19:01:55","2023-11-01 03:58:21" +"261","finding-ligands-targeting-the-conserved-rna-binding-site-of-sars-cov-2-nsp13","CACHE2: Finding Ligands Targeting The Conserved RNA Binding Site of SARS-CoV-2 NSP13","Finding ligands targeting the conserved RNA binding site of SARS-CoV-2 NSP13 for potential antiviral drug development.","Predicted compounds will be procured and tested at CACHE using both enzymatic and binding assays","","https://cache-challenge.org/challenges/finding-ligands-targeting-the-conserved-rna-binding-site-of-sars-cov-2-nsp13","completed","intermediate","17","","2022-06-22","2022-09-04","2023-09-27 19:02:43","2023-11-01 03:58:00" +"262","finding-ligands-targeting-the-macrodomain-of-sars-cov-2-nsp3","CACHE3: Finding ligands targeting the macrodomain of SARS-CoV-2 Nsp3","Study the macrodomain of SARS-CoV-2 Nsp3 for potential therapeutic applications.","To predict ligands that bind to the ADPr site of SARS-CoV-2 Nsp3 macrodomain (Mac1).","","https://cache-challenge.org/challenges/finding-ligands-targeting-the-macrodomain-of-sars-cov-2-nsp3","completed","intermediate","17","","2022-11-02","2023-01-01","2023-09-27 19:03:13","2023-10-16 19:01:19" +"263","finding-ligands-targeting-the-tkb-domain-of-cblb","CACHE4: Finding ligands targeting the TKB domain of CBLB","Investigate the TKB domain of CBLB to discover novel compounds for treatment.","Predict compounds that bind to the closed conformation of the CBLB TKB domain with novel chemical templates and KD below 30 micromolar.","","https://cache-challenge.org/challenges/finding-ligands-targeting-the-tkb-domain-of-cblb","completed","intermediate","17","","2023-03-09","2023-05-09","2023-09-27 19:03:14","2023-10-16 19:01:22" +"264","rare-disease-ai-hackathon","Rare Disease AI Hackathon","Advance rare disease diagnosis using artificial intelligence (AI) models","Bring AI and medical experts together to build open source models for rare diseases. Create zero-barrier access to rare disease expertise for patients, researchers and physicians. Use AI to Uncover novel links between rare diseases. Establish validation methods for medical AI models. Jumpstart an open source community for rare disease AI models. Launch models for Beta testing on Hypophosphatasia.ai and EhlersDanlos.ai.","","https://www.rarediseaseaihackathon.org/","active","intermediate","14","","2023-09-30","2024-01-15","2023-09-27 19:10:40","2023-10-24 15:56:45" +"265","cometh-benchmark","COMETH Benchmark","Quantify tumor heterogeneity—how many cell types are present and in which proportions within cancer samples.","Successful treatment of cancer is still a challenge and this is partly due to a wide heterogeneity of cancer composition across patient population. Unfortunately, accounting for such heterogeneity is very difficult. Clinical evaluation of tumor heterogeneity often requires the expertise of anatomical pathologists and radiologists.This benchmark is dedicated to the quantification of intra-tumor heterogeneity using appropriate statistical methods on cancer omics data.In particular, it focuses on estimating cell types and proportion in biological samples based on methylation and methylome data sets. The goal is to explore various statistical methods for source separation/deconvolution analysis (Non-negative Matrix Factorization, Surrogate Variable Analysis, Principal component Analysis, Latent Factor Models, ...) using both RNA-seq and methylome data.","","https://www.codabench.org/competitions/218/","completed","intermediate","10","","2020-06-14","2020-12-29","2023-09-28 23:25:52","2023-10-10 19:47:14" +"266","the-miccai-2014-machine-learning-challenge","The MICCAI 2014 Machine Learning Challenge","Predict binary and continuous phenotypes from Structural Brain MRI Data in a benchmark study","Machine learning tools have been increasingly applied to structural brain magnetic resonance imaging (MRI) scans, largely for developing models to predict clinical phenotypes at the individual level. Despite significant methodological developments and novel application domains, there has been little effort to conduct benchmark studies with standardized datasets, which researchers can use to validate new tools, and more importantly conduct an objective comparison with state-of-the-art algorithms. The MICCAI 2014 Machine Learning Challenge (MLC) will take a significant step in this direction, where we will employ four separate, carefully compiled, and curated large-scale (each N > 70) structural brain MRI datasets with accompanying clinically relevant phenotypes. Our goal is to provide a snapshot of the current state of the art in the field of neuroimage-based prediction, and attract machine-learning practitioners to the MICCAI community and the field of medical image computing in g...","","https://competitions.codalab.org/competitions/1471","completed","intermediate","9","","2014-04-16","2014-06-14","2023-09-28 23:36:12","2023-10-19 23:31:50" +"267","cagi6-annotate-all-missense","CAGI6: Annotate All Missense","Predict the functional effect of every coding single nucleotide variant (SNV) in the human genome.","dbNSFP currently describes 81,782,923 possible protein-altering variants in the human genome. The challenge is to predict the functional effect of every such variant. For the vast majority of these missense and nonsense variants, the functional impact is not currently known, but experimental and clinical evidence is accruing rapidly. Rather than drawing upon a single discrete dataset as typical with CAGI, predictions will be assessed by comparing with experimental or clinical annotations made available after the prediction submission date, on an ongoing basis. If predictors assent, predictions will also be incorporated into dbNSFP.","","https://genomeinterpretation.org/cagi6-annotate-all-missense.html","completed","intermediate","1","","2021-06-01","2021-10-11","2023-06-23 00:00:00","2023-10-12 18:13:42" +"268","cagi6-hmbs","CAGI6: HMBS","Submit predictions of the fitness score for each of the variants in the HMBS gene","Hydroxymethylbilane synthase (HMBS), also known as porphobilinogen deaminase (PBGD) or uroporphyrinogen I synthase, is an enzyme involved in heme production. In humans, variants that affect HMBS function result in acute intermittent porphyria (AIP), an autosomal dominant genetic disorder caused by a build-up of porphobilinogen in the cytoplasm. A large library of HMBS missense variants was assessed with respect to their effects on protein function using a high-throughput yeast complementation assay. The challenge is to predict the functional effects of these variants.","","https://genomeinterpretation.org/cagi6-hmbs.html","completed","intermediate","1","","2021-06-08","2021-10-11","2023-06-23 00:00:00","2023-10-12 18:12:05" +"269","cagi6-id-panel","CAGI6: Intellectual Disability Panel","Analyze the sequence data for the Intellectual Disability Panel to identify causative variants","The objective in this challenge is to predict a patient's clinical phenotype and the causal variant(s) based on their gene panel sequences. Sequence data for 74 genes from a cohort of 500 patients with a range of neurodevelopmental presentations (intellectual disability, autistic spectrum disorder, epilepsy, microcephaly, macrocephaly, hypotonia, ataxia) has been made available for this challenge. Additional data from 150 patients from the same clinical study is available for training and validation.","","https://genomeinterpretation.org/cagi6-id-panel.html","completed","intermediate","1","","2021-06-08","2021-10-11","2023-06-23 00:00:00","2023-10-12 18:12:09" +"270","cagi6-mapk1","CAGI6: MAPK1","Predict the ΔΔG(H2O) value for the MAPK1 protein, related to stability and catalytic efficiency","MAPK1 (ERK2) is active as serine/threonine kinase in the Ras-Raf-MEK-ERK signal transduction cascade that regulates cell proliferation, transcription, differentiation, and cell cycle progression. MAPK1 is activated by phosphorylation which occurs with strict specificity by MEK1/2 on Thr185 and Tyr187, and may also act as a transcriptional repressor independent of its kinase activity. A library of eleven missense variants selected from the COSMIC database was assessed by near and far-UV circular dichroism and intrinsic fluorescence spectra to determine thermodynamic stability at different concentrations of denaturant. These data were used to calculate a ΔΔGH20 value; i.e., the difference in unfolding free energy ΔGH20 between each variant and the wildtype protein, both in phosphorylated and unphosphorylated forms. The challenge is to predict these two ΔΔGH20 values and the catalytic efficiency (kcat/km)mut/(kcat/km)wt, as determined by a fluorescence assay, of the phosphorylated fo...","","https://genomeinterpretation.org/cagi6-mapk1.html","completed","intermediate","1","","2021-07-08","2021-10-11","2023-06-23 00:00:00","2023-10-12 18:12:13" +"271","cagi6-mapk3","CAGI6: MAPK3","Predict the ΔΔG(H2O) value for the MAPK3 protein, related to stability and catalytic efficiency","MAPK3 (ERK1) is active as serine/threonine kinase in the Ras-Raf-MEK-ERK signal transduction cascade that regulates cell proliferation, transcription, differentiation, and cell cycle progression. MAPK3 is activated by phosphorylation which occurs with strict specificity by MEK1/2 on Thr202 and Tyr204, and may also act as a transcriptional repressor independent of its kinase activity. A library of twelve missense variants selected from the COSMIC database was assessed by near and far-UV circular dichroism and intrinsic fluorescence spectra to determine thermodynamic stability at different concentrations of denaturant. These data were used to calculate a ΔΔGH20 value; i.e., the difference in unfolding free energy ΔGH20 between each variant and the wildtype protein, both in phosphorylated and unphosphorylated forms. The challenge is to predict these two ΔΔGH20 values and the catalytic efficiency (kcat/km)mut/(kcat/km)wt, as determined by a fluorescence assay, of the phosphorylated fo...","","https://genomeinterpretation.org/cagi6-mapk3.html","completed","intermediate","1","","2021-08-04","2021-10-11","2023-06-23 00:00:00","2023-10-12 18:12:15" +"272","cagi6-mthfr","CAGI6: MTHFR","Submit predictions of the fitness score for each missense variant in the MTHFR gene","Methylenetetrahydrofolate reductase (MTHFR) catalyzes the production of 5-methyltetrahydrofolate, which is needed for conversion of homocysteine to methionine. Humans with variants affecting MTHFR function present with a wide range of phenotypes, including homocystinuria, homocysteinemia, developmental delay, severe mental retardation, psychiatric disturbances, and late-onset neurodegenerative disorders. A further complication to interpretation of variants in this gene is a common variant, Ala222Val, carried by a large fraction of the human population. A large library of MTHFR missense variants was assessed with respect to their effects on protein function using a high-throughput yeast complementation assay. The challenge is to predict the functional effects of these variants in two different settings- for the wildtype protein, and for the protein with the common Ala222Val variant.","","https://genomeinterpretation.org/cagi6-mthfr.html","completed","intermediate","1","","2021-05-03","2021-06-30","2023-06-23 00:00:00","2023-10-12 18:12:18" +"273","cagi6-prs","CAGI6: Polygenic Risk Scores","Provide a fully trained prediction model that estimates polygenic risk scores (PRS) for complex diseases","Polygenic risk scores (PRS) have potential clinical utility for risk surveillance, prevention and personalized medicine. Participants will be provided with datasets of four real phenotypes (Type 2 Diabetes, Breast Cancer, Inflammatory Bowel Disease and Coronary Artery Disease) and of thirty simulated phenotypes representing a range of genetic architectures of common polygenic diseases. The challenge is to predict the disease outcomes of individuals in held-out validation cohorts.","","https://genomeinterpretation.org/cagi6-prs.html","completed","intermediate","1","","2021-06-08","2021-10-11","2023-06-23 00:00:00","2023-10-12 18:12:23" +"274","cagi6-rgp","CAGI6: Rare Genomes Project","Identify causative variants in rare disease genomes for diagnosis","The Rare Genomes Project (RGP) is a direct-to-participant research study on the utility of genome sequencing for rare disease diagnosis and gene discovery. The study is led by genomics experts and clinicians at the Broad Institute of MIT and Harvard. Research subjects are consented for genomic sequencing and the sharing of their sequence and phenotype information with researchers working to understand the molecular causes of rare disease. When a candidate disease variant believed to be related to the phenotype is identified, the variant is confirmed with Sanger sequencing in a clinical setting and returned to the participant via his or her local physician. In this challenge, whole genome sequence data and phenotype data from a subset of the solved and unsolved RGP families will be provided. Participants in the challenge will try to identify the causative variant(s) in each case. For the unsolved cases, prioritized variants from the participating teams will be examined to see if ad...","","https://genomeinterpretation.org/cagi6-rgp.html","completed","intermediate","1","","2021-06-08","2021-10-11","2023-06-23 00:00:00","2023-10-12 18:12:27" +"275","cagi6-invitae","CAGI6: Sherloc clinical classification","Over 122,000 coding variants will be predicted for clinical utility assessment and submission to ClinVar","Invitae is a genetic testing company that publishes their variant interpretations to ClinVar. In this challenge, over 122,000 previously uncharacterized variants are provided, spanning the range of effects seen in the clinic. Following the close of this challenge, Invitae will submit their interpretations for these variants to ClinVar. Predictors are asked to interpret the pathogenicity of these variants, and the clinical utility of predictions will be assessed across multiple categories by Invitae.","","https://genomeinterpretation.org/cagi6-invitae.html","completed","intermediate","1","","2021-07-08","2021-12-01","2023-06-23 00:00:00","2023-10-12 18:12:31" +"276","cagi6-splicing-vus","CAGI6: Splicing VUS","Predict whether the experimentally validated VUS disrupt splicing and contribute to genetic disorders","Variants causing aberrant splicing have been implicated in a range of common and rare disorders, including retinitis pigmentosa, autism spectrum disorder, amyotrophic lateral sclerosis, and a variety of cancers. However, such variants are frequently overlooked by diagnostic sequencing pipelines, leading to missed diagnoses for patients. Clinically ascertained variants of unknown significance underwent whole-blood based RT-PCR to test for impact on splicing. The challenge is to predict which of the tested variants disrupt splicing.","","https://genomeinterpretation.org/cagi6-splicing-vus.html","completed","intermediate","1","","2021-06-08","2021-10-11","2023-06-23 00:00:00","2023-10-12 18:12:34" +"277","cagi6-stk11","CAGI6: STK11","Submit predictions on the impact of the variants located in the STK11 gene associated with Peutz-Jeghers syndrome","Serine/Threonine Kinase 11 (STK11) is considered a master kinase that functions as a tumor suppressor and nutrient sensor within a heterotrimeric complex with pseudo-kinase STRAD-alpha and structural protein MO25. Germline variants resulting in loss of STK11 define Peutz-Jaghers Syndrome, an autosomal dominant cancer predisposition syndrome marked by gastrointestinal hamartomas and freckling of the oral mucosa. Somatic loss of function variants, both nonsense and missense, occur in 15-30% of non-small cell lung adenocarcinomas, where they correlate clinically with insensitivity to anti-PD1 monoclonal antibody therapy. The challenge is to predict the impact on STK11 function for each missense variant in relation to wildtype STK11.","","https://genomeinterpretation.org/cagi6-stk11.html","completed","intermediate","1","","2021-06-08","2021-09-01","2023-06-23 00:00:00","2023-10-12 18:12:38" +"278","qbi-hackathon","QBI hackathon","A 48-hour event connecting the Bay Area developer community with scientists to advance biomedical research","The QBI hackathon is a 48-hour event connecting the vibrant Bay Area developer community with the scientists from UCSF, UCB and UCSC, during which we work together on the cutting edge biomedical problems. Advances in computer vision, AI, and machine learning have enabled computers to pick out cat videos, recognize people’s faces from photos, play video games and drive cars. More recently, application of deep neural nets to protein structure prediction completely revolutionized the field. We look forward to seeing how far we can push science ahead when we apply these latest algorithms to biomedically relevant light microscopy, electron microscopy, and proteomics data. If you love FFTs, transformers, language models, topological data processing, or simply writing code, this is your chance to apply your skills to make an impact on global healthcare. Beyond the actual event, we hope to establish a better connection between talented developers and scientists in the Bay Area, so that we...","","https://www.eventbrite.com/e/qbi-hackathon-2023-tickets-633794304827?aff=oddtdtcreator","completed","intermediate","14","","2023-11-04","2023-11-05","2023-10-06 21:22:51","2023-10-26 23:23:36" +"279","niddk-central-repository-data-centric-challenge","NIDDK Central Repository Data-Centric Challenge","Enhance NIDDK datasets for future Artificial Intelligence (AI) applications","The National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) Central Repository (https://repository.niddk.nih.gov/home/) is conducting a Data Centric Challenge aimed at augmenting existing Repository data for future secondary research including data-driven discovery by artificial intelligence (AI) researchers. The NIDDK Central Repository (NIDDK-CR) program strives to increase the utilization and impact of the resources under its guardianship. However, lack of standardization and consistent metadata within and across studies limit the ability of secondary researchers to easily combine datasets from related studies to generate new insights using data science methods. In the fall of 2021, the NIDDK-CR began implementing approaches to augment data quality to improve AI-readiness by making research data FAIR (findable, accessible, interoperable, and reusable) via a small pilot project utilizing Natural Language Processing (NLP) to tag study variables. In 2022, the NIDD...","","https://www.challenge.gov/?challenge=niddk-central-repository-data-centric-challenge","completed","intermediate","14","","2023-09-20","2023-11-03","2023-10-18 16:58:17","2023-10-18 20:52:49" +"280","stanford-ribonanza-rna-folding","Stanford Ribonanza RNA Folding","Pioneering RNA Science: A Path to Programmable Medicine and Scientific Breakthroughs","Ribonucleic acid (RNA) is essential for most biological functions. A better understanding of how to manipulate RNA could help usher in an age of programmable medicine, including first cures for pancreatic cancer and Alzheimer’s disease as well as much-needed antibiotics and new biotechnology approaches for climate change. But first, researchers must better understand each RNA molecule's structure, an ideal problem for data science.","","https://www.kaggle.com/competitions/stanford-ribonanza-rna-folding","active","intermediate","8","","2023-08-23","2023-11-24","2023-10-23 20:58:06","2023-11-02 18:01:38" +"281","uls23","Universal Lesion Segmentation '23 Challenge","Revolutionizing Lesion Segmentation: Advancements, Challenges, and a Universal Solution Emerges","Significant advancements have been made in AI-based automatic segmentation models for tumours. Medical challenges focusing on e.g. liver, kidney, or lung tumours have resulted in large performance improvements for segmenting these types of lesions. However, in clinical practice there is a need for versatile and robust models capable of quickly segmenting the many possible lesions types in the thorax-abdomen area. Developing a Universal Lesion Segmentation (ULS) model that can handle this diversity of lesions types requires a well-curated and varied dataset. Whilst there has been previous work on ULS [6-8], most research in this field has made extensive use of a single partially annotated dataset [9], containing only the long- and short-axis diameters on a single axial slice. Furthermore, a test set containing 3D segmentation masks used during evaluation on this dataset by previous publications is not publicly available. For these reasons we are excited to host the ULS23 Challenge...","","https://uls23.grand-challenge.org/","active","intermediate","5","","2023-10-29","2024-03-17","2023-11-02 15:35:22","2023-11-02 18:02:48" From eb233ffd73510b8fc172e560cd59a770069fee0b Mon Sep 17 00:00:00 2001 From: verena <9377970+vpchung@users.noreply.github.com> Date: Mon, 6 Nov 2023 23:43:19 +0000 Subject: [PATCH 2/5] increase `headline` varchar limit in table creation --- .../src/main/resources/db/migration/V1.0.0__create_tables.sql | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/apps/openchallenges/challenge-service/src/main/resources/db/migration/V1.0.0__create_tables.sql b/apps/openchallenges/challenge-service/src/main/resources/db/migration/V1.0.0__create_tables.sql index fb88dc8ce3..1742c6984a 100644 --- a/apps/openchallenges/challenge-service/src/main/resources/db/migration/V1.0.0__create_tables.sql +++ b/apps/openchallenges/challenge-service/src/main/resources/db/migration/V1.0.0__create_tables.sql @@ -31,7 +31,7 @@ CREATE TABLE `challenge` `id` bigint(20) NOT NULL AUTO_INCREMENT, `slug` varchar(255) NOT NULL, `name` varchar(255) DEFAULT NULL, - `headline` varchar(80), + `headline` varchar(120), `description` varchar(1000) NOT NULL, `avatar_url` varchar(255), `website_url` varchar(255) NOT NULL, From 8e97fa6b40eca3b8d24621b2f1901b344816a7f1 Mon Sep 17 00:00:00 2001 From: verena <9377970+vpchung@users.noreply.github.com> Date: Mon, 6 Nov 2023 23:43:56 +0000 Subject: [PATCH 3/5] update API specs and docs --- .../openchallenges/api-description/build/challenge.openapi.yaml | 2 +- libs/openchallenges/api-description/build/openapi.yaml | 2 +- .../src/components/schemas/ChallengeHeadline.yaml | 2 +- 3 files changed, 3 insertions(+), 3 deletions(-) diff --git a/libs/openchallenges/api-description/build/challenge.openapi.yaml b/libs/openchallenges/api-description/build/challenge.openapi.yaml index 47e9d26e0e..e7c5613eca 100644 --- a/libs/openchallenges/api-description/build/challenge.openapi.yaml +++ b/libs/openchallenges/api-description/build/challenge.openapi.yaml @@ -384,7 +384,7 @@ components: description: The headline of the challenge. type: string minLength: 0 - maxLength: 80 + maxLength: 120 example: Example challenge headline ChallengeDescription: description: The description of the challenge. diff --git a/libs/openchallenges/api-description/build/openapi.yaml b/libs/openchallenges/api-description/build/openapi.yaml index e8700d7be9..51f63bbb1c 100644 --- a/libs/openchallenges/api-description/build/openapi.yaml +++ b/libs/openchallenges/api-description/build/openapi.yaml @@ -536,7 +536,7 @@ components: description: The headline of the challenge. type: string minLength: 0 - maxLength: 80 + maxLength: 120 example: Example challenge headline ChallengeDescription: description: The description of the challenge. diff --git a/libs/openchallenges/api-description/src/components/schemas/ChallengeHeadline.yaml b/libs/openchallenges/api-description/src/components/schemas/ChallengeHeadline.yaml index b7c9acf6cb..3f90605cea 100644 --- a/libs/openchallenges/api-description/src/components/schemas/ChallengeHeadline.yaml +++ b/libs/openchallenges/api-description/src/components/schemas/ChallengeHeadline.yaml @@ -1,5 +1,5 @@ description: The headline of the challenge. type: string minLength: 0 -maxLength: 80 +maxLength: 120 example: Example challenge headline From 9afa82a870331e93684b16cb54a783a9d8829a34 Mon Sep 17 00:00:00 2001 From: verena <9377970+vpchung@users.noreply.github.com> Date: Tue, 7 Nov 2023 00:47:44 +0000 Subject: [PATCH 4/5] clamp headline at 3 lines instead of 2 --- libs/openchallenges/styles/src/lib/_general.scss | 3 +-- 1 file changed, 1 insertion(+), 2 deletions(-) diff --git a/libs/openchallenges/styles/src/lib/_general.scss b/libs/openchallenges/styles/src/lib/_general.scss index 581ced70f1..655092d8f4 100644 --- a/libs/openchallenges/styles/src/lib/_general.scss +++ b/libs/openchallenges/styles/src/lib/_general.scss @@ -281,7 +281,6 @@ table { } .card-banner { width: 100%; - height: 130px; display: flex; align-items: flex-start; } @@ -297,7 +296,7 @@ table { @include line-clamp(2); } .mat-caption { - @include line-clamp(2); + @include line-clamp(3); } .card-body { width: 100%; From d6ca0203af0963dd872e385d58d9511231591a9f Mon Sep 17 00:00:00 2001 From: verena <9377970+vpchung@users.noreply.github.com> Date: Tue, 7 Nov 2023 00:48:15 +0000 Subject: [PATCH 5/5] decrease height of banner on card --- .../assets/src/assets/images/banner-default.svg | 2 +- .../ui/src/lib/challenge-card/challenge-card.component.scss | 4 ++-- 2 files changed, 3 insertions(+), 3 deletions(-) diff --git a/libs/openchallenges/assets/src/assets/images/banner-default.svg b/libs/openchallenges/assets/src/assets/images/banner-default.svg index 88370e229f..1921255e69 100644 --- a/libs/openchallenges/assets/src/assets/images/banner-default.svg +++ b/libs/openchallenges/assets/src/assets/images/banner-default.svg @@ -1,4 +1,4 @@ - + diff --git a/libs/openchallenges/ui/src/lib/challenge-card/challenge-card.component.scss b/libs/openchallenges/ui/src/lib/challenge-card/challenge-card.component.scss index 6a43e1a748..967d00de6a 100644 --- a/libs/openchallenges/ui/src/lib/challenge-card/challenge-card.component.scss +++ b/libs/openchallenges/ui/src/lib/challenge-card/challenge-card.component.scss @@ -16,7 +16,7 @@ min-height: 190px; } .card-banner { - height: 100%; + height: 57px; } .card-status { div:first-child { @@ -36,5 +36,5 @@ display: flex; align-items: center; flex: 1; - @include general.line-clamp(2); + @include general.line-clamp(1); }