CONVOCALS: a CONVOlutional neural network to predict symptoms and major secondary CArdiovascuLar events based on high-resolution scanned histological Slides.
Francesco Cisternino5*, Yipei Song1*, Gert Jan de Borst2, Joost Mekke2, Barend Mol2, Dominique P.V. de Kleijn2, Gerard Pasterkamp3, Aryan Vink4, Sander W. van der Laan3*, Clint L. Miller1*, Craig Glastonbury5*. * Authors contributed equally.
1) Center for Public Health Genomics, Department of Public Health Sciences, Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA 22908, USA. 2) Department of Vascular Surgery, Division Surgical Specialties, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands. 3) Central Diagnostics Laboratory, Division Laboratories, Pharmacy, and Biomedical Genetics, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands. 4) Department of Pathology, Division Laboratories, Pharmacy, and Biomedical Genetics, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands. 5) Human Technopole, Viale Rita Levi-Montalcini, 1, 20157, Milano, Italy.
Despite tremendous medical progress, cardiovascular diseases (CVD) are still topping global charts of morbidity and mortality. Atherosclerosis is the major underlying cause of CVD and results in atherosclerotic plaque formation. The extent and type of atherosclerosis is manually assessed through histological analysis, and histological characteristics are linked to major acute cardiovascular events (MACE). However, conventional means of assessing plaque characteristics suffer major limitations directly impacting their predictive power. CONVOCALS will use a machine learning technique, convolutional neural network (CNN), to develop an internal representation of the 2-dimensional plaque images, allowing the model to learn position and scale in variant structures in the data. A CNN is a subset of deep learning which has established as a powerful class of models for image recognition problems such as analysis of x-ray medical images. The aim of CONVOCALS is to build a CNN to process high-resolution images from scanned histological slides of plaques in order to predict MACE.
We will use data from the Athero-Express Biobank Study (AE){target="_blank"} comprising ±2,500 carotid endarterectomy patients of whom extensive clinical data (demographic, lifestyle, laboratory, medical history, and medication) as well as plaques are collected. At two Dutch tertiary referral centers patients are included that underwent endarterectomy; details of the study design were described before{target="_blank"}. Briefly, blood and plaque material were obtained during endarterectomy and stored at -80 ℃. Only carotid endarterectomy (CEA) patients were included in the present study. All patients provided informed consent and the study was approved by the medical ethics committee.
All plaques are histological assessed using 9 different standardized protocols for CD34, CD66b, CD68, SMA, elastin, hematoxylin, picro-sirius red, fibrin, glycophorin C and scanned at high-resolution into .ndpi
or .TIF
whole-slide images (WSI). For CONVOCALS we will use all the available data, i.e. ± 22,500 images and clinical data, to build a CNN using advanced computer algorithms as implemented in Python and classify patients based on 1) symptoms, and 2) MACE.
You can load this project in RStudio by opening the file called 'CONVOCALS.Rproj'.
File | Description | Usage |
---|---|---|
README.md | Description of project | Human editable |
CONVOCALS.Rproj | Project file | Loads project |
LICENSE | User permissions | Read only |
.worcs | WORCS metadata YAML | Read only |
renv.lock | Reproducible R environment | Read only |
images | Images used in readme, etc | Human editable |
scripts | Script to process data | Human editable |
This project uses the Workflow for Open Reproducible Code in Science (WORCS) to ensure transparency and reproducibility. The workflow is designed to meet the principles of Open Science throughout a research project.
To learn how WORCS helps researchers meet the TOP-guidelines and FAIR principles, read the preprint at https://osf.io/zcvbs/
- To get started with
worcs
, see the setup vignette - For detailed information about the steps of the WORCS workflow, see the workflow vignette
Please refer to the vignette on reproducing a WORCS project for step by step advice.
Do you have burning questions or do you want to discuss usage with other users? Do you want to report an issue? Or do you have an idea for improvement or adding new features to our method and tool? Please use the Issues tab.
Using our EntropyMasker
method? Please cite our work:
An automatic entropy method to efficiently mask histology whole-slide images
Yipei Song, Francesco Cisternino, Joost Mekke, Gert Jan de Borst, Dominique P.V. de Kleijn, Gerard Pasterkamp, Aryan Vink, Craig Glastonbury, Sander W. van der Laan, Clint L. Miller.
medRxiv 2022.09.01.22279487; doi: https://doi.org/10.1101/2022.09.01.22279487
The whole-slide images used in this project are available through a DataverseNL repository. There are restrictions on use by commercial parties, and on sharing openly based on (inter)national laws, regulations and the written informed consent. Therefore these data (and additional clinical data) are only available upon discussion and signing a Data Sharing Agreement (see Terms of Access) and within a specially designed UMC Utrecht provided environment.
We are thankful for the support of the Netherlands CardioVascular Research Initiative of the Netherlands Heart Foundation (CVON 2011/B019 and CVON 2017-20: Generating the best evidence-based pharmaceutical targets for atherosclerosis [GENIUS I&II]), the ERA-CVD program 'druggable-MI-targets' (grant number: 01KL1802), and the Leducq Fondation 'PlaqOmics'.
Funding for this research was provided by National Institutes of Health (NIH) grant nos. R00HL125912 and R01HL14823 (to Clint L. Miller), and a Leducq Foundation Transatlantic Network of Excellence ('PlaqOmics') grant no. 18CVD02 (to Dr. Clint L. Miller and Dr. Sander W. van der Laan), and EU H2020 TO_AITION grant no. 848146 (to Dr. Sander W. van der Laan).
Dr. Sander W. van der Laan has received Roche funding for unrelated work.
Dr Craig A. Glastonbury has stock options in BenevolentAI and is a paid consultant for BenevolentAI, unrelated to this work.
Plaque samples are derived from arterial endarterectomies as part of the Athero-Express Biobank Study which is an ongoing study in the UMC Utrecht. We would like to thank all the (former) employees involved in the Athero-Express Biobank Study of the Departments of Surgery of the St. Antonius Hospital Nieuwegein and University Medical Center Utrecht for their continuing work. In particular we would like to thank (in no particular order) Marijke Linschoten, Arjan Samani, Petra H. Homoed-van der Kraak, Tim Bezemer, Tim van de Kerkhof, Joyce Vrijenhoek, Evelyn Velema, Ben van Middelaar, Sander Reukema, Robin Reijers, Joëlle van Bennekom, and Bas Nelissen. Lastly, we would like to thank all participants of the Athero-Express Biobank Study; without you these studies would not be possible.
The framework was based on the WORCS
package.
Version: v1.0.1
Last update: 2022-10-04
Written by: Francesco Cisternino; Craig Glastonbury; Sander W. van der Laan; Clint L. Miller; Yipei Song.
Description: CONVOCALS repository: classification of atherosclerotic histological whole-slide images
Minimum requirements: R version 3.4.3 (2017-06-30) -- 'Single Candle', Mac OS X El Capitan
**MoSCoW To-Do List**
The things we Must, Should, Could, and Would have given the time we have.
_M_
_S_
_C_
_W_
Changes log
* v1.0.1 Updates and re-organization.
* v1.0.0 Initial version.
Copyright (c) 2022 Francesco Cisternino | Craig Glastonbury | Sander W. van der Laan | Clint L. Miller | Yipei Song
This is a human-readable summary of (and not a substitute for) the license.
You are free to share, copy and redistribute the material in any medium or format. The licencor cannot revoke these freedoms as long as you follow the license terms.
Under the following terms:
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- NonCommercial — You may not use the material for commercial purposes.
- NoDerivatives — If you remix, transform, or build upon the material, you may not distribute the modified material.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
Notices:
You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation.
No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.