Time and event-specific deep learning for personalized risk assessment after cardiac perfusion imaging

Pieszko, Konrad; Shanbhag, Aakash D.; Singh, Ananya; Hauser, M. Timothy; Miller, Robert J. H.; Liang, Joanna X.; Motwani, Manish; Kwieciński, Jacek; Sharir, Tali; Einstein, Andrew J.; Fish, Mathews B.; Ruddy, Terrence D.; Kaufmann, Philipp A.; Sinusas, Albert J.; Miller, Edward J.; Bateman, Timothy M.; Dorbala, Sharmila; Di Carli, Marcelo; Berman, Daniel S.; Dey, Damini; Slomka, Piotr J.

doi:10.1038/s41746-023-00806-x

Download PDF

Article
Open access
Published: 01 May 2023

Time and event-specific deep learning for personalized risk assessment after cardiac perfusion imaging

npj Digital Medicine volume 6, Article number: 78 (2023) Cite this article

6253 Accesses
9 Citations
86 Altmetric
Metrics details

Subjects

Abstract

Standard clinical interpretation of myocardial perfusion imaging (MPI) has proven prognostic value for predicting major adverse cardiovascular events (MACE). However, personalizing predictions to a specific event type and time interval is more challenging. We demonstrate an explainable deep learning model that predicts the time-specific risk separately for all-cause death, acute coronary syndrome (ACS), and revascularization directly from MPI and 15 clinical features. We train and test the model internally using 10-fold hold-out cross-validation (n = 20,418) and externally validate it in three separate sites (n = 13,988) with MACE follow-ups for a median of 3.1 years (interquartile range [IQR]: 1.6, 3.6). We evaluate the model using the cumulative dynamic area under receiver operating curve (cAUC). The best model performance in the external cohort is observed for short-term prediction – in the first six months after the scan, mean cAUC for ACS and all-cause death reaches 0.76 (95% confidence interval [CI]: 0.75, 0.77) and 0.78 (95% CI: 0.78, 0.79), respectively. The model outperforms conventional perfusion abnormality measures at all time points for the prediction of death in both internal and external validations, with improvement increasing gradually over time. Individualized patient explanations are visualized using waterfall plots, which highlight the contribution degree and direction for each feature. This approach allows the derivation of individual event probability as a function of time as well as patient- and event-specific risk explanations that may help draw attention to modifiable risk factors. Such a method could help present post-scan risk assessments to the patient and foster shared decision-making.

Predicting long-term time to cardiovascular incidents using myocardial perfusion imaging and deep convolutional neural networks

Article Open access 15 February 2024

Enhancing the diagnosis of functionally relevant coronary artery disease with machine learning

Article Open access 12 June 2024

Machine learning for ECG diagnosis and risk stratification of occlusion myocardial infarction

Article Open access 29 June 2023

Introduction

Through recent advances, artificial intelligence (AI) has established an important new paradigm in medical image analysis, potentially enhancing prognostic applications from all cardiovascular imaging modalities. However, there are challenges in how risks are conveyed to both physicians and patients to facilitate the best and most appropriate preventative strategies. A single metric of all risks combined is perhaps less useful than a diverse map of individual risks, their timeline, and influencing factors¹.

Myocardial perfusion imaging (MPI) is a well-established technique for diagnosing coronary artery disease. Although the primary purpose of MPI is the assessment of the flow-limiting coronary artery disease, it is often used for risk stratification². Prognostic risk assessment has been based on the distribution and burden of ischemia detected, usually combined with a composite score of clinical risk factors or inferred from an expert clinical impression—while this has proven to be a generally successful model from a statistical viewpoint, it is arguably relatively crude at the individual patient level. In the status quo, although a patient may be informed that they are at high risk for an adverse event, they are left with less information about what type of event, or within what timeframe can be anticipated, and this can be more unnerving than productive. While researchers have successfully used AI to facilitate, quantify, and automate several aspects of the conventional imaging workflow for diagnosing of disease³, the efforts applied to prognostic interpretation lack time-specific or event-specific prediction⁴. To date, proposed predictive AI models for cardiovascular image interpretation do not differentiate between the possible adverse events and oversimplify the predicted risk to a single numeric value despite the richness and depth of the source data⁵. Compounding this issue is the significant heterogeneity of the definitions of composite end-points, such as major adverse cardiovascular events (MACE) in prior clinical studies⁶.

To date, no methods are established to predict time-dependent risks of specific event types (such as death or myocardial infarction) from a single model after cardiovascular imaging. In this study, we aimed to create a deep learning model capable of predicting patient and event-specific risk over time directly from combined cardiac perfusion image and clinical data. We also describe methods for visual explanation of these predicted risks over time which can be presented to physicians and patients. This could be applied to patient care by presenting patient-individualized survival curves for specific events and explaining the contribution of risk factors to event risk, potentially leading to patient engagement and tailoring therapy for the prevention of adverse events. The overview of the study is presented in Fig. 1.

**Fig. 1: Deep learning enabled time-to-event outcome prediction after cardiac imaging - study overview.**

Results

Population characteristics

The training and internal testing cohort included 20,401 patients followed up for MACE for a median of 4.4 years (interquartile range [IQR]: 3.4, 5.7). All-cause death was observed in 1,396 patients (6.8%) and the median time to death was 2.3 years (IQR: 1.1, 3.7). ACS was observed in 657 patients (3.2%) and the median time to that event was 1.6 years (IQR: 0.6, 3.0). Revascularization was observed in 1,485 cases (7.3%) and the median time to revascularization was 0.6 years (0.1, 2.3). Summary of the clinical characteristics of the derivation cohort is shown in Table 1.

Table 1 Characteristics of the REFINE SPECT set (training and internal testing).

Full size table

The external testing set included 13,988 patients followed up for MACE for a median of 3.1 years (IQR: 1.6, 3.6). All-cause death was observed in 683 patients (5%) and occurred after a median of 1.5 (IQR: 0.6, 2.5) years from the scan. Acute coronary syndrome (ACS) was observed in 361 (2.5%) of patients after a median of 1.3 years from scan (IQR: 0.5, 2.3) and 918 patients (6.6%) underwent revascularization after a median of 0.1 years from baseline imaging (IQR: 0.03, 1.3). Summary of the clinical characteristics of the external cohort is shown in Table 2.

Table 2 Characteristics of the external testing set.

Full size table

Internal testing

We present the cumulative dynamic area under receiver operating curve (cAUC) for the prediction of any event as well as each of the separate events in Fig. 2. The best performance for the prediction of ACS and all-cause death was observed for short-term prediction – in the first six months after scan, mean cAUC for ACS and all-cause death reached 0.78 (95% confidence interval [CI]: 0.77, 0.79) and 0.86 (95% CI: 0.85, 0.87), respectively. For revascularization, the initially high cAUC declined after the first year but achieved its peak values in long-term observation – mean cAUC in the fifth year of follow-up was 0.84 (95% CI: 0.82, 0.85). While the perfusion abnormality measure maintains high cAUC over a short-term observation for the prediction of ACS, revascularization or any event, the cAUC decreases over time faster than of our model. The model has superior performance to the perfusion abnormality for death prediction at all time points for cAUC as shown in Table 3 (in the internal testing set) and in Table 4 (in the external testing set).

Table 3 Comparison of performance in the internal set (n = 20,401).

Full size table

Table 4 Comparison of performance in the external testing set (n = 13,988).

Full size table

Time-dependent concordances for the prediction of each separate event are shown in Supplementary Table 1. The concordances were higher for all-cause death than for ACS, or revascularization.

External testing

Similar to internal testing, the model preserved relatively constant cAUC within 3 years from the baseline scan, while for perfusion abnormality, the cAUC declined gradually over time (Fig. 3). For each of the events, the best performance was observed for short-time prediction – in the first six months after scan mean cAUC for ACS and all-cause death reached 0.76 (95% confidence interval [CI]: 0.75, 0.76) and 0.78 (95% CI: 0.78, 0.79), respectively. Our model outperforms perfusion abnormality at all time points for the prediction of death. Time-dependent concordance for the prediction of each event in the external set is presented in Supplementary Table 1.

The time-to-event model outperformed both perfusion abnormality and the clinical-only model in the external testing set at 1 and 3 years from scan. The area under the receiver operating curve (AUC) for any MACE at 1 year from the scan was 0.74 (95% CI: 0.73,0.76) for the time-to-event model, 0.71 (95% CI: 0.70,0.73) for perfusion abnormality, and 0.68 (95% CI: 0.66,0.69) for the clinical-only model. At 3 years from scan, the AUC for any MACE was 0.73 (95% CI: 0.72,0.74) for the time-to-event model, 0.69 (95% CI: 0.67,0.70) for perfusion abnormality, and 0.69 (95% CI: 0.68,0.71) for the clinical-only model. The time-to-event model performed better than perfusion abnormality at 1 and 3 years from scan in the prediction of death or ACS and better than the clinical-only model in the prediction of revascularization. Detailed comparison of AUC values for all events for the time-to-event model compared with perfusion abnormality and the clinical-only model is shown in Table 3. Receiver-operating curve (ROC) plots for the prediction of death, ACS, and revascularization at 1, 3 and 5 years in the external testing sets using time-to-event model are shown on Supplementary Fig. 1. Sensitivity analysis showed no significant effect on the prediction of other events or composite MACE outcome, but it decreased performance for prediction of revascularization when revascularization events within 180 days from the scan are removed (Supplementary Table 2). The time-to-event model also outperformed the multivariable Cox regression model in the prediction of all types of events (Supplementary Table 3).

Individual prediction and explanation

Examples of individualized predictions for four patients who experienced different types of outcomes in the follow-up period are shown in Fig. 4. The prediction of our model is presented as three cumulative incidence functions – separately for each type of event. The individual prediction plot is accompanied by a waterfall plot (Fig. 5) – providing an explanation of the highest predicted risk that highlights how the polar maps and clinical features contribute to the overall risk. The waterfall plot allows for visualization of both the extent of influence (length of the arrow) and direction (increasing risk of the event – red arrow pointing to the right, decreasing risk – blue arrow pointing to the left). In the presented case, a 41-year-old female with heart failure with moderately reduced ejection fraction and moderate perfusion deficits is identified as having a high risk of death. Explanation of the prediction shows the elevated resting heart rate as one of the factors having the greatest contribution to the elevated risk. Simulating the reduction of the resting heart rate to 70/min shows that optimal guideline-guided management could lower average predicted risk of death by 36%. Inference using our model took below 12 milliseconds per patient case on an Apple MacBook Pro laptop computer.

**Fig. 4: Patient-level prediction of time-dependent risk of major adverse cardiovascular events.**

**Fig. 5: Patient-level risk explanation with simulated modification of risk factors.**

Discussion

Leveraging a large cardiac imaging registry, we developed a deep learning approach for individual risk computation that allows time-dependent and event-specific predictions jointly from clinical and cardiac imaging data. We obtain time- and event-specific risk estimation and provide visually intuitive graphs for individual risk explanations. The model provides risk estimates over-time, for all-cause death, ACS, and late revascularization separately, with easy-to-understand patient-level explanations. We evaluated our model in a large, multi-site external dataset as well as with internal 10-fold cross-validation. Good performance in the external testing set points to the ability of our model to generalize to unseen real-life data from new centers. The model relies on the combined predictive potential of the clinical features, stress test data, and direct image analysis, similarly to the way clinicians try to integrate all available information to provide the most accurate study interpretation. Moreover, this approach also leverages time-to-event data to provide more robust risk estimation over time, which could potentially be applied to a broad range of AI tasks.

Previous prognostic studies estimated risk jointly using composite adverse events and without the use of time-to-event data⁵. A high-risk of death in the next year is a very different scenario than a risk of hospital admission or revascularization over 10 years – but our current presentation and assessment of data lack this granularity. Recently, non-linear AI survival models have demonstrated practical implementations in healthcare^7,8,9 with state-of-the-art performances that are comparable to or improve the performance of traditional Cox proportional hazard models¹⁰. Examples of such models include precision genomic prognostication in patients receiving cancer treatment¹¹, prediction of oral cancer survival¹² as well as of progression of potentially malignant disorders to cancer¹³. A large scale, multisite study investigated the use of deep neural networks trained using full electronic health records data in prediction multiple medical events¹⁴. This is however, to our knowledge, the first study to evaluate prediction at multiple time points of multiple events in a large multi-site registry of cardiovascular imaging data that also explicitly takes advantage of time-to-event data during model training.

Patient level-explanation may be crucial for the clinical adoption of AI in medical imaging^15,16, but this approach was not previously applied as a joint explanation of direct imaging data and clinical variables for individualized risks of specific events. Such explanations may point to abnormalities in the imaging data as well as to some clinical features that drive the increased risk for a given adverse event, potentially allowing for a more comprehensive assessment of patient’s condition. Event-specific predictions within a single model can be presented at the time of review of imaging and may enable physicians to practice precision medicine, with individually tailored treatments and preventive measures. For instance, a prediction of high risk of all-cause death could encourage more frequent follow-up visits and additional diagnostic tests, while a high risk of ACS and revascularization could indicate that the patient is a candidate for revascularization or needs intensification of medical therapy.

In addition to informing the physician about the rationale behind model predictions, the visualization of factors contributing to increased risk of adverse events might serve as a powerful tool in shared decision-making after the exam, utilizing all available information¹⁷. When discussed with the patient, a special focus might be given to modifiable risk factors such as high BMI¹⁸, hypertension¹⁹, diabetes, and dyslipidemia²⁰, leading to optimal, goal-directed medical therapy of these risk factors. That could be a starting point for a discussion on how these factors can be targeted through lifestyle modifications and medications. Such an approach could be an important step towards patient empowerment and could improve adherence to physicians’ recommendations. However, it is important to acknowledge the limitations of SHapley Additive exPlanations (SHAP) - derived feature importance²¹, especially that they do not imply casual relations between the input features and the outcome. For this reason, the waterfall plots (Fig. 5) that were generated based on SHAP values should be considered an illustrative tool and should be interpreted with caution.

Interestingly, in the external testing set, we found that the perfusion abnormality variable had lower performance than the clinical-only time-to-event model in predicting all-cause mortality at any time point. This confirms that clinical features, such as age and medical comorbidities, are important determinants of all-cause mortality and have previously been shown to influence the “warranty period” of normal perfusion on MPI²². Additionally, myocardial perfusion may change in response to anti-anginal therapies and thus would not be expected to be an accurate predictor of long-term hard outcomes²³. However, unsurprisingly the revascularization prediction performance was similar for the perfusion abnormality and the full time-to-event model and higher than for the clinical-only time-to-event model. This is expected because physicians may rely on perfusion information when making revascularization decisions^24,25, which could lead to overestimation of its prediction performance for the revascularization.

Our study has several limitations. First, we have only assessed all-cause mortality and could, therefore, not differentiate between cardiac and non-cardiac deaths. We separately considered the major cardiovascular events of ACS and revascularization. However, other possible events like atrial fibrillation, worsening of heart failure, or sub-classification of ACS (presence of ST-elevation) were not available for analysis due to the multi-site, retrospective nature of the imaging registry. In our model, the risk for death and ACS is estimated independently from the risk of revascularization, thus allowing for the event-specific assessment of patient’s prognosis, but it should be noted that an increased risk of ACS and revascularization would lead physicians to consider the same preventive strategies. Furthermore, while the dynamic cAUC of the standard quantitative perfusion analysis decreases over time for the revascularization prediction, our model maintains higher cAUC for the revascularization prediction in the long term. The performance of our model could be further improved by utilizing data from other imaging modalities²⁶. For instance, computed tomography attenuation correction scans could be used to automatically calculate calcium score, which could be included in the time-to event model²⁷. Finally, the usefulness of the time-to-event predictions has not been evaluated in prospective studies. This is understandable given the novelty of the proposed methods. Further investigation is needed to assess if the additional temporal dimension of the model’s prediction and its ability to differentiate the risk of specific events can improve physicians’ workflows and lead to better clinical decisions.

The proposed deep learning model, using cardiac perfusion images and clinical data with time-to-event specific outcomes, provides a robust prediction of the risk of all-cause death, ACS, and revascularization. The model significantly improved the prediction of all-cause death and the composite MACE outcome, while also improving the prediction of ACS in the external testing population. By presenting the individualized patient-specific post-scan risk assessment over time in an intuitive manner for the clinicians and patients, our approach can potentially help better address patient risk and guide management that is tailored to the patient’s individual risk profile.

Methods

Patient populations

For the training and internal validation, we included 20,418 scans from five international centers participating in the prospective, multi-site Registry of Fast Myocardial Perfusion Imaging with Next generation SPECT (REFINE SPECT)²⁸. We included all consecutive patients who underwent clinically indicated SPECT MPI from 2009 to 2014. We excluded 17 patients without gated studies, leaving a total of 20,401 patients.

Definition of events

Patients were followed for MACE, which was defined as all-cause death, myocardial infarction, unstable angina, and revascularization (surgical or percutaneous). Non-fatal myocardial infarction was defined as hospitalization for cardiac chest pain or anginal equivalent with positive cardiac biomarkers²⁹. Unstable angina was defined as recent onset or escalating cardiac chest pain with negative cardiac biomarkers. All outcomes were adjudicated by experienced cardiologists after considering all available clinical data. We chose three outcomes as events of interest: death, acute coronary syndrome (ACS) - defined as either non-fatal myocardial infarction or admission for unstable angina, and revascularization (with percutaneous coronary intervention or coronary artery bypass grafting). For each patient, only the first occurring event was considered and therefore, each patient had either one of the three events or no events. If a patient presented with ACS and had revascularization on the same day, that event was considered as an ACS. If a patient had either ACS or revascularization and died on the same day, that event was considered as a death. For area under receiver-operating curve (AUC) analysis, events that occurred up to a given time-point were considered as positive events, and if the event occurred after the specified time point, the patient was considered event-free.

External cohort

The external testing population included an additional 13,988 patients who underwent clinically indicated SPECT MPI with MACE follow–up at a separate three external centers: Oklahoma Heart Hospital (n = 6034), University of Calgary Hospital (n = 2985) and Yale New Haven Hospital (n = 4969). All outcomes were adjudicated using the same criteria as in the training cohort.

Image collection

Patients were imaged with either a DSPECT (Spectrum-Dynamics, Caesarea, Israel), GE Discovery NM 530c, or NM/CT570c (GE Healthcare, Haifa, Israel) camera system. Patients underwent either symptom-limited exercise testing or pharmacologic stress. Additional details regarding imaging protocols and acquisition have been previously described²⁸.

After anonymization, all images were transferred to Cedars-Sinai Medical Center, where quality control was performed by experienced core laboratory technologists without knowledge of the clinical data. Left ventricular (LV) myocardial contours were computed and verified by an experienced nuclear medicine technologist using standard clinical software³⁰. Polar maps of the LV, representing compressed form of images, were automatically generated from the images. Five polar maps were derived for each patient including perfusion, motion, thickening, cardiac phase, and amplitude. Clinical data and images from the external centers were de-identified and transferred to Cedars-Sinai. This study complies with the Declaration of Helsinki.

Ethical approvals

The institutional review boards at Cedars-Sinai and the participating sites approved the collection of data for the registry: Cedars-Sinai Institutional Review Board, PeaceHealth System Institutional Review Board (Oregon Heart and Vascular Institute), Ottawa Health Science Network Research Ethics Board, Partners Human Research Committee (Brigham and Women’s Hospital), Assuta Medical Centers Ethics Committee, Western Institutional Review Board (Oklahoma Heart Hospital Research Foundation), Conjoint Health Research Ethics Board of the University of Calgary, and Yale University Institutional Review Board. Informed, written consent was obtained from the subjects (or their legally authorized representative) in Cedars Sinai Medical Center and Brigham and Women’s Hospital. In the remaining sites, waiver of consent was granted by the respective local institutional review boards.

Clinical features

Clinical and stress test results were collected according to the protocol of the REFINE SPECT registry. All the clinical features used in the model are listed in Supplementary Table 4.

Design and training of the event-specific deep learning network

We employed a deep-learning-based approach capable of learning the distribution of event ‘hitting times’ directly from data. We extended the DeepHit architecture and associated loss function³¹ and implemented with the PyTorch framework³². To allow the network to process images, we added convolutional layers capable of directly interrogating perfusion, motion, wall thickening, and phase polar maps and combining the imaging data with 15 clinical features that were chosen based on our previous work on the minimum set of variables for machine learning cardiovascular event prediction³³. The network consists of two main parts:

1.
convolutional part that processes the 28 x 36 x 5 input of 5 normalized polar maps consisting of 2 convolution blocks, each with 3×3 convolution kernels, batch normalization, dropout, and Leaky Rectified Linear Unit (ReLU) layers, which were added to prevent overfitting.
2.
clinical features are added in the fully connected layer, with 512 nodes and 15 clinical features passed to a separate fully connected layer with 32 nodes. The output of these layers is concatenated and passed to the DeepHit network described by Lee et al.³¹, with 256 nodes in a single shared layer and 256 nodes in each of the three event-specific layers.

The output of the model is a 2-D 3 × 131 array of shape, representing probabilities of each of the events occurring at time 0 and every 30 days up to the maximum follow up time. We used loss functions proposed by Lee et al³¹. with modification by Kvamme et al³².

Missing values (Supplementary Table 5) were imputed using mean or mode (in case of categorical features) values in the training set. This method was previously shown to perform similar to other data imputation techniques³⁴.

The important aspect of the architecture is the ability to handle multiple competing events³⁵ and generate predictions separately for each of them. The resulting architecture is shown in Supplementary Fig. 2. The model generated predictions in the form of a 2-D array of monthly event probabilities for multiple events.

Additional analyses

To evaluate the usefulness of combining clinical and imaging data in a model, we trained and tested a separate model that used clinical features only. This model utilizes the same architecture as the time-to-event model, but without image input. Additionally, we performed a sensitivity analysis in the external testing population to investigate the effect of removing cases with revascularization events within 180 days from the MPI. We compared the AUC for the prediction of revascularization, death, ACS, and MACE in the external dataset at 1 year and 3 years from scan with and without removing the early revascularization cases.

Comparison with Cox regression model

For comparison, we created a multivariable Cox regression model that used all clinical features utilized by the time-to-event model and stress total perfusion deficit (perfusion abnormality). This model was trained in the internal set and evaluated in the external testing set. We compared AUC for the prediction of death, ACS, revascularization, and MACE at 1 and 3 years from scan.

Internal Training and testing routine

The model was trained and tested in a 10-fold repeated hold-out regimen. The development set was randomly divided into 10 samples (folds) with the same fraction of each MACE event (stratified split). Then, 10 separate models were trained, each using 9 of 10 folds for training and the remaining one for testing. Within the 90% of training data, the model randomly selected 20% of cases that were used for model hyperparameter optimization in this fold. There was no overlap of training data with the testing data at any point. Testing results from each of the 10 folds and 10 models were concatenated for robust assessment of the overall performance in unseen data.

External testing

The generalizability of the approach to data from new medical centers was evaluated in external testing regimen. For the robust estimation of external performance, each of the 10 models generated in 10-fold cross-validation of REFINE SPECT cohort was evaluated in a separate external cohort from new three new centers. Performance was then evaluated separately for the 10 sets of predictions and presented as mean with 95% confidence intervals after bootstrapping.

Patient-specific explanations

We provide explanations of individualized predictions made by the algorithm. This approach allows for the identification of important patient-specific features driving the prediction and provides a feature importance ranking for each patient, separately for each of the three separate outcomes. The individual explanation of the predicted probability of each of the target events was achieved through the generation of SHapley Additive exPlanations (SHAP values)³⁶. To provide a meaningful explanation, we summed the SHAP values for all pixels in each image input and presented them alongside the importance of clinical features in the form of waterfall plots. For each of the top contributing features, the water fall plot visualizes how strongly the given feature increases or decreases the risk for the specific events in a specific patient.

Statistical analysis

Continuous variables were expressed as median and (interquartile ranges [IQR]). Two-sided Kruskal-Wallis test were used to compare differences in median values. Categorical variables were compared using Fisher’s exact test. A p value <0.05 was considered statistically significant. Univariable comparisons and summary statistics were computed using R 4.1.2 and R Studio. Details on used software packages and versions are given in the Supplementary Table 6.

The predictive performance of the model was evaluated using time-dependent concordance index³⁷ that extends the concordance index for time-dependent predictions, and cumulative dynamic area under the receiver-operating characteristic (ROC) curve (cAUC)³⁸ as implemented in the scikit-survival python package. This measure reflects the probability that, given two randomly chosen patients, one having failed before time <T > and the other having failed after <T > , the prognostic marker will be correctly ranked. We used plots of the cAUC values as a function of time from scan to visualize the ability to capture temporal changes in the risk of adverse events. The 95% confidence limits for the cAUC curves were established using bootstrapping (100 samples with replacement). We compared the cAUC of our model’s output with the clinical-only model and with the extent of perfusion abnormality (stress total perfusion deficit)³⁰, which is an established, clinically used quantitative MPI variable. In previous studies, total perfusion deficit measure demonstrated efficient risk stratification³⁹ and identification of patients who may benefit from early revascularization²⁴.

Additionally, we evaluated the model using area under the ROC curves (AUC) at 3-time points in the internal testing set (1, 3, and 5 years from scan), separately for all-cause death, ACS, and revascularization at each time point and for each event using DeLong’s test.

Compliance with recommendations for machine-learning-related research

This study was designed and conducted following the transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD)⁴⁰ checklist that is included as Supplementary Table 7.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

De-identified data supporting this study may be shared based on reasonable written request to the corresponding author. Access to de-identified data will require a Data Access Agreement and IRB clearance, which will be considered by the institutions who provideded the data for this research.

Code availability

The source code will be shared using a Creative Commons NC-ND 4.0 international license upon reasonable written request to the corresponding author and requires a research use agreement. The list of software packages for the deep learning model and for statistical analyses and their versions is provided in Supplementary Table 6.

References

Oren, O., Gersh, B. J. & Bhatt, D. L. Artificial intelligence in medical imaging: switching from radiographic pathological data to clinically meaningful endpoints. Lancet Digital Health 2, e486–e488 (2020).
Article PubMed Google Scholar
Knuuti, J. et al. 2019 ESC Guidelines for the diagnosis and management of chronic coronary syndromes: The Task Force for the diagnosis and management of chronic coronary syndromes of the European Society of Cardiology (ESC). Eur. Heart J. 41, 407–477 (2020).
Article PubMed Google Scholar
Otaki, Y. et al. Clinical Deployment of Explainable Artificial Intelligence for Diagnosis of Coronary Artery Disease. JACC Cardiovasc Imaging In press. (2021).
Knott, K. D. et al. The Prognostic Significance of Quantitative Myocardial Perfusion: An Artificial Intelligence–Based Approach Using Perfusion Mapping. Circulation 141, 1282–1291 (2020).
PubMed PubMed Central Google Scholar
Oikonomou, E. K. et al. A novel machine learning-derived radiotranscriptomic signature of perivascular fat improves cardiac risk prediction using coronary CT angiography. Eur. Heart J. 40, 3529–3543 (2019).
Article PubMed PubMed Central Google Scholar
Kip, K. E., Hollabaugh, K., Marroquin, O. C. & Williams, D. O. The Problem With Composite End Points in Cardiovascular Studies. The Story of Major Adverse Cardiac Events and Percutaneous Coronary Intervention. J. Am. Coll. Cardiol. 51, 701–707 (2008).
Article PubMed Google Scholar
Mobadersany, P. et al. Predicting cancer outcomes from histology and genomics using convolutional networks. Proc. Natl. Acad. Sci. 115, E2970–E2979 (2018).
Nagpal, C., Li, X. & Dubrawski, A. Deep Survival Machines: Fully Parametric Survival Regression and Representation Learning for Censored Data With Competing Risks. IEEE J. Biomed. Health Inf. 25, 3163–3175 (2021).
Article Google Scholar
Katzman, J. L. et al. DeepSurv: Personalized treatment recommender system using a Cox proportional hazards deep neural network. BMC Med. Res. Methodol. 18, 1–12 (2018).
Article Google Scholar
Cox, D. R. Regression Models and Life-Tables. J. R. Stat. Soc. Ser. B (Methodol.) 34, 187–220 (1972).
Google Scholar
Yousefi, S. et al. Predicting clinical outcomes from large scale cancer genomic profiles with deep survival models. Sci. Rep. 7, 11707 (2017).
Article PubMed PubMed Central Google Scholar
Kim, D. W. et al. Deep learning-based survival prediction of oral cancer patients. Sci. Rep. 9, 6994–6994 (2019).
Article PubMed PubMed Central Google Scholar
Adeoye, J. et al. Deep Learning Predicts the Malignant-Transformation-Free Survival of Oral Potentially Malignant Disorders. Cancers 13, (2021).
Rajkomar, A. et al. Scalable and accurate deep learning with electronic health records. npj Digital Med. 1, 1–10 (2018).
Article Google Scholar
Hu, L.-H. et al. Prognostically safe stress-only single-photon emission computed tomography myocardial perfusion imaging guided by machine learning: report from REFINE SPECT. Eur. Heart J. - Cardiovascular Imaging 1, 1–10 (2020).
CAS Google Scholar
Khurshid, S. et al. ECG-Based Deep Learning and Clinical Risk Factors to Predict Atrial Fibrillation. Circulation 145, 122–133 (2022).
Article CAS PubMed Google Scholar
Elwyn, G. et al. Shared decision making: a model for clinical practice. J. Gen. Intern Med 27, 1361–1367 (2012).
Article PubMed PubMed Central Google Scholar
Neeland, I. J., McGuire, D. K. & Sattar, N. Cardiovascular Outcomes Trials for Weight Loss Interventions: Another Tool for Cardiovascular Prevention? Circulation 144, 1359–1361 (2021).
Article PubMed Google Scholar
Whelton Paul, K. et al. 2017 ACC/AHA/AAPA/ABC/ACPM/AGS/APhA/ASH/ASPC/NMA/PCNA Guideline for the Prevention, Detection, Evaluation, and Management of High Blood Pressure in Adults. J. Am. Coll. Cardiol. 71, e127–e248 (2018).
Article CAS PubMed Google Scholar
Grundy, S. M. et al. 2018 AHA/ACC/AACVPR/AAPA/ABC/ACPM/ADA/AGS/APhA/ASPC/NLA/PCNA Guideline on the Management of Blood Cholesterol: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines. Circulation 139, e1082–e1143 (2019).
PubMed Google Scholar
Kumar, I. E., Venkatasubramanian, S., Scheidegger, C. & Friedler, S. In International Conference on Machine Learning. 5491–5500 (PMLR).
Romero-Farina, G. et al. Warranty periods for normal myocardial perfusion stress SPECT. J. Nucl. Cardiol. 22, 44–54 (2015).
Article PubMed Google Scholar
Zoghbi, G. J., Dorfman, T. A. & Iskandrian, A. E. The Effects of Medications on Myocardial Perfusion. J. Am. Coll. Cardiol. 52, 401–416 (2008).
Article CAS PubMed Google Scholar
Azadani, P. N. et al. Impact of Early Revascularization on Major Adverse Cardiovascular Events in Relation to Automatically Quantified Ischemia. JACC: Cardiovascular Imaging 14, 644–653 (2021).
PubMed Google Scholar
Rozanski, A. et al. Benefit of Early Revascularization Based on Inducible Ischemia and Left Ventricular Ejection Fraction. J. Am. Coll. Cardiol. 80, 202–215 (2022).
Article PubMed Google Scholar
Nudi, F., Schillaci, O., Biondi-Zoccai, G. & Iskandrian, A. E. Hybrid cardiac imaging for clinical decision-making. 1 edn, (Springer Nature, 2022).
Miller, R. J. H. et al. Deep learning coronary artery calcium scores from SPECT/CT attenuation maps improves prediction of major adverse cardiac events. J. Nucl. Med. 122, 264423 (2022).
Google Scholar
Slomka, P. J. et al. Rationale and design of the REgistry of Fast Myocardial Perfusion Imaging with NExt generation SPECT (REFINE SPECT). J. Nucl. Cardiol. 27, 1010–1021 (2020).
Article PubMed Google Scholar
Gulati, M. et al. 2021 AHA/ACC/ASE/CHEST/SAEM/SCCT/SCMR Guideline for the Evaluation and Diagnosis of Chest Pain: A Report of the American College of Cardiology/American Heart Association Joint Committee on Clinical Practice Guidelines. Circulation 144, e368–e454 (2021).
PubMed Google Scholar
Slomka, P. J. et al. Automated quantification of myocardial perfusion SPECT using simplified normal limits. J. Nucl. Cardiol. 12, 66–77 (2005).
Article PubMed Google Scholar
Lee, C., Zame, W. R., Yoon, J. & Van Der Schaar, M. DeepHit: A deep learning approach to survival analysis with competing risks. 32nd AAAI Conference on Artificial Intelligence, AAAI 2018, 2314–2321 (2018).
Kvamme, H., Borgan, O. & Scheel, I. Time-to-event prediction with neural networks and cox regression. J. Mach. Learn. Res. 20, 1–30 (2019).
Google Scholar
Rios, R. et al. Determining a minimum set of variables for machine learning cardiovascular event prediction: results from REFINE SPECT registry. Cardiovasc Res. 118, 2152–2164 (2021).
Rios, R. et al. Handling missing values in machine learning to predict patient-specific risk of adverse cardiac events: Insights from REFINE SPECT registry. Computers Biol. Med. 145, 105449 (2022).
Article CAS Google Scholar
Austin, P. C., Lee, D. S. & Fine, J. P. Introduction to the Analysis of Survival Data in the Presence of Competing Risks. Circulation 133, 601–609 (2016).
Article PubMed PubMed Central Google Scholar
Lundberg, S. M. & Lee, S.-I. In Advances in Neural Information Processing Systems 30 (eds Guyon, I. et al.) (Curran Associates, Inc., 2017).
Uno, H., Cai, T., Pencina, M. J., D’Agostino, R. B. & Wei, L. J. On the C-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data. Stat. Med 30, 1105–1117 (2011).
Article PubMed PubMed Central Google Scholar
Lambert, J. & Chevret, S. Summary measure of discrimination in survival models based on cumulative/dynamic time-dependent ROC curves. Stat. Methods Med. Res. 25, 2088–2102 (2014).
Article PubMed Google Scholar
Otaki, Y. et al. 5-Year Prognostic Value of Quantitative Versus Visual MPI in Subtle Perfusion Defects. JACC: Cardiovascular Imaging 13, 774–785 (2020).
PubMed Google Scholar
Moons, K. G. M. et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann. Intern. Med. 162, W1–W73 (2015).
Article PubMed Google Scholar

Download references

Acknowledgements

This research was supported in part by grants R01-HL089765 and R35-HL161195 from the National Heart, Lung, and Blood Institute/ National Institutes of Health (NHLBI/NIH) (PI: Piotr Slomka). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Author information

Authors and Affiliations

Departments of Medicine (Division of Artificial Intelligence in Medicine), Imaging, and Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, CA, USA
Konrad Pieszko, Aakash D. Shanbhag, Ananya Singh, Robert J. H. Miller, Joanna X. Liang, Jacek Kwieciński, Daniel S. Berman, Damini Dey & Piotr J. Slomka
Department of Interventional Cardiology and Cardiac Surgery, Collegium Medicum, University of Zielona Góra, Zielona Góra, Poland
Konrad Pieszko
Department of Nuclear Cardiology, Oklahoma Heart Hospital, Oklahoma City, OK, USA
M. Timothy Hauser
Department of Cardiac Sciences, University of Calgary and Libin Cardiovascular Institute, Calgary, AB, Canada
Robert J. H. Miller
Institute of Cardiovascular Science, University of Manchester, Manchester, UK
Manish Motwani
Department of Cardiology, Manchester Heart Institute, Manchester Royal Infirmary, Manchester University NHS Foundation Trust, Manchester, UK
Manish Motwani
Department of Interventional Cardiology and Angiology, Institute of Cardiology, Warsaw, Poland
Jacek Kwieciński
Department of Nuclear Cardiology, Assuta Medical Centers, Tel Aviv, Israel
Tali Sharir
Division of Cardiology, Department of Medicine and Department of Radiology, Columbia University Irving Medical Center and NewYork-Presbyterian Hospital, New York, NY, USA
Andrew J. Einstein
Oregon Heart and Vascular Institute, Sacred Heart Medical Center, Springfield, OR, USA
Mathews B. Fish
Division of Cardiology, University of Ottawa Heart Institute, Ottawa, ON, Canada
Terrence D. Ruddy
Department of Nuclear Medicine, Cardiac Imaging, University Hospital Zurich, Zurich, Switzerland
Philipp A. Kaufmann
Section of Cardiovascular Medicine, Department of Internal Medicine, Yale University School of Medicine, New Haven, CT, USA
Albert J. Sinusas & Edward J. Miller
Cardiovascular Imaging Technologies LLC, Kansas City, MO, USA
Timothy M. Bateman
Department of Radiology, Division of Nuclear Medicine and Molecular Imaging, Brigham and Women’s Hospital, Boston, MA, USA
Sharmila Dorbala & Marcelo Di Carli

Authors

Konrad Pieszko
View author publications
You can also search for this author in PubMed Google Scholar
Aakash D. Shanbhag
View author publications
You can also search for this author in PubMed Google Scholar
Ananya Singh
View author publications
You can also search for this author in PubMed Google Scholar
M. Timothy Hauser
View author publications
You can also search for this author in PubMed Google Scholar
Robert J. H. Miller
View author publications
You can also search for this author in PubMed Google Scholar
Joanna X. Liang
View author publications
You can also search for this author in PubMed Google Scholar
Manish Motwani
View author publications
You can also search for this author in PubMed Google Scholar
Jacek Kwieciński
View author publications
You can also search for this author in PubMed Google Scholar
Tali Sharir
View author publications
You can also search for this author in PubMed Google Scholar
Andrew J. Einstein
View author publications
You can also search for this author in PubMed Google Scholar
Mathews B. Fish
View author publications
You can also search for this author in PubMed Google Scholar
Terrence D. Ruddy
View author publications
You can also search for this author in PubMed Google Scholar
Philipp A. Kaufmann
View author publications
You can also search for this author in PubMed Google Scholar
Albert J. Sinusas
View author publications
You can also search for this author in PubMed Google Scholar
Edward J. Miller
View author publications
You can also search for this author in PubMed Google Scholar
Timothy M. Bateman
View author publications
You can also search for this author in PubMed Google Scholar
Sharmila Dorbala
View author publications
You can also search for this author in PubMed Google Scholar
Marcelo Di Carli
View author publications
You can also search for this author in PubMed Google Scholar
Daniel S. Berman
View author publications
You can also search for this author in PubMed Google Scholar
Damini Dey
View author publications
You can also search for this author in PubMed Google Scholar
Piotr J. Slomka
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.P. and P.S. conceptualized and designed the study. K.P., A.S., and A.S. developed and tested the deep learning model. A.S. analyzed the data. PS. designed and organized the multi-site data registry. K.P. performed the investigation, validation, formal analysis, and visualization. K.P. and A.S. have directly accessed and verified the underlying data reported in the manuscript. K.P. drafted the manuscript, which was edited and critically reviewed by A.S., A.S., M.T.H., R.M., J.L., M.M., J.K., T.S., A.J.E., M.B.F., T.D.R., P.K., A.S., E.M., T.B., S.D., M.D.C., D.B., D.D., and P.S. Cedars-Sinai Medical Center was the coordinating center for the study and the core lab for all analysis. All authors read and approved the final manuscript and had final responsibility for the decision to submit for publication.

Corresponding author

Correspondence to Piotr J. Slomka.

Ethics declarations

Competing interests

Drs. Berman and Slomka participate in software royalties for QPS software at Cedars-Sinai Medical Center. Dr. Slomka has received research grant support from Siemens Medical Systems and has received consulting honoraria form Synektik, SA. Dr. Berman has served as a consultant for GE Healthcare. Dr. Robert Miller has served as a consultant for Pfizer. Dr. Einstein has served as a consultant to W. L. Gore & Associates. Dr. Di Carli has received research grant support from Spectrum Dynamics and consulting honoraria from Sanofi and GE Healthcare. Dr. Ruddy has received research grant support from GE Healthcare. Dr. Einstein’s institution has received research support from GE Healthcare, International Atomic Energy Agency, Eidos Therapeutics, Roche Medical Systems, Pfizer, Attralus, and W. L. Gore & Associates. Dr. Robert Miller’s institution has received research support from Pfizer. The remaining authors have nothing to disclose.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Material

REPORTING SUMMARY

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Pieszko, K., Shanbhag, A.D., Singh, A. et al. Time and event-specific deep learning for personalized risk assessment after cardiac perfusion imaging. npj Digit. Med. 6, 78 (2023). https://doi.org/10.1038/s41746-023-00806-x

Download citation

Received: 27 July 2022
Accepted: 27 March 2023
Published: 01 May 2023
DOI: https://doi.org/10.1038/s41746-023-00806-x

This article is cited by

A review of evaluation approaches for explainable AI with applications in cardiology
- Ahmed M. Salih
- Ilaria Boscolo Galazzo
- Gloria Menegaz
Artificial Intelligence Review (2024)
Hybridizing machine learning in survival analysis of cardiac PET/CT imaging
- Luis Eduardo Juarez-Orozco
- Mikael Niemi
- Riku Klén
Journal of Nuclear Cardiology (2023)

Subjects

Abstract

Similar content being viewed by others

Predicting long-term time to cardiovascular incidents using myocardial perfusion imaging and deep convolutional neural networks

Enhancing the diagnosis of functionally relevant coronary artery disease with machine learning

Machine learning for ECG diagnosis and risk stratification of occlusion myocardial infarction

Introduction

Results

Population characteristics

Internal testing

External testing

Individual prediction and explanation

Discussion

Methods

Patient populations

Definition of events

External cohort

Image collection

Ethical approvals

Clinical features

Design and training of the event-specific deep learning network

Additional analyses

Comparison with Cox regression model

Internal Training and testing routine

External testing

Patient-specific explanations

Statistical analysis

Compliance with recommendations for machine-learning-related research

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Material

REPORTING SUMMARY

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

A review of evaluation approaches for explainable AI with applications in cardiology

Hybridizing machine learning in survival analysis of cardiac PET/CT imaging

Search

Quick links