Application of interpretable machine learning algorithms to predict distant metastasis in ovarian clear cell carcinoma.
Abstract | BACKGROUND: METHODS: Utilizing the Surveillance, Epidemiology, and End Results (SEER) database, we identified patients diagnosed with OCCC between 2004 and 2015. The most influential factors were selected through the application of Gaussian Naive Bayes (GNB) and Adaboost machine learning algorithms, employing a Venn test for further refinement. Subsequently, six machine learning (ML) techniques, namely XGBoost, LightGBM, Random Forest (RF), Adaptive Boosting (Adaboost), Support Vector Machine (SVM), and Multilayer Perceptron (MLP), were employed to construct predictive models for distant metastasis. Shapley Additive Interpretation (SHAP) analysis facilitated a visual interpretation for individual patient. Model validity was assessed using accuracy, sensitivity, specificity, positive predictive value, negative predictive value, F1 score, and the area under the receiver operating characteristic curve (AUC). RESULTS: In the realm of predicting distant metastasis, the Random Forest (RF) model outperformed the other five machine learning algorithms. The RF model demonstrated accuracy, sensitivity, specificity, positive predictive value, negative predictive value, F1 score, and AUC (95% CI) values of 0.792 (0.762-0.823), 0.904 (0.835-0.973), 0.759 (0.731-0.787), 0.221 (0.186-0.256), 0.974 (0.967-0.982), 0.353 (0.306-0.399), and 0.834 (0.696-0.967), respectively, surpassing the performance of other models. Additionally, the calibration curve's Brier Score (95%) for the RF model reached the minimum value of 0.06256 (0.05753-0.06759). SHAP analysis provided independent explanations, reaffirming the critical clinical factors associated with the risk of metastasis in OCCC patients. CONCLUSIONS: This study successfully established a precise predictive model for OCCC patient metastasis using machine learning techniques, offering valuable support to clinicians in making informed clinical decisions.
|
Authors | Qin-Hua Guo, Feng-Chun Xie, Fang-Min Zhong, Wen Wen, Xue-Ru Zhang, Xia-Jing Yu, Xin-Lu Wang, Bo Huang, Li-Ping Li, Xiao-Zhong Wang |
Journal | Cancer medicine
(Cancer Med)
Vol. 13
Issue 7
Pg. e7161
(Apr 2024)
ISSN: 2045-7634 [Electronic] United States |
PMID | 38613173
(Publication Type: Journal Article)
|
Copyright | © 2024 The Authors. Cancer Medicine published by John Wiley & Sons Ltd. |
Topics |
- Female
- Humans
- Bayes Theorem
- Algorithms
- Carcinoma, Ovarian Epithelial
- Adenocarcinoma, Clear Cell
- Machine Learning
- Ovarian Neoplasms
|
|
Join CureHunter, for free Research Interface BASIC access!
Take advantage of free CureHunter research engine access to explore the best drug and treatment options for any disease.
Find out why thousands of doctors, pharma researchers and patient activists
around the world use CureHunter every day.
Realize the full power of the drug-disease research graph!
|