Accurate Prediction of Survival Based on Kaplan–Meier Analytics

Philip de Melo; Michele DiLella; Tameka Holman; Shakira McElveen

doi:doi:10.11648/j.crj.20251304.14

Research Article |

| Peer-Reviewed

Accurate Prediction of Survival Based on Kaplan–Meier Analytics

Philip de Melo^*, Michele DiLella, Tameka Holman, Shakira McElveen

Published in Cancer Research Journal (Volume 13, Issue 4)

Received: 27 November 2025 Accepted: 10 December 2025 Published: 29 December 2025

Views: Downloads:

Download PDF

Share This Article

Twitter
Linked In
Facebook

Abstract

This study integrates Kaplan–Meier survival analysis with the Stochastic and Augmented Interpretable Health Analytics (SAIHA) framework to model long-term survival in pancreatic cancer, a malignancy characterized by late diagnosis, rapid progression, and poor prognosis. The Kaplan–Meier estimator was first employed to nonparametrically characterize empirical survival probabilities across the observed follow-up period, capturing censoring patterns and short-term mortality dynamics without imposing distributional assumptions. This step provided a transparent baseline representation of survival up to approximately four years post-diagnosis, where empirical data density remains sufficient for reliable estimation. To address the limitations of traditional Kaplan–Meier analysis in extrapolating beyond observed follow-up, the SAIHA framework was then applied using a Weibull survival model to propagate uncertainty, incorporate population heterogeneity, and generate probabilistic survival projections into the long-term horizon. The Weibull distribution was selected for its flexibility in modeling monotonic hazard functions commonly observed in aggressive cancers and for its interpretability within clinical contexts. Parameter uncertainty was explicitly modeled to reflect variability in disease progression and treatment response across patients. The combined model predicts a pronounced decline in survival beyond year four, with the most likely five-year survival probability estimated near 3% and a median six-year survival approaching 1.5%. These projections align with known epidemiological patterns of pancreatic cancer and underscore the persistent lethality of the disease despite advances in therapy. Importantly, the SAIHA framework provides full survival distributions rather than point estimates, enabling clinicians and researchers to assess uncertainty bounds and tail risks associated with long-term outcomes. Overall, the integrated Kaplan–Meier–SAIHA approach extends classical survival analysis by combining empirical rigor with stochastic, distribution-aware forecasting. This methodology offers a robust and interpretable framework for high-risk clinical prediction, supporting more informed decision-making in oncology research, population health modeling, and precision medicine applications.

Published in	Cancer Research Journal (Volume 13, Issue 4)
DOI	10.11648/j.crj.20251304.14
Page(s)	173-185
Creative Commons	This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited.
Copyright	Copyright © The Author(s), 2025. Published by Science Publishing Group

Keywords

Prediction Analytics, Kaplan-Meier Method, Weibull Distribution, SAIHA Framework

References

[1]	Kaplan, E. L. (1958). Nonparametric Estimation from Incomplete Observations. Journal of the American Statistical Association: https://doi.org/10.2307/2281868
[2]	Schober P, V. T. (2018). Survival Analysis and Interpretation of Time-to-Event Data. National Library of Medicine: https://pmc.ncbi.nlm.nih.gov/articles/PMC6110618/
[3]	Cox, D. R. (1972). Regression models and life‐tables. Journal of the Royal Statistical Society: Series B (Methodological), 34(2), 187–220.
[4]	Schober P, Vetter TR. Survival Analysis and Interpretation of Time-to-Event Data: The Tortoise and the Hare. Anesth Analg. 2018 Sep; 127(3): 792-798. https://doi.org/10.1213/ANE.0000000000003653
[5]	Pang M, Platt RW, Schuster T, Abrahamowicz M. Spline-based accelerated failure time model. Stat Med. 2021 Jan 30; 40(2): 481-497. https://doi.org/10.1002/sim.8786
[6]	Fleming, S. T. (2020). Managerial Epidemiology: Cases and Concepts (4th ed.). Health Administration Press. Harrell, F. E. (2015). Regression Modeling Strategies: With Applications to Linear Models, Logistic and Ordinal Regression, and Survival Analysis (2nd ed.). Springer.
[7]	Arcuri LJ, Souza Santos FP, Perini GF, Hamerschlak N. (2020). Fine and Gray or Cox model? Blood Adv. 2024 Mar 26; 8(6): 1420-1421. https://doi.org/10.1182/bloodadvances.2023012157
[8]	She Y, Jin Z, Wu J, Deng J, Zhang L, Su H, Jiang G, Liu H, Xie D, Cao N, Ren Y, Chen C. (2020). Development and Validation of a Deep Learning Model for Non-Small Cell Lung Cancer Survival. JAMA Netw Open. 2020 Jun 1; 3(6): e205842. https://doi.org/10.1001/jamanetworkopen.2020.5842
[9]	Steingrimsson JA, Morrison S. (2020). Deep learning for survival outcomes. Stat Med. Author manuscript. 2020 Apr 13; 39(17): 2339-2349. https://doi.org/10.1002/sim.8542.
[10]	Lee C, Yoon J, Schaar MV. Dynamic-DeepHit: A Deep Learning Approach for Dynamic Survival Analysis With Competing Risks Based on Longitudinal Data. IEEE Trans Biomed Eng. 2020 Jan; 67(1): 122-133. https://doi.org/10.1109/TBME.2019.2909027
[11]	De Melo, P (2025). Prediction Modeling: Basic Metabolic Panel. Advances in Bioscience and Biotechnology, 16, 360-378. https://doi.org/10.4236/abb.2025.169024
[12]	National Cancer Institute. (2023). Pancreatic cancer treatment (PDQ®)–Patient version. https://www.cancer.gov/types/pancreatic/patient/pancreatic-treatment-pdq
[13]	American Cancer Society. (2025). Survival rates for pancreatic cancer. https://www.cancer.org/cancer/types/pancreatic-cancer/detection-diagnosis-staging/survival-rates.html
[14]	Siegel, R. L., Miller, K. D., Wagle, N. S., & Jemal, A. (2024). Cancer statistics, 2024. CA: A Cancer Journal for Clinicians, 74(1), 12–43. https://doi.org/10.3322/caac.21820
[15]	Bakasa W, Viriri S. Pancreatic Cancer Survival Prediction: A Survey of the State-of-the-Art. Comput Math Methods Med. 2021 Sep 30; 2021:1188414. https://doi.org/10.1155/2021/1188414 .
[16]	Osareh A., Shadgar B. Machine learning techniques to diagnose breast cancer. 2010 5th international symposium on health informatics and bioinformatics; 2010; Ankara, Turkey. pp. 114–120.
[17]	Safiyari A., Javidan R. Predicting lung cancer survivability using ensemble learning methods. in 2017 Intelligent Systems Conference (IntelliSys); 2017; London, UK. pp. 684–688.
[18]	Osman M. H. Pancreatic cancer survival prediction using machine learning and comparing its performance with TNM staging system and prognostic nomograms. AACR Annual Meeting 2019; 2019; Atlanta, GA.
[19]	Song W., Miao D.-L., Chen L. Nomogram for predicting survival in patients with pancreatic cancer. Oncotargets and Therapy. 2018; 11:539–545. https://doi.org/10.2147/OTT.S154599 .
[20]	Chaves DO, Bastos AC, Almeida AM, Guerra MR, Teixeira MTB, Melo APS, Passos VMA. The increasing burden of pancreatic cancer in Brazil from 2000 to 2019: estimates from the Global Burden of Disease Study 2019. Rev Soc Bras Med Trop. 2022 Jan 28; 55(suppl 1): e0271. https://doi.org/10.1590/0037-8682-0271-2021 ..
[21]	De Melo, P., (2024) Public Health Informatics and Technology, Library of Congress. Washington DC.
[22]	Lanjewar, MG., Panchbhai, KG., Patle, LB., (2024), Fusion of transfer learning models with LSTM for detection of breast cancer using ultrasound images, Computers in biology and medicine, 2024-02, Vol. 169, p. 107914-107914, Article 107914.
[23]	de Melo, P. and St. Rose, M. (2025) Accurate Classification of Diabetes via PM Generative AI. Advances in Bioscience and Biotechnology, 16, 379-409. https://doi.org/10.4236/abb.2025.169025
[24]	Ying GS, Heitjan DF. Weibull prediction of event times in clinical trials. Pharm Stat. 2008 Apr-Jun; 7(2): 107-20. https://doi.org/10.1002/pst.271

Cite This Article

Plain Text BibTeX RIS

APA Style

Melo, P. D., DiLella, M., Holman, T., McElveen, S. (2025). Accurate Prediction of Survival Based on Kaplan–Meier Analytics. Cancer Research Journal, 13(4), 173-185. https://doi.org/10.11648/j.crj.20251304.14

Copy | Download

ACS Style

Melo, P. D.; DiLella, M.; Holman, T.; McElveen, S. Accurate Prediction of Survival Based on Kaplan–Meier Analytics. Cancer Res. J. 2025, 13(4), 173-185. doi: 10.11648/j.crj.20251304.14

Copy | Download

AMA Style

Melo PD, DiLella M, Holman T, McElveen S. Accurate Prediction of Survival Based on Kaplan–Meier Analytics. Cancer Res J. 2025;13(4):173-185. doi: 10.11648/j.crj.20251304.14

Copy | Download

@article{10.11648/j.crj.20251304.14,
  author = {Philip de Melo and Michele DiLella and Tameka Holman and Shakira McElveen},
  title = {Accurate Prediction of Survival Based on Kaplan–Meier Analytics},
  journal = {Cancer Research Journal},
  volume = {13},
  number = {4},
  pages = {173-185},
  doi = {10.11648/j.crj.20251304.14},
  url = {https://doi.org/10.11648/j.crj.20251304.14},
  eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.crj.20251304.14},
  abstract = {This study integrates Kaplan–Meier survival analysis with the Stochastic and Augmented Interpretable Health Analytics (SAIHA) framework to model long-term survival in pancreatic cancer, a malignancy characterized by late diagnosis, rapid progression, and poor prognosis. The Kaplan–Meier estimator was first employed to nonparametrically characterize empirical survival probabilities across the observed follow-up period, capturing censoring patterns and short-term mortality dynamics without imposing distributional assumptions. This step provided a transparent baseline representation of survival up to approximately four years post-diagnosis, where empirical data density remains sufficient for reliable estimation. To address the limitations of traditional Kaplan–Meier analysis in extrapolating beyond observed follow-up, the SAIHA framework was then applied using a Weibull survival model to propagate uncertainty, incorporate population heterogeneity, and generate probabilistic survival projections into the long-term horizon. The Weibull distribution was selected for its flexibility in modeling monotonic hazard functions commonly observed in aggressive cancers and for its interpretability within clinical contexts. Parameter uncertainty was explicitly modeled to reflect variability in disease progression and treatment response across patients. The combined model predicts a pronounced decline in survival beyond year four, with the most likely five-year survival probability estimated near 3% and a median six-year survival approaching 1.5%. These projections align with known epidemiological patterns of pancreatic cancer and underscore the persistent lethality of the disease despite advances in therapy. Importantly, the SAIHA framework provides full survival distributions rather than point estimates, enabling clinicians and researchers to assess uncertainty bounds and tail risks associated with long-term outcomes. Overall, the integrated Kaplan–Meier–SAIHA approach extends classical survival analysis by combining empirical rigor with stochastic, distribution-aware forecasting. This methodology offers a robust and interpretable framework for high-risk clinical prediction, supporting more informed decision-making in oncology research, population health modeling, and precision medicine applications.},
 year = {2025}
}

Copy | Download

TY  - JOUR
T1  - Accurate Prediction of Survival Based on Kaplan–Meier Analytics
AU  - Philip de Melo
AU  - Michele DiLella
AU  - Tameka Holman
AU  - Shakira McElveen
Y1  - 2025/12/29
PY  - 2025
N1  - https://doi.org/10.11648/j.crj.20251304.14
DO  - 10.11648/j.crj.20251304.14
T2  - Cancer Research Journal
JF  - Cancer Research Journal
JO  - Cancer Research Journal
SP  - 173
EP  - 185
PB  - Science Publishing Group
SN  - 2330-8214
UR  - https://doi.org/10.11648/j.crj.20251304.14
AB  - This study integrates Kaplan–Meier survival analysis with the Stochastic and Augmented Interpretable Health Analytics (SAIHA) framework to model long-term survival in pancreatic cancer, a malignancy characterized by late diagnosis, rapid progression, and poor prognosis. The Kaplan–Meier estimator was first employed to nonparametrically characterize empirical survival probabilities across the observed follow-up period, capturing censoring patterns and short-term mortality dynamics without imposing distributional assumptions. This step provided a transparent baseline representation of survival up to approximately four years post-diagnosis, where empirical data density remains sufficient for reliable estimation. To address the limitations of traditional Kaplan–Meier analysis in extrapolating beyond observed follow-up, the SAIHA framework was then applied using a Weibull survival model to propagate uncertainty, incorporate population heterogeneity, and generate probabilistic survival projections into the long-term horizon. The Weibull distribution was selected for its flexibility in modeling monotonic hazard functions commonly observed in aggressive cancers and for its interpretability within clinical contexts. Parameter uncertainty was explicitly modeled to reflect variability in disease progression and treatment response across patients. The combined model predicts a pronounced decline in survival beyond year four, with the most likely five-year survival probability estimated near 3% and a median six-year survival approaching 1.5%. These projections align with known epidemiological patterns of pancreatic cancer and underscore the persistent lethality of the disease despite advances in therapy. Importantly, the SAIHA framework provides full survival distributions rather than point estimates, enabling clinicians and researchers to assess uncertainty bounds and tail risks associated with long-term outcomes. Overall, the integrated Kaplan–Meier–SAIHA approach extends classical survival analysis by combining empirical rigor with stochastic, distribution-aware forecasting. This methodology offers a robust and interpretable framework for high-risk clinical prediction, supporting more informed decision-making in oncology research, population health modeling, and precision medicine applications.
VL  - 13
IS  - 4
ER  -

Copy | Download

Author Information

Philip de Melo

Nursing and Allied Health, Norfolk State University, Norfolk, United States of America

Contact Email
Michele DiLella

Nursing and Allied Health, Norfolk State University, Norfolk, United States of America

Contact Email
Tameka Holman

Nursing and Allied Health, Norfolk State University, Norfolk, United States of America
Shakira McElveen

Nursing and Allied Health, Norfolk State University, Norfolk, United States of America

Download PDF

Submit an Article

Sections

Plain Text BibTeX RIS

APA Style

Melo, P. D., DiLella, M., Holman, T., McElveen, S. (2025). Accurate Prediction of Survival Based on Kaplan–Meier Analytics. Cancer Research Journal, 13(4), 173-185. https://doi.org/10.11648/j.crj.20251304.14

Copy | Download

ACS Style

Melo, P. D.; DiLella, M.; Holman, T.; McElveen, S. Accurate Prediction of Survival Based on Kaplan–Meier Analytics. Cancer Res. J. 2025, 13(4), 173-185. doi: 10.11648/j.crj.20251304.14

Copy | Download

AMA Style

Melo PD, DiLella M, Holman T, McElveen S. Accurate Prediction of Survival Based on Kaplan–Meier Analytics. Cancer Res J. 2025;13(4):173-185. doi: 10.11648/j.crj.20251304.14

Copy | Download

@article{10.11648/j.crj.20251304.14,
  author = {Philip de Melo and Michele DiLella and Tameka Holman and Shakira McElveen},
  title = {Accurate Prediction of Survival Based on Kaplan–Meier Analytics},
  journal = {Cancer Research Journal},
  volume = {13},
  number = {4},
  pages = {173-185},
  doi = {10.11648/j.crj.20251304.14},
  url = {https://doi.org/10.11648/j.crj.20251304.14},
  eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.crj.20251304.14},
  abstract = {This study integrates Kaplan–Meier survival analysis with the Stochastic and Augmented Interpretable Health Analytics (SAIHA) framework to model long-term survival in pancreatic cancer, a malignancy characterized by late diagnosis, rapid progression, and poor prognosis. The Kaplan–Meier estimator was first employed to nonparametrically characterize empirical survival probabilities across the observed follow-up period, capturing censoring patterns and short-term mortality dynamics without imposing distributional assumptions. This step provided a transparent baseline representation of survival up to approximately four years post-diagnosis, where empirical data density remains sufficient for reliable estimation. To address the limitations of traditional Kaplan–Meier analysis in extrapolating beyond observed follow-up, the SAIHA framework was then applied using a Weibull survival model to propagate uncertainty, incorporate population heterogeneity, and generate probabilistic survival projections into the long-term horizon. The Weibull distribution was selected for its flexibility in modeling monotonic hazard functions commonly observed in aggressive cancers and for its interpretability within clinical contexts. Parameter uncertainty was explicitly modeled to reflect variability in disease progression and treatment response across patients. The combined model predicts a pronounced decline in survival beyond year four, with the most likely five-year survival probability estimated near 3% and a median six-year survival approaching 1.5%. These projections align with known epidemiological patterns of pancreatic cancer and underscore the persistent lethality of the disease despite advances in therapy. Importantly, the SAIHA framework provides full survival distributions rather than point estimates, enabling clinicians and researchers to assess uncertainty bounds and tail risks associated with long-term outcomes. Overall, the integrated Kaplan–Meier–SAIHA approach extends classical survival analysis by combining empirical rigor with stochastic, distribution-aware forecasting. This methodology offers a robust and interpretable framework for high-risk clinical prediction, supporting more informed decision-making in oncology research, population health modeling, and precision medicine applications.},
 year = {2025}
}

Copy | Download

TY  - JOUR
T1  - Accurate Prediction of Survival Based on Kaplan–Meier Analytics
AU  - Philip de Melo
AU  - Michele DiLella
AU  - Tameka Holman
AU  - Shakira McElveen
Y1  - 2025/12/29
PY  - 2025
N1  - https://doi.org/10.11648/j.crj.20251304.14
DO  - 10.11648/j.crj.20251304.14
T2  - Cancer Research Journal
JF  - Cancer Research Journal
JO  - Cancer Research Journal
SP  - 173
EP  - 185
PB  - Science Publishing Group
SN  - 2330-8214
UR  - https://doi.org/10.11648/j.crj.20251304.14
AB  - This study integrates Kaplan–Meier survival analysis with the Stochastic and Augmented Interpretable Health Analytics (SAIHA) framework to model long-term survival in pancreatic cancer, a malignancy characterized by late diagnosis, rapid progression, and poor prognosis. The Kaplan–Meier estimator was first employed to nonparametrically characterize empirical survival probabilities across the observed follow-up period, capturing censoring patterns and short-term mortality dynamics without imposing distributional assumptions. This step provided a transparent baseline representation of survival up to approximately four years post-diagnosis, where empirical data density remains sufficient for reliable estimation. To address the limitations of traditional Kaplan–Meier analysis in extrapolating beyond observed follow-up, the SAIHA framework was then applied using a Weibull survival model to propagate uncertainty, incorporate population heterogeneity, and generate probabilistic survival projections into the long-term horizon. The Weibull distribution was selected for its flexibility in modeling monotonic hazard functions commonly observed in aggressive cancers and for its interpretability within clinical contexts. Parameter uncertainty was explicitly modeled to reflect variability in disease progression and treatment response across patients. The combined model predicts a pronounced decline in survival beyond year four, with the most likely five-year survival probability estimated near 3% and a median six-year survival approaching 1.5%. These projections align with known epidemiological patterns of pancreatic cancer and underscore the persistent lethality of the disease despite advances in therapy. Importantly, the SAIHA framework provides full survival distributions rather than point estimates, enabling clinicians and researchers to assess uncertainty bounds and tail risks associated with long-term outcomes. Overall, the integrated Kaplan–Meier–SAIHA approach extends classical survival analysis by combining empirical rigor with stochastic, distribution-aware forecasting. This methodology offers a robust and interpretable framework for high-risk clinical prediction, supporting more informed decision-making in oncology research, population health modeling, and precision medicine applications.
VL  - 13
IS  - 4
ER  -

Copy | Download