Accurate suicide risk prediction in clinical practice is hindered by stringent privacy regulations, fragmented data ownership, and pronounced heterogeneity across healthcare institutions in patient demographics, symptom severity, and social determinants of health. To address these challenges, we propose a federated learning (FL) framework for binary suicide-risk stratifi-cation (high-risk vs. lower-risk) that enables collaborative model training across hospitals without sharing raw patient data. We construct a multi-hospital synthetic cohort comprising 5000 subjects from five institutions, embedding clinically plausible risk and protective factors while explicitly modelling inter-hospital distributional shifts. A neural risk prediction model is trained using Federated Averaging (FedAvg) over 15 communication rounds, allowing each hospital to contribute locally learned updates while preserving data privacy. The proposed FL approach achieves a final global accuracy of 0.942 and a global AUC-ROC of 0.9568, closely matching centralized training performance (0.945 accuracy; 0.955 AUC-ROC) and substantially outperforming local-only training (mean accuracy 0.930; mean AUC-ROC 0.8962). Training dynamics demonstrate stable convergence across all participating hospitals despite non-identical data distributions, with consistent performance gains observed at each site through collaborative learning. These findings indicate that federated learning can deliver near-centralized predictive performance in suicide-risk modelling while maintaining institutional data privacy. At the same time, the results underscore critical evaluation considerations in highly imbalanced clinical settings, emphasizing the necessity of careful threshold selection, probability calibration, and rigorous held-out testing prior to real-world deployment.
Cite this paper
Filippis, R. D. and Foysal, A. A. (2026). Federated Learning for Suicide Risk Prediction across Heterogeneous Hospitals Using Privacy-Preserving Synthetic Data
. Open Access Library Journal, 13, e14921. doi: http://dx.doi.org/10.4236/oalib.1114921.
Oaten, A., Jordan, A., Chandler, A. and Marzetti, H. (2022) Suicide Prevention as Biopolitical Surveillance: A Critical Analysis of UK Suicide Prevention Policies. Critical Social Policy, 43, 654-675. https://doi.org/10.1177/02610183221142544
Sandman, L. and Liliemark, J. (2023) Should Severity Assessments in Healthcare Priority Setting Be Risk- and Time-Sensitive? Health Care Analysis, 31, 169-185. https://doi.org/10.1007/s10728-023-00460-0
David Rudd, M., Bryan, C.J., Jobes, D.A., Feuerstein, S. and Conley, D. (2022) A Standard Proto-col for the Clinical Management of Suicidal Thoughts and Behavior: Implications for the Suicide Prevention Narrative. Frontiers in Psychiatry, 13, Article 929305. https://doi.org/10.3389/fpsyt.2022.929305
Torok, M., Han, J., Baker, S., Werner-Seidler, A., Wong, I., Larsen, M.E. and Christensen, H. (2020) Suicide Prevention Using Self-Guided Digital Interventions: A Systematic Review and Meta-Analysis of Randomised Controlled Trials. The Lancet Digital Health, 2, e25-e36. https://doi.org/10.1016/S2589-7500(19)30199-2
Kirtley, O.J., van Mens, K., Hoogendoorn, M., Kapur, N. and de Beurs, D. (2022) Trans-lating Promise into Practice: A Review of Machine Learning in Suicide Research and Prevention. The Lancet Psychiatry, 9, 243-252. https://doi.org/10.1016/s2215-0366(21)00254-6
Ehtemam, H., Sadeghi Esfahlani, S., Sanaei, A., Ghaemi, M.M., Hajesmaeel-Gohari, S., Rahimi-sadegh, R., et al. (2024) Role of Machine Learning Algorithms in Suicide Risk Prediction: A Systematic Review-Meta Analysis of Clinical Studies. BMC Medical Informatics and Decision Making, 24, Article No. 138. https://doi.org/10.1186/s12911-024-02524-0
Su, C., Aseltine, R., Doshi, R., Chen, K., Rogers, S.C. and Wang, F. (2020) Machine Learning for Suicide Risk Prediction in Children and Adolescents with Electronic Health Records. Translational Psychiatry, 10, Article No. 413. https://doi.org/10.1038/s41398-020-01100-0
Richardson, P.G., San Miguel, J.F., Moreau, P., Hajek, R., Dimopoulos, M.A., Laubach, J.P., et al. (2018) Interpreting Clinical Trial Data in Multiple Myeloma: Translating Findings to the Real-World Setting. Blood Cancer Journal, 8, Article No. 109. https://doi.org/10.1038/s41408-018-0141-0
Gilron, I., Blyth, F. and Smith, B.H. (2019) Translating Clinical Trials into Improved Real-World Man-agement of Pain: Convergence of Translational, Population-Based, and Primary Care Research. Pain, 161, 36-42. https://doi.org/10.1097/j.pain.0000000000001684
Chatterjee, S., Din-darian, A. and Rengaraju, U. (2025) Preserving Data Assessment, Privacy in Mental Healthcare: Ensuring Authenticity, Confidentiality, and Security in Data Integration from Diverse Source. In: Chatterjee, S., Dindarian, A. and Rengaraju, U., Eds., Revolutionizing Youth Mental Health with Ethical AI, Apress, 185-253. https://doi.org/10.1007/979-8-8688-1186-9_6
Zhang, H., Mao, Y., Lin, Y. and Zhang, D. (2025) E-mental Health in the Age of AI: Data Safety, Privacy Regulations and Recommendations. Alpha Psychiatry, 26, Article No. 44279. https://doi.org/10.31083/ap44279
Hazeem, H. and AlBurshaid, E. (2024) Fragmented Data Landscape and Data Asymmetries in the Real Estate Industry. In: Jreisat, A. and Mili, M., Eds., Blockchain in Real Estate, Springer, 179-205. https://doi.org/10.1007/978-981-99-8533-3_10
Murphy, N.C., Burke, N., Breathnach, F.M., Burke, G., McAuliffe, F.M., Morrison, J.J., et al. (2020) Inter-Hospital Comparison of Cesarean Delivery Rates Should Not Be Considered to Reflect Quality of Care without Consideration of Patient Hetero-geneity: An Observational Study. European Journal of Obstetrics & Gynecology and Reproductive Biology, 250, 112-116. https://doi.org/10.1016/j.ejogrb.2020.05.003
Fassett, K.T., Wolcott, M.D., Harpe, S.E. and McLaughlin, J.E. (2022) Considerations for Writing and Including Demographic Variables in Education Research. Currents in Pharmacy Teaching and Learning, 14, 1068-1078. https://doi.org/10.1016/j.cptl.2022.07.022
Long, G., Shen, T., Jiang, J. and Blumenstein, M. (2024) Du-al-Personalizing Adapter for federated Foundation Models. Advances in Neural Information Processing Systems, 37, 39409-39433. https://doi.org/10.52202/079017-1245
Aggarwal, M., Khullar, V. and Goyal, N. (2024) A Comprehensive Review of Federated Learning: Methods, Applications, and Challenges in Privacy-Preserving Collaborative Model Train-ing. In: Singh, J., Goyal, S.B., Kaushal, R.K., Kumar, N. and Sehra, S.S., Eds., Ap-plied Data Science and Smart Systems, CRC Press, 570-575. https://doi.org/10.1201/9781003471059-73
Hasan, M.T. and Kudapa, S.P. (2021) Data Privacy-Aware Machine Learning and Federated Learning: A Framework for Data Security. American Journal of Interdisciplinary Studies, 2, 1-34. https://doi.org/10.63125/vj1hem03
Chowdhury, T.K. and Kudapa, S.P. (2024) Federated Learning Models for Privacy-Preserving Data Sharing and Secure Analytics in Healthcare Industry. International Journal of Business and Economics Insights, 4, 91-133. https://doi.org/10.63125/c2dzn006
Lyu, L., Yu, J., Nandakumar, K., Li, Y., Ma, X., Jin, J., et al. (2020) Towards Fair and Privacy-Preserving Federated Deep Models. IEEE Transactions on Parallel and Distributed Systems, 31, 2524-2541. https://doi.org/10.1109/tpds.2020.2996273
Bokhari, M.U., Yadav, G. and Zeyauddin, M. (2024) Exploring Ensemble-Based Approaches for Granular Suicide Risk Assessment: A Comprehensive Framework in Therapeutic Informatics. International Journal of Information Technology. https://doi.org/10.1007/s41870-024-02060-0
Tran, T., Phung, D., Luo, W., Harvey, R., Berk, M. and Venkatesh, S. (2013) An Integrated Framework for Suicide Risk Prediction. Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Chicago, 11-14 August 2013, 1410-1418. https://doi.org/10.1145/2487575.2488196
Dormont, B. and Milcent, C. (2005) How to Regulate Heterogeneous Hospitals? Journal of Economics & Management Strategy, 14, 591-621. https://doi.org/10.1111/j.1530-9134.2005.00075.x
Giuseppi, A., Tor-re, L.D., Menegatti, D., Priscoli, F.D., Pietrabissa, A. and Poli, C. (2022) An Adaptive Model Averaging Procedure for Federated Learning (AdaFed). Journal of Advances in Information Technology, 13, 539-548. https://doi.org/10.12720/jait.13.6.539-548
Manoj, T., Makkithaya, K. and Narendra, V.G. (2022) A Federated Learning-Based Crop Yield Prediction for Agricultural Production Risk Management. 2022 IEEE Delhi Section Confer-ence (DELCON), New Delhi, 11-13 February 2022, 1-7. https://doi.org/10.1109/delcon54057.2022.9752836
Brewer, B.B., Carley, K.M., Benham-Hutchins, M., Effken, J.A. and Reminga, J. (2020) Exploring the Stability of Communica-tion Network Metrics in a Dynamic Nursing Context. Social Networks, 61, 11-19. https://doi.org/10.1016/j.socnet.2019.08.003
Han, D., Kolli, K.K., Gransar, H., Lee, J.H., Choi, S., Chun, E.J., et al. (2020) Machine Learning Based Risk Prediction Model for Asymptomatic Individuals Who Underwent Coronary Artery Calcium Score: Comparison with Traditional Risk Prediction Approaches. Journal of Cardiovascular Computed Tomography, 14, 168-176. https://doi.org/10.1016/j.jcct.2019.09.005
Lyssenko, V., Jonsson, A., Almgren, P., Pulizzi, N., Isomaa, B., Tuomi, T., et al. (2008) Clinical Risk Factors, DNA Variants, and the Development of Type 2 Diabetes. New England Journal of Medicine, 359, 2220-2232. https://doi.org/10.1056/nejmoa0801869
Özyürekoğlu, T., McCabe, S.J., Goldsmith, L.J. and LaJoie, A.S. (2006) The Minimal Clinically Important Differ-ence of the Carpal Tunnel Syndrome Symptom Severity Scale. The Journal of Hand Surgery, 31, 733-738. https://doi.org/10.1016/j.jhsa.2006.01.012
Anestis, M.D. (2016) Prior Sui-cide Attempts Are Less Common in Suicide Decedents Who Died by Firearms Relative to Those Who Died by Other Means. Journal of Affective Disorders, 189, 106-109. https://doi.org/10.1016/j.jad.2015.09.007
Rutter, M. and Sandberg, S. (1992) Psychosocial Stressors: Concepts, Causes and Effects. European Child & Adolescent Psychiatry, 1, 3-13. https://doi.org/10.1007/bf02084429
Kuhle, S., Maguire, B., Zhang, H., Hamilton, D., Allen, A.C., Joseph, K.S., et al. (2018) Comparison of Logistic Re-gression with Machine Learning Methods for the Prediction of Fetal Growth Abnormalities: A Retrospective Cohort Study. BMC Pregnancy and Childbirth, 18, Article No. 333. https://doi.org/10.1186/s12884-018-1971-2
Park, S. and Byun, J. (2021) A Study of Predictive Models for Early Outcomes of Post-Prostatectomy Incontinence: Machine Learning Approach vs. Logistic Re-gression Analysis Approach. Applied Sciences, 11, Article 6225. https://doi.org/10.3390/app11136225
Amini, P., Ahmadinia, H., Poorolajal, J. and Amiri, M.M. (2016) Evaluating the High Risk Groups for Suicide: A Comparison of Logistic Regression, Support Vector Machine, Decision Tree and Artificial Neural Network. Iranian Journal of Public Health, 45, 1179-1187.
Tran, A.T., Zeevi, T. and Payabvash, S. (2025) Strategies to Improve the Robustness and Generalizability of Deep Learning Segmentation and Classification in Neuroimaging. BioMe-dInformatics, 5, Article 20. https://doi.org/10.3390/biomedinformatics5020020
Liu, J. and Cui, P. (2025) Data Heterogeneity Modeling for Trustworthy Machine Learning. Pro-ceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.2, Toronto, 3-7 August 2025, 6086-6095. https://doi.org/10.1145/3711896.3736560
Paschali, M., Conjeti, S., Navarro, F. and Navab, N. (2018) Generalizability Vs. Robustness: Investigating Medical Imaging Networks Using Adversarial Examples. In: Frangi, A., Schnabel, J., Davatzikos, C., Alberola-López, C. and Fichtinger, G., Eds., Medical Image Computing and Computer Assisted Intervention—MICCAI 2018, Springer In-ternational Publishing, 493-501. https://doi.org/10.1007/978-3-030-00928-1_56
Kazmierski, M., Welch, M., Kim, S., McIntosh, C., Rey-McIntyre, K., Huang, S.H., et al. (2023) Mul-ti-Institutional Prognostic Modeling in Head and Neck Cancer: Evaluating Impact and Generalizability of Deep Learning and Radiomics. Cancer Research Commu-nications, 3, 1140-1151. https://doi.org/10.1158/2767-9764.crc-22-0152
Jayne, M.E.A. and Dipboye, R.L. (2004) Leveraging Diversity to Improve Business Performance: Research Findings and Recommendations for Organizations. Human Resource Management, 43, 409-424. https://doi.org/10.1002/hrm.20033
Anthonysamy, P., Rashid, A. and Chitchyan, R. (2017) Privacy Requirements: Present & Future. 2017 IEEE/ACM 39th International Conference on Software Engineering: Software Engineering in Society Track (ICSE-SEIS), Bueons Aires, 20-28 May 2017, 13-22. https://doi.org/10.1109/icse-seis.2017.3
Alhassan, I., Sammon, D. and Daly, M. (2016) Data Governance Activities: An Analysis of the Literature. Journal of Decision Systems, 25, 64-75. https://doi.org/10.1080/12460125.2016.1187397
van Panhuis, W.G., Paul, P., Emerson, C., Grefenstette, J., Wilder, R., Herbst, A.J., et al. (2014) A Systematic Review of Barriers to Data Sharing in Public Health. BMC Public Health, 14, Article No. 1144. https://doi.org/10.1186/1471-2458-14-1144
Fassnacht, M., Benz, C., Heinz, D., Leimstoll, J. and Satzger, G. (2023) Barriers to Data Sharing among Private Sector Organizations. Proceedings of the Annual Hawaii International Conference on System Sciences, Maui, 3-6 January 2023, 3695-3704. https://doi.org/10.24251/hicss.2023.453
Bensusan, H., and Kalousis, A. (2001) Estimating the Predictive Accuracy of a Classifier. In: European Confer-ence on Machine Learning, Springer Berlin Heidelberg, 25-36. https://doi.org/10.1007/3-540-44795-4_3
Naiseh, M., Al-Thani, D., Jiang, N. and Ali, R. (2023) How the Different Explanation Classes Impact Trust Calibration: The Case of Clinical Decision Support Systems. International Journal of Human-Computer Studies, 169, Article ID: 102941. https://doi.org/10.1016/j.ijhcs.2022.102941
Alsoud, D., Sabino, J., Fer-rante, M., Verstockt, B. and Vermeire, S. (2025) Calibration, Clinical Utility, and Specificity of Clinical Decision Support Tools in Inflammatory Bowel Disease. Clinical Gastroenterology and Hepatology, 23, 1216-1227.e14. https://doi.org/10.1016/j.cgh.2024.09.020
Goe, L., Holdheide, L. and Miller, T. (2011) A Practical Guide to Designing Comprehensive Teacher Evalu-ation Systems: A Tool to Assist in the Development of Teacher Evaluation Sys-tems. National Comprehensive Center for Teacher Quality.
Wind, A., Oberst, S., Westerhuis, W., Blaauwgeers, H., Sæter, G., de Paoli, P., et al. (2023) Evaluating Comprehensive Cancer Networks; a Review of Standards and Evalu-ation Methods for Care Networks to Inform a Comparison with the OECI Com-prehensive Cancer Network Standards. Acta Oncologica, 62, 15-24. https://doi.org/10.1080/0284186x.2023.2170275
Linden, A. (2006) Measuring Diagnostic and Predictive Accuracy in Disease Management: An Introduction to Receiver Operating Characteristic (ROC) Analysis. Journal of Evaluation in Clinical Practice, 12, 132-139. https://doi.org/10.1111/j.1365-2753.2005.00598.x
Yang, S. and Ber-dine, G. (2017) The Receiver Operating Characteristic (ROC) Curve. The Southwest Respiratory and Critical Care Chronicles, 5, 34-36. https://doi.org/10.12746/swrccc.v5i19.391
Miller, M., Hafner, M., Sontag, E., Davidsohn, N., Subramanian, S., et al. (2012) Modular Design of Ar-tificial Tissue Homeostasis: Robust Control through Synthetic Cellular Hetero-geneity. PLOS Computational Biology, 8, e1002579. https://doi.org/10.1371/journal.pcbi.1002579
Doudchenko, N., Khosra-vi, K., Pouget-Abadie, J., Lahaie, S., Lubin, M., Mirrokni, V. and Spiess, J. (2021) Synthetic Design: An Optimization Approach to Experimental Design with Syn-thetic Controls. Advances in Neural In-formation Processing Systems, 34, 8691-8701.
Zhou, M., Li, J., Basu, R. and Ferreira, J. (2022) Creating Spa-tially-Detailed Heterogeneous Synthetic Populations for Agent-Based Microsim-ulation. Computers, Environment and Urban Systems, 91, Article ID: 101717. https://doi.org/10.1016/j.compenvurbsys.2021.101717
Byrnes, J.E.K. and Dee, L.E. (2025) Causal Inference with Observational Data and Unobserved Confounding Varia-bles. Ecology Letters, 28, e70023. https://doi.org/10.1111/ele.70023
Tchetgen Tchetgen, E. (2013) The Control Outcome Calibration Approach for Causal Inference with Unobserved Confounding. American Journal of Epidemiology, 179, 633-640. https://doi.org/10.1093/aje/kwt303
Bovens, A.M.P.M., van Baak, M.A., Vrencken, J.G.P.M., Wijnen, J.A.G. and Verstappen, F.T.J. (1990) Variability and Reliability of Joint Measurements. The American Journal of Sports Medicine, 18, 58-63. https://doi.org/10.1177/036354659001800110
Medina, L.S. and Zurakowski, D. (2003) Measurement Variability and Confidence Intervals in Medicine: Why Should Radiologists Care? Radiology, 226, 297-301. https://doi.org/10.1148/radiol.2262011537
Blonde, L., Khunti, K., Har-ris, S.B., Meizinger, C. and Skolnik, N.S. (2018) Interpretation and Impact of Re-al-World Clinical Data for the Practicing Clinician. Advances in Therapy, 35, 1763-1774. https://doi.org/10.1007/s12325-018-0805-y
Chen, J., Ho, M., Lee, K., Song, Y., Fang, Y., Goldstein, B.A., et al. (2021) The Current Land-scape in Biostatistics of Real-World Data and Evidence: Clinical Study Design and Analysis. Statistics in Biopharmaceutical Research, 15, 29-42. https://doi.org/10.1080/19466315.2021.1883474
Ziemer, R. (1967) Error Probabilities Due to Additive Combinations of Gaussian and Impulsive Noise. IEEE Transactions on Communication Technology, 15, 471-474. https://doi.org/10.1109/tcom.1967.1089608
Le Montagner, Y., Ange-lini, E.D. and Olivo-Marin, J. (2014) An Unbiased Risk Estimator for Image De-noising in the Presence of Mixed Poisson-Gaussian Noise. IEEE Transactions on Image Processing, 23, 1255-1268. https://doi.org/10.1109/tip.2014.2300821
Li, N., Kolmanovsky, I. and Girard, A. (2021) An Analytical Safe Approximation to Joint Chance-Constrained Programming with Additive Gaussian Noises. IEEE Trans-actions on Automatic Control, 66, 5490-5497. https://doi.org/10.1109/tac.2021.3051000
Arpit, D., Zhou, Y.B., Kota, B. and Govindaraju, V. (2016) Normalization Propagation: A Parametric Tech-nique for Removing Internal Covariate Shift in Deep Networks. International Conference on Machine Learning 2016, New York, 19-24 June 2016, 1168-1176.
Awais, M., Bin Iqbal, M.T. and Bae, S. (2021) Revisiting In-ternal Covariate Shift for Batch Normalization. IEEE Transactions on Neural Networks and Learning Systems, 32, 5082-5092. https://doi.org/10.1109/tnnls.2020.3026784
Kiliçarslan, S. and Celik, M. (2021) Rsigelu: A Nonlinear Activation Function for Deep Neural Networks. Expert Systems with Applications, 174, Article ID: 114805. https://doi.org/10.1016/j.eswa.2021.114805
Banerjee, C., Mukherjee, T. and Pasiliao, E. (2020) The Multi-Phase ReLU Activation Function. Proceed-ings of the 2020 ACM Southeast Conference, Tampa, 2-4 April 2020, 239-242. https://doi.org/10.1145/3374135.3385313
Rahman, S., Jiang, L.Y., Ga-briel, S., Aphinyanaphongs, Y., Oermann, E.K. and Chunara, R. (2024) Gener-alization in Healthcare AI: Evaluation of a Clinical Large Language Model. arXiv: 2402.10965.
Zhang, Z.L. and Sabuncu, M.R. (2018) Generalized Cross Entropy Loss for Training Deep Neural Networks with Noisy Labels. Proceedings of the 32nd International Conference on Neural Information Processing Sys-tems, Montréal, 3-8 December 2018, 8792-8802.
Wu, Y., Du, K., Wang, X. and Min, F. (2024) Misclassification-Guided Loss under the Weighted Cross-Entropy Loss Framework. Knowledge and Information Systems, 66, 4685-4720. https://doi.org/10.1007/s10115-024-02123-5
Ahmed, A., Sun, G., Saadeldin, M., Bilal, A., Li, Y., Osman, M., et al. (2025) Efficient Melanoma De-tection Using Pixel Intensity-Based Masking and Intensity-Weighted Binary Cross-Entropy. International Journal of Imaging Systems and Technology, 35, e70179. https://doi.org/10.1002/ima.70179
Rachakonda, S., Moorthy, S., Jain, A., Bukharev, A., Bucur, A., Manni, F., et al. (2023) Privacy Enhancing and Scalable Federated Learning to Accelerate AI Implementation in Cross-Silo and IoMT Environments. IEEE Journal of Biomedical and Health Informatics, 27, 744-755. https://doi.org/10.1109/jbhi.2022.3185418
Albarqouni, S., Andreux, M., Avestimehr, S., Ayed, S., Bellet, A., Cyffers, E., et al. (2022) Flamby: Datasets and Benchmarks for Cross-Silo Federated Learning in Realistic Healthcare Settings. Advances in Neural Information Processing Systems 35, New Orleans, 28 November-9 December 2022, 5315-5334. https://doi.org/10.52202/068431-0384
Pillutla, K., Kakade, S.M. and Harchaoui, Z. (2022) Robust Aggregation for Federated Learning. IEEE Transactions on Signal Processing, 70, 1142-1154. https://doi.org/10.1109/tsp.2022.3153135
Khan, N., Nisar, S., Khan, M.A., Ur Rehman, Y.A., Noor, F. and Barb, G. (2025) Optimizing Federated Learning with Aggregation Strategies: A Comprehensive Survey. IEEE Open Journal of the Computer Society, 6, 1227-1247. https://doi.org/10.1109/ojcs.2025.3590102
Volinsky, C.T., Madigan, D., Raftery, A.E. and Kronmal, R.A. (1997) Bayesian Model Averaging in Propor-tional Hazard Models: Assessing the Risk of a Stroke. Journal of the Royal Statis-tical Society Series C: Applied Statistics, 46, 433-448. https://doi.org/10.1111/1467-9876.00082
Kelley, J.G. and Simmons, B.A. (2019) Introduction: The Power of Global Performance Indicators. Interna-tional Organization, 73, 491-510. https://doi.org/10.1017/s0020818319000146
Ruby, U. and Yendapalli, V. (2020) Binary Cross Entropy with Deep Learning Technique for Image Clas-sification. International Journal of Advanced Trends in Computer Science and Engineering, 9, 5393-5397. https://doi.org/10.30534/ijatcse/2020/175942020
Tang, X., Guo, S. and Guo, J. (2022) Personalized Federated Learning with Contextualized Generalization. Proceed-ings of the Thirty-First International Joint Conference on Artificial Intelligence, Vienna, 23-29 July 2022, 2241-2247. https://doi.org/10.24963/ijcai.2022/311
Marulli, F., Verde, L., Marrone, S., Barone, R. and De Biase, M.S. (2021) Evaluating Efficiency and Effectiveness of Federated Learning Approaches in Knowledge Extraction Tasks. 2021 Inter-national Joint Conference on Neural Networks (IJCNN), Shenzhen, 18-22 July 2021, 1-6. https://doi.org/10.1109/ijcnn52387.2021.9533946
Ali, W., Kumar, R., Deng, Z., Wang, Y. and Shao, J. (2021) A Federated Learning Ap-proach for Privacy Protection in Context-Aware Recommender Systems. The Computer Journal, 64, 1016-1027. https://doi.org/10.1093/comjnl/bxab025
Elbir, A.M., Coleri, S. and Mishra, K.V. (2021) Hybrid Federated and Centralized Learning. 2021 29th European Signal Processing Conference (EUSIPCO), Dublin, 23-27 August 2021, 1541-1545. https://doi.org/10.23919/eusipco54536.2021.9616120
Li, C., Li, C., Zhao, Y., Zhang, B. and Li, C. (2021) An Efficient Multi-Model Training Algorithm for Federated Learning. 2021 IEEE Global Communications Conference (GLOBECOM), Madrid, 7-11 December 2021, 1-6. https://doi.org/10.1109/globecom46510.2021.9685230
Jiménez-Valverde, A. (2014) Thresh-old-Dependence as a Desirable Attribute for Discrimination Assessment: Impli-cations for the Evaluation of Species Distribution Models. Biodiversity and Con-servation, 23, 369-385. https://doi.org/10.1007/s10531-013-0606-1
Abdulrazaq, M. (2023) Rare-Event Prediction in Imbalanced Data: A Unified Evaluation and Optimiza-tion Framework for High-Risk Systems. Communication In Physical Sciences, 9, 968-979.
Iturbe-Araya, J. and Rifà-Pous, H. (2025) Hyperparameter Op-timization and Evaluation Metrics for Unsupervised Anomaly-Based Cyberattack Detection in Imbalanced Smart Home Datasets. Journal of Network and Systems Management, 33, Article No. 99. https://doi.org/10.1007/s10922-025-09973-6
Pepe, M.S., Longton, G. and Janes, H. (2009) Estimation and Comparison of Receiver Operating Char-acteristic Curves. The Stata Journal: Promoting communications on statistics and Stata, 9, 1-16. https://doi.org/10.1177/1536867x0900900101
Movahedi, F., Padman, R. and Antaki, J.F. (2023) Limitations of Receiver Operating Characteristic Curve on Imbalanced Data: Assist Device Mortality Risk Scores. The Journal of Thoracic and Cardiovascular Surgery, 165, 1433-1442.e2. https://doi.org/10.1016/j.jtcvs.2021.07.041
Grau, J., Grosse, I. and Keilwagen, J. (2015) PRROC: Computing and Visualizing Precision-Recall and Receiver Operating Characteristic Curves in R. Bioinformatics, 31, 2595-2597. https://doi.org/10.1093/bioinformatics/btv153
Obuchowski, N.A. and Bul-len, J.A. (2018) Receiver Operating Characteristic (ROC) Curves: Review of Methods with Applications in Diagnostic Medicine. Physics in Medicine & Biolo-gy, 63, 07TR01. https://doi.org/10.1088/1361-6560/aab4b1
Van der Maat, E. (2021) Simplified Complexity: Analytical Strategies for Conflict Event Research. Conflict Management and Peace Science, 38, 87-108. https://doi.org/10.1177/0738894218771077
Davis, J. and Goadrich, M. (2006) The Relationship between Precision-Recall and ROC Curves. Proceed-ings of the 23rd International Conference on Machine Learning—ICML’06, Pittsburgh, 25-29 June 2006, 233-240. https://doi.org/10.1145/1143844.1143874
Doumpos, M. and Pasiour-as, F. (2005) Developing and Testing Models for Replicating Credit Ratings: A Multicriteria Approach. Computational Economics, 25, 327-341. https://doi.org/10.1007/s10614-005-6412-4
Jin Huang, and Ling, C.X. (2005) Using AUC and Accuracy in Evaluating Learning Algorithms. IEEE Transactions on Knowledge and Data Engineering, 17, 299-310. https://doi.org/10.1109/tkde.2005.50
Carrington, A.M., Manuel, D.G., Fieguth, P.W., Ramsay, T., Osmani, V., Wernly, B., et al. (2023) Deep ROC Anal-ysis and AUC as Balanced Average Accuracy, for Improved Classifier Selection, Audit and Explanation. IEEE Transactions on Pattern Analysis and Machine In-telligence, 45, 329-341. https://doi.org/10.1109/tpami.2022.3145392
Kovacs, I., Iosub, A., Țopa, M., Buzo, A. and Pelz, G. (2019) A Gradient-Based Sensitivity Analysis Method for Complex Systems. 2019 IEEE 25th International Symposium for Design and Technology in Electronic Packaging (SIITME), Cluj-Napoca, 23-26 October 2019, 333-338. https://doi.org/10.1109/siitme47687.2019.8990871
Lin, N. and Ensel, W.M. (1989) Life Stress and Health: Stressors and Resources. American Socio-logical Review, 54, 382-399. https://doi.org/10.2307/2095612
Hovenkamp-Hermelink, J.H.M., van der Veen, D.C., Oude Voshaar, R.C., Batelaan, N.M., Penninx, B.W.J.H., Jeronimus, B.F., et al. (2019) Anxiety Sensitivity, Its Stability and Longitudinal Association with Severity of Anxiety Symptoms. Scientific Reports, 9, Article No. 4314. https://doi.org/10.1038/s41598-019-39931-7
Pavlou, M., Ambler, G., Seaman, S.R., Guttmann, O., Elliott, P., King, M., et al. (2015) How to Develop a More Accurate Risk Prediction Model When There Are Few Events. BMJ, 351, h3868. https://doi.org/10.1136/bmj.h3868
Hamon, R., Junklewitz, H. and Sanchez, I. (2020) Robustness and Explainability of Artificial Intelligence. Publications Office of the European Union 207, No. 40.
Battula, J., Jillelamudi, V.A., Sammeta, C.K. and Amilpur, S. (2025) Exploring Cancer Ge-nomics with Graph Convolutional Networks: A Comparative Explainability Study with Integrated Gradients and SHAP. BIO Web of Conferences, 163, Article ID: 01003. https://doi.org/10.1051/bioconf/202516301003
Gupta, S., Kumar, S., Chang, K., Lu, C., Singh, P. and Kalpathy-Cramer, J. (2023) Collaborative Privacy-Preserving Ap-proaches for Distributed Deep Learning Using Multi-Institutional Data. Radi-oGraphics, 43, e220107. https://doi.org/10.1148/rg.220107
Monika, T., Kishor Kumar Reddy, C., Puttanapura, J. and Doss, S. (2025) Optimizing Neural Disorder Treatment through Federated Learning and Multi-Institutional Data Collaboration. In: Kishor Kumar Reddy, C. and Nag, A., Eds., Federated Learn-ing for Neural Disorders in Healthcare 6.0, CRC Press, 120-157.
Guo, P., Wang, P., Zhou, J., Jiang, S. and Patel, V.M. (2021) Multi-Institutional Collabora-tions for Improving Deep Learning-Based Magnetic Resonance Image Recon-struction Using Federated Learning. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, 20-25 June 2021, 2423-2432. https://doi.org/10.1109/cvpr46437.2021.00245
Marwaha, J.S., Downing, M., Halamka, J., Abernethy, A., Franklin, J.B., Anderson, B., et al. (2024) Mobilizing Data during a Crisis: Building Rapid Evidence Pipelines Using Multi-Institutional Real World Data. Healthcare, 12, Article ID: 100738. https://doi.org/10.1016/j.hjdsi.2024.100738