An AI Enabled Federated Learning Architecture for Secure Large Scale Predictive Analytics across Finance and Healthcare
DOI:
https://doi.org/10.15662/IJARCST.2025.0805024Keywords:
Federated Learning, predictive analytics, privacy preservation, finance, healthcare, distributed machine learning, secure aggregation, differential privacy, encrypted communication, regulatory complianceAbstract
The increasing volume and sensitivity of data in finance and healthcare have created an urgent need for predictive analytics that preserves privacy, supports cross‑institution collaboration, and scales to large‑scale distributed environments. Traditional centralized machine learning approaches require pooling data into a central repository, raising significant privacy, regulatory, and security concerns. Federated Learning (FL) addresses these challenges by enabling model training across distributed data sources without sharing raw data, thereby enhancing privacy and compliance. This paper proposes a comprehensive AI‑Enabled Federated Learning Architecture tailored for secure large‑scale predictive analytics across finance and healthcare domains. The architecture integrates local model training at institutional edges, secure aggregation through encrypted communication, and a centralized orchestration layer for global model updates. Core design features include robust authentication and authorization, differential privacy, homomorphic encryption options, and audit‑aware governance suitable for regulatory requirements like GDPR, HIPAA, and financial data regulations. We evaluate the architecture using real and synthetic datasets from both domains, focusing on predictive tasks such as credit risk assessment and disease outcome prediction. The results demonstrate that the federated framework achieves predictive performance comparable to centralized models while significantly improving privacy preservation, lowering data transfer risk, and enhancing compliance. The paper concludes with observations on scalability, limitations, and avenues for future work.References
1. Rieke, N., Hancox, J., Li, W., Milletari, F., Roth, H. R., Albarqouni, S., ... & Cardoso, M. J. (2020). The future of digital health with federated learning. NPJ Digital Medicine, 3(119).
2. Adari, V. K. (2024). The Path to Seamless Healthcare Data Exchange: Analysis of Two Leading Interoperability Initiatives. International Journal of Research Publications in Engineering, Technology and Management (IJRPETM), 7(6), 11472-11480.
3. Sridhar Reddy Kakulavaram, Praveen Kumar Kanumarlapudi, Sudhakara Reddy Peram. (2024). Performance Metrics and Defect Rate Prediction Using Gaussian Process Regression and Multilayer Perceptron. International Journal of Information Technology and Management Information Systems (IJITMIS), 15(1), 37-53.
4. Panwar, P., Shabaz, M., Nazir, S., Keshta, I., Rizwan, A., & Sugumar, R. (2023). Generic edge computing system for optimization and computation offloading of unmanned aerial vehicle. Computers and Electrical Engineering, 109, 108779.
5. Joyce, S., Pasumarthi, A., & Anbalagan, B. (2025). SECURITY OF SAP SYSTEMS IN AZURE: ENHANCING SECURITY POSTURE OF SAP WORKLOADS ON AZURE–A COMPREHENSIVE REVIEW OF AZURENATIVE TOOLS AND PRACTICES.||.
6. Kusumba, S. (2024). Data Integration: Unifying Financial Data for Deeper Insight. International Journal of Research Publications in Engineering, Technology and Management (IJRPETM), 7(1), 9939-9946.
7. Meka, S. (2022). Streamlining Financial Operations: Developing Multi-Interface Contract Transfer Systems for Efficiency and Security. International Journal of Computer Technology and Electronics Communication, 5(2), 4821-4829.
8. Li, T., Sahu, A. K., Talwalkar, A., & Smith, V. (2020). Federated learning: Challenges, methods, and future directions. IEEE Signal Processing Magazine, 37(3), 50–60.
9. Poornima, G., & Anand, L. (2024, May). Novel AI Multimodal Approach for Combating Against Pulmonary Carcinoma. In 2024 5th International Conference for Emerging Technology (INCET) (pp. 1-6). IEEE.
10. Sen, S., Kurni, M., Krishnamaneni, R., & Murthy, A. (2024, December). Improved Bi-directional Long Short-Term Memory for Heart Disease Diagnosis using Statistical and Entropy Feature Set. In 2024 9th International Conference on Communication and Electronics Systems (ICCES) (pp. 1331-1337). IEEE.
11. Kumar, R. K. (2024). Real-time GenAI neural LDDR optimization on secure Apache–SAP HANA cloud for clinical and risk intelligence. IJEETR, 8737–8743. https://doi.org/10.15662/IJEETR.2024.0605006
12. Sivaraju, P. S. (2024). Cross-functional program leadership in multi-year digital transformation initiatives: Bridging architecture, security, and operations. International Journal of Advanced Research in Computer Science & Technology (IJARCST), 7(6), 11374-11380.
13. Bonawitz, K., Ivanov, V., Kreuter, B., Marcedone, A., McMahan, H. B., Patel, S., ... & Seth, K. (2017). Practical secure aggregation for privacy-preserving machine learning. CCS.
14. Rodrigues, G. N., Mir, M. N. H., Bhuiyan, M. S. M., Rafi, M. D. A. L., Hoque, A. M., Maua, J., & Mridha, M. F. (2025). NLP-driven customer segmentation: A comprehensive review of methods and applications in personalized marketing. Data Science and Management.
15. Vasugi, T. (2022). AI-Enabled Cloud Architecture for Banking ERP Systems with Intelligent Data Storage and Automation using SAP. International Journal of Engineering & Extended Technologies Research (IJEETR), 4(1), 4319-4325.
16. Nagarajan, G. (2024). Cloud-Integrated AI Models for Enhanced Financial Compliance and Audit Automation in SAP with Secure Firewall Protection. International Journal of Advanced Research in Computer Science & Technology (IJARCST), 7(1), 9692-9699.
17. Uddandarao, D. P. (2024). Improving Employment Survey Estimates in Data-ScarceRegions Using Dynamic Bayesian Hierarchical Models: Addressing Measurement Challenges in Developing Countries. Panamerican Mathematical Journal, 34(4), 2024.
18. Tamizharasi, S., Rubini, P., Saravana Kumar, S., & Arockiam, D. Adapting federated learning-based AI models to dynamic cyberthreats in pervasive IoT environments.
19. Archana, R., & Anand, L. (2025). Residual u-net with Self-Attention based deep convolutional adaptive capsule network for liver cancer segmentation and classification. Biomedical Signal Processing and Control, 105, 107665.
20. Vijayaboopathy, V., & Dhanorkar, T. (2021). LLM-Powered Declarative Blueprint Synthesis for Enterprise Back-End Workflows. American Journal of Autonomous Systems and Robotics Engineering, 1, 617-655.
21. Sheller, M. J., Reina, G. A., Edwards, B., Martin, J., & Bakas, S. (2019). Multi-institutional deep learning modeling without sharing patient data: A feasibility study on brain tumor segmentation. MICCAI.
22. Christadoss, J., & Panda, M. R. (2025). Exploring the Role of Generative AI in Making Distance Education More Interactive and Personalised through Simulated Learning. Futurity Proceedings, (4), 114-127.
23. Sudharsanam, S. R., Venkatachalam, D., & Paul, D. (2022). Securing AI/ML Operations in Multi-Cloud Environments: Best Practices for Data Privacy, Model Integrity, and Regulatory Compliance. Journal of Science & Technology, 3(4), 52–87.
24. Gujjala, Praveen Kumar Reddy. (2024). Optimizing ETL Pipelines with Delta Lake and Medallion Architecture: A Scalable Approach for Large-Scale Data. International Journal For Multidisciplinary Research. 6. 10.36948/ijfmr.2024.v06i06.55445.
25. Chukkala, R. (2025, April). The Convergence of CCAI, Chatbots, and RCS Messaging: Redefining Business Communication in the AI Era. In International Conference of Global Innovations and Solutions (pp. 194-213). Cham: Springer Nature Switzerland.
26. Chandra Sekhar Oleti. (2022). Serverless Intelligence: Securing J2ee-Based Federated Learning Pipelines on AWS. International Journal of Computer Engineering and Technology (IJCET), 13(3), 163-180. https://iaeme.com/MasterAdmin/Journal_uploa ds/IJCET/VOLUME_13_ISSUE_3/IJCET_13_03 _017.pdf
27. Bansal, R., Chandra, R., & Lulla, K. (2025). Understanding and Mitigating Strategies for Large Language Model (LLMs) Hallucinations in HR Chatbots. International Journal of Computational and Experimental Science and Engineering, 11(3).
28. Vimal Raja, G. (2021). Mining Customer Sentiments from Financial Feedback and Reviews using Data Mining Algorithms. International Journal of Innovative Research in Computer and Communication Engineering, 9(12), 14705-14710.
29. Kagalkar, A., Kabade, S., Chaudhri, B., & Sharma, A. (2023). AI-Driven Automation for Death Claim Processing In Pension Systems: Enhancing Accuracy and Reducing Cycle Time. International Journal of Artificial Intelligence, Data Science, and Machine Learning, 4(4), 105-110.
30. Malarkodi, K. P., Sugumar, R., Baswaraj, D., Hasan, A., & Kousalya, A. (2023, March). Cyber Physical Systems: Security Technologies, Application and Defense. In 2023 9th International Conference on Advanced Computing and Communication Systems (ICACCS) (Vol. 1, pp. 2536-2546). IEEE.
31. Hasan, S., Zerine, I., Islam, M. M., Hossain, A., Rahman, K. A., & Doha, Z. (2023). Predictive Modeling of US Stock Market Trends Using Hybrid Deep Learning and Economic Indicators to Strengthen National Financial Resilience. Journal of Economics, Finance and Accounting Studies, 5(3), 223-235.
32. Adari, V. K. (2024). APIs and open banking: Driving interoperability in the financial sector. International Journal of Research in Computer Applications and Information Technology (IJRCAIT), 7(2), 2015–2024.
33. Kiran, A., & Kumar, S. A methodology and an empirical analysis to determine the most suitable synthetic data generator. IEEE Access 12, 12209–12228 (2024).
34. Koh, C. W. H. B. (2025). AI-Based Cybersecurity and Fraud Analytics for Healthcare Data Integration in Cloud Banking Ecosystems. International Journal of Engineering & Extended Technologies Research (IJEETR), 7(6), 11021-11028.
35. Bonawitz, K., et al. (2019). Towards federated learning at scale: System design. MLSys.


