Next-Generation FinTech Cloud Framework: Databricks and Azure-Based AI with Gradient Boosting and LLM Integration for SAP-Driven Open Banking and Quality Assurance
DOI:
https://doi.org/10.15662/IJARCST.2025.0806807Keywords:
FinTech cloud framework, Databricks, Azure, gradient boosting, large-language models, open banking, SAP integration, quality assurance, anomaly detection, machine learningAbstract
In the evolving FinTech landscape, open banking and regulatory-driven financial innovation demand cloud-native, AI-powered infrastructures that integrate enterprise systems such as SAP S/4HANA and support large-scale data analytics and machine learning. This paper proposes a next-generation FinTech cloud framework that leverages Databricks on Microsoft Azure combined with a hybrid modelling strategy using gradient boosting and large-language-model (LLM) integration to deliver real-time risk scoring, fraud detection, transaction analytics and quality assurance for an open banking ecosystem. The architecture integrates SAP-driven core banking and back-office workflows, open banking APIs, Databricks data lake and machine-learning pipelines, and Azure services for orchestration and deployment. Using gradient boosting for structured transaction data and LLMs for unstructured text (e.g., chat logs, compliance documentation), the framework enables enhanced anomaly detection, real-time alerts and automated remediation workflows embedded into the SAP ecosystem. A pilot implementation across a mid-sized bank’s open-API platform demonstrates measurable improvements: model accuracy for fraud detection increased by ~17% over baseline, end-to-end time from anomaly detection to remediation reduced by ~35%, and QA defect rate in data exchange pipelines decreased by ~28%. The results indicate that combining cloud-native data/AI platforms with enterprise systems and mixed-modelling approaches can materially enhance FinTech operational resilience and quality assurance. The paper discusses limitations (data governance, model interpretability, integration complexity) and outlines future research directions for federated learning, multi-tenant banking domains and regulatory audit-automation.
References
1. Armbrust M., Fox A., Griffith R., Joseph A. D., Katz R., Konwinski A., Lee G., Patterson D., Rabkin A., Stoica I., Zaharia M., “A view of cloud computing”, Communications of the ACM, vol. 53, no. 4, pp. 50-58, Apr. 2010.
2. Marston S., Li Z., Bandyopadhyay S., Zhang J., Ghalsasi A., “Cloud computing – The business perspective”, Decision Support Systems, vol. 51, no. 1, pp. 176-189, 2011.
3. Arner D. W., Barberis J., Buckley R. P., “FinTech, RegTech and the reconceptualisation of financial regulation”, Fordham Journal of Corporate & Financial Law, vol. 23, no. 1, pp. 31-103, 2017.
4. Puschmann T., “FinTech”, Business & Information Systems Engineering, vol. 59, no. 1, pp. 69-76, 2017.
5. Gai K., Qiu M., Sun X., “A survey on FinTech”, Journal of Network and Computer Applications, vol. 103, pp. 262-273, 2018.
6. Liu Q., Chan K. C., Chimhundu R., “Fintech research: systematic mapping, classification, and future directions”, Financial Innovation, vol. 10, article 24, Jan. 2024.
7. Reddy, B. T. K., & Sugumar, R. (2025, June). Effective forest fire detection by UAV image using Resnet 50 compared over Google Net. In AIP Conference Proceedings (Vol. 3267, No. 1, p. 020274). AIP Publishing LLC.
8. Shashank, P. S. R. B., Anand, L., & Pitchai, R. (2024, December). MobileViT: A Hybrid Deep Learning Model for Efficient Brain Tumor Detection and Segmentation. In 2024 International Conference on Progressive Innovations in Intelligent Systems and Data Science (ICPIDS) (pp. 157-161). IEEE.
9. Binu, C. T., Kumar, S. S., Rubini, P., & Sudhakar, K. (2024). Enhancing Cloud Security through Machine Learning-Based Threat Prevention and Monitoring: The Development and Evaluation of the PBPM Framework. https://www.researchgate.net/profile/Binu-C-T/publication/383037713_Enhancing_Cloud_Security_through_Machine_Learning-Based_Threat_Prevention_and_Monitoring_The_Development_and_Evaluation_of_the_PBPM_Framework/links/66b99cfb299c327096c1774a/Enhancing-Cloud-Security-through-Machine-Learning-Based-Threat-Prevention-and-Monitoring-The-Development-and-Evaluation-of-the-PBPM-Framework.pdf
10. Adari, V. K. (2024). APIs and open banking: Driving interoperability in the financial sector. International Journal of Research in Computer Applications and Information Technology (IJRCAIT), 7(2), 2015–2024.
11. Manda, P. (2024). THE ROLE OF MACHINE LEARNING IN AUTOMATING COMPLEX DATABASE MIGRATION WORKFLOWS. International Journal of Research Publications in Engineering, Technology and Management (IJRPETM), 7(3), 10451-10459.
12. Sridhar Kakulavaram. (2022). Life Insurance Customer Prediction and Sustainbility Analysis Using Machine Learning Techniques. International Journal of Intelligent Systems and Applications in Engineering, 10(3s), 390 –.Retrieved from https://ijisae.org/index.php/IJISAE/article/view/7649
13. Amuda, K. K., Kumbum, P. K., Adari, V. K., Chunduru, V. K., & Gonepally, S. (2024). Evaluation of crime rate prediction using machine learning and deep learning for GRA method. Data Analytics and Artificial Intelligence, 4 (3).
14. Kandula, N. Machine Learning Techniques in Fracture Mechanics a Comparative Study of Linear Regression, Random Forest, and Ada Boost Model.
15. Kesavan, E. (2024). Big Data Analytics: Tools, Technologies, and Real-World Applications–A Review. International Journal of Innovations in Science, Engineering And Management, 120-126.https://ijisem.com/journal/index.php/ijisem/article/view/315/280
16. HV, M. S., & Kumar, S. S. (2024). Fusion Based Depression Detection through Artificial Intelligence using Electroencephalogram (EEG). Fusion: Practice & Applications, 14(2).
17. Raju, L. H. V., & Sugumar, R. (2025, June). Improving jaccard and dice during cancerous skin segmentation with UNet approach compared to SegNet. In AIP Conference Proceedings (Vol. 3267, No. 1, p. 020271). AIP Publishing LLC.
18. Poornima, G., & Anand, L. (2024, April). Effective strategies and techniques used for pulmonary carcinoma survival analysis. In 2024 1st International Conference on Trends in Engineering Systems and Technologies (ICTEST) (pp. 1-6). IEEE.
19. Bussu, V. R. R. Leveraging AI with Databricks and Azure Data Lake Storage. https://pdfs.semanticscholar.org/cef5/9d7415eb5be2bcb1602b81c6c1acbd7e5cdf.pdf
20. Lin, T. (2024). The role of generative AI in proactive incident management: Transforming infrastructure operations. International Journal of Innovative Research in Science, Engineering and Technology, 13(12), Article — . https://doi.org/10.15680/IJIRSET.2024.1312014
21. Pachyappan, R., Kotapati, V. B. R., & Shanmugam, L. (2024). TicketGenesis: LLM-Driven Compliance Evidence Extraction and Auto-Assignment Engine. Los Angeles Journal of Intelligent Systems and Pattern Recognition, 4, 325-366.
22. Sivaraju, P. S., & Mani, R. (2024). Private Cloud Database Consolidation in Financial Services: A Comprehensive Case Study on APAC Financial Industry Migration and Modernization Initiatives. International Journal of Research Publications in Engineering, Technology and Management (IJRPETM), 7(3), 10472-10490.
23. Gorle, S., Christadoss, J., & Sethuraman, S. (2025). Explainable Gradient-Boosting Classifier for SQL Query Performance Anomaly Detection. American Journal of Cognitive Computing and AI Systems, 9, 54-87.
24. Kühn W., “ERP systems and open banking – integration challenges for financial institutions”, Journal of Banking & Finance Technology, vol. 2, no. 2, pp. 112-125, 2019.
25. Chen P., “A Review of FinTech Research in the Context of Digital Innovation”, Scientific Journal of Technology, vol. 6, no. 7, pp. 130-140, 2024.
26. Dharika K., Yamini K., “Cloud Computing in Fintech: Opportunities and Challenges”, European Journal of Advances in Engineering and Technology, vol. 10, no. 5, 2023.


