Big Data Analytics in Digital Banking Fraud Detection Technologies and Methods
Main Article Content
Abstract
Digital banking fraud has escalated dramatically with the proliferation of online financial services, causing billions in annual losses and threatening the stability of global economic systems. This paper examines the Big Data Analytics (BDA) technologies and methods for real-time fraud detection in digital banking environments. The paper discusses the evolution from traditional rule-based systems to modern distributed computing frameworks, analysing how Apache Hadoop and Spark enable the processing of massive transaction volumes with varying trade-offs between latency and accuracy. Key machine learning approaches are covered, including supervised methods, unsupervised methods, and hybrid architectures that combine both paradigms. The paper identifies critical implementation challenges across technical dimensions, operational aspects, and regulatory requirements. Emerging trends explored include federated learning for privacy-preserving model training, blockchain integration for cross-institutional fraud detection, and edge computing for ultra-low latency inference. The analysis shows that while individual studies report improvements in detection, challenges remain in real-world validation, model interpretability, and cross-institutional generalizability. The paper concludes with practical recommendations for implementing hybrid streaming-batch architectures, embedding explainability mechanisms, and adopting privacy-preserving techniques. This paper provides insight for researchers and practitioners to understand the current capabilities, limitations, and future trends of BDA in enhancing fraud detection in increasingly complex digital banking ecosystems.
Manuscript received: 9 Jun 2025 | Revised: 6 Aug 2025 | Accepted: 24 Sep 2025 | Published: 31 Mar 2026
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
References
J. Gee and M. Button, “The Financial Cost of Fraud 2018: The Latest Data from Around the World”, Crowe Clark Whitehill, 2018.
URL: https://www.crowe.com/uk/croweuk/insights/financial-cost-of-fraud-2018 (Accessed 6 August 2025)
ACFE, “Occupational Fraud 2024: A Report to the Nations,” Association of Certified Fraud Examiners (ACFE), 2024.
URL:https://legacy.acfe.com/report-to-the-nations/2024/ (Accessed 6 August 2025)
Kaspersky, “Financial Cyberthreats Report 2024,” SecureList by Kaspersky, 2024.
URL:https://securelist.com/financial-threat-report-2024/115966/ (Accessed 6 August 2025)
UK Finance, “Annual Fraud Report 2025,” UK Finance Limited trading, 2025.
URL:https://www.ukfinance.org.uk/policy-and-guidance/reports-and-publications/annual-fraud-report-2025 (Accessed 8 August 2025)
A. Ali, S.A. Razak, S.H. Othman, T.A.E. Eisa, A. Al-Dhaqm, M. Nasser, T. Elhassan, H. Elshafie and A. Saif, "Financial Fraud Detection Based on Machine Learning: A Systematic Literature Review," Applied Sciences, vol. 12, no. 19, pp. 9637, 2022.
DOI: https://doi.org/10.3390/app12199637
M.U. Hassan, M.H. Rehmani and J. Chen, "Anomaly Detection in Blockchain Networks: A Comprehensive Survey," IEEE Communications Surveys & Tutorials, vol. 25, no. 1, pp. 289-318, 2023.
DOI: https://doi.org/10.1109/COMST.2022.3205643
C.P. Chen and C. Zhang, "Data-intensive applications, challenges, techniques and technologies: A survey on Big Data," Information Sciences, vol. 275, pp. 314-347, 2014.
DOI: https://doi.org/10.1016/j.ins.2014.01.015
R. Vinayakumar, M. Alazab, K.P. Soman, P. Poornachandran, A. Al-Nemrat and S. Venkatraman, "Deep Learning Approach for Intelligent Intrusion Detection System," IEEE Access, vol. 7, pp. 41525-41550, 2019.
DOI: https://doi.org/10.1109/ACCESS.2019.2895334
Y. Sahin and E. Duman, "Detecting credit card fraud by ANN and logistic regression," 2011 International Symposium on Innovations in Intelligent Systems and Applications, pp. 315-319, 2011.
DOI: https://doi.org/10.1109/INISTA.2011.5946108
F. Almalki and M. Masud, "Financial Fraud Detection Using Explainable AI and Stacking Ensemble Methods," arXiv, 2025.
DOI: https://doi.org/10.48550/arXiv.2505.10050
R. P., S.V. E., C. Anilkumar, P. Thilakaveni and U. Moorthy, "Big Data Analytics and Implementation Challenges of Machine Learning in Big data," Applied and Computational Engineering, vol. 2, no. 1, pp. 484-489, 2023.
DOI: https://doi.org/10.54254/2755-2721/2/20220584
A. Madhavi and T. Sivaramireddy, "Real-Time Credit Card Fraud Detection Using Spark Framework," Algorithms for Intelligent Systems, pp. 287-298, 2021.
DOI: https://doi.org/10.1007/978-981-33-4046-6_28
S. Fritz-Morgenthal, B. Hein and J. Papenbrock, "Financial Risk Management and Explainable, Trustworthy, Responsible AI," Frontiers in Artificial Intelligence, vol. 5, 2022.
DOI: https://doi.org/10.3389/frai.2022.779799
J. Dean and S. Ghemawat, "MapReduce," Communications of the ACM, vol. 51, no. 1, pp. 107-113, 2008.
DOI: https://doi.org/10.1145/1327452.1327492
K. Shvachko, H. Kuang, S. Radia and R. Chansler, "The Hadoop Distributed File System," 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST), pp. 1-10, 2010.
DOI: https://doi.org/10.1109/MSST.2010.5496972
T. White, “Hadoop: the definitive guide, Fourth edition,” O’Reilly, 2015.
A. Ghoting, R. Krishnamurthy, E. Pednault, B. Reinwald, V. Sindhwani, S. Tatikonda, Y. Tian and S. Vaithyanathan, "SystemML: Declarative machine learning on MapReduce," 2011 IEEE 27th International Conference on Data Engineering, 2011.
DOI: https://doi.org/10.1109/ICDE.2011.5767930
M. Zareapoor and P. Shamsolmoali, "Application of Credit Card Fraud Detection: Based on Bagging Ensemble Classifier," Procedia Computer Science, vol. 48, pp. 679-685, 2015.
DOI: https://doi.org/10.1016/j.procs.2015.04.201
N. Malini and M. Pushpa, "Analysis on credit card fraud identification techniques based on KNN and outlier detection," 2017 Third International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics (AEEICB), pp. 255-258, 2017.
DOI: https://doi.org/10.1109/AEEICB.2017.7972424
S. Panigrahi, A. Kundu, S. Sural and A. Majumdar, "Credit card fraud detection: A fusion approach using Dempster–Shafer theory and Bayesian learning," Information Fusion, vol. 10, no. 4, pp. 354-363, 2009.
DOI: https://doi.org/10.1016/j.inffus.2008.04.001
Suman and D. Kumar, “Performance Analysis of Various Credit Card Fraud Detection Approaches: A Review,” International Journal of Advanced Research in Science and Engineering, vol. 5, no. 9, pp. 120–126, 2016.
URL:https://www.ijarse.com/images/fullpdf/1473254973_506ijarse.pdf
M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker and I. Stoica, “Spark: cluster computing with working sets,” Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing, p. 10, 2010.
URL: https://dl.acm.org/doi/10.5555/1863103.1863113
M. Zaharia, T. Das, H. Li, T. Hunter, S. Shenker and I. Stoica, "Discretized streams," Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles, pp. 423-438, 2013.
DOI: https://doi.org/10.1145/2517349.2522737
K. Dabas, A. Dubey, and "Real-Time Fraud Detection using Apache Kafka, Apache Spark, and PySpark MLlib," International Journal Of Scientific Research In Engineering And Management, vol. 9, no. 05, pp. 1-9, 2025.
DOI: https://doi.org/10.55041/IJSREM47661
X. Meng, J. Bradley, B. Yavuz, E. Sparks, S. Venkataraman, D. Liu, J. Freeman, D. Tsai, M. Amde, S. Owen, D. Xin, R. Xin, M.J. Franklin, R. Zadeh, M. Zaharia and A. Talwalkar, "MLlib: Machine Learning in Apache Spark," arXiv, 2015.
DOI: https://doi.org/10.48550/arXiv.1505.06807
T. Chen and C. Guestrin, "XGBoost," Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 785-794, 2016.
DOI: https://doi.org/10.1145/2939672.2939785
M. Almansoori and M. Telek, "Anomaly Detection using combination of Autoencoder and Isolation Forest," 1st Workshop on Intelligent Infocommunication Networks, Systems and Services, pp. 25-30, 2023.
DOI: https://doi.org/10.3311/WINS2023-005
R.D. Baruah and P. Angelov, "DEC: Dynamically Evolving Clustering and Its Application to Structure Identification of Evolving Fuzzy Models," IEEE Transactions on Cybernetics, vol. 44, no. 9, pp. 1619-1631, 2014.
DOI: https://doi.org/10.1109/TCYB.2013.2291234
R.D. Baruah and P. Angelov, "Evolving local means method for clustering of streaming data," 2012 IEEE International Conference on Fuzzy Systems, pp. 1-8, 2012.
DOI: https://doi.org/10.1109/FUZZ-IEEE.2012.6251366
L.H. Aros, L.X.B. Molano, F. Gutierrez-Portela, J.J.M. Hernandez and M.S.R. Barrero, "Financial fraud detection through the application of machine learning techniques: a literature review," Humanities and Social Sciences Communications, vol. 11, no. 1, 2024.
DOI: https://doi.org/10.1057/s41599-024-03606-0
P. Sundaravadivel, R.A. Isaac, D. Elangovan, D. KrishnaRaj, V.V.L. Rahul and R. Raja, "Optimizing credit card fraud detection with random forests and SMOTE," Scientific Reports, vol. 15, no. 1, 2025.
DOI: https://doi.org/10.1038/s41598-025-00873-y
L. Breiman, "Random Forests," Machine Learning, vol. 45, no. 1, pp. 5-32, 2001.
DOI: https://doi.org/10.1023/A:1010933404324
N.S. Halvaiee and M.K. Akbari, "A novel model for credit card fraud detection using Artificial Immune Systems," Applied Soft Computing, vol. 24, pp. 40-49, 2014.
DOI: https://doi.org/10.1016/j.asoc.2014.06.042
M. Ester, H. -P. Kriegel, and X. Xu, “A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise,” Second International Conference on Knowledge Discovery and Data Mining, pp. 226–231, 1996,.
URL: https://cdn.aaai.org/KDD/1996/KDD96-037.pdf
N. Carneiro, G. Figueira and M. Costa, "A data mining based system for credit-card fraud detection in e-tail," Decision Support Systems, vol. 95, pp. 91-101, 2017.
DOI: https://doi.org/10.1016/j.dss.2017.01.002
J. Pei, J. Han, B. Mortazavi-Asl, J. Wang, H. Pinto, Q. Chen, U. Dayal and M. Hsu, "Mining sequential patterns by pattern-growth: the PrefixSpan approach," IEEE Transactions on Knowledge and Data Engineering, vol. 16, no. 11, pp. 1424-1440, 2004.
DOI: https://doi.org/10.1109/TKDE.2004.77
B. Krawczyk and M. Wozniak, "Diversity measures for one-class classifier ensembles," Neurocomputing, vol. 126, pp. 36-44, 2014.
DOI: https://doi.org/10.1016/j.neucom.2013.01.053
T. Kohonen, "The self-organizing map," Neurocomputing, vol. 21, no. 1-3, pp. 1-6, 1998.
DOI: https://doi.org/10.1016/S0925-2312(98)00030-7
R.A. Leite, T. Gschwandtner, S. Miksch, S. Kriglstein, M. Pohl, E. Gstrein and J. Kuntner, "EVA: Visual Analytics to Identify Fraudulent Events," IEEE Transactions on Visualization and Computer Graphics, vol. 24, no. 1, pp. 330-339, 2018.
DOI: https://doi.org/10.1109/TVCG.2017.2744758
S. Kaisler, F. Armour, J.A. Espinosa and W. Money, "Big Data: Issues and Challenges Moving Forward," 2013 46th Hawaii International Conference on System Sciences, 2013.
DOI: https://doi.org/10.1109/HICSS.2013.645
P. Jain, M. Gyanchandani and N. Khare, "Big data privacy: a technological perspective and review," Journal of Big Data, vol. 3, no. 1, 2016.
DOI: https://doi.org/10.1186/s40537-016-0059-y
U. Sivarajah, M.M. Kamal, Z. Irani and V. Weerakkody, "Critical analysis of Big Data challenges and analytical methods," Journal of Business Research, vol. 70, pp. 263-286, 2017.
DOI: https://doi.org/10.1016/j.jbusres.2016.08.001
E. Hormozi, M.K. Akbari, H. Hormozi and M.S. Javan, "Accuracy evaluation of a credit card fraud detection system on Hadoop MapReduce," The 5th Conference on Information and Knowledge Technology, pp. 35-39, 2013.
DOI: https://doi.org/10.1109/IKT.2013.6620034
J. Liu, S. Zhang and H. Fan, "A two-stage hybrid credit risk prediction model based on XGBoost and graph-based deep neural network," Expert Systems with Applications, vol. 195, pp. 116624, 2022.
DOI: https://doi.org/10.1016/j.eswa.2022.116624
T.T. Nguyen, H. Tahir, M. Abdelrazek and A. Babar, "Deep Learning Methods for Credit Card Fraud Detection," arXiv, 2020.
DOI: https://doi.org/10.48550/arXiv.2012.03754
I.D. Mienye and T.G. Swart, “A Hybrid Deep Learning Approach with Generative Adversarial Network for Credit Card Fraud Detection,” Technologies, vol. 12, no. 10, p. 186, 2024.