Fraud Prediction in Online Financial Transactions with a Combination of SMOTE and Ensemble Classifier

Main Article Content

juvinal Ximenes guterres
Delfim Da Silva
Jacinto Defatima Sales
Abrao Freitas
Nuno da Costa

Abstract

Detecting fraudulent transactions remains a major challenge in digital financial systems due to the severe imbalance between legitimate and fraudulent records. This study aims to develop a classification model capable of identifying fraudulent transactions with high sensitivity to minority classes, while ensuring performance stability suitable for operational deployment. The methodology includes data preprocessing through outlier removal, feature normalization, and stratified data partitioning. To address class imbalance, the Synthetic Minority Over-sampling Technique (SMOTE) is applied to generate representative synthetic samples for the minority class. Multiple machine learning algorithms are evaluated, including Random Forest, Decision Tree, Bagging, Gradient Boosting, Logistic Regression, Neural Network, K-Nearest Neighbors, and Support Vector Machine. Model performance is assessed using Precision, Recall, F1-Score, AUC, and G-Mean. The results show that the proposed approach achieves stable and reliable performance, with an AUC of 0.89 and a G-Mean of 0.81, demonstrating its effectiveness for operational fraud detection and error minimization.

Article Details

Section
Articles

References

M. Barroso and J. Laborda, “Digital transformation and the emergence of the Fintech sector: Systematic literature review,” Digit. Bus., vol. 2, no. 2, p. 100028, 2022, doi: 10.1016/j.digbus.2022.100028.

A. A. Calderon, “Digital Payments and their Role in Enhancing Financial Transactions Efficiency,” Int. J. Econ. Financ. Issues , vol. 15, no. 1, pp. 182–189, 2025, doi: 10.32479/ijefi.17555.

S. Nazari Nezhad, M. H. Zahedi, and E. Farahani, “Detecting diseases in medical prescriptions using data mining methods,” BioData Min., vol. 15, no. 1, pp. 1–19, 2022, doi: 10.1186/s13040-022-00314-w.

J. W. Chang, N. Yen, and J. C. Hung, “Design of a NLP-empowered finance fraud awareness model: the anti-fraud chatbot for fraud detection and fraud classification as an instance,” J. Ambient Intell. Humaniz. Comput., vol. 13, no. 10, pp. 4663–4679, 2022, doi: 10.1007/s12652-021-03512-2.

M. Jullum, A. Løland, R. B. Huseby, G. Ånonsen, and J. Lorentzen, “Detecting money laundering transactions with machine learning,” J. Money Laund. Control, vol. 23, no. 1, pp. 173–186, 2020, doi: 10.1108/JMLC-07-2019-0055.

A. Firmansah, Aripriharta, N. Mufti, A. N. Affandi, and I. A. E. Zaeni, “Self-powered IoT Based Vibration Monitoring of Induction Motor for Diagnostic and Prediction Failure,” IOP Conf. Ser. Mater. Sci. Eng., vol. 588, no. 1, 2019, doi: 10.1088/1757-899X/588/1/012016.

M. Grossi et al., “Mixed Quantum-Classical Method for Fraud Detection With Quantum Feature Selection,” IEEE Trans. Quantum Eng., vol. 3, no. August, pp. 1–12, 2022, doi: 10.1109/TQE.2022.3213474.

T. Widiyaningtyas, “ALGORITME REKOMENDASI MENGGUNAKAN CLUSTERING DAN USER PROFILE CORRELATION-BASED SIMILARITY (UPCSim) PADA SISTEM REKOMENDASI FILM,” 2022.

D. N. Triyanto, M. A. N. Fajri, and D. Wahyuni, “How is financial reporting fraud with the fraud hexagon approach before and during Covid-19 pandemic?,” J. Contemp. Account., pp. 97–114, 2023, doi: 10.20885/jca.vol5.iss2.art4.

M. A. Islam, M. A. Uddin, S. Aryal, and G. Stea, “An ensemble learning approach for anomaly detection in credit card data with imbalanced and overlapped classes,” J. Inf. Secur. Appl., vol. 78, no. October, 2023, doi: 10.1016/j.jisa.2023.103618.

M. J. Madhurya, H. L. Gururaj, B. C. Soundarya, K. P. Vidyashree, and A. B. Rajendra, “Exploratory analysis of credit card fraud detection using machine learning techniques,” Glob. Transitions Proc., vol. 3, no. 1, pp. 31–37, 2022, doi: 10.1016/j.gltp.2022.04.006.

A. A. Taha and S. J. Malebary, “An intelligent approach to credit card fraud detection using an optimized light gradient boosting machine,” IEEE Access, 2020, [Online]. Available: https://ieeexplore.ieee.org/abstract/document/8979331/

A. Prasetya Wibawa, S. Aji Kurniawan, and I. Ari Elbaith Zaeni Assistant Professor, “Determining Journal Rank by Applying Particle Swarm Optimization-Naive Bayes Classifier,” J. Inf. Technol. Manag., vol. 13, no. 4, pp. 116–125, 2021, [Online]. Available: https://jitm.ut.ac.ir/article_83962.html

M. F. A. Saputra, T. Widiyaningtyas, and A. P. Wibawa, “Illiteracy classification using K means-naïve bayes algorithm,” Int. J. Informatics Vis., vol. 2, no. 3, pp. 153–158, 2018, doi: 10.30630/joiv.2.3.129.

T. H. Pranto, K. T. A. M. Hasib, T. Rahman, A. B. Haque, A. K. M. N. Islam, and R. M. Rahman, “Blockchain and Machine Learning for Fraud Detection: A Privacy-Preserving and Adaptive Incentive Based Approach,” IEEE Access, vol. 10, no. July, pp. 87115–87134, 2022, doi: 10.1109/ACCESS.2022.3198956.

Purnawansyah et al., “Congestion Predictive Modelling on Network Dataset Using Ensemble Deep Learning,” J. Appl. Data Sci., vol. 5, no. 4, pp. 1597–1613, 2024, doi: 10.47738/jads.v5i4.333.

I. Y. Hafez, A. Y. Hafez, A. Saleh, A. A. Abd El-Mageed, and A. A. Abohany, “A systematic review of AI-enhanced techniques in credit card fraud detection,” J. Big Data, vol. 12, no. 1, 2025, doi: 10.1186/s40537-024-01048-8.

M. K. Severino and Y. Peng, “Machine learning algorithms for fraud prediction in property insurance: Empirical evidence using real-world microdata,” Mach. Learn. with Appl., vol. 5, no. June, p. 100074, 2021, doi: 10.1016/j.mlwa.2021.100074.

U. Pujianto, I. A. E. Zaeni, and K. I. Rasyida, “Comparison of Naive Bayes and Random Forests Classifier in the Classification of News Article Popularity as Learning Material,” Proc. 1st UMGESHIC Int. Semin. Heal. Soc. Sci. Humanit. (UMGESHIC-ISHSSH 2020), vol. 585, pp. 229–242, 2021, doi: 10.2991/assehr.k.211020.036.

O. I. Obaid and A. Y. Al-sultan, “Leveraging LSTM and Attention for High-Accuracy Credit Card Fraud Detection,” Fusion Pract. Appl., vol. 17, no. 1, pp. 209–220, 2025, doi: 10.54216/fpa.170115.

E. Esenogho, I. D. Mienye, T. G. Swart, K. Aruleba, and ..., “A neural network ensemble with feature engineering for improved credit card fraud detection,” IEEE …, 2022, [Online]. Available: https://ieeexplore.ieee.org/abstract/document/9698195/

A. Hanae, B. Abdellah, E. Saida, and G. Youssef, “End-to-End Real-time Architecture for Fraud Detection in Online Digital Transactions,” Int. J. Adv. Comput. Sci. Appl., vol. 14, no. 6, pp. 749–757, 2023, doi: 10.14569/IJACSA.2023.0140680.

S. Al Balawi and N. Aljohani, “Credit-card fraud detection system using neural networks.,” Int. Arab J. Inf. Technol., 2023, [Online]. Available: https://iajit.org/portal/images/year2023/No.2/20341.pdf

F. A. Ghaleb, F. Saeed, M. Al-Sarem, S. N. Qasem, and T. Al-Hadhrami, “Ensemble Synthesized Minority Oversampling-Based Generative Adversarial Networks and Random Forest Algorithm for Credit Card Fraud Detection,” IEEE Access, vol. 11, no. August, pp. 89694–89710, 2023, doi: 10.1109/ACCESS.2023.3306621.

Y. Y. Hsin, T. S. Dai, Y. W. Ti, M. C. Huang, T. H. Chiang, and L. C. Liu, “Feature Engineering and Resampling Strategies for Fund Transfer Fraud with Limited Transaction Data and a Time-Inhomogeneous Modi Operandi,” IEEE Access, vol. 10, no. July, pp. 86101–86116, 2022, doi: 10.1109/ACCESS.2022.3199425.

M. Ashraf, M. A. Abourezka, and F. A. Maghraby, “A Comparative Analysis of Credit Card Fraud Detection Using Machine Learning and Deep Learning Techniques,” Lect. Notes Networks Syst., vol. 224, pp. 267–282, 2022, doi: 10.1007/978-981-16-2275-5_16.

P. Serie, W. Paper, T. Newman, M. Mccann, and A. Scott, “the initial teacher education : a cornerstone in the improvement of today ’ s Working Papers Series ‘ Meeting New Challenges in Education ’ ISSUE 1,” no. May 2023, 2024.

D. Gong and Y. Liu, “A Mechine Learning Approach for Botnet Detection Using LightGBM,” 2022 3rd Int. Conf. Comput. Vision, Image Deep Learn. & Int. Conf. Comput. Eng. Appl. (CVIDL & ICCEA), 2022, doi: 10.1109/cvidliccea56201.2022.9824033.

H. Hairani, T. Widiyaningtyas, and D. D. Prasetya, “Feature Selection and Hybrid Sampling with Machine Learning Methods for Health Data Classification,” Rev. d’Intelligence Artif., vol. 38, no. 4, pp. 1255–1261, 2024, doi: 10.18280/ria.380419.

I. Saifudin and T. Widiyaningtyas, “Systematic Literature Review on Recommender System: Approach, Problem, Evaluation Techniques, Datasets,” IEEE Access, vol. 12, no. January, pp. 19827–19847, 2024, doi: 10.1109/ACCESS.2024.3359274.

Y. Vivek, V. Ravi, A. A. Mane, and L. R. Naidu, “Explainable Artificial Intelligence and Causal Inference based ATM Fraud Detection,” no. Ci, pp. 1–34, 2022, doi: 10.1109/CIFER62890.2024.10772906.

S. Bagga, A. Goyal, N. Gupta, and A. Goyal, “Credit Card Fraud Detection using Pipeling and Ensemble Learning,” Procedia Comput. Sci., vol. 173, no. 2019, pp. 104–112, 2020, doi: 10.1016/j.procs.2020.06.014.

M. Azim Mim, N. Majadi, and P. Mazumder, “A soft voting ensemble learning approach for credit card fraud detection,” Heliyon, vol. 10, no. 3, p. e25466, 2024, doi: 10.1016/j.heliyon.2024.e25466.

R. Priyadarshini, K. Anuratha, N. Rajendran, and S. Sujeetha, “APMFT: Anamoly Prediction Model for Financial Transactions Using Learning Methods in Machine Learning and Deep Learning,” Adv. Parallel Comput., 2021, doi: 10.3233/apc210101.

W. Rahayu et al., “Synthetic Minority Oversampling Technique (SMOTE) for Boosting the Accuracy of C4.5 Algorithm Model,” J. Artif. Intell. Eng. Appl., vol. 3, no. 3, pp. 624–630, 2024, doi: 10.59934/jaiea.v3i3.469.

H. Hairani, T. Widiyaningtyas, D. D. Prasetya, and A. Aminuddin, “Addressing Imbalance in Health Datasets: A New Method NR-Clustering SMOTE and Distance Metric Modification,” Comput. Mater. Contin., vol. 82, no. 2, pp. 2931–2949, 2025, doi: 10.32604/cmc.2024.060837.

Z. Li, M. Huang, G. Liu, and C. Jiang, “A hybrid method with dynamic weighted entropy for handling the problem of class imbalance with overlap in credit card fraud detection,” Expert Syst. Appl., vol. 175, no. January, p. 114750, 2021, doi: 10.1016/j.eswa.2021.114750.

Y. C and M. Lucas, “Credit card fraud detection using machine learning with integration of contextual knowledge To cite this version : HAL Id : tel-02951477 Computer Science Credit Card Fraud Detection using Machine Learning with Integration of Contextual Knowledge Before th,” 2020.

Most read articles by the same author(s)