Optimizing Fraud Detection in Indonesia via Rare-Event Logit Approach: A Simulation Study on Large-Scale
DOI:
https://doi.org/10.61255/jeemba.v4i1.1309Keywords:
Fraud Detection, Rare-Event Logit, Class Imbalance, Simulation Study, Financial RiskAbstract
Purpose: This study examines the use of the Rare-Event Logit approach to improve fraud detection under conditions of extreme class imbalance. The topic is important because fraud cases usually represent only a very small proportion of total financial transactions, which may reduce the accuracy of conventional classification models.
Design/methodology/approach: This study uses a simulation-based quantitative design to evaluate fraud detection performance in large-scale imbalanced data settings. The analysis compares standard logistic regression and Rare-Event Logit with bias-corrected estimation, including Firth’s penalized likelihood approach. Model performance is assessed using the Area Under the Precision-Recall Curve and F1-Score.
Findings/Results: The results show that standard logit and Rare-Event Logit perform similarly under moderate imbalance conditions. However, Rare-Event Logit provides a stronger theoretical advantage in handling rare-event bias and stabilizing parameter estimation as data sparsity increases. This indicates that bias-corrected probabilistic models are more suitable for fraud detection in highly imbalanced environments.
Originality/Value: This study highlights the value of Rare-Event Logit as an alternative approach for fraud detection in rare-event settings. The findings imply that financial institutions can improve fraud risk identification by adopting bias-corrected models that are more robust to class imbalance.
Abstract views: 16
,
PDF downloads: 10
Downloads
References
Abdallah, A., Maarof, M. A., & Zainal, A. (2016). Fraud detection system: A survey. Journal of Network and Computer Applications, 68, 90–113. https://doi.org/10.1016/j.jnca.2016.04.007
Awoyemi, J. O., Adetunmbi, A. O., & Oluwadare, S. A. (2017). Credit card fraud detection using machine learning techniques: A comparative analysis. 2017 International Conference on Computing Networking and Informatics (ICCNI), 1–7. https://doi.org/10.1109/ICCNI.2017.8123782
Bauder, R. A., Khoshgoftaar, T. M., & Hasanin, T. (2018). Empirical comparison of data sampling methods for highly imbalanced medicare fraud detection. IEEE 19th International Conference on Information Reuse and Integration (IRI), 1–8.
Carcillo, F., Le Borgne, Y. A., Caelen, O., & Bontempi, G. (2018). Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics, 5(4), 285–300. https://doi.org/10.1007/s41060-018-0116-z
Dal Pozzolo, A., Boracchi, G., Caelen, O., Alippi, C., & Bontempi, G. (2018). Credit card fraud detection: a realistic modeling and a novel evaluation strategy. IEEE Transactions on Neural Networks and Learning Systems, 29(8), 3784–3797. https://doi.org/10.1109/TNNLS.2017.2736643
Dornadula, V. N., & Geetha, S. (2019). Credit card fraud detection using machine learning algorithms. Procedia Computer Science, 165, 631–641. https://doi.org/10.1016/j.procs.2020.01.057
Fernández, A., Garcia, S., Galar, M., Prati, R. C., Krawczyk, B., & Herrera, F. (2018). Learning from imbalanced data sets. Springer. https://doi.org/10.1007/978-3-319-98074-4
Fiore, U., De Santis, A., Perla, F., Zanetti, P., & Palmieri, F. (2019). Using generative adversarial networks for improving classification effectiveness in credit card fraud detection. Information Sciences, 479, 448–455. https://doi.org/10.1016/j.ins.2017.12.030
Heinze, G., & Puhr, R. (2020). Bias-reduced and separation-proof logistic regression with small or sparse data. Statistics in Medicine, 39(16), 2187–2200.
Hilal, W., Gadsden, S. A., & Yawney, J. (2022). Financial fraud: a review of anomaly detection techniques and recent advances. Expert Systems with Applications, 193, 116429. https://doi.org/10.1016/j.eswa.2021.116429
Ileberi, E., Sun, Y., & Wang, Z. (2022). A machine learning based credit card fraud detection using the GA algorithm for feature selection. Journal of Big Data, 9(1), 24. https://doi.org/10.1186/s40537-022-00573-8
Lebichot, B., Braun, F., Caelen, O., & Bontempi, G. (2020). A graph-based imputed-weighted framework for credit card fraud detection. Engineering Applications of Artificial Intelligence, 87, 103254.
Lucas, Y., Portier, P. E., Laporte, L., He-Guelton, L., Caelen, O., & others. (2020). Towards automated feature engineering for credit card fraud detection using multi-perspective HMMs. Future Generation Computer Systems, 102, 393–402. https://doi.org/10.1016/j.future.2019.08.029
Makki, S., Assaghir, Z., Taher, Y., Haque, R., Hacid, M. S., & Zeineddine, H. (2019). An experimental study with imbalanced classification approaches for credit card fraud detection. IEEE Access, 7, 93010–93022. https://doi.org/10.1109/ACCESS.2019.2927266
Mansour, R. I., & Cribari-Neto, F. (2020). Bias correction in the rare event logistic regression model. Journal of Statistical Computation and Simulation, 90(18), 3326–3345.
Otoritas Jasa Keuangan (OJK). (2024). Laporan Strategi Anti-Fraud Perbankan Indonesia: Tantangan Era Digital.
Puhr, R., Heinze, G., Nold, M., Lusa, L., & Geroldinger, A. (2017). Firth’s logistic regression with rare events: accurate effect estimates and predictions. Statistics in Medicine, 36(14), 2302–2317. https://doi.org/10.1002/sim.7273
Wang, Y., & Xu, W. (2018). Leveraging deep learning with LDA-based text analytics to detect automobile insurance fraud. Decision Support Systems, 105, 87–95. https://doi.org/10.1016/j.dss.2017.11.001
Zhu, T., Lin, Y., & Tie, Y. (2021). A novel credit card fraud detection method based on an improved semi-supervised learning. Applied Soft Computing, 111, 107693.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Agung Tri Utomo, Abdul Rahman

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
















Email: fadhila.della@gmail.com, andika.isma@unm.ac.id