Optimizing Fraud Detection in Indonesia via Rare-Event Logit Approach: A Simulation Study on Large-Scale

Authors

  • Agung Tri Utomo Universitas Negeri Makassar, Indonesia
  • Abdul Rahman Universitas Negeri Makassar, Indonesia

DOI:

https://doi.org/10.61255/jeemba.v4i1.1309

Keywords:

Fraud Detection, Rare-Event Logit, Class Imbalance, Simulation Study, Financial Risk

Abstract

Purpose: This study examines the use of the Rare-Event Logit approach to improve fraud detection under conditions of extreme class imbalance. The topic is important because fraud cases usually represent only a very small proportion of total financial transactions, which may reduce the accuracy of conventional classification models.

Design/methodology/approach: This study uses a simulation-based quantitative design to evaluate fraud detection performance in large-scale imbalanced data settings. The analysis compares standard logistic regression and Rare-Event Logit with bias-corrected estimation, including Firth’s penalized likelihood approach. Model performance is assessed using the Area Under the Precision-Recall Curve and F1-Score.

Findings/Results: The results show that standard logit and Rare-Event Logit perform similarly under moderate imbalance conditions. However, Rare-Event Logit provides a stronger theoretical advantage in handling rare-event bias and stabilizing parameter estimation as data sparsity increases. This indicates that bias-corrected probabilistic models are more suitable for fraud detection in highly imbalanced environments.

Originality/Value: This study highlights the value of Rare-Event Logit as an alternative approach for fraud detection in rare-event settings. The findings imply that financial institutions can improve fraud risk identification by adopting bias-corrected models that are more robust to class imbalance.

Abstract views: 16 , PDF downloads: 10

Downloads

Download data is not yet available.

References

Abdallah, A., Maarof, M. A., & Zainal, A. (2016). Fraud detection system: A survey. Journal of Network and Computer Applications, 68, 90–113. https://doi.org/10.1016/j.jnca.2016.04.007

Awoyemi, J. O., Adetunmbi, A. O., & Oluwadare, S. A. (2017). Credit card fraud detection using machine learning techniques: A comparative analysis. 2017 International Conference on Computing Networking and Informatics (ICCNI), 1–7. https://doi.org/10.1109/ICCNI.2017.8123782

Bauder, R. A., Khoshgoftaar, T. M., & Hasanin, T. (2018). Empirical comparison of data sampling methods for highly imbalanced medicare fraud detection. IEEE 19th International Conference on Information Reuse and Integration (IRI), 1–8.

Carcillo, F., Le Borgne, Y. A., Caelen, O., & Bontempi, G. (2018). Streaming active learning strategies for real-life credit card fraud detection: assessment and visualization. International Journal of Data Science and Analytics, 5(4), 285–300. https://doi.org/10.1007/s41060-018-0116-z

Dal Pozzolo, A., Boracchi, G., Caelen, O., Alippi, C., & Bontempi, G. (2018). Credit card fraud detection: a realistic modeling and a novel evaluation strategy. IEEE Transactions on Neural Networks and Learning Systems, 29(8), 3784–3797. https://doi.org/10.1109/TNNLS.2017.2736643

Dornadula, V. N., & Geetha, S. (2019). Credit card fraud detection using machine learning algorithms. Procedia Computer Science, 165, 631–641. https://doi.org/10.1016/j.procs.2020.01.057

Fernández, A., Garcia, S., Galar, M., Prati, R. C., Krawczyk, B., & Herrera, F. (2018). Learning from imbalanced data sets. Springer. https://doi.org/10.1007/978-3-319-98074-4

Fiore, U., De Santis, A., Perla, F., Zanetti, P., & Palmieri, F. (2019). Using generative adversarial networks for improving classification effectiveness in credit card fraud detection. Information Sciences, 479, 448–455. https://doi.org/10.1016/j.ins.2017.12.030

Heinze, G., & Puhr, R. (2020). Bias-reduced and separation-proof logistic regression with small or sparse data. Statistics in Medicine, 39(16), 2187–2200.

Hilal, W., Gadsden, S. A., & Yawney, J. (2022). Financial fraud: a review of anomaly detection techniques and recent advances. Expert Systems with Applications, 193, 116429. https://doi.org/10.1016/j.eswa.2021.116429

Ileberi, E., Sun, Y., & Wang, Z. (2022). A machine learning based credit card fraud detection using the GA algorithm for feature selection. Journal of Big Data, 9(1), 24. https://doi.org/10.1186/s40537-022-00573-8

Lebichot, B., Braun, F., Caelen, O., & Bontempi, G. (2020). A graph-based imputed-weighted framework for credit card fraud detection. Engineering Applications of Artificial Intelligence, 87, 103254.

Lucas, Y., Portier, P. E., Laporte, L., He-Guelton, L., Caelen, O., & others. (2020). Towards automated feature engineering for credit card fraud detection using multi-perspective HMMs. Future Generation Computer Systems, 102, 393–402. https://doi.org/10.1016/j.future.2019.08.029

Makki, S., Assaghir, Z., Taher, Y., Haque, R., Hacid, M. S., & Zeineddine, H. (2019). An experimental study with imbalanced classification approaches for credit card fraud detection. IEEE Access, 7, 93010–93022. https://doi.org/10.1109/ACCESS.2019.2927266

Mansour, R. I., & Cribari-Neto, F. (2020). Bias correction in the rare event logistic regression model. Journal of Statistical Computation and Simulation, 90(18), 3326–3345.

Otoritas Jasa Keuangan (OJK). (2024). Laporan Strategi Anti-Fraud Perbankan Indonesia: Tantangan Era Digital.

Puhr, R., Heinze, G., Nold, M., Lusa, L., & Geroldinger, A. (2017). Firth’s logistic regression with rare events: accurate effect estimates and predictions. Statistics in Medicine, 36(14), 2302–2317. https://doi.org/10.1002/sim.7273

Wang, Y., & Xu, W. (2018). Leveraging deep learning with LDA-based text analytics to detect automobile insurance fraud. Decision Support Systems, 105, 87–95. https://doi.org/10.1016/j.dss.2017.11.001

Zhu, T., Lin, Y., & Tie, Y. (2021). A novel credit card fraud detection method based on an improved semi-supervised learning. Applied Soft Computing, 111, 107693.

Downloads

Published

2026-01-30

How to Cite

Utomo, A. T., & Rahman, A. (2026). Optimizing Fraud Detection in Indonesia via Rare-Event Logit Approach: A Simulation Study on Large-Scale . Journal of Economics, Entrepreneurship, Management Business and Accounting, 4(1), 35–42. https://doi.org/10.61255/jeemba.v4i1.1309