A Comparative Evaluation of Back Translation and Easy Data Augmentation for Indonesian Automatic Short Answer Scoring

Authors

DOI:

https://doi.org/10.61255/pisces.v4i1.1294

Keywords:

Automatic Short Answer Scoring, Back Translation, Easy Data Augmentation, IndoBERT, BiLSTM, FastText

Abstract

Automatic Short Answer Scoring (AES) is a Natural Language Processing (NLP) application designed to automatically assess short-answer responses. One of the primary challenges in developing AES systems is the limited size and diversity of available datasets, which can adversely affect a model’s generalization capability. Previous studies have demonstrated that Easy Data Augmentation (EDA) based on IndoBERT-generated synonyms can improve model performance on the UKARA dataset; however, this approach remains limited because the augmentation process is performed at the word level. This study aims to compare the effectiveness of Back Translation and IndoBERT-based Synonym EDA for Indonesian AES systems using the UKARA dataset. To ensure a fair comparison, the dataset, preprocessing procedures, FastText-based text representation, BiLSTM architecture, and evaluation methods were kept consistent across experiments, allowing performance differences to be attributed solely to the augmentation techniques. The experiments were conducted using both Non-K-Fold Evaluation and 3-Fold Cross-Validation scenarios. The results indicate that Back Translation outperformed IndoBERT-based Synonym EDA in most experimental settings, achieving the highest accuracy of 89.00% on Dataset A. Furthermore, the findings suggest that the quality and semantic diversity of the generated data have a greater impact on model performance than merely increasing the amount of training data. Therefore, Back Translation can serve as an effective alternative for enhancing dataset quality and improving the performance of Indonesian AES systems.

Keywords: Automatic Short Answer Scoring, Back Translation, Easy Data Augmentation, IndoBERT, BiLSTM, FastText.

Abstract views: 0 , PDF downloads: 0

Downloads

Download data is not yet available.

References

Ariyus, D., Manongga, D., & Sembiring, I. (2024). Enhancing Sentiment Analysis of Indonesian Tourism Video Content Commentary on TikTok: A FastText and Bi-LSTM Approach. Engineering, Technology & Applied Science Research, 14(6), 18020-18028. https://doi.org/10.48084/etasr.8859

Chiyuan Zhang, Samy Bengio, Moritz Hardt, Benjamin Recht, and Oriol Vinyals. (2021). Understanding deep learning (still) requires rethinking generalization. Communications of the ACM, 64(3), 107-115.

Fadilah, N., & Priyanta, S. (2022). Automatic Essay Scoring Using Data Augmentation in Bahasa Indonesia. IJCCS (Indonesian Journal of Computing and Cybernetics Systems), 16(4), 401-410.

Herwanto, G. B., Sari, Y., Prastowo, B. N., Bustoni, I. A., & Hidayatulloh, I. (2018). UKARA: A fast and simple automatic short answer scoring system for Bahasa Indonesia. ICEAP 2019, 2, 48-53.

Nurhaliza, S., & Al Fatta, H. (2024). Analisis Sentimen Ulasan Aplikasi KAI Access Menggunakan Metode Naive Bayes Classifier dan Support Vector Machine dengan Teknik Preprocessing. Jurnal Teknoinfo, 18(1), 145-156.

Purwoko, R. A., Utami, E., & Luthfi, E. T. (2023). Klasifikasi Teks Aduan Publik Menggunakan Kombinasi TF-IDF dan Bidirectional Long Short-Term Memory (BiLSTM). Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), 7(4), 812-820.

Rahma, I. A., & Suadaa, L. H. (2023). Penerapan Text Augmentation untuk Mengatasi Data yang Tidak Seimbang pada Klasifikasi Teks Berbahasa Indonesia. Jurnal Teknologi Informasi dan Ilmu Komputer (JTIIK), 10(6), 1329-1340.

Sani, D. A., & Sarwani, M. Z. (2022). Koreksi Jawaban Esai Berdasarkan Persamaan Makna Menggunakan Fasttext dan Algoritma Backpropagation. Jurnal Nasional Pendidikan Teknik Informatika (JANAPATI), 11(2), 92–111.

Sennrich, R., Haddow, B., & Birch, A. (2016). Improving neural machine translation models with monolingual data. arXiv preprint arXiv:1511.06709.

Siahaan, M., Sitanggang, F., & Girsang, A. S. (2023). Metode Komparatif Teknik Augmentasi Data pada Klasifikasi Teks Sentimen Berbahasa Indonesia Menggunakan Dataset Terbatas. Jurnal Media Informatika Budidarma, 7(2), 743-752.

Tanaka, E. A., Anderies, A., & Chowanda, A. (2024). Evaluating Back Translation and Misspelling Correction Utilization on Indonesian AES. In 2024 5th International Conference on Artificial Intelligence and Data Sciences (AiDAS) (pp. 1-6). IEEE.

Wei, J., & Zou, K. (2019). EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks. arXiv preprint arXiv:1901.11196.

Downloads

Published

2026-03-01

How to Cite

Fadilah, N., Ahmad, K. A., & Pratama, M. I. (2026). A Comparative Evaluation of Back Translation and Easy Data Augmentation for Indonesian Automatic Short Answer Scoring. Progressive Information, Security, Computer, and Embedded System, 4(1), 25–36. https://doi.org/10.61255/pisces.v4i1.1294

Issue

Section

Articles