A Comparative Evaluation of Back Translation and Easy Data Augmentation for Indonesian Automatic Short Answer Scoring
DOI:
https://doi.org/10.61255/pisces.v4i1.1294Keywords:
Automatic Short Answer Scoring, Back Translation, Easy Data Augmentation, IndoBERT, BiLSTM, FastTextAbstract
Automatic Short Answer Scoring (AES) is a Natural Language Processing (NLP) application designed to automatically assess short-answer responses. One of the primary challenges in developing AES systems is the limited size and diversity of available datasets, which can adversely affect a model’s generalization capability. Previous studies have demonstrated that Easy Data Augmentation (EDA) based on IndoBERT-generated synonyms can improve model performance on the UKARA dataset; however, this approach remains limited because the augmentation process is performed at the word level. This study aims to compare the effectiveness of Back Translation and IndoBERT-based Synonym EDA for Indonesian AES systems using the UKARA dataset. To ensure a fair comparison, the dataset, preprocessing procedures, FastText-based text representation, BiLSTM architecture, and evaluation methods were kept consistent across experiments, allowing performance differences to be attributed solely to the augmentation techniques. The experiments were conducted using both Non-K-Fold Evaluation and 3-Fold Cross-Validation scenarios. The results indicate that Back Translation outperformed IndoBERT-based Synonym EDA in most experimental settings, achieving the highest accuracy of 89.00% on Dataset A. Furthermore, the findings suggest that the quality and semantic diversity of the generated data have a greater impact on model performance than merely increasing the amount of training data. Therefore, Back Translation can serve as an effective alternative for enhancing dataset quality and improving the performance of Indonesian AES systems.
Keywords: Automatic Short Answer Scoring, Back Translation, Easy Data Augmentation, IndoBERT, BiLSTM, FastText.
Abstract views: 0
,
PDF downloads: 0
Downloads
References
Ariyus, D., Manongga, D., & Sembiring, I. (2024). Enhancing Sentiment Analysis of Indonesian Tourism Video Content Commentary on TikTok: A FastText and Bi-LSTM Approach. Engineering, Technology & Applied Science Research, 14(6), 18020-18028. https://doi.org/10.48084/etasr.8859
Chiyuan Zhang, Samy Bengio, Moritz Hardt, Benjamin Recht, and Oriol Vinyals. (2021). Understanding deep learning (still) requires rethinking generalization. Communications of the ACM, 64(3), 107-115.
Fadilah, N., & Priyanta, S. (2022). Automatic Essay Scoring Using Data Augmentation in Bahasa Indonesia. IJCCS (Indonesian Journal of Computing and Cybernetics Systems), 16(4), 401-410.
Herwanto, G. B., Sari, Y., Prastowo, B. N., Bustoni, I. A., & Hidayatulloh, I. (2018). UKARA: A fast and simple automatic short answer scoring system for Bahasa Indonesia. ICEAP 2019, 2, 48-53.
Nurhaliza, S., & Al Fatta, H. (2024). Analisis Sentimen Ulasan Aplikasi KAI Access Menggunakan Metode Naive Bayes Classifier dan Support Vector Machine dengan Teknik Preprocessing. Jurnal Teknoinfo, 18(1), 145-156.
Purwoko, R. A., Utami, E., & Luthfi, E. T. (2023). Klasifikasi Teks Aduan Publik Menggunakan Kombinasi TF-IDF dan Bidirectional Long Short-Term Memory (BiLSTM). Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), 7(4), 812-820.
Rahma, I. A., & Suadaa, L. H. (2023). Penerapan Text Augmentation untuk Mengatasi Data yang Tidak Seimbang pada Klasifikasi Teks Berbahasa Indonesia. Jurnal Teknologi Informasi dan Ilmu Komputer (JTIIK), 10(6), 1329-1340.
Sani, D. A., & Sarwani, M. Z. (2022). Koreksi Jawaban Esai Berdasarkan Persamaan Makna Menggunakan Fasttext dan Algoritma Backpropagation. Jurnal Nasional Pendidikan Teknik Informatika (JANAPATI), 11(2), 92–111.
Sennrich, R., Haddow, B., & Birch, A. (2016). Improving neural machine translation models with monolingual data. arXiv preprint arXiv:1511.06709.
Siahaan, M., Sitanggang, F., & Girsang, A. S. (2023). Metode Komparatif Teknik Augmentasi Data pada Klasifikasi Teks Sentimen Berbahasa Indonesia Menggunakan Dataset Terbatas. Jurnal Media Informatika Budidarma, 7(2), 743-752.
Tanaka, E. A., Anderies, A., & Chowanda, A. (2024). Evaluating Back Translation and Misspelling Correction Utilization on Indonesian AES. In 2024 5th International Conference on Artificial Intelligence and Data Sciences (AiDAS) (pp. 1-6). IEEE.
Wei, J., & Zou, K. (2019). EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks. arXiv preprint arXiv:1901.11196.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Nur Fadilah, Khawaritzmi Abdallah Ahmad, Muh. Isbar Pratama

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.






Email: