Optimizing Students' Language Skills Through a Multimodal Learning Model in Indonesian Language Learning in Elementary Schools: A Systematic Literature Review

Authors

  • Budi Febriyanto Febriyanto Universitas Pendidikan Indonesia
  • Dadang Sunendar Universitas Pendidikan Indonesia
  • Bachrudin Musthafa Universitas Pendidikan Indonesia
  • Yuliawati Universitas Pendidikan Indonesia
  • Agus Rofi'i Universitas Majalengka

DOI:

https://doi.org/10.61255/jupiter.v4i1.873

Keywords:

Elementary School, Indonesian Language Learning, Language Skills, Multimodal Learning, Systematic Literature Review

Abstract

Background: Multimodal learning has gained increasing attention in language education because it enables learners to construct meaning through text, visuals, audio, gesture, space, and social interaction. However, the literature remains fragmented, and no integrated model has been clearly established for Indonesian language learning in elementary schools. Purpose: This study analyses the conceptual and pedagogical characteristics and design components of multimodal learning, its influence on students' language-skill development, and the conceptual, methodological, and assessment gaps in the literature. Methods: This study used a Systematic Literature Review (SLR) design. Articles indexed in Scopus were selected through the PRISMA flow. The search identified 277 records, and 44 reports were included in the final analysis. Findings: The integration of text, visuals, audio, gestures, social interaction, and meaning-making activities within structured instructional designs characterises multimodal learning. Across the reviewed studies, it tends to support reading, writing, speaking, listening, vocabulary development, and communicative competence. However, its effectiveness varies depending on instructional design, teacher readiness, student characteristics, and classroom context. The literature also remains conceptually, methodologically, and contextually fragmented, especially regarding Indonesian language learning in elementary schools. Research implications: The findings provide a conceptual foundation for developing Indonesian language instruction that is more contextual, participatory, and supportive of integrated language-skill development. They also offer guidance for designing more coherent instructional models, implementation strategies, and assessment systems for elementary school settings. Conclusion: Multimodal learning should be understood not merely as media variation, but as a design of meaning and learning experience. Future research needs to test integrated multimodal models directly in Indonesian elementary school language classrooms. Originality: This study systematically maps the conceptual foundations, pedagogical patterns, empirical trends, and research gaps in multimodal learning as a basis for developing Indonesian language-learning models in elementary schools. The review highlights that the existing literature remains fragmented and has yet to produce many fully integrated models for this context.

Abstract views: 6 , PDF downloads: 1

Downloads

Download data is not yet available.

References

Alsubaie, M. A. (2022). Distance education and the social literacy of elementary school students during the Covid-19 pandemic. Heliyon, 8(7), e09811. https://doi.org/10.1016/j.heliyon.2022.e09811

Ayetiran, E. F., & Özgöbek, Ö. (2024a). A Review of Deep Learning Techniques for Multimodal Fake News and Harmful Languages Detection. IEEE Access, 12, 76133–76153. https://doi.org/10.1109/ACCESS.2024.3406258

Ayetiran, E. F., & Özgöbek, Ö. (2024b). An inter-modal attention-based deep learning framework using unified modality for multimodal fake news, hate speech and offensive language detection. Information Systems, 123, 102378. https://doi.org/10.1016/j.is.2024.102378

Bai, Y., & Lei, S. (2025). Cross-language dissemination of Chinese classical literature using multimodal deep learning and artificial intelligence. Scientific Reports, 15(1), 21648. https://doi.org/10.1038/s41598-025-05921-1

Boaventura, D., Neves, A. T., Santos, J., Pereira, P. C., Luís, C., Monteiro, A., Cartaxana, A., Hawkins, S. J., Caldeira, M. F., & Ponces De Carvalho, A. (2021). Promoting Ocean Literacy in Elementary School Students Through Investigation Activities and Citizen Science. Frontiers in Marine Science, 8, 675278. https://doi.org/10.3389/fmars.2021.675278

Carter, H., & Abbott, J. (2024). Literacy Teachers in the Making: A Look at Teacher Candidates’ Experiences as they Tutor Elementary Students. Literacy Research and Instruction, 63(1), 79–101. https://doi.org/10.1080/19388071.2023.2167676

Chung, K., Kim, S., Jang, Y., Choi, S., & Kim, H. (2024). Developing an AI literacy diagnostic tool for elementary school students. Education and Information Technologies, 30, 1013–1044. https://doi.org/10.1007/s10639-024-13097-w

Condie, C., & Pomerantz, F. (2020). Elementary students’ literacy opportunities in an age of accountability and standards: Implications for teacher educators. Teaching and Teacher Education, 92, 103058. https://doi.org/10.1016/j.tate.2020.103058

Dahl-Leonard, K., Hall, C., & Peacott, D. (2024). A meta-analysis of technology-delivered literacy instruction for elementary students. Educational Technology Research and Development, 72(3), 1507–1538. https://doi.org/10.1007/s11423-024-10354-0

Ding, A.-C. E., Glazewski, K., & Pawan, F. (2022). Language teachers and multimodal instructional reflections during video-based online learning tasks. Technology, Pedagogy and Education, 31(3), 293–312. https://doi.org/10.1080/1475939X.2022.2030790

Engman, M. M. (2021). A worksheet, a whiteboard, a teacher-learner: Leveraging materials and colonial language frames for multimodal indigenous language learning. Classroom Discourse, 12(1–2), 75–100. https://doi.org/10.1080/19463014.2020.1856696

Farías, M., & Véliz, L. (2016). Efl Students’ Metaphorical Conceptualizations Of Language Learning. Trabalhos Em Linguística Aplicada, 55(3), 833–850. https://doi.org/10.1590/010318135146185751

Garcia, M. (2026). Multilingual language learning in a multimodal metaverse: A multidimensional study of communicative, affective, and cognitive development. Innovation in Language Learning and Teaching, 1–27. https://doi.org/10.1080/17501229.2026.2621262

Gladys, A., & Vetriselvi, V. (2024). Sentiment analysis on a low-resource language dataset using multimodal representation learning and cross-lingual transfer learning. Applied Soft Computing, 157, 111553. https://doi.org/10.1016/j.asoc.2024.111553

Goo, M., Myers, D., Maurer, A. L., & Serwetz, R. (2020). Effects of Using an iPad to Teach Early Literacy Skills to Elementary Students With Intellectual Disability. Intellectual and Developmental Disabilities, 58(1), 34–48. https://doi.org/10.1352/1934-9556-58.1.34

Hadad, S., Watted, A., & Blau, I. (2023). Cultural background in digital literacy of elementary and middle school students: Self‐appraisal versus actual performance. Journal of Computer Assisted Learning, 39(5), 1591–1606. https://doi.org/10.1111/jcal.12820

Hagerman, M. S., Cotnam-Kappel, M., Turner, J.-A., & Hughes, J. M. (2022). Literacies in the Making: Exploring elementary students’ digital-physical meaning-making practices while crafting musical instruments from recycled materials. Technology, Pedagogy and Education, 31(1), 63–84. https://doi.org/10.1080/1475939X.2021.1997794

Heo, Y., Kang, S., & Seo, J. (2023). Natural-Language-Driven Multimodal Representation Learning for Audio-Visual Scene-Aware Dialog System. Sensors, 23(18), 7875. https://doi.org/10.3390/s23187875

Hermes, M. R., Engman, M. M., Meixi, & McKenzie, J. (2023). Relationality and Ojibwemowin in Forest Walks: Learning from Multimodal Interaction about Land and Language. Cognition and Instruction, 41(1), 1–31. https://doi.org/10.1080/07370008.2022.2059482

Hong, J., & Kim, K. (2025). Impact of AIoT education program on digital and AI literacy of elementary school students. Education and Information Technologies, 30(1), 107–130. https://doi.org/10.1007/s10639-024-12758-0

Huang, Y., Xu, W., Sukjairungwattana, P., & Yu, Z. (2024). Learners’ continuance intention in multimodal language learning education: An innovative multiple linear regression model. Heliyon, 10(6), e28104. https://doi.org/10.1016/j.heliyon.2024.e28104

Jensen, M. T., Solheim, O. J., & Olsen, E. (2025). Leader support in relation to teacher self-efficacy, classroom emotional climate and students’ literacy skills in elementary school. Scandinavian Journal of Educational Research, 69(4), 729–742. https://doi.org/10.1080/00313831.2024.2348451

Li, L., Bai, X., Xu, J., Wang, D., & Jiang, T. (2025). Multimodal learning audio-visual detection for obtaining object-level sound sources in Japanese-language teaching room. Scientific Reports, 15(1), 16632. https://doi.org/10.1038/s41598-025-00588-0

Li, W., Yu, J., Zhang, Z., & Liu, X. (2022). Dual Coding or Cognitive Load? Exploring the Effect of Multimodal Input on English as a Foreign Language Learners’ Vocabulary Learning. Frontiers in Psychology, 13, 834706. https://doi.org/10.3389/fpsyg.2022.834706

Lin, J., Zhang, H., & Lin, X. (2022). Prosodic Transfer in English Literacy Skills among Chinese Elementary-Age Students: Controlling for Non-Verbal Intelligence. Journal of Intelligence, 10(4), 114. https://doi.org/10.3390/jintelligence10040114

Maijala, M. (2023). Multimodal postcards to future selves: Exploring pre-service language teachers’ process of transformative learning during one-year teacher education programme. Innovation in Language Learning and Teaching, 17(1), 72–87. https://doi.org/10.1080/17501229.2021.1919683

Malone, J., Hui, B., Pandža, N., & Tytko, T. (2025). Eye Movements, Item Modality, and Multimodal Second Language Vocabulary Learning: Processing and Outcomes. Language Learning, lang.70007. https://doi.org/10.1111/lang.70007

Melo-Pfeifer, S., & Chik, A. (2022). Multimodal linguistic biographies of prospective foreign language teachers in Germany reconstructing beliefs about languages and multilingual language learning in initial teacher education. International Journal of Multilingualism, 19(4), 499–522. https://doi.org/10.1080/14790718.2020.1753748

Meneses, A., Uccelli, P., & Valeri, L. (2023). Teacher Talk and Literacy Gains in Chilean Elementary Students: Teacher Participation, Lexical Diversity, and Instructional Non-present Talk. Linguistics and Education, 73, 101145. https://doi.org/10.1016/j.linged.2022.101145

Nguyen-Thi, M.-H., Tran, K.-X., & Giang, T.-V. (2025). Exploring the emotional experience in learning Chinese as a second language of students from the multimodal affective perspective: A case study in Vietnam. Acta Psychologica, 260, 105575. https://doi.org/10.1016/j.actpsy.2025.105575

Pellicer-Sánchez, A. (2022). Multimodal reading and second language learning. ITL - International Journal of Applied Linguistics, 173(1), 2–17. https://doi.org/10.1075/itl.21039.pel

Rahmanu, I. W. E. D., & Molnár, G. (2024). Multimodal immersion in English language learning in higher education: A systematic review. Heliyon, 10(19), e38357. https://doi.org/10.1016/j.heliyon.2024.e38357

Ramezanali, N., Uchihara, T., & Faez, F. (2021). Efficacy of Multimodal Glossing on Second Language Vocabulary Learning: A Meta‐analysis. TESOL Quarterly, 55(1), 105–133. https://doi.org/10.1002/tesq.579

Scott, J.-A. (2020). (Re)directing a university storytelling troupe for at-risk elementary students for course credit: A story of embodied empathy, literacy, and personal transformation. Text and Performance Quarterly, 40(2), 170–186. https://doi.org/10.1080/10462937.2019.1691742

Sheng, H., Shen, X., Du, H., & Yu, X. (2026). Mobile Auslan: A multimodal dialogue-centered sign language learning system. Computer Vision and Image Understanding, 265, 104646. https://doi.org/10.1016/j.cviu.2026.104646

Tong, P., & An, I. S. (2026). Synaesthesia in digital multimodal composing: The case of a mobile-assisted task for learning Chinese as a foreign language. Computer Assisted Language Learning, 1–40. https://doi.org/10.1080/09588221.2025.2605538

Umino, T. (2023). Using multimodal language learning histories to understand learning experiences and beliefs of second language learners in Japan. The Modern Language Journal, 107(1), 308–327. https://doi.org/10.1111/modl.12828

Wahyudi, L. (2024). Watase Uake: Research Collaboration Tools. Retrieved from https://www.watase.web.id. https://www.watase.web.id

Yang, Y., Yang, Y.-Q., Ren, G., & Yu, B.-G. (2025). Hierarchically trusted evidential fusion method with consistency learning for multimodal language understanding. Knowledge-Based Systems, 312, 113164. https://doi.org/10.1016/j.knosys.2025.113164

Yoon, J., Choi, G., & Choi, C. (2023). Multimedia analysis of robustly optimized multimodal transformer based on vision and language co-learning. Information Fusion, 100, 101922. https://doi.org/10.1016/j.inffus.2023.101922

Yue, M., Jong, M. S.-Y., Dai, Y., & Lau, W. W.-F. (2025). Students as AI literate designers: A pedagogical framework for learning and teaching AI literacy in elementary education. Journal of Research on Technology in Education, 1–22. https://doi.org/10.1080/15391523.2025.2449942

Zhou, M., Steinberg, S., Stiso, C., Danish, J. A., & Craig, K. (2024). Using network visualizations to engage elementary students in locally relevant data literacy. Information and Learning Sciences, 125(3/4), 209–231. https://doi.org/10.1108/ILS-06-2023-0069

Downloads

Published

2026-03-31

How to Cite

Febriyanto, B. F., Sunendar, D., Musthafa, B., Yuliawati, Y., & Rofi'i, A. (2026). Optimizing Students’ Language Skills Through a Multimodal Learning Model in Indonesian Language Learning in Elementary Schools: A Systematic Literature Review. Jurnal Pendidikan Terapan, 4(1), 61–77. https://doi.org/10.61255/jupiter.v4i1.873

Issue

Section

Articles