تحليل بنود الأسئلة مهارات اللغة العربية في المدرسة الابتدائية الإسلامية طريق الهدى مالانج: دراسة حالة Analysis of Arabic Language Skills Test Items at Al-Huda Islamic Elementary School Malang: A Case Study https://doi.org/10.35719/arkhas.v5i2.2366 Authors Putri Damara Chaniago UIN Maulana Malik Ibrahim Malang, Indonesia Ziana walida S UIN Maulana Malik Ibrahim Malang, Indonesia Yessita Amanda Putri UIN Maulana Malik Ibrahim Malang, Indonesia Nur Qamari UIN Maulana Malik Ibrahim Malang, Indonesia Learning evaluation, test items, Arabic language skills, test validity, case study Abstract How to Cite Metrics References Similar Articles Evaluation is the process of giving a value to an object based on certain criteria. This study aims to analyze the quality of Arabic language skills test items at SD Islam MI Thoriqul Huda Malang and assess their suitability with the principles of compiling language skills test items. This study uses a qualitative approach with a case study design. Primary data sources include Arabic language skills test documents, while secondary data are in the form of interviews with teachers and other supporting documents. Data collection was carried out through documentation and interviews, and analyzed using the Miles and Huberman model (data reduction, data presentation, and drawing conclusions). The results of the study showed that the test items used covered four main language skills, namely listening, speaking, reading, and writing. The test items were presented in various forms such as multiple choice, essay questions, and performance. The questions were compiled by the teacher with reference to the learning objectives and student abilities, and using a question bank as a reference. Daily assessments were carried out separately for each skill, while summative assessments tended to combine all skills into one test that focused more on reading and writing. Judging from the assessment principles, the test items were generally valid and relevant; however, there were shortcomings in the evaluation of speaking and listening skills due to limited resources available. This study recommends refining evaluation tools, increasing the clarity of instructions, and aligning test formats to optimally measure all language skills. تحليل بنود الأسئلة مهارات اللغة العربية في المدرسة الابتدائية الإسلامية طريق الهدى مالانج: دراسة حالة: Analysis of Arabic Language Skills Test Items at Al-Huda Islamic Elementary School Malang: A Case Study. (2025). Journal of Arabic Language Teaching, 5(2), 337-354. https://doi.org/10.35719/arkhas.v5i2.2366 More Citation Formats ACM ACS APA ABNT Chicago Harvard IEEE MLA Turabian Vancouver AMA Download Citation Endnote/Zotero/Mendeley (RIS) BibTeX Downloads Download data is not yet available. References Aalst, J. van. (2000). An introduction to physics education research. Canadian Journal of Physics, 78(1), 57–71. https://doi.org/10.1139/p00-005 Abd-Elmoneim, D. M., Ghandour, H. H., Elrefaie, D. A., & Khodeir, M. S. (2023). Development of an Arabic test for assessment of semantics for the Arabic-speaking children: the Arabic semantic test. The Egyptian Journal of Otolaryngology, 39(1), 49. https://doi.org/10.1186/s43163-023-00405-3 Al-Rawafi, A., Sudana, D., Lukmana, I., & Syihabuddin, S. (2021). Students’ apologizing in Arabic and English: An interlanguage pragmatic case study at an Islamic boarding school in Indonesia. Indonesian Journal of Applied Linguistics, 10(3). https://doi.org/10.17509/ijal.v10i3.31740 Almelhes, S. (2024). Enhancing Arabic Language Acquisition: Effective Strategies for Addressing Non-Native Learners’ Challenges. Education Sciences, 14(10), 1116. https://doi.org/10.3390/educsci14101116 Aloudah, N. M. (2022). Qualitative research in the Arabic language. When should translations to English occur? A literature review. Exploratory Research in Clinical and Social Pharmacy, 6, 100153. https://doi.org/10.1016/j.rcsop.2022.100153 Ashfia, A., & Ridlo, U. (2024). E-ISSN : 2792-0876 Optimalisasi Higher Order Thinking Skill ( HOTS ) dalam Kurikulum Merdeka : Strategi dan Konsep Penyusunan Soal Bahasa Arab di MTs Pembangunan Jakarta. 5(1), 330–342. https://doi.org/10.37274/mauriduna.v5i2.1189 Assyakurrohim, D., Ikhram, D., Sirodj, R. A., & Afgani, M. W. (2022). Case Study Method in Qualitative Research. Jurnal Pendidikan Sains Dan Komputer, 3(01), 1–9. Baroroh, U., & Hamani, T. (2022). Development of Authentic Assessment in Islamic Religious Education in Elementary School. Nazhruna: Jurnal Pendidikan Islam, 5(3), 940–955. https://doi.org/10.31538/nzh.v5i3.2380 Bella, S., & Huda, M. M. (2022). The Use Of Youtube Media In Improving Listening And Speaking Skills In UIN Kiai Haji Achmad Siddiq Jember. Journal of Arabic Language Teaching, 2(1), 43–56. https://doi.org/10.35719/arkhas.v2i1.1275 Coombe, C., Vafadar, H., & Mohebbi, H. (2020). Language assessment literacy: what do we need to learn, unlearn, and relearn? Language Testing in Asia, 10(1), 3. https://doi.org/10.1186/s40468-020-00101-6 Darling-Hammond, L., Flook, L., Cook-Harvey, C., Barron, B., & Osher, D. (2020). Implications for educational practice of the science of learning and development. Applied Developmental Science, 24(2), 97–140. https://doi.org/10.1080/10888691.2018.1537791 Dianova, F. R., & Anwar, N. (2024). Analisis Butir Uji Validitas, Reliabilitas, Tingkat Kesukaran, dan Daya Pembeda Soal Sumatif Bahasa Arab SD Islam. Jurnal Bahasa Daerah Indonesia, 1(3), 13. https://doi.org/10.47134/jbdi.v1i3.2863 Dinanti, S. D. (2024). Shaut Al- ‘ Arabiyah Analisis Butir Soal Bahasa Arab d i Madrasah Ibtida ’ iyyah Bengkulu. 12(2), 518–530. Dobrinić, D., Miler, M., & Medak, D. (2025). Mapping the Green Urban: A Comprehensive Review of Materials and Learning Methods for Green Infrastructure Mapping. Sensors, 25(2), 464. https://doi.org/10.3390/s25020464 Dunn, K. J., & McCray, G. (2020). The Place of the Bifactor Model in Confirmatory Factor Analysis Investigations Into Construct Dimensionality in Language Testing. Frontiers in Psychology, 11. https://doi.org/10.3389/fpsyg.2020.01357 Essam, M., Deif, M. A., & Elgohary, R. (2024). Deciphering Arabic question: a dedicated survey on Arabic question analysis methods, challenges, limitations and future pathways. Artificial Intelligence Review, 57(9), 251. https://doi.org/10.1007/s10462-024-10880-6 Fidayani, E. F., & Ammar, F. M. (2023). The Use of Azhari Curriculum in Arabic Language Learning at Islamic Boarding School. Nazhruna: Jurnal Pendidikan Islam, 6(1), 25–45. https://doi.org/10.31538/nzh.v6i1.2866 Fulcher, G. (2012). Assessment Literacy for the Language Classroom. Language Assessment Quarterly, 9(2), 113–132. https://doi.org/10.1080/15434303.2011.642041 Golden, J., & Kohlbeck, M. (2020). Addressing cheating when using test bank questions in online Classes. Journal of Accounting Education, 52, 100671. https://doi.org/10.1016/j.jaccedu.2020.100671 Graff Zivin, J., Song, Y., Tang, Q., & Zhang, P. (2020). Temperature and high-stakes cognitive performance: Evidence from the national college entrance examination in China. Journal of Environmental Economics and Management, 104, 102365. https://doi.org/10.1016/j.jeem.2020.102365 Hidayat, W., Lawahid, N. A., & Mujahidah, M. (2021). roblems and Constraints of Authentic Assessment among Children s Early Education Teachers. Pacific Early Childhood Education Research Association, 15(2), 87–109. https://doi.org/10.17206/apjrece.2021.15.2.87 Ismail, S. M., Rahul, D. R., Patra, I., & Rezvani, E. (2022). Formative vs. summative assessment: impacts on academic motivation, attitude toward learning, test anxiety, and self-regulation skill. Language Testing in Asia, 12(1), 40. https://doi.org/10.1186/s40468-022-00191-4 Jauharoh, E., Anam, W., & Huda, M. M. (2021). The Use of Expressions in Improving Kalam Skill in Learning Arabic for MTSN 2 Kediri Students. Asalibuna. https://jurnalfaktarbiyah.iainkediri.ac.id/index.php/asalibuna/article/view/586 Kaya, M. H., & Adiguzel, T. (2021). Technology Integration Through Evidence-Based Multimodal Reflective Professional Training. Contemporary Educational Technology, 13(4), ep323. https://doi.org/10.30935/cedtech/11143 Kremmel, B., & Harding, L. (2020). Towards a Comprehensive, Empirical Model of Language Assessment Literacy across Stakeholder Groups: Developing the Language Assessment Literacy Survey. Language Assessment Quarterly, 17(1), 100–120. https://doi.org/10.1080/15434303.2019.1674855 Li, M., & Zhang, X. (2021). A meta-analysis of self-assessment and language performance in language testing and assessment. Language Testing, 38(2), 189–218. https://doi.org/10.1177/0265532220932481 Mohapatra, B., & Laures-Gore, J. (2021). Moving toward accurate assessment of working memory in adults with neurogenically based communication disorders. American Journal of Speech-Language Pathology, 30(3), 1292–1300. https://doi.org/10.1044/2021_AJSLP-20-00305 Muhammad Taufiq Ismail. (2016). ANALISIS BUTIR SOAL PELAJARAN BAHASA ARAB SUMATIF AKHIR SMESTER GANJIL TAHUN AJARAN 2022/2023 KELAS XI SEKOLAH MENENGAH ATAS AL-FATTAH SIDOARJO. 09, 1–23. Ni, U., Novikasari, I., Islam, U., Prof, N., & Zuhri, K. H. S. (2024). Analisis Butir Soal Akhir Semester I Mata Pelajaran Bahasa Indonesia Kelas II Madrasah Ibtidaiyah. 7(1). Nuswowati, M., Binadja, A., Efti, K., & Ifada, N. (2010). Pengaruh Validitas Dan Reliabilitas Butir Soal Ulangan Akhir Semester Bidang Studi Kimia Terhadap Pencapaian Kompetensi. Jurnal Inovasi Pendidikan Kimia, 4(1), 566–573. Panadero, E., Fraile, J., Fernández Ruiz, J., Castilla-Estévez, D., & Ruiz, M. A. (2019). Spanish university assessment practices: examination tradition with diversity by faculty. Assessment & Evaluation in Higher Education, 44(3), 379–397. https://doi.org/10.1080/02602938.2018.1512553 Pittman, R. T., Chang, H., Lindner, A., Binks-Cantrell, E., & Joshi, M. (2023). What do classroom teachers of varying backgrounds know about English spelling? Annals of Dyslexia, 73(3), 415–439. https://doi.org/10.1007/s11881-023-00286-4 Puad, L. M. A. Z., & Ashton, K. (2023). A critical analysis of Indonesia’s 2013 national curriculum: Tensions between global and local concerns. The Curriculum Journal, 34(3), 521–535. https://doi.org/10.1002/curj.194 Qiao, H., & Zhao, A. (2023). Artificial intelligence-based language learning: illuminating the impact on speaking skills and self-regulation in Chinese EFL context. Frontiers in Psychology, 14. https://doi.org/10.3389/fpsyg.2023.1255594 Qodri, M., & Sanjaya, B. (2024). Evaluation of the Implementation of Arabic Language Learning for Postgraduate Masters Students at UIN STS Jambi / Evaluasi Pelaksanaan Pembelajaran Bahasa Arab Pada Mahasiswa Magister Pascasarjana UIN STS Jambi. In Loghat Arabi : Jurnal Bahasa Arab dan Pendidikan Bahasa Arab (Vol. 5, Issue 1, p. 105). Institut Agama Islam (IAI DDI) Polewali Mandar. https://doi.org/10.36915/la.v5i1.228 Rahman, K. A., Seraj, P. M. I., Hasan, M. K., Namaziandost, E., & Tilwani, S. A. (2021). Washback of assessment on English teaching-learning practice at secondary schools. Language Testing in Asia, 11(1), 12. https://doi.org/10.1186/s40468-021-00129-2 Rakhlin, N. V., Aljughaiman, A., & Grigorenko, E. L. (2021). Assessing language development in Arabic: The Arabic language: Evaluation of function (ALEF). Applied Neuropsychology: Child, 10(1), 37–52. https://doi.org/10.1080/21622965.2019.1596113 Ramadhan, R., & Firdaus, F. N. (2022). Analisis Butir Soal Ujian Tengah Semester Bahasa Arab Kelas XII di SMA Al-Izzah IIBS Malang. Tsaqofiya : Jurnal Pendidikan Bahasa Dan Sastra Arab, 4(1), 126–135. https://doi.org/10.21154/tsaqofiya.v4i1.49 Saleh, S. (2017). Penerbit Pustaka Ramadhan, Bandung. Analisis Data Kualitatif, 1, 180. Soliman, R., & Khalil, S. (2024). The teaching of Arabic as a community language in the UK. International Journal of Bilingual Education and Bilingualism, 27(9), 1246–1257. https://doi.org/10.1080/13670050.2022.2063686 Su, Y. E., & Jiang, Y. (2024). Challenges with computing scalar and ad-hoc implicatures in Mandarin-speaking 4–8-year-old autistic children. Journal of Communication Disorders, 110. https://doi.org/10.1016/j.jcomdis.2024.106427 Sukenti, D., Tambak, S., & Charlina, C. (2020). Developing Indonesian language learning assessments: Strengthening the personal competence and Islamic psychosocial of teachers. International Journal of Evaluation and Research in Education (IJERE), 9(4), 1079. https://doi.org/10.11591/ijere.v9i4.20677 Sukma, E., Ramadhan, S., Aldiyah, M. P., & Sihes, A. J. (2023). Challenges in Implementing Indonesian Language Teaching Materials in Elementary Schools. Lnternational Electronic Journal of Elementary Education. https://doi.org/10.26822/iejee.2024.327 Thabtah, F., Hammoud, S., Kamalov, F., & Gonsalves, A. (2020). Data imbalance in classification: Experimental evaluation. Information Sciences, 513, 429–441. https://doi.org/10.1016/j.ins.2019.11.004 Ummah, M. S. (2019). VALIDITAS TES DAN KUALITAS BUTIR SOAL. Sustainability (Switzerland), 11(1), 1–14. Wahyuni, L. G. E., Dewi, N. L. P. E. S., & Paramartha, A. A. G. Y. (2021). Authentic Assessment Practice. https://doi.org/10.2991/assehr.k.210407.258 Wong, H. M., Kwek, D., & Tan, K. (2020). Changing Assessments and the Examination Culture in Singapore: A Review and Analysis of Singapore’s Assessment Policies. Asia Pacific Journal of Education, 40(4), 433–457. https://doi.org/10.1080/02188791.2020.1838886 Yu, M. H., Reynolds, B. L., & Ding, C. (2021). Listening and Speaking for Real-World Communication: What Teachers Do and What Students Learn From Classroom Assessments. Sage Open, 11(2). https://doi.org/10.1177/21582440211009163 Submitted 2025-12-27 Downloads Full Text (English) 2025-12-27 Vol. 5 No. 2 (2025): ARKHAS ~ Journal of Arabic Language Teaching Section Articles Copyright (c) 2025 Putri Damara Chaniago, Ziana walida S, Yessita Amanda Putri, Nur Qamari This work is licensed under a Creative Commons Attribution 4.0 International License. How to Cite تحليل بنود الأسئلة مهارات اللغة العربية في المدرسة الابتدائية الإسلامية طريق الهدى مالانج: دراسة حالة: Analysis of Arabic Language Skills Test Items at Al-Huda Islamic Elementary School Malang: A Case Study. (2025). Journal of Arabic Language Teaching, 5(2), 337-354. https://doi.org/10.35719/arkhas.v5i2.2366 More Citation Formats ACM ACS APA ABNT Chicago Harvard IEEE MLA Turabian Vancouver AMA Download Citation Endnote/Zotero/Mendeley (RIS) BibTeX