A comparative evaluation of machine translation vs. human translation for legal texts: a case study of translations between English to Arabic
Journal cover Comparative Legilinguistics, volume 63, year 2025
PDF

Keywords

neural machine translation
machine translation
legal translation
ChatGPT
Google Translate
human translation
Arabic-English translation

How to Cite

Abdelaal, N., & Al Sawi, I. (2025). A comparative evaluation of machine translation vs. human translation for legal texts: a case study of translations between English to Arabic . Comparative Legilinguistics, 63, 186–223. https://doi.org/10.14746/cl.2025.63.1

Abstract

This study examines the comparative accuracy and fluency of Neural Machine Translations (NMTs) and Language Model-based translations (LLMs), represented by ChatGPT and Google Translate (GT), in legal contexts. Texts from Farahaty’s "Arabic-English-Arabic Legal Translation," sourced from primary texts cited in the book and translated by renowned scholars such as Hatim, Shunnaq, Buckley, and Farahaty, were used as benchmarks for human translation (HT). Sixteen diverse texts encompassing various legal discourse subgenres were selected for analysis, with all Arabic in-text examples transliterated using the Library of Congress (LOC) system. Qualitative analysis was conducted to assess the extent to which NMTs and LLMs match HT in accuracy and fluency. The study also investigates the similarities and differences between ChatGPT and GT in their translation outputs. Findings highlight HT’s superiority in producing precise, stylistically appropriate translations, compared to the challenges faced by NMTs and LLMs in capturing legal terminology and subtle linguistic nuances. Despite variations, both ChatGPT and GT demonstrate efficiency and context sensitivity, suggesting their potential as valuable tools when coupled with human post-editing. The study concludes by advocating for a hybrid approach that leverages the strengths of automated translation systems and human expertise to enhance cross-linguistic legal communication.

https://doi.org/10.14746/cl.2025.63.1
PDF

References

Abdelaal, N. M., & Alazzawie, A. (2020). Machine translation: The case of Arabic-English translation of news texts. Theory & Practice in Language Studies, 10(4), 415-424.

Al Sawi I., & Allam, R. (2024). Exploring challenges in audiovisual translation: A comparative analysis of human- and AI-generated Arabic subtitles in Birdman. PLOS ONE 19(10): e0311020.

Al Sawi, I., & Alaa, A. (2024). Navigating the impact: a study of editors’ and proofreaders’ perceptions of AI tools in editing and proofreading. Discov Artif Intell 4, 23 (2024).

Alcaraz, E., & Hughes, B. (2002). Legal translation explained. Routledge.

Badawi, E., Carter, M., & Gully, A. (2004). Modern Written Arabic: A Comprehensive Grammar. Routledge.

Bahdanau, D., Cho, K., & Bengio, Y. (2015). Neural machine translation by jointly learning to align and translate. International Conference on Learning Representations. https://arxiv.org/abs/1409.0473

Baniata, L. H., Ampomah, I. K., & Park, S. (2021). A transformer-based neural machine translation model for Arabic dialects that utilizes subword units. Sensors, 21(19),

Bhatia, V. K. (1993). Analyzing genre: Language use in professional settings. Longman.

Borowski, M. (2015). Language and its influence on how we understand reality. SKASE Journal of Theoretical Linguistics, 12(2), 70-91.

Crystal, D., & Davy, D. (1969). Investigating English style. Longman.

El-Farahaty, H. (2015). Arabic-English-Arabic legal translation: Issues and strategies. Routledge.

Farzindar, A., & Lapalme, G. (2009). Machine translation of legal information and its evaluation. In Advances in Artificial Intelligence: 22nd Canadian Conference on Artificial Intelligence, Canadian AI 2009, Kelowna, Canada, May 25-27, 2009 Proceedings 22 (pp. 64-73). Springer.

Hatim, B., & Mason, I. (1997). The translator as communicator. Routledge.

Hendy, A., Abdelrehim, M., Sharaf, A., Raunak, V., Gabr, M., Matsushita, H., Jin Kim, Y., Afify, M., & Awadalla, H. H. (2023). How good are GPT models at machine translation? A comprehensive evaluation. CoRR. https://arxiv.org/abs/2302.09210

Jiang, Z., Lv, Q., & Zhang, Z. (2023). Distinguishing translations by human, NMT, and ChatGPT: A linguistic and statistical approach. arXiv preprint arXiv:2312.10750.

Karpinska, M., & Iyyer, M. (2023). Large language models effectively leverage document-level context for literary translation, but critical errors persist. arXiv preprint arXiv:2304.03245.

Killman, J. (2014). Vocabulary accuracy of statistical machine translation in the legal context. In Proceedings of the 11th Conference of the Association for Machine Translation in the Americas (pp. 85-98).

Killman, J. (2017). Applicability of EU multilingual resources: A case study of the translation into English of legal vocabulary in the judicial context of Spain. Babel, 63(6), 861-889.

Killman, J. (2023). Machine translation and legal terminology: Data-driven approaches to contextual accuracy. In Ł. Biel & H. J. Kockaert (Eds.), Handbook of terminology: Volume 3. Legal terminology (pp. 485-510). John Benjamins Publishing.

Kit, C., & Wong, T. M. (2008). Comparative evaluation of online machine translation systems with legal texts. Law Library Journal, 100(2), 221-240.

Kovács, T. (2022). Human and machine translation: A comparative analysis of neural machine and human-translated EN-HU and HU-EN legal texts. Porta Lingua, 1, 49-57.

Mileto, F. (2019). Post-editing and legal translation. H2D | Revista de Humanidades Digitais, 1(1), 35-46.

Sutskever, I., Vinyals, O., & Le, Q. (2014). Sequence to sequence learning with neural networks. In Advances in Neural Information Processing Systems (NIPS 2014).

Tiersma, P. M. (1999). Legal language. University of Chicago Press.

Varga, M., & Grlić, T. (2023). Assessing machine translation from English to German: A case study.

Vigier-Moreno, F. J., & Pérez-Macías, L. (2022). Assessing neural machine translation of court documents: A case study on the translation of a Spanish remand order into English. Revista de Llengua i Dret, 78, 89-102.

Wilks, Y. (2008). Machine translation: Its scope and limits. Springer Science & Business Media.

Williams, C. (2005). Tradition and change in legal English: Verbal constructions in prescriptive texts. Peter Lang.

Wu, Y., Schuster, M., Chen, Z., Le, Q. V., Norouzi, M., Macherey, W., Krikun, M., Cao, Y., Gap, Q., Macherey, K., Klingner, J., Shah, A., Johnson, M., Liu, X., Kaiser, Ł., Gouws, S., Kato, Y., Kudo, T., Kazawa, H., Stevens, K., Kurian, G., Patil, N., Wang, W., Young, C., Smith, J., Riesa, J., Rudnick, A., Vinyals, O., Corrado, G., Hughes, M., & Dean, J. (2016). Google’s neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:1609.08144.