AI and the BoLC: Streamlining legal translation
PDF

Keywords

corpus-based translation
legal translation
legal corpora
AI
machine translation

How to Cite

Giampieri, P. (2024). AI and the BoLC: Streamlining legal translation. Comparative Legilinguistics, 58, 67–90. https://doi.org/10.14746/cl.2024.58.1

Abstract

Artificial Intelligence (AI) applied in the legal field has gained considerable ground in the recent years and is used in many fields, amongst which the legal one is of no exception. This paper wishes to explore the quality of the translation (from English into Italian) of an arbitration clause performed by the ChatGPT chatbot. To do so, the automatically generated target text is post-edited by consulting the BoLC (Bononia Legal Corpus) and the web as corpus in the gov.it Italian governmental domain. General and legal dictionaries are also used. The paper findings report some inaccuracies at word level which are easily tackled by corpus consultation. In view of the results obtained, however, complete reliance upon AI-driven solutions is not recommendable at the time being. Conversely, the use of ad hoc corpora and of targeted web searches are the most feasible and reliable solutions, although time-consuming. 

https://doi.org/10.14746/cl.2024.58.1
PDF

References

Abate, Salvatore C. 1998. Il documento legale anglosassone. Milan: Hoepli.

Berezowski, Łukasz J. 2021. Translating Latin legalese: between dynamic equivalence and formal correspondence. A comparative study case of Latin legal terms and maxims in English, Italian and Polish language. In: Traduttologia e Traduzioni Vol II, ed. Joanna Ciesielka, 35–51. http://hdl.handle.net/11089/39976 (accessed May 14, 2024). DOI: https://doi.org/10.18778/8220-676-0.04

Bhatia, Vijay K. 2010. Textbook on Legal Language and Legal Writing. New Delhi: Universal Law Publishing Co.

Brooks, Chay, Cristian Gherhes, and Tim Vorley. 2020. Artificial intelligence in the legal sector: Pressures and challenges of transformation. Cambridge Journal of Regions, Economy and Society, 13(1): 135–152. DOI: https://doi.org/10.1093/cjres/rsz026

Chalkidis, Ilias, Manos Fergadiotis, and Ion Androutsopoulos. 2021. MultiEURLEX – A Multi-lingual and Multi-label Legal Document Classification Dataset for Zero-shot Cross-lingual Transfer. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP 2021), on-line and Punta Cana, Dominical Republic. https://aclanthology.org/2021.emnlp-main.559 (accessed May 14, 2024). DOI: https://doi.org/10.18653/v1/2021.emnlp-main.559

Coulthard, Malcolm, and Alison Johnson. 2007. An Introduction to Forensic Linguistics. Language in Evidence. London/New York: Routledge.

Emsley, Robin. 2023. ChatGPT: these are not hallucinations – they’re fabrications and falsifications. Schizophrenia, 9(52): 1–2. DOI: https://doi.org/10.1038/s41537-023-00379-4

Giampieri, Patrizia. 2023a. Is machine translation reliable in the legal field? A corpus-based critical comparative analysis for teaching ESP at tertiary level. ESP Today, 11(1): 119–137. DOI: https://doi.org/10.18485/esptoday.2023.11.1.6

Giampieri, Patrizia. 2023b. Legal Machine Translation Explained: MT in legal contexts. Newcastle Upon Tyne: Cambridge Scholars Publishing.

Giampieri, Patrizia. 2024. The Use of Comparable Corpora on (General) Terms and Conditions as a Pedagogical Tool in Translation Training between English and Italian. PhD dissertation. University of Malta. https://www.um.edu.mt/library/oar/handle/123456789/119427 (accessed May 14, 2024).

Gigi, Gisa M., Rajesh, Nikhitha, Sania, Ann, Mohan, Theresa, and Thekkanath Biju A. 2023. AI Contract: Future and Scope in Legal System. International Journal of Advance and Applied Research, 10(6): 237–240.

Guerreiro, Nuno M., Elena Voita, and André F.T.Martins. 2023. Looking for a needle in a haystack: A comprehensive study of hallucinations in neural machine translation. Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, May 2023. (accessed May 14, 2024). DOI: https://doi.org/10.18653/v1/2023.eacl-main.75

Jacometti, Valentina, and Barbara Pozzo. 2018. Traduttologia e linguaggio giuridico. Milan: Wolters Kluwer.

Koehn, Philip. 2005. Europarl: A Parallel Corpus for Statistical Machine Translation. In: The Tenth Machine Translation Summit Proceedings of Conference, ed. John Hutchins, 79–86. International Association for Machine Translation. https://www.pure.ed.ac.uk/ws/portalfiles/portal/26315407/MTS_2005_Koehn.pdf (accessed May 14, 2024).

Ljubešić, Nikola, and Tomaž Erjavec. 2018. JRC EU DGT Translation Memory Parsebank DGT-UD 1.0. Slovenian language resource repository CLARIN.SI. http://hdl.handle.net/11356/1197 (accessed May 14, 2024).

McCarthy, John, Minsky, Martin, Shannon, Claude E., and Rochester, Nathanielì. 1955. “A proposal for the Dartmouth summer research project on artificial intelligence”. AI Magazine, 27(4): 12.

Moneus, Mohammed A. and Yousef Sahari. 2024. Artificial intelligence and human translation: A contrastive study based on legal texts. Heliyon, 10(6): 1–14. DOI: https://doi.org/10.1016/j.heliyon.2024.e28106

Nunes Vieira, Lucas, O’Hagan, Minako, and O’Sullivan, Carol. 2021. Understanding the societal impacts of machine translation: a critical review of the literature on medical and legal use cases. Information, Communication & Society, 24(11): 1515–1532. DOI: https://doi.org/10.1080/1369118X.2020.1776370

Pontrandolfo, Gianluca. 2019. Corpus methods in legal translation studies. In: Research methods in legal translation and interpreting, eds. Łucja Biel, Jan Engberg, Rosario Martín Ruano, and Vilelmini Sosoni, 13–28. London: Routledge. DOI: https://doi.org/10.4324/9781351031226-2

Rizzo, Camino Rea, and Maria J.M. Pérez. 2012. Structure and design of the British Law Report Corpus (BLRC): a legal corpus of judicial decisions from the UK. Journal of English Studies, 10: 131–145. DOI: https://doi.org/10.18172/jes.184

Rossini, Favretti, Rema, Fabio Tamburini, and Edoardo Martelli. 2007. Words from Bononia Legal Corpus. In: Text Corpora and Multilingual Lexicography, ed. Wolfgang Tuebert, 11–30. Amsterdam: John Benjamins. DOI: https://doi.org/10.1075/bct.8.03ros

Steinberger, Ralf, Bruno Pouliquen, Anna Widiger, Camelia Ignat, Tomaž Erjavec, Dan Tufis, and Dániel Varga. 2006. The JRC-Acquis: A multilingual aligned parallel corpus with 20+ languages. Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC'2006). Genoa, Italy, 24–26 May 2006. http://www.lrec-conf.org/proceedings/lrec2006/pdf/340_pdf.pdf (accessed May 14, 2024).

Thomson Reuters Institute. 2023a. ChatGPT and Generative AI within Law Firms. https://www.thomsonreuters.com/en-us/posts/wp-content/uploads/sites/20/2023/04/2023-Chat-GPT-Generative-AI-in-Law-Firms.pdf (accessed May 14, 2024).

Thomson Reuters Institute. 2023b. Stepping into the future: How generative AI will help lawyers improve legal service delivery. https://legal.thomsonreuters.com/content/dam/ewp-m/documents/legal/en/pdf/reports/lawyers-harnessing-power-of-ai.pdf (accessed May 14, 2024).

Tiedemann, Jörg. 2012. Parallel Data, Tools and Interfaces in OPUS. Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC’2012), European Language Resources Association, Istanbul, 2214–2218. http://www.lrec-conf.org/proceedings/lrec2012/pdf/463_Paper.pdf (accessed May 14, 2024).

Tiersma, Peter. 1999. Legal language. Chicago, IL: The University of Chicago Press.

Tiersma, Peter and Lawrence M. Solan. 2012. The Oxford Handbook of Language and Law. New York: Oxford University Press.

Vigier Moreno, Francisco. J. 2016. Teaching the Use of ad hoc Corpora. Language and Law / Linguagem e Direito, 3(1): 100–119. https://ojs.letras.up.pt/index.php/LLLD/article/view/1658 (accessed May 14, 2024).

Vigier Moreno, Francisco J., and María d. M. Sánchez Ramos. 2017. Using Parallel Corpora to Study the Translation of Legal System-Bound Terms: The Case of Names of English and Spanish Courts. In: Computational and Corpus-Based Phraseology EUROPHRAS 2017, Lecture Notes in Computer Science, ed. Ruslan Mitkov, 206–273. Cham: Springer. DOI: https://doi.org/10.1007/978-3-319-69805-2_19

Williams, Christopher. 2023. The Impact of Plain Language on Legal English in the United Kingdom. London/New York: Routledge. DOI: https://doi.org/10.4324/9781003025009

ChatGPT: https://chat.openai.com (accessed May 14, 2024).

Hoepli dictionary: https://www.dizionari.repubblica.it (accessed May 14, 2024).

Garner, Bryan A. 2004. Black’s Law Dictionary. Eighth edition. Eagan: Thomson West.