TY - JOUR AU - Krimpas, Panagiotis AU - Valavani, Christina PY - 2023/01/09 Y2 - 2024/03/29 TI - Attention mechanism and skip-gram embedded phrases: short and long-distance dependency n-grams for legal corpora JF - Comparative Legilinguistics JA - cl VL - 52 IS - SE - Articles DO - 10.14746/cl.52.2022.14 UR - https://pressto.amu.edu.pl/index.php/cl/article/view/35296 SP - 318-350 AB - <p>This article examines common translation errors that occur in the translation of legal texts. In particular, it focuses on how German texts containing legal terminology are rendered into Modern Greek by the Google translation machine. Our case study is the Google-assisted translation of the original (German) version of the Constitution of the Federal Republic of Germany into Modern Greek. A training method is proposed for phrase extraction based on the occurrence frequency, which goes through the Skip-gram algorithm to be then integrated into the Self Attention Mechanism proposed by Vaswani et al. (2017) in order to minimise human effort and contribute to the development of a robust machine translation system for multi-word legal terms and special phrases. This Neural Machine Translation approach aims at developing vectorised phrases from large corpora and process them for translation. The research direction is to increase the in-domain training data set and enrich the vector dimension with more information for legal concepts (domain specific features).</p> ER -