An Enhancement Method for Japanese-English Automated Translation

Main Article Content

Bartholomäus Wloka
Werner Winiwarter

Abstrakt

We present a method for improving existing statistical machine translation methods using a knowledge base compiled from a bilingual corpus as well as sequence alignment and pattern matching techniques from the area of machine learning and bioinformatics. An alignment algorithm identifies similar sentences, which are then used to construct a better word order for the translation. Our preliminary test results indicate a significant improvement of the translation quality.
 

Downloads

Download data is not yet available.

Article Details

Jak cytować
Wloka, B., & Winiwarter, W. (2010). An Enhancement Method for Japanese-English Automated Translation. Investigationes Linguisticae, 21, 99-118. https://doi.org/10.14746/il.2010.21.7
Dział
Artykuły

Bibliografia

  1. Christian Boitet, Herve Blanchon, Mark Seligman, Valerie Bellynck. Evolution of MT with the Web: In: Proceedings of the Conference ”Machine Translation 25 Years On”: Cranfield, England (2009)
  2. Francis Bond et al.: Enhancing the Japanese WordNet. In: Proceedings of the 7th Workshop on Asian Language Resources, in conjunction with ACL-IJCNLP. (2009)
  3. Eric Brill: Transformation-based error-driven learning, natural language processing: a case study in part-of-speech tagging: Comput. Linguist, MIT Press, 21, 4, 543–565, (1995)
  4. Peter Brown et al.: A statistical approach to machine translation, Comput. Linguist. MIT Press 16, 2, 79–85 (1990)
  5. Chris Callison-Burch, Miles Osborne: Re-evaluating the role of BLEU in machine translation research. In: Proceedings of the Conference EACL, pp. 249–256.(2006)
  6. Michael Carl, Andy Way, Walter Daelemans: Recent Advances in Example-Based Machine Translation: Comput. Linguist. MIT Press, 30, 4, 516–520 (2004)
  7. Django Project http://www.djangoproject.com
  8. G. Doddington: Automatic evaluation of machine translation quality using n-gram cooccurrency statistics. In: Proceedings of the ARPAWorkshop of Human Language Technology. (2002)
  9. Excite Japan, Translation www.excite.co.jp/world/english
  10. M. Finke, P. Geutner, H. Hild, T. Kemp, K. Ries, M. Westphal: The Karlsruhe-Verbmobil Speech Recognition Engine. Acoustics, Speech,, Signal Processing: IEEE Computer Society 1, 83 (1997)
  11. Catherine Havasi and Rob Speer and Jason Alonso: ConceptNet 3:a Flexible, Multilingual Semantic Network for Common Sense Knowledge. In: Proceedings of Recent Advances in Natural Language Processing. (2007)
  12. Hieu Hoang, Alexandra Birch, Chris Callison-burch, Richard Zens, Rwth Aachen, Alexandra Constantin, Marcello Federico, Nicola Bertoldi, Chris Dyer, Brooke Cowan, Wade Shen, Christine Moran, Ondrej Bojar: Moses: Open source toolkit for statistical machine translation, pp. 177–180. Association for Computational Linguistics (2007)
  13. W. John Hutchins, Harold L. Somers: An Introduction to Machine Translation: Academic Press (1992)
  14. Andreas Karwath, Kristian Kersting: Relational Sequence Alignments, Logos. In: Inductive Logic Programming 16th International Conference, pp. 290–304. Springer (2007)
  15. Andreas Karwath, Kristian Kersting: Relational Sequence Alignments. In: Proceedings of the 4th International Workshop on Mining, Learning with Graphs (MLG’06) (2006)
  16. Andreas Karwath, Kristian Kersting, Niels Landwehr: Boosting Relational Sequence Alignments. In: Proceedings of the 8th IEEE International Conference on Data Mining (2008)
  17. Kristian Kersting, Luc De Raedt, Bernd Gutman, Andreas Karwath, Niels Landwehr:
  18. Probabilistic Inductive Logic Programming: Relational Sequence Learning: Springer Berlin/Heidelberg (2008)
  19. Taku Kudo, Yuji Matsumoto: Japanese Dependency Analysis using Cascaded Chunking: In: CoNLL 2002: Proceedings of the 6th Conference on Natural Language Learning 2002 (COLING 2002 Post-Conference Workshops), pp. 63-69 (2002)
  20. H. Liu: An end-to-end natural language processor with common sense: MIT Media Lab (2004)
  21. H. Liu, P. Singh: ConceptNet — A Practical Commonsense Reasoning Tool-Kit: BT Technology Journal, Kluwer Academic Publishers 22, 4, 211–226 (2004)
  22. S. Makino, M. Tsutsui: A Dictionary of Basic Japanese Grammar: The Japan Times (1986)
  23. Yuji Matsumoto, Akira Kitauchi, Tatsuo Yamashitaka Hirano, Hiroshi Matsuda, Kazuma Takaoka, Masayuki Asahara: Japanese Morphological Analysis System ChaSen version 2.2.1: Users Manual (2000)
  24. Y.M. McClain: Handbook of Modern Japanese Grammar: The Hokuseido Press (1981)
  25. Teruko Mitamura, Nyberg Eric: Hierarchical Lexical Structure, Interpretive Mapping in Machine Translation. In: Proceedings of the 14th Conference on Computational Linguistics, pp. 1254–1258. Association for Computational Linguistics (1992)
  26. Makato Nagao: A framework of a mechanical translation between Japanese, English by analogy principle. In: Proceedings of the international NATO symposium on Artificial, human intelligence, pp. 173–180 Elsevier North-Holland, Inc. (1984)
  27. S.B. Needleman, C.D. Wunsch: A general method applicable to the search for similarities in the amino acid sequence of two proteins: Journal of Molecular Biology 48, 2, 443-453 (1970)
  28. Franz Josef Och, Hermann Ney: A Systematic Comparison of Various Statistical Alignment Models: Computational Linguistics, 1, 29, 19–51, (2003)
  29. Nicholas Ostler: The Jungle Is Neutral – Newcomer Languages Face New Media. In: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, University Politecnica de Catalunya Barcelona Spain (2009)
  30. Kishore Papineni, Salim Roukos, Todd Ward, Wei-Jing Zhu: BLEU: a method for automatic evaluation of machine translation. In: ACL ’02: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 311–318. Association for Computational Linguistics
  31. T.F. Smith, M.S. Waterman: Identification of common molecular subsequences: Journal of Molecular Biology, 147, 195–197 (1981)
  32. Yasushito Tanaka: Compilation of a multilingual parallel corpus. In: Proceedings of the PACLING 2001, pp. 265–268. Kyushu, Japan (2001)
  33. Masao Utiyama, Hitoshi Isahara: Reliable measures for aligning Japanese-English news articles, sentences. In: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics, pp. 72–79. Association for Computational Linguistics (2003)
  34. Frank Wilcoxon: Individual Comparisons by Ranking Methods: Biometrics Bulletin, 1, 6, 80–83 (1945)
  35. Yorick Wilks: Machine Translation: Its Scope and Limits. Springer (2008)
  36. Werner Winiwarter: WILLIE – a Web Interface for a Language Learning, Instruction Environment. In: Proceedings of the 6th International Conference on Web-based Learning: Springer (2008)
  37. WernerWiniwarter: WETCAT – Web-Enabled Translation Using Corpus-Based Acquisition of Transfer Rules. In: Proceedings of the Third IEEE International Conference on Innovations in Information Technology: Dubai, United Arab Emirates (2006)
  38. Bartholom¨aus Wloka: Enhancing Japanese-English Machine Translation – A Hybrid Approach: University of Freiburg (2009)
  39. Bartholom¨aus Wloka, Werner Winiwarter Project Web-site https://wloka.dac.univie.ac.at/project/