研究生: |
蕭孟勤 Meng-Chin Hsiao |
---|---|
論文名稱: |
利用依存關係之詞彙翻譯 Word Translation Disambiguation via Dependency |
指導教授: |
張俊盛
Jason S. Chang |
口試委員: | |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 資訊系統與應用研究所 Institute of Information Systems and Applications |
論文出版年: | 2007 |
畢業學年度: | 95 |
語文別: | 中文 |
論文頁數: | 74 |
中文關鍵詞: | 翻譯選擇 、統計式機器翻譯 、平行語料庫 、決策表 、依存關係 |
外文關鍵詞: | translation selection, statistical machine translation, parallel corpus, decision list, dependency |
相關次數: | 點閱:4 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本論文提出了一個利用依存關係解決詞彙翻譯的新方法。我們的方法包含了訓練階段及測試階段。在訓練階段,取得與實詞具依存關係的搭配字,並在這些依存關係的條件下,學習分辨翻譯歧義的決策表(decision list)。在測試階段,對於句子中每個實詞檢查跟其有依存關係的搭配字。在測試階段,比對決策表,給予這些字一個正確翻譯。
我們實際撰寫了程式,並利用香港新聞及香港立法會議記錄作為訓練資料。在實驗中我們用了五種不同的方法去處理測試資料並透過一個自動的擬似BLEU的評估方法去比較實驗結果。由實驗結果顯示,依存關係的確可以顯著的幫助詞彙翻譯,而實驗也證實某些依存關係是比其他的依存關係更具影響力的。
We introduce a new method for automatically disambiguation of word translations by using dependency relationships. In our approach, we learn the relationships between translations and dependency relationships from a parallel corpus.
The method consists of a training stage and a runtime stage. During the training stage, the system automatically learns a translation decision list based on source sentences and its dependency relationships. At runtime, for each content word in the given sentence, we give a most appropriate Chinese translation relevant to the context of the given sentence according to the decision list.
We also describe the implementation of the proposed method using bilingual Hong Kong news and Hong Kong Hansard corpus. In the experiment, we use five different ways to translate content words in the test data and evaluate the results based an automatic BLEU-like evaluation methodology. Experimental results indicate that dependency relations can obviously help us to disambiguate word translations and some kinds of dependency are more effective than others.
Bengt Altenberg and Sylviane Grange. 2001. “The grammatical and lexical patterning of make in native and non-native student writing”. Applied Linguisics, 22(2), 173-194.
Clara Cabezas and Philip Resnik. 2005. “Using WSD Techniques for Lexical Selection in Statistical Machine Translation”. July http://handle.dtic.mil/100.2/ADA453538
Marine Carpuat and Dekai Wu. 2005. “Word Sense Disambiguation vs. Statistical Machine Translation”. In 43th Annual Meeting of the Association for Computation Linguistics (ACL 2005).
Dagan, Ido, Alon Itai, and Ulrike Schwall. 1991. "Two Languages are More Informative than One". In Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics (ACL91). Berkeley.
Philipp Koehn, and Kevin Knight. 2000. “Estimating word translation probabilities from unrelated monolingual corpora using the EM algorithm”. In Proceedings of the 17th
National Conference on Artificial Intelligence, pages 711–715, Austin, TX.
Cong Li and Hang Li. 2002. “Word translation disam-biguation using bilingual bootstrapping”. In Proceed-ings of the 40th Annual Meeting of the Association for Computational Linguistics, pages 343-351.
Yajuan Lü, Ming Zhou, Sheng Li, Changning Huang, Tiejun Zhao (2001b). “Automatic translation template acquisition based on bilingual structure alignment”. International Journal of Computational Linguistics and Chinese Language Processing. 6(1), pp. 1-26.
Hwee Tou Ng, BinWang, and Yee Seng Chan. 2003. “Exploiting parallel texts for word sense disambiguation: An empirical study”. In Proceedings of ACL-03, Sapporo, Japan, pages 455–462.
Karolina Owczarzak, Josef van Genabith, and Andy Way. 2007. “Dependency-Based Automatic Evaluation for Machine Translation” In Proceeding of SSST, NAACL-HLT 2007 / AMTA Workshop on Syntax and Structure and Structure in Statisical Translation, pages 80-87, Rochester, New York.
K. Papineni, S. Roukos, T. Ward, and W. Zhu. 2002. “Bleu: a method for automatic evaluation of machine translation”. In Proceedings of 40th Annual Meeting of
the ACL, Philadelphia.
Ted Pedersen. 2000. “A simple approach to building ensembles of naive Bayesian classifiers for word sense disambiguation”. In Proceedings of the First Meeting of the North American Chapter of the Association for Computational Linguistics, Seattle.
Thanh Phong Pham, Hwee Tou Ng, and Wee Sun Lee. 2005. “Word sense disambiguation with semi-supervised learning” AAAI-05, The Twentieth National Conference on Artificial Intelligence.
Dan Klein and Christopher D. Manning. 2003. “Fast exact inference with a factored model for natural language parsing”. In Suzanna Becker, Sebastian Thrun, and Klaus Obermayer, editors, Advances in Neural Information Processing Systems 15, Cambridge, MA.MIT Press.
D. Yarowsky. 1994. “Decision lists for lexical ambiguity resolution: Application to accent restoration in Spanish and French”. In Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, Las Cruces, NM.
D. Yarowsky. 1995. “Unsupervised word sense disambiguation rivaling supervised methods”. In Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, pages 189–196.
Ming Zhou, Yuan Ding, and Changning Huang. 2001. “Improving translation selection with a new translation model trained by independent monolingual corpora”. Computational linguistics and Chinese Language Processing. Vol. 6, No. 1, pp 1-26.