簡易檢索 / 詳目顯示

研究生: 蕭孟勤
Meng-Chin Hsiao
論文名稱: 利用依存關係之詞彙翻譯
Word Translation Disambiguation via Dependency
指導教授: 張俊盛
Jason S. Chang
口試委員:
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 資訊系統與應用研究所
Institute of Information Systems and Applications
論文出版年: 2007
畢業學年度: 95
語文別: 中文
論文頁數: 74
中文關鍵詞: 翻譯選擇統計式機器翻譯平行語料庫決策表依存關係
外文關鍵詞: translation selection, statistical machine translation, parallel corpus, decision list, dependency
相關次數: 點閱:3下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 本論文提出了一個利用依存關係解決詞彙翻譯的新方法。我們的方法包含了訓練階段及測試階段。在訓練階段,取得與實詞具依存關係的搭配字,並在這些依存關係的條件下,學習分辨翻譯歧義的決策表(decision list)。在測試階段,對於句子中每個實詞檢查跟其有依存關係的搭配字。在測試階段,比對決策表,給予這些字一個正確翻譯。
    我們實際撰寫了程式,並利用香港新聞及香港立法會議記錄作為訓練資料。在實驗中我們用了五種不同的方法去處理測試資料並透過一個自動的擬似BLEU的評估方法去比較實驗結果。由實驗結果顯示,依存關係的確可以顯著的幫助詞彙翻譯,而實驗也證實某些依存關係是比其他的依存關係更具影響力的。


    We introduce a new method for automatically disambiguation of word translations by using dependency relationships. In our approach, we learn the relationships between translations and dependency relationships from a parallel corpus.
    The method consists of a training stage and a runtime stage. During the training stage, the system automatically learns a translation decision list based on source sentences and its dependency relationships. At runtime, for each content word in the given sentence, we give a most appropriate Chinese translation relevant to the context of the given sentence according to the decision list.
    We also describe the implementation of the proposed method using bilingual Hong Kong news and Hong Kong Hansard corpus. In the experiment, we use five different ways to translate content words in the test data and evaluate the results based an automatic BLEU-like evaluation methodology. Experimental results indicate that dependency relations can obviously help us to disambiguate word translations and some kinds of dependency are more effective than others.

    摘要 i ABSTRACT ii 致謝辭 iii Table of Content iv List of Tables v List of Figures vi Chapter 1 Introduction 1 Chapter 2 Related Work 4 Chapter 3 the Dependency Method 7 3.1 Problem Statement 7 3.2 Training the Dependency-Based Translation Model 9 3.3 Word Translation Disambiguation at Runtime 17 Chapter 4 Experiments and Evaluation 23 4.1 Experimental Setting 23 4.2 Evaluation and Discussion 27 Chapter 5 Future Work and Conclusion 34 Rerferences 36 Appendix A – Examples of Test Data 38

    Bengt Altenberg and Sylviane Grange. 2001. “The grammatical and lexical patterning of make in native and non-native student writing”. Applied Linguisics, 22(2), 173-194.
    Clara Cabezas and Philip Resnik. 2005. “Using WSD Techniques for Lexical Selection in Statistical Machine Translation”. July http://handle.dtic.mil/100.2/ADA453538
    Marine Carpuat and Dekai Wu. 2005. “Word Sense Disambiguation vs. Statistical Machine Translation”. In 43th Annual Meeting of the Association for Computation Linguistics (ACL 2005).
    Dagan, Ido, Alon Itai, and Ulrike Schwall. 1991. "Two Languages are More Informative than One". In Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics (ACL91). Berkeley.
    Philipp Koehn, and Kevin Knight. 2000. “Estimating word translation probabilities from unrelated monolingual corpora using the EM algorithm”. In Proceedings of the 17th
    National Conference on Artificial Intelligence, pages 711–715, Austin, TX.
    Cong Li and Hang Li. 2002. “Word translation disam-biguation using bilingual bootstrapping”. In Proceed-ings of the 40th Annual Meeting of the Association for Computational Linguistics, pages 343-351.
    Yajuan Lü, Ming Zhou, Sheng Li, Changning Huang, Tiejun Zhao (2001b). “Automatic translation template acquisition based on bilingual structure alignment”. International Journal of Computational Linguistics and Chinese Language Processing. 6(1), pp. 1-26.
    Hwee Tou Ng, BinWang, and Yee Seng Chan. 2003. “Exploiting parallel texts for word sense disambiguation: An empirical study”. In Proceedings of ACL-03, Sapporo, Japan, pages 455–462.

    Karolina Owczarzak, Josef van Genabith, and Andy Way. 2007. “Dependency-Based Automatic Evaluation for Machine Translation” In Proceeding of SSST, NAACL-HLT 2007 / AMTA Workshop on Syntax and Structure and Structure in Statisical Translation, pages 80-87, Rochester, New York.
    K. Papineni, S. Roukos, T. Ward, and W. Zhu. 2002. “Bleu: a method for automatic evaluation of machine translation”. In Proceedings of 40th Annual Meeting of
    the ACL, Philadelphia.
    Ted Pedersen. 2000. “A simple approach to building ensembles of naive Bayesian classifiers for word sense disambiguation”. In Proceedings of the First Meeting of the North American Chapter of the Association for Computational Linguistics, Seattle.
    Thanh Phong Pham, Hwee Tou Ng, and Wee Sun Lee. 2005. “Word sense disambiguation with semi-supervised learning” AAAI-05, The Twentieth National Conference on Artificial Intelligence.
    Dan Klein and Christopher D. Manning. 2003. “Fast exact inference with a factored model for natural language parsing”. In Suzanna Becker, Sebastian Thrun, and Klaus Obermayer, editors, Advances in Neural Information Processing Systems 15, Cambridge, MA.MIT Press.
    D. Yarowsky. 1994. “Decision lists for lexical ambiguity resolution: Application to accent restoration in Spanish and French”. In Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, Las Cruces, NM.
    D. Yarowsky. 1995. “Unsupervised word sense disambiguation rivaling supervised methods”. In Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, pages 189–196.
    Ming Zhou, Yuan Ding, and Changning Huang. 2001. “Improving translation selection with a new translation model trained by independent monolingual corpora”. Computational linguistics and Chinese Language Processing. Vol. 6, No. 1, pp 1-26.

    無法下載圖示 全文公開日期 本全文未授權公開 (校內網路)
    全文公開日期 本全文未授權公開 (校外網路)

    QR CODE