研究生: |
陳怡君 Chen, Yi-Jyun. |
---|---|
論文名稱: |
奠基於雙語自動對齊之動介片語翻譯改進 Improving Phrase Translation Based on Sentence Alignment of Chinese-English Parallel Corpus |
指導教授: |
張俊盛
Chang, Jason S. |
口試委員: |
張智星
JANG, JYH-SHING 陳浩然 Chen, Hao-Jan |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 資訊工程學系 Computer Science |
論文出版年: | 2020 |
畢業學年度: | 108 |
語文別: | 中文 |
論文頁數: | 71 |
中文關鍵詞: | 雙語句子對齊 、文法規則 、搭配詞 、片語翻譯 |
外文關鍵詞: | Sentence Alignment, Grammar Patterns, Collocations, Phrase Translations |
相關次數: | 點閱:2 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本研究呈現片語查詢的雛形系統「PrecisePhraseBook」,能從雙語語料庫中,自動擷取英文名詞與介系詞搭配的中文翻譯及例句,可輔助語言學習者學習語言,亦能改善機器翻譯或提供語言研究者撰寫文法規則之參考。本方法利用雙語語料庫擷取英文片語的中文翻譯。其方法為使用統計方法由雙語語料庫中的詞彙自動對齊,分別擷取名詞及介系詞的翻譯,再根據由中文語料庫統計而來的中文高頻搭配詞,將名詞及介系詞的翻譯做適當調整,並產生例句。系統執行時,使用者輸入一組英文名詞與介系詞的搭配,系統會呈現資料庫中此搭配的翻譯及例句。本研究的評估方式是隨機抽取三十組名詞及介系詞的搭配,人工評估本研究方法產生的翻譯。
This thesis presents a phrases searching system, PrecisePhraseBook, which provides Chinese translations and example sentences of English phrases with a noun and preposition to assist learners in learning English or Chinese. PrecisePhraseBook provides researchers a reference tool for generating grammar rules. We propose a method for extracting Chinese translations of English phrases from bilingual parallel corpora. We use statistical methods to extract translations of nouns and prepositions from bilingual parallel corpora with sentence alignment, and then adjust the translations according to the Chinese collocations extracted from a Chinese corpus. Finally, we generate example sentences for the translations. At run-time, the user enters an English phrase with a noun and a preposition, and the system retrieves translations and example sentences from the database and presents the results to the user. The evaluation is done using randomly 30 selected phrases. We used human judge in assess the translations.
Peter F. Brown, John Cocke, Stephen Della Pietra, Vincent J. Della Pietra, Fred- erick Jelinek, John D. La↵erty, Robert L. Mercer, and Paul S. Roossin. A sta- tistical approach to machine translation. Comput. Linguistics, 16:79–85, 1990.
Peter F. Brown, Jennifer C. Lai, and Robert L. Mercer. Aligning sentences in parallel corpora. In ACL, 1991.
Roberta Catizone, Graham Russell, and Susan J. Warwick. Deriving translation data from bilingual texts. 1989.
Stanley F. Chen. Aligning sentences in bilingual corpora using lexical information. In ACL, 1993.
Pascale Fung. A pattern matching method for finding noun and proper noun translations from noisy parallel corpora. In ACL, 1995.
William A. Gale and Kenneth Ward Church. Identifying word correspondences in parallel texts. In HLT, 1991.
Ming Hsien Ko. Alignment of multi-word expressions in parallel corpora. 2006. I. Dan Melamed. Automatic evaluation and uniform filter cascades for inducing n-best translation lexicons. ArXiv, cmp-lg/9505044, 1995.
Robert C. Moore. Towards a simple and accurate statistical approach to learning translation relationships among words. In DDMMT@ACL, 2001.
Michel Simard, George F. Foster, and Pierre Isabelle. Using cognates to align sentences in bilingual corpora. In CASCON, 1993.
Frank Smadja. Retrieving collocations from text: Xtract. Computational Linguistics, 19:143–177, 1993.
Dekai Wu and Xuanyin Xia. Learning an english-chinese lexicon from a parallel corpus. In AMTA, 1994.