學習使用知識庫以改善機器翻譯｜國立清華大學博碩士論文庫

簡易檢索 / 詳目顯示

回結果列表

研究生：	蔡名喬 Tsai, Ming-Chiao
論文名稱：	學習使用知識庫以改善機器翻譯 Learning to Use Knowledge Base to Improve Machine Translation
指導教授：	張俊盛 Chang, Jason S.
口試委員:	張智星陳浩然
學位類別：	碩士 Master
系所名稱：	電機資訊學院 - 資訊工程學系 Computer Science
論文出版年：	2020
畢業學年度：	108
語文別：	英文
論文頁數：	43
中文關鍵詞：	知識庫、機器翻譯
外文關鍵詞：	Knowledge Base, Machine Translation
相關次數：	點閱：1 下載：0
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

我們提出一個機器翻譯的方法，可以翻譯帶有命名實體的句子。
在研究方法中，我們將命名實體和其翻譯從雙語知識庫中抽出，以求正確翻譯在平行語料中代表性不足的命名實體。
此方法涉及偵測、鏈接和替換輸入句子中的命名實體，識別命名實體類別，以及訓練類神經機器翻譯模型並基於命名實體類別生成翻譯。
在執行時，接受文本段落，並將命名實體換成命名實體類別，然後用類神經機器翻譯模型和雙語知識庫生成翻譯。
我們將該方法應用於平行語料庫和雙語知識庫，實際製作了一個雛型翻譯系統。
根據我們對句子翻譯的評估，此模型在命名實體翻譯和增強目標句子的流利性方面上，有顯著的改進。

We introduce a method for learning to generate machine translation of a given sentence with potential rare named entities (NE).
In our approach, NEs and their translations are extracted from a bilingual knowledge base, aimed at maximizing correct translations for under-represented named entities in a parallel corpus.
The method involves linking NEs in the bilingual training sentences, replacing NEs with NE-type labels, and training a neural machine translation (NMT) model for partially lexicalized training data with regular tokens and NE-type labels.
At run time, the system accepts a text passage, links and replaces NEs with NE-type labels, and then translates the text using the trained NMT model and translate NE-type labels using a bilingual knowledge base.
We present a prototype system, WikiTrans that applies the method to a parallel corpus and \textit{Wikipedia}.
Evaluation on a set of sentences shows that the method achieves reasonably good performance in terms of generating high quality NE translations and enhancing the fluency of target sentences.

Chapter 1:   Introduction                                                           1
Chapter 2:  Related Work                                                         4
Chapter 3:  Methodology                                                         6
3.1    Problem Statement .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  . 6
3.2    Training Neural Machine Translation Model with Partially Lexicalized Sentences  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  . 7
3.2.1     Linking entities to knowledge base    .  .  .  .  .  .  .  .  .  .  .  .  .  . 7
3.2.2    Collecting Named Entity Synonyms  .  .  .  .  .  .  .  .  .  .  .  .  .  . 8
3.2.3    Aligning words in a parallel corpus   .  .  .  .  .  .  .  .  .  .  .  .  .  . 9
3.2.4    Generating NE-type labels  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .   10
3.2.5    Generating Partially Lexicalized Sentences  .  .  .  .  .  .  .  .  .  .   10
3.2.6    Training Machine Translation model   .  .  .  .  .  .  .  .  .  .  .  .  .   11
3.3    Run-time Machine Translation .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  13
Chapter 4:  Experiments                                                         15
4.1    Datasets  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .   15
4.2    Preprocessing   .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .   16
4.3    Setting Hyper-parameters of NMT model .  .  .  .  .  .  .  .  .  .  .  .  .  .  .   17
4.4    Systems Compared   .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .   18
4.5    Evaluation Metric .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .  .   19
Chapter 5:   Results and Discussion                                                        20
Chapter 6:   Conclusion and Future Work                                                        23
Appendices                                                        24
Reference                                                        41
                                

Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473, 2014.
Peter F Brown, Vincent J Della Pietra, Stephen A Della Pietra, and Robert L Mercer. The mathematics of statistical machine translation: Parameter estimation. Computational linguistics, 19(2):263–311, 1993.
Joseph Chang, Richard Tzong-Han Tsai, and Jason S Chang. Wikisense: Supersense tagging of wikipedia named entities based wordnet. In Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, Volume 1, pages 72–81, 2009.
Kyunghyun Cho, Bart Van Merri ̈enboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. Learning phrase representations using rnn encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078, 2014.
Chris Dyer, Victor Chahuneau, and Noah A Smith. A simple, fast, and effective reparameterization of ibm model 2. 2013.
Jonas Gehring, Michael Auli, David Grangier, Denis Yarats, and Yann N Dauphin. Convolutional sequence to sequence learning. In Proceedings of the 34th International Conference on Machine Learning-Volume 70, pages 1243–1252. JMLR. org, 2017.
Jiatao Gu, Zhengdong Lu, Hang Li, and Victor OK Li. Incorporating copying mechanism in sequence-to-sequence learning. arXiv preprint arXiv:1603.06393, 2016
Caglar Gulcehre, Sungjin Ahn, Ramesh Nallapati, Bowen Zhou, and Yoshua Bengio. Pointing the unknown words. arXiv preprint arXiv:1603.08148, 2016.
S ́ebastien Jean, Kyunghyun Cho, Roland Memisevic, and Yoshua Bengio. On using very large target vocabulary for neural machine translation. arXiv preprint arXiv:1412.2007, 2014.
Guillaume Klein, Yoon Kim, Yuntian Deng, Jean Senellart, and Alexander M
Rush. Opennmt: Open-source toolkit for neural machine translation. arXiv preprint arXiv:1701.02810, 2017.
Philipp Koehn, Franz Josef Och, and Daniel Marcu. Statistical phrase-based translation. In Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology-Volume 1, pages 48–54. Association for Computational Linguistics, 2003.
Nikolaos Kolitsas, Octavian-Eugen Ganea, and Thomas Hofmann. End-to-end neural entity linking. arXiv preprint arXiv:1808.07699, 2018.
Minh-Thang Luong, Ilya Sutskever, Quoc V Le, Oriol Vinyals, and Wojciech Zaremba. Addressing the rare word problem in neural machine translation. arXiv preprint arXiv:1410.8206, 2014.
Minh-Thang Luong, Hieu Pham, and Christopher D Manning. Effective approaches to attention-based neural machine translation. arXiv preprint arXiv:1508.04025, 2015.
Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. Bleu: a method for automatic evaluation of machine translation. In Proceedings of the 40th annual meeting on association for computational linguistics, pages 311–318. Association for Computational Linguistics, 2002.
Rico Sennrich, Barry Haddow, and Alexandra Birch. Improving neural machine translation models with monolingual data. arXiv preprint arXiv:1511.06709, 2015a.
Rico Sennrich, Barry Haddow, and Alexandra Birch. Neural machine translation of rare words with subword units. arXiv preprint arXiv:1508.07909, 2015b
Ilya Sutskever, Oriol Vinyals, and Quoc V Le. Sequence to sequence learning with neural networks. In Advances in neural information processing systems, pages 3104–3112, 2014.
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin. Attention is all you need. In Advances in neural information processing systems, pages 5998–6008, 2017.

簡易檢索 / 詳目顯示

相關論文