研究生: |
吳瑀瑄 Wu, Yu-Hsuan |
---|---|
論文名稱: |
機器學習之英文動詞誤用改正 Learning to Correct English Verb Misuse based on Revision Log |
指導教授: |
張俊盛
Chang, Jason S. |
口試委員: |
蘇豐文
Soo, Von-Wun 陳浩然 Chen, Hao-Jan |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 資訊系統與應用研究所 Institute of Information Systems and Applications |
論文出版年: | 2017 |
畢業學年度: | 105 |
語文別: | 英文 |
論文頁數: | 50 |
中文關鍵詞: | 自動錯誤改正 、動詞推薦 、編修紀錄 |
外文關鍵詞: | Verb Suggestion, The Wiked Error Corpus, Revision Log |
相關次數: | 點閱:2 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
我們提出了一個偵測錯誤與改正錯誤的方法。根據劍橋學習者語料庫 (Cambridge Learner Corpus) 的統計,在英文學習者常犯的錯誤中,除了拼字錯誤外,最多的就是動詞誤用 (Replace Verb Error)。我們使用了維基百科的編修紀錄來建立通道模型 (Channel Model) ,並結合搭配詞組 (Collocation) 來取出候選動詞組,用以來取代句子中可能錯誤的動詞。我們的改錯方法包含了句子剖析 (Parsing)、動詞辨認、應用迴歸分析於通道模型、 擷取候選詞組,最後重新排序候選詞組中的動詞,來達成動詞改正建議的目的。
According to the analysis of a sample of Cambridge Learner Corpus, the most frequent type of grammatical errors is using a wrong verb. We introduce a method for learning to detect and correct potential verb errors in a given sentence. In our approach, alternative verbs are considered to replace the verb based on verb- object collocations and a channel model trained on an error-annotated corpus. The method involves parsing the sentence, identifying the verbs, applying regression on channel model, retrieving a list of alternative verbs, and re-ranking each alternative. Our method combine channel and language models, resulting in high recall and precision rates in detecting and high MRR in correcting verb misuse.
Aoife Cahill, Nitin Madnani, Joel Tetreault, and Diane Napolitano. Robust sys- tems for preposition error correction using Wikipedia revisions. In Proceedings of NAACL-HLT, pages 507–517, 2013.
Daniel Dahlmeier and Hwee Tou Ng. Grammatical error correction with alter- nating structure optimization. In Proceedings of the 49th ACL, pages 915–923, 2011.
Rachele De Felice and Stephen G Pulman. A classifier-based approach to prepo- sition and determiner error correction in L2 English. In Proceedings of the 22nd Cocling, 2008.
Mariano Felice, Zheng Yuan, Øistein E Andersen, Helen Yannakoudakis, and Eka- terina Kochmar. Grammatical error correction using hybrid systems and type filtering. In Proceedings of the 18th CoNLL: Shared Task, pages 15–24, 2014.
Christiane Fellbaum. WordNet. Wiley Online Library, 1998.
Yoko Futagi, Paul Deane, Martin Chodorow, and Joel Tetreault. A computational approach to detecting collocation errors in the writing of non-native speakers of English. Computer Assisted Language Learning, 21(4):353–367, 2008.
Jeroen Geertzen, Theodora Alexopoulou, and Anna Korhonen. Automatic linguis- tic annotation of large scale L2 databases: The EF-Cambridge open language database (EFCAMDAT). In Proceedings of the 31st Second Language Research Forum. Somerville, MA: Cascadilla Proceedings Project, 2013.
Marcin Junczys-Dowmunt Roman Grundkiewicz. The AMU system in the CoNLL- 2014 Shared Task: Grammatical error correction by data-intensive and feature- rich statistical machine translation. Proceedings of the 18th CoNLL: Shared Task, page 25, 2014.
Roman Grundkiewicz and Marcin Junczys-Dowmunt. The Wiked Error Corpus: A corpus of corrective Wikipedia edits and its application to grammatical error correction. In International Conference on Natural Language Processing, 2014.
Claudia Leacock, Martin Chodorow, Michael Gamon, and Joel Tetreault. Auto- mated grammatical error detection for language learners. Synthesis Lectures on Human Language Technologies, 7(1):1–170, 2014.
John Lee and Stephanie Seneff. Correcting misuse of verb forms. In Proceedings of ACL-08: HLT, pages 174–182, 2008.
Anne Li-E Liu, David Wible, and Nai-Lung Tsao. Automated suggestions for miscollocations. In Proceedings of the 4th Workshop on Innovative Use of NLP for Building Educational Applications, pages 47–50, 2009.
Tomoya Mizumoto, Yuta Hayashibe, Mamoru Komachi, Masaaki Nagata, and Yuji Matsumoto. The effect of learner corpus size in grammatical error correction of ESL writings. In Proceedings of the 24th Cocling, page 863, 2012.
Diane Nicholls. The Cambridge Learner Corpus: Error coding and analysis for lex- icography and ELT. In Proceedings of the Corpus Linguistics 2003 Conference, volume 16, pages 572–581, 2003.
Alla Rozovskaya and Dan Roth. Algorithm selection and model adaptation for ESL correction tasks. In Proceedings of the 49th ACL, pages 924–933, 2011.
Alla Rozovskaya and Dan Roth. Grammatical error correction: Machine translation and classifiers. Urbana, 51:61820, 2016.
Alla Rozovskaya, Dan Roth, and Vivek Srikumar. Correcting grammatical verb errors. In Proceedings of the 14th EACL, pages 358–367, 2014.
Yu Sawai, Mamoru Komachi, and Yuji Matsumoto. A learner corpus-based approach to verb suggestion for ESL. In Proceedings of the 51st ACL, pages 708–713, 2013.
C-C Shei and Helen Pain. An ESL writer’s collocational aid. Computer Assisted Language Learning, 13(2):167–182, 2000.
Frank Smadja. Retrieving collocations from text: Xtract. Computational linguistics, 19(1):143–177, 1993.
Andreas Stolcke et al. Srilm-an extensible language modeling toolkit. In Inter- speech, volume 2002, page 2002, 2002.
Joel Tetreault, Jennifer Foster, and Martin Chodorow. Using parse features for preposition selection and error detection. In Proceedings of the 48th ACL, pages 353–358, 2010.
Jian-Cheng Wu, Yu-Chia Chang, Teruko Mitamura, and Jason S Chang. Automatic collocation suggestion in academic writing. In Proceedings of the 48th ACL, pages 115–119, 2010.
Jian-Cheng Wu, Tzu-Hsi Yen, Jim Chang, Guan-Cheng Huang, Jim Chang, Hsiang-Ling Hsu, Yu-Wei Chang, and Jason S Chang. NTHU at the CoNLL- 2014 Shared Task. In Proceedings of the 18th CoNLL: Shared Task, pages 91–95, 2014.
Yuanbin Wu and Hwee Tou Ng. Grammatical error correction using integer linear programming. In Proceedings of the 51st ACL, pages 1456–1465, 2013.