簡易檢索 / 詳目顯示

研究生: 李明璋
Lee, James M.
論文名稱: Leveraging Word Classes for Smarter Learning in a Disambiguation Task: The Case of in, at, and on
利用詞類來輔助歧異辨識: 以 in, at, 和 on 為例
指導教授: 張俊盛
Chang, Jason S.
口試委員:
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 資訊系統與應用研究所
Institute of Information Systems and Applications
論文出版年: 2009
畢業學年度: 97
語文別: 英文
論文頁數: 36
中文關鍵詞: 文法糾正介係詞詞類歧異辨識
外文關鍵詞: grammar correction, prepositions, word classes, disambiguation, WordNet, decision lists
相關次數: 點閱:2下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • This work is a study of the problem of disambiguation of the often confused English locational prepositions, in, at, and on, and the leveraging of semantic knowledge in an automatic way to aid in solving this problem. Although native speakers of a language normally may not find any difficulty with the usage of prepositions, second language learners commonly find them troublesome and often err in their usage of them. This particular set of prepositions is interesting as the three prepositions are commonly mixed-up among second language speakers of English with Chinese as their first language and the prepositions can often be translated as or have a similar function to a single word, the Chinese word在 (zài).
    We use a decision list, a technique well-known to have been successfully applied to word sense disambiguation to this problem. Then, using a simple, yet effective method that does not require word sense disambiguation, we use WordNet to make overlapping, non-independent word class features available to the decision list learner. This helps us to achieve two goals: 1) we find that adding word class features significantly improve performance to as good as or better than doubling the amount of training data and 2) the combination of the decision list technique and word classes avoids the opaqueness of other black box-type machine learning methods, learning human-interpretable rules that are similar in quality to those described by grammarians.


    Abstract 2 List of Tables 5 Introduction 7 The Trouble with Prepositions 7 Motivations: Word Class Based Rule Learning and Avoiding Black Boxes 9 Background 12 Previous Works Using Word Classes 12 Method 13 Evaluation 14 WordNet to Word Classes 14 Decision Lists 15 Ranking Algorithm 16 Cutoffs for Feature Selection 17 Features 17 Corpus Selection and Processing 18 Instance Processing 19 Setting Parameters 19 Results 22 Importance of Various Feature Types 22 The Power of Word Classes 22 Rules Learned 25 Some Limits 30 Related Work 32 Conclusion 34

    Bitchener J., Young, S., and Cameron, D. “The Effect of Different Types of Corrective Feedback on ESL Student Writing,” Journal of Second Language Writing, 14, 192-205. 2005.

    Chodorow, M., Tetreault, J., Han, N. “Detection of Grammatical Errors Involving Prepositions,” In Proc. of the 4th ACL-SIGSEM Workshop on Prepositions, 2007.

    Clark S. and Weir D. 2002. “Class-based Probability Estimation Using a Semantic Hierarchy,” Computational Linguistics, 2002.

    Dalgish, G. “Computer-assisted ESL Research and Courseware Development,” Computers and Composition. 1985.

    De Felice, R. and Pulman S.G. “Automatically Acquiring Models of Preposition Use,” In Proc. of the 4th ACL-SIGSEM Workshop on Prepositions. Prague, Czech Republic, 2007.

    Eastwood, John. Oxford Practice Grammar: Intermediate. Oxford University Press.
    2006.

    Gamon, M., Gao J., Brockett, C., Klementiev, A., Dolan, W.B., Belenko, D., and Vanderwende, L. “Using Contextual Speller Techniques and Language Modeling for ESL Error Correction,” In Proc. of IJCNLP, Hyderabad, India, 2008.

    Graff, D., Kong J., Chen, K., and Maeda, K. English Gigaword Third Edition. Linguistic Data Consortium, Philadelphia, 2007.

    Hermet, M., Desilets, A., Szpakowicz, S. “Using the Web as a Linguistic Resource to Automatically Correct Lexico-Syntactic Errors,” The 6th Edition of the Language Resources and Evaluation Conference. Marrakech, Morocco, 2008.

    Lee, J. and Seneff, S. “Automatic Grammar Correction for Second-language Learners,” Proceedings of Interspeech, Pittsburgh, Pennsylvania, 2006.

    Miller, G. “Nouns in Wordnet,” WordNet: An Electronic Lexical Database. MIT Press, 1998.

    Miller, G., Beckwith, R., Fellbaum, C., Gross D., and Miller, K. “Introduction to Wordnet: An On-line Lexical Database,” International Journal of Lexicography. 1990.

    Resnik, P. “Selectional Preference and Sense Disambiguation,” In Proc. of ACL Siglex Workshop on Tagging Text with Lexical Semantics, 1997.

    Riloff, E. “An Empirical Study of Automated Dictionary Construction for Information Extraction in Three Domains,” Aritificial Intelligence. Vol. 85., 1996.

    Rivest, R. “Learning Decision Lists,” In Machine Learning. 1987.

    Ruiz-Casado, M., Alfonseca, E., and Castells, P. “Automatising the Learning of Lexical Patterns: An application to the enrichment of WordNet by extracting semantic relationships from Wikipedia,” Data Knowledge Engineering. 61(3): 484-499, 2007.

    Suchanek, F.M., Kasneci, G., and Weikum, G. “YAGO: A Large Ontology From Wikipedia and WordNet,” Journal of Web Semantics. 6(3): 203-217, 2008.

    Tsuruoka, Y. and Tsujii, J. “Bidirectional Inference with the Easiest-First Strategy for Tagging Sequence Data,” In Proc. of HLT/EMNLP. 2005

    Webb, G. “Learning Decision Lists by Prepending Inferred Rules,” In Proc. of the Australian Workshop on Machine Learning and Hybrid Systems. 1993.

    Wu, S-H and Su C-Y. “An Evaluation of Adopting Language Model as the Checker of Preposition Usage,” In Proc. of ROCLING, Hsinchu, Taiwan, 2006.

    Yarowsky, D. “Unsupervised Word Sense Disambiguation Rivaling Supervised Methods,” In Proc. of the 33rd annual meeting of the Association for Computational Linguistics, 1995.

    無法下載圖示 全文公開日期 本全文未授權公開 (校內網路)
    全文公開日期 本全文未授權公開 (校外網路)

    QR CODE