簡易檢索 / 詳目顯示

研究生: 張俊欽
Chang Chun-Chin
論文名稱: 自動字義判析-網路資源為本的非督導式學習策略
Web-Based Unsupervised Method for Word Translation Disambiguation
指導教授: 張俊盛
Jason S. Chang
口試委員:
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 資訊工程學系
Computer Science
論文出版年: 2005
畢業學年度: 93
語文別: 英文
論文頁數: 48
中文關鍵詞: 字義歧義辨析配搭詞機器翻譯網路基本決定表
外文關鍵詞: Machine Translation, Collocation, Web, Decision List
相關次數: 點閱:2下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 本論文提出一個利用網路資源,解決詞彙翻譯歧義的自動學習機制。我們的方法包含了訓練階段及測試階段。在訓練階段,首先取得歧義字的常見搭配詞,並且使用網路資決定歧義字在搭配詞的條件下,最可能的翻譯。最後將利用歧義字與這些搭配詞,查詢搜尋引擎,得到不同翻譯的訓練資料。之後再利用這些訓練資料,學習分辨翻譯歧義的決策表(decision list)。在測試階段,對包含歧義字之句子,檢查歧義字鄰近的搭配字,比對決策表,辨別其所歧義字之正確翻譯。
    我們隨機選取英國國家語料庫的例句做為測試資料,人工所標記四個崎義的翻譯。最後評估本論文作法以及現有機器翻譯系統的詞彙翻譯正確率。由實驗結果顯示,本文的作法可以在沒有人工標示的訓練資料的條件下,以網路為資料庫,獲得比現有機器翻譯系統更高的詞彙翻譯正確率。


    We introduce a new method for automatically disambiguation of word translations by using collocations. In our approach, we learn the the relationships between translation categories and collocations using the information on the Web.
    The method consists of a training stage and a runtime stage. During the training stage, the method involves automatically acquisition of collocates of target words from a large corpus, distinguishing of collocations into two or more parts by translations of a given word, and learning a translation decision list based on sentences with the target word and its collocates automatically acquired from the Web. At runtime, the target word in the given sentence is translated according to the decision list model.
    We also describe the implementation of a prototype system of the proposed method, experiments, and evaluation. In the experiment, we used four polysemous words to assess the performance of the method compare the results against judgments made by human subjects. Experimental results indicate that the proposed unsupervised method based on the Web as corpus overcomes the knowledge acquisition bottleneck and provides a promising approach for word translation disambiguation.

    致謝辭 i 摘要 ii ABSTRACT iii Table of Content iv List of Tables v List of Figures vi Chapter 1 Introduction 1 1.1 Background 1 1.2 Motivation 1 Chapter 2. Related work 5 Chapter 3 Web-Based Method 10 3.1 Problem Statement 10 3.2 Training the Web-based model 12 3.2.1 Automatically Acquisition of Collocates of Words 12 3.2.2 Classifying Collocations according to relevant translation 13 3.2.3 Gather Training Data for Each Translation from the Web 17 3.2.4 Unsupervised Learning Algorithm 18 3.3 Word Translation Disambiguation at Runtime 20 Chapter 4 Experiment and Evaluation 22 4.1 Experimental Setting 22 4.2 Evaluation and Discussion 25 4.2.1 The Result of Distinguishing Collocations 25 4.2.2 Experiment and Test Data 27 Chapter 5 Conclusion and Future work 30 5.1 Conclusion 30 5.2 Future work 31 References 32 Appendix A 38 tank 38 plant 40 bass 47 palm 47

    Brown, Peter F., Della Pietra, Stephen, Della Pietra, Vincent J., and Mercer Robert L. (1991), “Word sense disambiguation using statistical methods”, Proceedings of the 29th Annual Meeting of Association for Computational Linguistics, Berkeley, California, 264-270
    Byrd, Roy J.; Calzolari, Nicoletta; Chodorov; Martin S.; Klavans, Judith L.; Neff, Mary s.; and Rizk, Omneya (1987). “Tools and methods for computational linguistics.” Computational Linguistics, 13(3/4), 219-240
    Calzolari, Nicoletta (1984). “Detecting patterns in a lexical data base”, Proceedings of the 10th International Conference on Computational Linguistics, COLING'84, 2-6 July 1984, Stanford University, California, 170-173.
    Chodorow, Martin S., Byrd, Roy J., and Heidorn, George E. (1985), “Extracting semantic hierarchies from a large on-line dictionary”, Proceedings of the 23rd Annual Meeting of the Association for Computational Linguistics, 8-12 July 1985, University of Chicago, Chicago, Illinois, 299-304.
    Church, Kenneth W, and Mercer, Robert L (1993), “Introduction to the special issue on computational linguistics using large corpora”, Computational Linguistics, 19(1), 1-24.
    Cong Li and Hang Li (2002), “Word translation disambiguation using bilingual bootstrapping”, Proceedings of 40th Annual Meeting of the Association for Computational Linguistics. 343-351.
    Cowie, Jim, Guthrie, Joe A., and Guthrie, Louise (1992), “Lexical disambiguation using simulated annealing”, Proceedings of the 14th International Conference on Computational Linguistics, COLING'92, 23-28 August, Nantes, France, vol. 1, 359-365.
    Dagan, Ido, Marcus, Shaul, and Markovitch, Shaul (1993), “Contextual word similarity and estimation from sparse data”, Proceedings of the 31st Annual Meeting of the Association for Computational Linguistics, 22-26 June 1993, Columbus, Ohio.
    Dagan, Ido, Peireira, Fernando and Lee, Lilian (1994), “Similarity-based estimation of word co-occurrence probabilities”, Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, Las Cruces, New Mexico, 272-278.
    Diab, Mona and Philip Resnik (2002), “An unsupervised method for word sense tagging using parallel corpora”, Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, Philadelphia, July. 255-262
    Gale, William A., Church, Kenneth W., and Yarowsky, David (1992a), “Using bilingual materials to develop word sense disambiguation methods.” Proceedings of the International Conference on Theoretical and Methodological Issues in Machine Translation, 101-112.
    Gale, William A., Church, Kenneth W., and Yarowsky, David (1992b), “Estimating upper and lower bounds on the performance of word-sense disambiguation programs”, Proceedings of the 30th Annual Meeting of the Association for Computational Linguistics, 28 June-2 July 1992, University of Delaware, Newark, Delaware, 249-256.
    Gale, William A., Church, Kenneth W., and Yarowsky, David (1992c), “One sense per discourse.” Proceedings of the Speech and Natural Language Workshop, San Francisco, Morgan Kaufmann, 233-37.
    Gale, William A., Church, Kenneth W., and Yarowsky, David (1992d), “Work on statistical methods for word sense disambiguation”, Probabilistic Approaches to Natural Language: Papers from the 1992 AAAI Fall Symposium, 23-25 October 1992, Cambridge,Massachusetts, 54-60.
    Gale, William A., Church, Kenneth W., and Yarowsky, David (1993). “A method for disambiguating word senses in a large corpus”, Computers and the Humanities, 26, 415-439.
    Hearst, Marti A. (1991). “Noun homograph disambiguation using local context in large corpora.” Proceedings of the 7th Annual Conf. of the University of Waterloo Centre for the New OED and Text Research, Oxford, United Kingdom, 1-19.
    Karov Yael and S. Edelman (1998), “Similarity-based word sense disambiguation”. Computational Linguistics, 24:41–59
    Kilgarriff, Adam (1993). “Dictionary word sense distinctions: An enquiry into their nature.” Computers and the Humanities, 26, 365-387.
    Kilgarriff, A. and Grefenstette, G (2003), “Introduction to the Special Issue on the Web as Corpus.“ Computational Linguistics 29(3), 333-348
    Klavans, Judith; Chodorow, Martin; and Wacholder, Nina (1990), “From dictionary to knowledge base via taxonomy.” Proceedings of the 6th Conference of the UW Centre for the New OED, Waterloo, Canada, 110-132.
    Kucera, Henri and Francis, Winthrop N. (1967), Computational Analysis of Present-Day American English, Brown University Press, Providence.
    Lesk, Michael (1986). “Automated Sense Disambiguation Using Machine-readable Dictionaries: How to Tell a Pine Cone from an Ice Cream Cone.” Proceedings of the 1986 SIGDOC Conference, Toronto, Canada, June 1986, 24-26.
    Liddy, Elisabeth D. and Paik, Woojin (1993). “Statistically-guided word sense disambiguation.” Proceedings of the AAAI Fall Symposium Series, 98-107.
    Markowitz, Judith; Ahlswede, Thomas; and Evens, Martha (1986). “Semantically significant patterns in dictionary definitions.” Proceedings of the 24th Annual Meeting of the Association for Computational Linguistics, New York, 112-119.
    Michiels, Archibal; Mullenders, Jacques; and No□l, Jacques (1980). “Exploiting a large database by Longman.” Proceedings of the 8th International Conference on Computational Linguistics, COLING’80, Tokyo, Japan, 374-382.
    Miller, George A.; Beckwith, Richard T. Fellbaum, Christiane D.; Gross, Derek; and Miller, Katherine J. (1990). “WordNet: An on-line lexical database.” International Journal of Lexicography, 3(4), 235-244.
    Miller, George A.; Chodorow, Martin; Landes, Shari; Leacock, Claudia; and Thomas, Robert G. (1994). “Using a semantic concordance for sense identification.” ARPA Workshop on Human Language Technology, Plainsboro, New Jersey, March 1994, 240-243.
    Nakamura, Jun-Ichi and Nagao, Makoto (1988). “Extraction of semantic information from an ordinary English dictionary and its evaluation.” Proceedings of the 12th International Conference on Computational Linguistics, COLING'88, 22-27 August 1988, Budapest, Hungary, 459-464.
    Ng, H. T., & Wang, B., & Chan, Y. S. (2003). “Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study.” In the Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics (ACL-03), 455-462
    Resnik, Philip (1992). “WordNet and distributional analysis: a class-based approach to statistical discovery.” AAAI Workshop on Statistically-based Natural Language Processing Techniques, San Jose, California, 48-56.
    Resnik, Philip (1995a). “Disambiguating Noun Groupings with Respect to WordNet Senses.” Proceedings of the Third Workshop on Very Large Corpora, Cambridge, Massachusetts, 54-68.
    Resnik, Philip (1995b). “Using information content to evaluate semantic similarity in a taxonomy.”, Proceedings of the 14th International Joint Conference on Artificial Intelligence, IJCAI’95. Montr□al, 448-53.
    Resnik, P. and N. Smith. (2002). “The Web as a parallel corpus.” Computational Linguistics 29(3), 349-380
    Richardson, Ray and Smeaton, Alan F. (1994) “Automatic word sense disambiguation in a KBIR application.” Working paper CA-0595, School of Computer Applications, Dublin City University, Dublin, Ireland.
    Slator, Brian M. (1992). “Sense and preference.” Computer and Mathematics with Applications, 23(6/9), 391-402.
    Sussna, Michael (1993). “Word sense disambiguation for free-text indexing using a massive semantic network.” Proceedings of the Second International Conference on Information and Knowledge Base Management, CIKM'93, Arlington, Virginia, 67-74.
    Voorhes, Ellen M. (1993). “Using WordNet to disambiguate word senses for text retrieval.” Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 27 June-1 July 1993, Pittsburgh, Pennsylvania, 171-180.
    Wilks, Yorick A.; Fass, Dan; Guo, Cheng-Ming; MacDonald, James E.; Plate, Tony; and Slator, Brian A. (1990). “Providing Machine Tractable Dictionary Tools.” In Pustejovsky, James (Ed.), Semantics and the Lexicon. MIT Press, Cambridge, Massachusetts.
    Yarowsky, David (1992). “Word sense disambiguation using statistical models of Roget's categories trained on large corpora.” Proceedings of the 14th International Conference on Computational Linguistics, COLING'92, 23-28 August, Nantes, France, 454-460.
    Yarowsky, David (1994a). “Decision lists for lexical ambiguity resolution: application to accent restoration in Spanish and French.” Proceedings of the 32nd Annual Meeting ofthe Association for Computational Linguistics, Las Cruces, New Mexico, 88-95.
    Yarowsky, David (1995). “Unsupervised word sense disambiguation rivaling supervised methods.” Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics, 26-30 June 1995, Cambridge, Massachusetts, 189-196.

    無法下載圖示 全文公開日期 本全文未授權公開 (校內網路)
    全文公開日期 本全文未授權公開 (校外網路)

    QR CODE