Boosting the Accuracy of a Chinese Factoid Question Answering System with Hybrid Modules and Lightweight Methods

簡易檢索 / 詳目顯示

回結果列表

研究生：	李政緯 Lee, Cheng-Wei
論文名稱：	Boosting the Accuracy of a Chinese Factoid Question Answering System with Hybrid Modules and Lightweight Methods 以混和方法模組與輕量級方法建構高正確率中文專名問答系統
指導教授：	許聞廉 Hsu, Wen-Lian
口試委員:
學位類別：	博士 Doctor
系所名稱：	電機資訊學院 - 資訊工程學系 Computer Science
論文出版年：	2009
畢業學年度：	97
語文別：	英文
論文頁數：	102
中文關鍵詞：	問答系統、混和方法、輕量級方法、問題分類、答案過濾、答案排序
外文關鍵詞：	Question Answering, Hybrid Method, Lightweight Method, Question Classification, Answer Filtering, Answer Ranking
相關次數：	點閱：1 下載：0
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

在資訊爆炸的時代，好的資訊搜尋技術幾乎等同於效率的代名詞。在眾多資訊搜尋技術中，專名問答(Factoid Question Answering)技術漸漸在學術圈內受到重視。從1999年開始，TREC、CLEF以及NTCIR等國際型評鑑會議開始興起，研究範圍涵蓋多種語言。中文雖然使用族群眾多，但在相關的研究上似乎仍與英文等主要語言有一段差距。在這一篇論文中，我們將探討此問題，尋求低成本可行的方式來提升中文專名問答系統的表現。我們將研究方向集中在兩個議題上：
(1) 使用整合式技術，整合以知識為基礎的方法與以機器學習為基礎的方法
大多數的問答系統相關研究都是單獨集中在以知識為基礎的方法，或是以機器學習為基礎的方法上，少有整合式技術在重要會議或期刊發表。為了填補此研究空缺，並驗證整合式技術對於實際應用的效益，我們選擇了問題分類器作為研究對象，分別以知識為基礎、以機器學習為基礎、以及使用整合式技術，開發了三個問題分類器。我們將幾組具備「異質」與「未見」特性的問題集，分別應用到這三個分類器。我們發現至少在這個受控制的實驗中，以知識為基礎的分類器表現優於以機器學習為基礎的分類器，同時整合式的分類器又優於以知識為基礎的分類器。驗證了知識與整合式技術的實用性。
(2) 輕量級問答系統技術
目前世上表現好的專名問答系統多少都會採用如句法剖析器、邏輯推理器等複雜費時的處理技術。這類型的技術雖有好處，但卻不適用在資源缺乏的語言或資源限制多的環境中。有鑑於此，我們試圖尋找有用的輕量級技術。我們提出了兩個輕量級問答系統技術，分別是「問題與答案關鍵詞共現總和法」(SCO-QAT, Sum of Co-occurrences of Question and Answer Terms)以及「以對齊法產生的表層模版」(ABSPs, Alignment-based Surface Patterns)。相較於其他以共現法為基礎的作法，SCO-QAT不需要額外的知識資料、不需要導入剔除規則供找不到共現頻率時之用、也不需要額外的工具支援，所有共現頻率資料都直接根據文句擷取模組所回傳的文句來計算。ABSPs則是使用一組根據問題答案集所訓練出來的文句模版，用來捕捉文句中問題關鍵字與答案的關係，計算信心分數用來過濾出可靠的答案。
經過測試，這兩個輕量級技術成功地在測試平台上將針對NTCIR-5資料集的RU正確率從0.445提升到0.535。同時在NTCIR-6資料集上也有0.5 RU正確率的好表現。

Factoid Question Answering (QA) is becoming an increasingly important research area in natural language processing. Since 1999, many international question answering contests have been held at conferences and workshops, such as TREC, CLEF, and NTCIR; and several languages have been tested in monolingual or cross-lingual question answering tasks. Although Chinese is growing in popularity worldwide, there seems to be a performance gap between Chinese question answering systems and some systems used for other languages. In this dissertation, our objective is to improve the performance of Chinese Factoid Question Answering systems. To this end, we investigate in the following two concepts.
(1) Hybrid Modules Comprised of Knowledge-based and Machine Learning based Methods
To date, most research on QA modules has focused on knowledge-based or machine learning based methods, possibly because hybrid methods are costly that both the knowledge-based and machine learning-based methods need to be adjusted, and it necessary to find an appropriate way to combine the methods in a hybrid model. To demonstrate the effect of hybrid modules, we developed a hybrid question classifier and used it to conduct a series of empirical experiments. Specifically, we compared the performances of the knowledge-based classifier, the machine learning based classifier and the hybrid classifier on several heterogeneous unseen questions from various sources. The results showed that the knowledge-based question classifier was more accurate than the machine learning-based classifier, but the proposed hybrid classifier achieved the highest accuracy.
(2) Lightweight Question Answering Methods
Nearly all the top performing systems use heavy methods that require sophisticated techniques, such as parsers or logic provers. However, such techniques are usually unavailable or unaffordable for under-resourced languages or in resource-limited situations. In contrast to state-of-the-art QA systems, we improve a top performing Chinese QA system by using lightweight methods effectively. We propose two lightweight methods, namely the Sum of Co-occurrences of Question and Answer Terms (SCO-QAT) and Alignment-based Surface Patterns (ABSPs). SCO-QAT is a co-occurrence-based answer ranking method that does not need extra knowledge, word-ignoring heuristic rules, or tools. It simply calculates co-occurrence scores based on the passage retrieval results. ABSPs are syntactic patterns trained from question-answer pairs with an alignment algorithm. They are used to capture the relations between terms; and the relations are used to filter answers. We attribute the success of the ABSP and SCO-QAT methods to the effective use of local syntactic information and global co-occurrence information.
By using SCO-QAT and ABSPs, we improved the RU-Accuracy of our testbed QA system, ASQA, from 0.445 to 0.535 on the NTCIR-5 dataset. The system also achieved the top 0.5 RU-Accuracy on the NTCIR-6 dataset. The result shows that lightweight methods are not only less expensive to implement, but also have the potential to achieve state-of-the-art performances.

中文摘要    iii
ABSTRACT    v
Chapter 1    INTRODUCTION    1
Chapter 2    Related Work    8
1.    Chinese Question Answering Systems    8
2.    Question Classification    10
3.    QA with Surface Patterns    11
4.    QA with Co-occurrence Information    13
Chapter 3    The Host QA System: ASQA    15
1.    InfoMap-A Knowledge Representation and Matching Engine    16
1.1.    InfoMap Framework and Knowledge Representation    17
1.2.    InfoMap Applications    20
2.    System Modules for Chinese QA    21
2.1.    Question Processing    21
2.2.    Passage Retrieval    29
2.3.    Answer Extraction    32
2.4.    Answer Ranking    34
3.    System Modules for English-Chinese Cross-Lingual QA    35
3.1.    English Question Classification    35
Chapter 4    Proposed Shallow Methods    38
1.    ABSPs - Alignment-Based Surface Patterns    38
1.1.    The Alignment Algorithm    38
1.2.    ABSP Generation    40
1.3.    ABSPs Selection    42
1.4.    Relation Extraction and Score Calculation    43
2.    SCO-QAT: Sum of Co-occurrences of Question and Answer Terms    46
3.    Enhancing SCO-QAT with Distance Information    48
Chapter 5    Evaluation Setup    51
1.    Question Classification Datasets    51
2.    Question Classification Evaluation Metrics    52
3.    Question Answering Datasets    53
4.    Question Answering Evaluation Metrics    55
5.    Variable Dependencies    57
Chapter 6    Experiments    59
1.    An Empirical Study of Question Classifiers    59
2.    Monolingual Experiments    62
2.1.    Comparing SCO-QAT with Other Single Ranking Features    63
2.2.    Enhancing SCO-QAT with Distance Information    66
2.3.    ABSP-based answer filter    67
3.    Cross-Lingual Experiments    69
3.1.    Single Shallow Features    69
3.2.    Influence of Machine Translation Quality    75
3.3.    Influence of Passage Quality Introduced by Deep Passages    77
3.4.    Influence of Answer Quality    80
Chapter 7    Discussion    86
Chapter 8    Conclusions and Future Work    92
Acknowledgments    95
APPENDIX A: ABSPs Used in ASQA at NTCIR-6 CLQA    96
APPENDIX B: Stop Word List for SCO-QAT    98
Bibliography    99

                                

BARZILAY, R. AND LEE, L. 2003. Learning to Paraphrase: An Unsupervised Approach Using Multiple-Sequence Alignment. In Proceedings of HLT-NAACL, 16-23.
BOUMA, G., MUR, J. AND NOORD, G.V. 2005. Reasoning over dependency relations for QA. In Proceedings of the IJCAI workshop on Knowledge and Reasoning for Answering Questions, 15-21.
CHEUNG, Z., PHAN, K.L., MAHIDADIA, A. AND HOFFMANN, A. 2004. Feature Extraction for Learning to Classify Questions. Proceedings of Advances in Artificial Intelligence (AI 2004), 1069-1075.
CLARKE, C.L.A., CORMACK, G., KEMKES, G., LASZLO, M., LYNAM, T., TERRA, E. AND TILKER, P. 2002. Statistical Selection of Exact Answers (MultiText Experiments for TREC 2002). In Proc. of TREC, 823–831.
CLARKE, C.L.A., CORMACK, G.V. AND LYNAM, T.R. 2001. Exploiting redundancy in question answering. In Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, 358-365.
COOPER, R.J. AND RUGER, S.M. 2000. A Simple Question Answering System. In Proc. of TREC.
CUI, H., SUN, R., LI, K., KAN, M.-Y. AND CHUA, T.-S. 2005. Question answering passage retrieval using dependency relations. In Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, Salvador, Brazil, 400-407.
DAY, M.-Y., LU, C.-H., ONG, C.-S., WU, S.-H. AND HSU, W.-L. 2006. Integrating Genetic Algorithms with Conditional Random Fields to Enhance Question Informer Prediction. In Proceedings of the the IEEE International Conference on Information Reuse and Integration (IEEE IRI 2006), Waikoloa, Hawaii, USA, Sep 16-18 2006, 414-419.
DAY, M.-Y., LU, C.-H., YANG, J.-T.D., CHIOU, G.-F., ONG, C.-S. AND HSU, W.-L. 2005. Designing an Ontology-based Intelligent Tutoring Agent with Instant Messaging. In Proceedings of ICALT2005.
DAY, M.-Y., ONG, C.-S. AND HSU, W.-L. 2007. Question Classification in English-Chinese Cross-Language Question Answering: An Integrated Genetic Algorithm and Machine Learning Approach. In Proceedings of the the IEEE International Conference on Information Reuse and Integration (IEEE IRI 2007), Las Vegas, Nevada, USA, August, 13-15 2007, 203-208.
DAY, M.-Y., TSAI, T.-H., SUNG, C.-L., LEE, C.-W., WU, S.-H., ONG, C.-S. AND HSU, W.-L. 2005. A Knowledge-based Approach to Citation Extraction. In Proceedings of IEEE IRI.
DAY, M.Y., LU, C.H., ONG, C.S., WU, S.H. AND HSU, W.L. 2006. Integrating Genetic Algorithms with Conditional Random Fields to Enhance Question Informer Prediction. In Proceedings of the IEEE International Conference on Information Reuse and Integration (IEEE IRI 2006), Waikoloa, Hawaii, USA, 414-419.
GANG, Z., TING, L., SHI-FU, Z., WAN-XIANG, C., BING, Q. AND SHENG, L. 2001. Research on Open-domain Chinese Question-Answering System. In 20th Annual Meeting of the Chinese Information Processing Society of China.
GUO, Y. 2004. Chinese Question Answering with Full-Text Retrieval Re-Visited Waterloo.
HACIOGLU, K. AND WARD, W. 2003. Question Classification with Support Vector Machines and Error Correcting Codes. In Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology (HLT-NAACL), Edmonton, Canada, 28-30.
HARABAGIU, S., MOLDOVAN, D., CLARK, C., BOWDEN, M., HICKL, A. AND WANG, P. 2005. Employing Two Question Answering Systems in TREC 2005. In Proceedings of the Fourteenth Text REtrieval Conference.
HSU, W.-L., WU, S.-H. AND CHEN, Y.-S. 2001. Event identification based on the information map-INFOMAP. In IEEE International Conference on Systems, Man, and Cybernetics, Tucson, AZ, USA, 1661-1666.
HSU, W.-L., WU, S.-H. AND CHEN, Y.-S. 2001. Event Identification Based on the Information Map - INFOMAP. In IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE).
HUANG, M., ZHU, X., HAO, Y., PAYAN, D.G., QU, K. AND LI, M. 2004. Discovering patterns to extract protein - protein interactions from full texts. Bioinformatics 20, 3604-3612.
KRISHNAN, V., DAS, S. AND CHAKRABARTI, S. 2005. Enhanced Answer Type Inference from Questions using Sequential Models. Proceedings of HLT/EMNLP, 315–322.
KWOK, K.-L. AND DENG, P. 2006. Chinese Question-Answering:Comparing Monolingual with English-Chinese Cross-Lingual Results. In Asia Information Retrieval Symposium, 244-257.
LAURENT, D., S GU LA, P. AND N GRE, S. 2006. Cross Lingual Question Answer ing using QRISTAL for CLEF 2006. In CLEF.
LEE, C.-W., DAY, M.-Y., SUNG, C.-L., LEE, Y.-H., JIANG, T.-J., WU, C.-W., SHIH, C.-W., CHEN, Y.-R. AND HSU, W.-L. 2007. Chinese-Chinese and English-Chinese Question Answering with ASQA at NTCIR-6 CLQA. In Proceedings of NTCIR-6 Workshop, Tokyo, Japan, 175-181.
LEE, C.-W., SHIH, C.-W., DAY, M.-Y., TSAI, T.-H., JIANG, T.-J., WU, C.-W., SUNG, C.-L., CHEN, Y.-R., WU, S.-H. AND HSU, W.-L. 2005. ASQA: Academia Sinica Question Answering System for NTCIR-5 CLQA. In NTCIR.
LEE, C.W., SHIH, C.W., DAY, M.Y., TSAI, T.H., JIANG, T.J., WU, C.W., SUNG, C.L., CHEN, Y.R., WU, S.H. AND HSU, W.L. 2005. ASQA: Academia Sinica Question Answering System for NTCIR-5 CLQA.
LI, X. AND ROTH, D. 2002. Learning Question Classifiers. In International Conference on Computational Linguistics, Taipei, Taiwan, 1-7.
LIN, C.-J. 2004. A Study on Chinese Open-Domain Question Answering Systems. In Department of Computer Science and Information Engineering National Taiwan University.
LIN, F., SHIMA, H., WANG, M. AND MITAMURA, T. 2005. CMU JAVELIN System for NTCIR5 CLQA1. In Proceedings of the 5th NTCIR Workshop.
LIN, J. 2005. Evaluation of resources for question answering evaluation. Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, 392-399.
LIN, S.-J., SHIA, M.-S., LIN, K.-H., LIN, J.-H., YU, S. AND LU, W.-H. 2005. Improving answer ranking using cohesion between answer and keywords. In NTCIR Workshop, 2005.
MAGNINI, B., PREVETE, M.N.R. AND TANEV, H. 2001. Is it the right answer?: exploiting web redundancy for Answer Validation. In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, 425-432.
MENG, I.H. AND YANG, W.P. 2003. The design and implementation of chinese question and answering system. In Computational Science and Its Applications - Iccsa 2003, Pt 1, Proceedings, 601-613.
METZLER, D. AND CROFT, W.B. 2005. Analysis of Statistical Question Classification for Fact-Based Questions. Information Retrieval 8, 481-504.
MOLDOVAN, D., PACA, M., HARABAGIU, S. AND SURDEANU, M. 2003. Performance issues and error analysis in an open-domain question answering system. ACM Transactions on Information System 21, 133-154.
MOLLA, D. AND GARDINER, M. 2005. AnswerFinder — Question Answering by Combining Lexical, Syntactic and Semantic Information. In Australasian Language Technology Workshop (ALTW) 2004.
MUSLEA, I. 1999. Extraction Patterns for Information Extraction Tasks: A Survey. In Workshop on Machine Learning for Information Extraction, Orlando.
QIN, B., LIU, T., WANG, Y., ZHENG, S.-F. AND LI, S. 2003. Chinese Question Answering System Based on Frequently Asked Questions. Journal of Harbin Institute of Technology.
RAVICHANDRAN, D. AND HOVY, E. 2001. Learning surface text patterns for a Question Answering system. In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, Philadelphia, Pennsylvania, 41-47.
RAVICHANDRAN, D. AND HOVY, E. 2002. Learning Surface Text Patterns for a Question Answering System. Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, 41-47.
RENNIE, J.D.M. AND RIFKIN, R. 2001. Improving multiclass text classification with the support vector machine. Online at:[http://www. ai. mit. edu/…] Available: May 23, 2002.
ROTH, D., CUMBY, C., LI, X., MORIE, P., NAGARAJAN, R., RIZZOLO, N., SMALL, K. AND YIH, W.T. 2003. Question-Answering via Enhanced Understanding of Questions. NIST SPECIAL PUBLICATION SP, 667-685.
SAIZ-NOEDA, M., SU´AREZ, A. AND PALOMAR, M. 2001. Semantic Pattern Learning Through Maximum Entropy-based WSD Technique. In Computational Natural Language Learning (CoNLL-2001), Toulouse, France.
SASAKI, Y., CHEN, H.-H., CHEN, K.-H. AND LIN, C.-J. 2005. Overview of the NTCIR-5 Cross-Lingual Question Answering Task. In Proceedings of the 5th NTCIR Workshop Meeting, Tokyo, Japan, 175-185.
SEBASTIANI, F. 2002. Machine learning in automated text categorization. Journal of ACM Computing Survey 34, 1-47.
SMITH, T.F. AND WATERNMAN, M.S. 1981. Identification of Common Molecular Subsequences. Journal of Molecular Biology 147, 195-197.
SOUBBOTIN, M.M. AND SOUBBOTIN, S.M. 2001. Patterns of Potential Answer Expressions as Clues to the Right Answers. Proceedings of the Tenth Text REtrieval Conference (TREC 2001).
STAAB, S., ERDMANN, M. AND MAEDCHE, A. 2001. Engineering Ontologies using Semantic Patterns. In Proceedings of the IJCAI-2001 Workshop on E-Business & Intelligent Web, Seattle.
SUZUKI, J., TAIRA, H., SASAKI, Y. AND MAEDA, E. 2003. Question classification using HDAG kernel Association for Computational Linguistics Morristown, NJ, USA, 61-68.
TAKAHASHI, T., NAWATA, K., INUI, K. AND MATSUMOTO, Y. 2004. NAIST QA System for QAC2. In NTCIR-4, Tokyo.
TSAI, T.-H., WU, S.-H., LEE, C.-W., SHIH, C.-W. AND HSU, W.-L. 2004. Mencius: A Chinese Named Entity Recognizer Using Maximum Entropy-based Hybrid Model. Computational Linguistics & Chinese Language Processing 9, 65-82.
VAPNIK, V.N. 1995. The Nature of Statistical Learning Theory. Springer.
WONG, W.-K., HSU, S.-C., WU, S.-H., LEE, C.-W. AND HSU, W.-L. 2007. LIM-G: Learner-initiating Instruction Model based on Cognitive Knowledge for Geometry Word Problem Comprehension. Computers and Education, 582-601.
WU, C.-H., YEH, J.-F. AND CHEN, M.-J. 2005. Domain-specific FAQ Retrieval Using Independent Aspects. ACM Transaction on Asian Language Information Processing 4, 1-17.
WU, C.-W., JAN, S.-Y., TSAI, R.T.-H. AND HSU, W.-L. 2006. On Using Ensemble Methods for Chinese Named Entity Recognition. In Proceeding of SIGHAN Workshop.
WU, S.-H., DAY, M.-Y. AND HSU, W.-L. 2001. FAQ-centered Qrganizational Memory. In Knowledge Management and Organizational Momery workshop on the Seventeenth International Joint Conference on Artificial Intelligence, 112-120.
ZHANG, D. AND LEE, W.S. 2003. Question Classification Using Support Vector Machines. In Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, Toronto, Canada, 26-32.
ZHAO, Y., XU, Z.M., GUAN, Y. AND LI, P. 2005. Insun05QA on QA track of TREC2005. In TREC.
ZHENG, Z. 2002. AnswerBus Question Answering System. Human Language Technology Conference, 24-27.

全文公開日期本全文未授權公開 (校內網路)
全文公開日期本全文未授權公開 (校外網路)

簡易檢索 / 詳目顯示

相關論文