研究生: |
陳正晏 Chen, Cheng-Yen |
---|---|
論文名稱: |
Discovering Episode Evolution Relationships from News Corpora 從新聞文集中找出事件階段的演化關係 |
指導教授: |
魏志平
Wei, Chih-Ping |
口試委員: | |
學位類別: |
碩士 Master |
系所名稱: |
科技管理學院 - 科技管理研究所 Institute of Technology Management |
論文出版年: | 2009 |
畢業學年度: | 97 |
語文別: | 英文 |
論文頁數: | 40 |
中文關鍵詞: | 事件階段演化 、事件階段演化關係 、文件探勘 、字詞挑選 、文件表達 |
外文關鍵詞: | Episode Evolution Discovery, Text Mining, Feature Selection, Document Representation |
相關次數: | 點閱:3 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
Owing to the widespread of the Internet, it is easier to reach information through the Internet. When performing environment scanning, organizations typically deal with a numerous of episodes and events about their core business, relevant technique standards, competitors, and market, among many others, where each episode or event to monitor or track generally is associated with many news documents. To reduce such information overload and information fatigues when monitoring or tracking events, it is essential to develop an effective episode evolution discovery technique to organize all news documents pertaining to an event of interest into an episode evolution graph. In this thesis, we propose a new feature selection metric, referred to as TF□□2 and develop an episode evolution discovery technique that uses the TF□□2 metric as its feature selection method and TF□IDF as the document representation scheme. Using the traditional TF□IDF as the performance benchmark, our empirical evaluation results suggest that our proposed TF□□2 technique outperforms its benchmarks and demonstrates the utility of TF□□2 metric for discovering episode evolution relationships.
[1] Allan, J., Harding, S., Fisher, D., Bolivar, A., Guzman-Lara, S., and Amstutz, P., “Taking Topic Detection from Evaluation to Practice,” Proceedings of the Thirty-Eighth Annual Hawaii International Conference on System Sciences (HICSS), Big Island, Hawaii, January 2005.
[2] Allan, J., Topic Detection and Tracking: Event based Information Organization, Kluwer Academic Publishers, 2000.
[3] Allan, J., “Detection as Multi-Topic Tracking,” Information Retrieval, Vol. 5, No. 2-3, April 2002, pp.139-157.
[4] Allan, J., Carbonell, J., Doddington, G., Yamron, J., and Yang, Y., “Topic Detection and Tracking Pilot Study: Final Report,” Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, 1998.
[5] Allan, J., Papka, R., and Lavrenko, V., “On-line New Event Detection and tracking, ”Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia, 1998, pp.37-45.
[6] Chen, J. D., “Event Episode Discovery: An Enhanced Temporal-based Approach”, Unpublished Master Thesis, Institute of Technology Management, National Tsing Hua University, Taiwan, ROC, 2008.
[7] Kuo, J.J., and Chen, H.H., “Multidocument Summary Generation: Using Informative and Event Words,” ACM Transactions on Asian Language Information Processing, Vol. 7, No. 1, February 2008.
[8] Makkonen, J., “Investigations on Event Evolution in TDT,” Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: Proceedings of the HLT-NAACL 2003 Student Research Workshop, Edmonton, Canada, 2003, pp.43-48.
[9] Nallapati, R., Feng, A., Peng, F., and Allen, J., “Event Threading within News Topics,” Proceedings of the Thirteenth ACM International Conference on Information and Knowledge Management, ACM: Washington, D.C., 2004
[10] Wang, T.Y., Chiang, H.M., “One-against-one Fuzzy Support Vector Machine Classifier: An Approach to Text Categorization,” Expert Systems with Application, Vol. 36, No. 6, August 2009, pp.10030-10034.
[11] Wei, C.P., “Accommodating Individual Preferences in the Categorization of Documents: A Personalized Clustering Approach” Journal of Management Information Systems (JMIS), Vol. 23, No. 2, Fall 2006, pp.173-201.
[12] Wei, C.P. and Chang, Y., “Discovering Event Evolution Patterns from Document Sequences,” IEEE Transactions on Systems, Man, and Cybernetics Part A: Systems and Humans, Vol. 37, No. 2, March 2007, pp.273-283.
[13] Wei, C., Lee, Y.H., Chiang, Y.S., Chen, J.D., and Yang, C.C., Discovering Event Episodes from News Corpora: A Temporal-based Approach,” Proceedings of 11th International Conference on Electronic Commerce, Taipei, Taiwan. August 2009.
[14] Wei, C., Lee, Y.H., and Hsiao, H.W., “A Text Mining Approach for Ontology Enrichment,” Proceedings of the Fourth Workshop on e-Business (Web 2005), Las Vegas, Nevada, December 2005.
[15] Yang, C.C., Shi, X. and Wei C.P., “Discovering Event Evolution Graphs from News Corpora”, IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans, Vol. 39, No. 4, July 2009, pp.850-863.
[16] Yang, Y., Pierce, T., and Carbonell, J., “A Study on Retrospective and On-Line Event Detection,” Proceedings of 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia, 1998, pp.28-36.
[17] Yang, Y., Carbonell, J., Brown, R., T. Pierce, B. T. Archibald, and X. Liu, “Learning Approaches for Detecting and Tracking News Events,” IEEE Intelligent Systems, Vol. 14, No. 4, July/August 1999, pp.32-43.
[18] Yang, Y., Ault, T., Pierce, T., and Lattimer, C.W., “Improving Text Categorization Methods for Event Tracking,” Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Athens, Greece, 2000, pp.65-72.
[19] Yang, Y., Carbonell, J., Brown, R., Lafferty, J., Pierce, T., and Ault, T., “Multi-Strategy Learning for TDT,” Topic Detection and Tracking: Event-based Information Organization, J. Allan (Ed.), Kluwer Academic Publishers, Norvell, 2002, pp.85-114.
[20] Yang, Y., Zhang, J., Carbonell, J., and Jin, C., “Topic-conditioned Novelty Detection,” Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Edmonton, Canada, 2002, pp.688-693.