研究生: |
徐珮玲 Pei-Ling Hsu |
---|---|
論文名稱: |
Using Ontology to Map Categories in Blog 以Ontology比對部落格上的記事分類 |
指導教授: |
陳宜欣
Yi-Shin Chen |
口試委員: | |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 資訊系統與應用研究所 Institute of Information Systems and Applications |
論文出版年: | 2006 |
畢業學年度: | 94 |
語文別: | 英文 |
論文頁數: | 43 |
中文關鍵詞: | Ontology 、Semantic Similarity 、Blog 、Category 、Classification |
相關次數: | 點閱:3 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
近年來有越來越多的使用者在網路上擁有自已的部落格。每位使用者在部落格中都有極大的自由去決定部落格的形式及文章的分類,這樣的自由也造成了分類不一致的現象。
為解決這樣的問題,本篇論文討論了許多現行方法,最後決定藉著
Ontology 的特性,找出內容相似的記事分類,以建立出一個專屬於
每一位使用者的記事分類推薦系統。
就目前所知,以往使用 Ontology 來比較語意上的相近(semantic
similarity),只能同時比較兩個詞(term / node)的相近程度。但在
這篇論文中,我們發展出新方法可以同時比較兩個群體的詞在
Ontology 中的語意相近程度。我們使用蕃薯藤的部落格來當作實驗
的主體,根據實驗結果顯示,這兩種方法不但可行而且成效顯著。
This paper proposes a framework to automatically map user-defined categories in blog. The proposed
framework is composed of a series of procedures including information extraction from
blog, building the personal ontology, and comparing semantic similarities between user-defined
categories. Our novel semantic similarity techniques can determine how similar two sets of information
concepts are, based on a given ontology. The experimental results demonstrated that
our framework and our proposed semantic similarity techniques are effective.
[1] Agirre, Eneko, Enrique Alfonseca, and Oier Lopez de Lacalle. Approximating hierarchybased
similarity for wordnet nominal synsets using topic signatures. Proceedings of the
Second Global WordNet Conference, pages 15–22, January 20-23 2004.
[2] E. Agirre and G. Rigau. Word sense disambiguation using conceptual density, 1996.
[3] Arul Prakash Asirvatham and Kranthi Kumar. Ravi. Web page categorization based on
document structure. SayYa-Publisher, 2000.
[4] B.Chandrasekaran. The ontology of tasks and methods. In Proceedings of the 11th Knowledge
Acquisition Modeling and Management Workshop, April 1998.
[5] A. Budanitsky. Semantic distance in wordnet: An experimental, application-oriented evaluation
of five measures, 2001.
[6] Diego Calvanese, Giuseppe De Giacomo, and Maurizio Lenzerini. Ontology of integration
and integration of ontologies. In Description Logics, 2001.
[7] B. Chandrasekaran, J. R. Josephson, and V. R. Benjaminse. What are ontologies, and why
do we need them? Intelligent Systems and Their Applications, IEEE [see also IEEE Intelligent
Systems, ((1)):20–26, 1999.
[8] Valerie Cross. Fuzzy semantic distance measures between ontological concepts. Fuzzy
Information Processing NAFIPS ’04 IEEE, 2004.
[9] Michal Cutler, Yungming Shih, and Weiyi Meng. Using the structure of html documents to
improve retrieval. In USENIX Symposium on Internet Technologies and Systems, 1997.
[10] Marc Ehrig and Steffen Staab. Efficiency of ontology mapping approaches. In International
Workshop on Semantic Intelligent Middleware for the Web and the Grid at ECAI 04,
Valencia and Spain, 8 2004.
[11] J. EUZENAT and P. VALTCHEV. An integrative proximity measure for ontology alignment.
[12] D.Fenseland I. Horrocks and F. Harmelenand D. McGuinnessand D. Patel-Schneider. Oil:
Ontology infrastructure to enable the semantic web, 2001.
[13] A. Hotho, A. Maedche, and S. Staab. Ontology-based text clustering.
[14] Jay J. Jiang and DavidW. Conrath. Semantic similarity based on corpus statistics and lexical
taxonomy. CoRR, cmp-lg/9709008, 1997.
[15] Yannis Kalfoglou and Marco Schorlemmer. Ontology mapping: The state of the art. In
Y. Kalfoglou, M. Schorlemmer, A. Sheth, S. Staab, and M. Uschold, editors, Semantic
Interoperability and Integration, number 04391 in Dagstuhl Seminar Proceedings. Internationales
Begegnungs- und Forschungszentrum (IBFI), Schloss Dagstuhl, Germany, 2005.
[16] Y. Liu, C. Wang, M. Zhang, and S. Ma. Finding ‘abstract fields ’of web pages and query
specific retrieval– thuir at trec 2004 web track–. Tsinghua University, 2004.
[17] Yiming Lu, Jian Hu, and Fanyuan Ma. Sjtu at trec2004: Web track experiments. TREC
2004, 2004.
[18] A. Maedche and S. Staab. Comparing ontologies— similarity measures and a comparison
study, 2001.
[19] A. Maedche and S. Staab. Measuring similarity between ontologies, 2002.
[20] G. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. Miller. Introduction to wordnet: An
on-line lexical database. Lexicography, (4):235–244, 1990.
[21] George A. Miller. Wordnet: A lexical database for english. Commun. ACM, 38(11):39–41,
1995.
[22] George A Miller 1990: Miller. Nouns in wordnet: a lexical inheritance system. In: International
Journal of Lexicography, (4), 1990.
[23] Saikat Mukherjee, Guizhen Yang, and I. V. Ramakrishnan. Automatic annotation of
content-rich html documents: Structural and semantic analysis. In International Semantic
Web Conference, pages 533–549, 2003.
[24] Gregory B. Newby. Information space based on html structure. In TREC, 2000.
[25] Sanguk Noh, Haesung Seo, Jaehyuk Choi, Kyunghee Choi, and Gihyun Jung. Classifying
web pages using adaptive ontology. Proceedings of the IEEE International Conference on
Systems, pages 2144–2149, October 2003.
[26] Kieron O’Hara. Knowledge representation with ontologies: The present and future.
Christopher Brewster, (1):72–81, January/February 2004.
[27] Philip Resnik. Semantic classes and syntactic ambiguity, 1993.
[28] R. Richardson, A.F. Smeaton, and J. Murphy. Using wordnet as a knowledge base for
measuring semantic similarity between words. In Proc. AICS Conference, Trinity College,
Dublin, September 1994.
[29] Ray Richardson and Alan F. Smeaton. Using WordNet in a knowledge-based approach to
information retrieval. Technical Report CA-0395, Dublin, Ireland, 1995.
[30] M. Andrea Rodr´ıguez and Max J. Egenhofer. Determining semantic similarity among entity
classes from different ontologies. IEEE Trans. Knowl. Data Eng., 15(2):442–456, 2003.
[31] Rosso, Paolo, Edgardo Ferretti, Daniel Jimenez, and Vicente Vidal. Text categorization and
information retrieval using wordnet senses. In: Proceedings of the Second Global WordNet
Conference, pages 299–304, January 20-23 2004.
[32] J. Sauvola and M. Pietikainen. Page segmentation and classification using fast feature extraction
and connectivity analysis, 1995.
[33] Hedda R. Schmidtke, Pinto H. Sofia, Asuncion Gomez-Perez, and Joao P. Martins. Some
issues on ontology integration. Proceedings of the IJCAI’99 Workshop on Ontology and
Problem-Solving Methods: Lesson Learned and Future Trends, pages 7–1 – 7–12, 1999.
[34] L. K. Shih and David R. Karger. Using urls and table layout for web classification tasks. In
WWW, pages 193–202, 2004.
[35] Michael Sussna. Word sense disambiguation for free-text indexing using a massive semantic
network. In CIKM 93, Proceedings of the Second International Conference on Information
and Knowledge Management, Washington, DC, USA, November 1-5, pages 67–74,
1993.
[36] P. Visser. A comparison of two legal ontologies, 1997.
[37] James Z. Wang and Farha Ali. An efficient ontology comparison tool for semantic web
applications. In 2005 IEEE / WIC / ACM International Conference on Web Intelligence (WI
2005), 19-22 September 2005, Compiegne, France, pages 372–378, 2005.
[38] James Z. Wang, Farha Ali, and Rashmy Appaneravanda. A web service for efficient ontology
comparison. In 2005 IEEE International Conference on Web Services (ICWS 2005,
pages 843–844, 2005.
[39] Min Zhang, Ruihua Song, and Shaoping Ma. Df or idf? on the use of html primary feature
fields for web ir. In WWW (Posters), 2003.