研究生: |
駱孝倫 Luo, Siao-Lun |
---|---|
論文名稱: |
以文件內容架構為基礎之評論文件價值評估模式 A Comment Valuation Model Based on Comment Contents and Structure |
指導教授: |
侯建良
Hou, Jiang-Liang |
口試委員: |
黃雪玲
Sheue-Ling Hwang 吳建瑋 Chien-Wei Wu |
學位類別: |
碩士 Master |
系所名稱: |
工學院 - 工業工程與工程管理學系 Department of Industrial Engineering and Engineering Management |
論文出版年: | 2014 |
畢業學年度: | 102 |
語文別: | 中文 |
論文頁數: | 334 |
中文關鍵詞: | 評論文件 、資訊價值 、資訊視覺化 、粒子群演算法 |
外文關鍵詞: | comments, information value, visualization, Particle Swarm Optimization |
相關次數: | 點閱:1 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
當讀者欲對於其感興趣之知識有更深入之認識時,其往往透過各類網路管道發掘其感興趣之知識相關文件並逐一選讀之,再由其選讀之文件中吸收其認可或感興趣之內容,達成獲得相關知識之目的。然而,讀者透過各類管道所蒐集之相關知識文件的數量眾多且其內容繁雜,使讀者需花費大量時間逐一閱讀尋得之文件,才能由所有尋得之文件中整理並吸收相關知識。其次,目前讀者選擇閱讀文件之依據仍以其對於文件第一印象之主觀評價或來自其他讀者之主觀評價為主,此作法容易遺漏其他未閱讀文件中所傳達之重要資訊。
為解決上述問題,本研究乃發展一「以文件內容架構為基礎之評論文件價值評估」模式,為建構此價值評估模式,本研究乃先釐清評論文件關鍵特質之擷取方式,並根據所整理之擷取方式擷取評論文件之關鍵特質;之後,為建構評論文件關鍵特質與其對應資訊價值指標間之關係,本研究乃運用粒子群演算法之概念求解「評論文件關鍵特質對資訊價值表現影響推論」最佳化模型,以取得評論文件之最佳關鍵特質權重組合,並可依獲得之最佳關鍵特質權重組合建構評論文件關鍵特質對資訊價值表現之關係模式和對應之系統,以藉由視覺化方式呈現評論文件之資訊價值,協助讀者選讀文件群中之評論文件;最後,本研究乃以「Mobile01」網路論壇和「Yahoo!奇摩知識+」之討論文件為基礎進行系統之績效驗證,以藉由實際之評論文件測試所開發的評論文件價值評估系統,而由驗證結果可得知,本研究所開發之系統可有效地擷取評論文件之關鍵特質,並使讀者有效率的選讀符合其需求之評論文件。
Readers often search relevant articles via various search engines as they want to have better understanding about domain knowledge they are interested in. They can read through the articles to make sure if the articles meet their needs. After that, readers can absorb contents of the article they are interested in to acquire the domain knowledge. However, articles collected from multiple search engines are great in number and complicated and readers tend to rely on their first impressions on the articles or other subjective evaluations for article selection. Under such circumstances, the important information might be missed.
In order to solve the above problem, a comment valuation model based on comment content and structure is developed in this study. Firstly, key attributes of a comment are extracted and defined. After that, the Particle Swarm Optimization (PSO) algorithm is applied to solve the optimization model for the impact of key attributes on comment value evaluation in order to derive an optimal weight for the attributes. The derived weights can be further used to establish the relationship between key attributes and valuation indexes of comments. Values of a target comment can be calculated and visualized via the relationship model. Furthermore, real-world cases from “Mobile01” internet forum and “Yahoo! Knowledge+” are used to evaluate the feasibility and performance of the proposed methodology and platform. As a whole, the proposed methodology and platform can effectively and efficiently extract key attributes from comments in order to assist readers select the important comments from discussion threads.
1. Mobile01討論區,http://www.mobile01.com/。
2. Yahoo!奇摩知識+,https://tw.knowledge.yahoo.com/。
3. 陳永德,1997,「中文斷詞中「長詞優先」、「詞頻對比」、「前詞優先」規則之使用」,博士論文(指導教授:鄭昭明),國立台灣大學心理學系
4. 胡志偉、方文熙、李美綾,1994,「文章的易讀性指標」,國立台灣大學心理學系
5. El-Sherbiny, M., 2007, “A Combined Particle Swarm Optimization Algorithm Based on the Previous Global Best and the Global Best Positions,” International Journal of Computers and Information, Vol. 1, No. 1, pp. 13-26.
6. Stoffel, A., Spretke, D., Kinnemann, H. and Keim, D. A., 2010, "Enhancing document structure analysis using visual analytics," Proceedings of the 2010 ACM Symposium on Applied Computing, pp. 8-12.
7. Chang, F., Chu, S.-Y. and Chen, C.-Y., 2005, "Chinese document layout analysis using an adaptive regrouping strategy," Pattern Recognition, Vol. 38, No. 2, pp. 261-271.
8. Wang, Y., Phillips, I. T. and Haralick, R. M., 2006, "Document zone content classification and its performance evaluation," Pattern Recognition, Vol. 39, No. 1, pp. 57-73.
9. Mohemad, R., Hamdan, A.R., Othman, Z.A., Maizura, N. and Noor, M., 2011, "Automatic document structure analysis of structured PDF files," International Journal of New Computer Architectures and their Applications, Vol. 2, No. 1, pp. 404-411.
10. Gao, L., Tang, Z., Lin, X., Liu, Y., Qiu, R. and Wang, Y., 2011, "Structure extraction from PDF-based book documents," Proceedings of the 11th Annual International ACM/IEEE Joint Conference on Digital libraries, pp. 11-20.
11. Wang, C., Lu, J. and Zhang, G., 2007, "Mining key information of web pages: A method and its application," Expert Systems with Applications, Vol. 33, No. 2, pp. 425-433.
12. Collins, C., Carpendale, S. and Penn, G., 2009, "DocuBurst: Visualizing document content using language structure," Computer Graphics Forum, Vol. 28, No. 3, pp. 1039-1046.
13. Chau, M. and Chen, H., 2008, "A machine learning approach to web page filtering using content and structure analysis," Decision Support Systems, Vol. 44, No. 2, pp. 482-494.
14. Drieger, P., 2013, "Semantic network analysis as a method for visual text analytics," Procedia - Social and Behavioral Sciences, Vol. 79, pp. 4-17.
15. Qureshi, P. A. R. and Memon, N., 2012, "Hybrid model of content extraction," Journal of Computer and System Sciences, Vol 78, No 4, pp. 1248-1257.
16. Sauvagnat, K., Boughanem, M. and Chrisment, C., 2006, "Answering content and structure-based queries on XML documents using relevance propagation," Information Systems, Vol. 31, No. 7, pp. 621-635.
17. Zubrinic, K., Kalpic, D. and Milicevic, M., 2012, "The automatic creation of concept maps from documents written using morphologically rich languages," Expert Systems with Applications, Vol. 39, No. 16, pp. 12709-12718.
18. Klink, S. and Kieninger, T., 2001, "Rule-based document structure understanding with a fuzzy combination of layout and textual features," International Journal on Document Analysis and Recognition, Vol. 4, No. 1, pp. 18-26.
19. Maderlechner, G., Suda, P. and Brückner, T., 1997, "Classification of documents by form and content," Pattern Recognition Letters, Vol. 18, No. 11–13, pp. 1225-1231.
20. Tao, X, Tang, Z. and Xu, C., 2014, "Contextual modeling for logical labeling of PDF documents," Computers and Electrical Engineering, Vol. 40, No. 4, pp. 1363-1375.
21. Huang, S.-H., Ke, H.-R. and Yang, W.-P., 2008, "Structure clustering for Chinese patent documents," Expert Systems with Applications, Vol. 34, No. 4, pp. 2290-2297.
22. Siciarek, J and Wiszniewski, B., 2011, “IODA-an interactive open document architecture,” Procedia Computer Science, Vol. 4, pp. 668-677.
23. Daniłowicz, C. and Baliński, J., 2001, "Document ranking based upon Markov chains," Information Processing and Management, Vol. 37, No. 4, pp. 623-637.
24. Vechtomova, O., and Karamuftuoglu, M., 2008, "Lexical cohesion and term proximity in document ranking," Information Processing & Management, Vol. 44, No. 4, pp. 1485-1502.
25. Du, Y., and Hai, Y., 2013, "Semantic ranking of web pages based on formal concept analysis," Journal of Systems and Software, Vol. 86, No. 1, pp. 187-197.
26. Luk, R. W. P. and Chen, C. M., 2001, "Document ranking for variable bit-block compression signatures," Information Processing & Management, Vol 37, No1, pp. 39-51.
27. Derhami, V., Khodadadian, E., Ghasemzadeh, M. and Bidok, A.M.Z.,2013, “Applying reinforcement learning for web pages ranking algorithms,” Applied Soft Computing, Vol. 13, No. 4, pp. 1686-1692.
28. Silveira, M. L. and Ribeiro-Neto, B., 2004, "Concept-based ranking: A case study in the juridical domain," Information Processing & Management, Vol. 40, No. 5, pp. 791-805.
29. Ganu, G., Kakodkar, Y. and Marian, A., 2013, "Improving the quality of predictions using textual information in online user reviews," Information Systems, Vol. 38, No. 1, pp. 1-15.
30. You, G.W. and Hwang, S.W., 2008, "Search structures and algoritms for personalized ranking," Information Sciences, Vol. 178, No. 20, pp. 3925-3942.
31. Hassan, O. A., Al-Rousan, T., Taleb, A. A. and Maaita, A., 2014, “An efficient and scalable ranking technique for mashups involving RSS data sources,” Journal of Network and Computer Applications, Vol. 39, pp. 179-190.
32. Materne, A., and Sleightholme, G., 2014, “Methods of ranking search results for searches based on multiple search concepts carried out in multiple databases,” World Patent Information, Vol. 36, pp.4-15.
33. Zareh Bidoki, A. M., Ghodsnia, P., Yazdani, N. and Oroumchian, F., 2010, "A3CRank: An adaptive ranking method based on connectivity, content and click-through data," Information Processing & Management, Vol. 46, No. 2, pp. 159-169.
34. Wei, W., Barnaghi, P. and Bargiela, A., 2011, "Rational Research model for ranking semantic entities," Information Sciences, Vol. 181, No. 13, pp. 2823-2840.
35. Eirinaki, M., Pisal, S. and Singh, J., 2012, "Feature-based opinion mining and ranking," Journal of Computer and System Sciences, Vol. 78, No. 4, pp. 1175-1184.
36. Hu, N., Bose, I., Koh, N. S. and Liu, L., 2012, "Manipulation of online reviews: An analysis of ratings, readability, and sentiments," Decision Support Systems, Vol. 52, No. 3, pp. 674-684.
37. Agrawal, J., Sharma, N., Kumar, P., Parshav, V. and Goudar, R. H., 2013, "Ranking of searched documents using semantic technology," Procedia Engineering, Vol. 64, pp. 1-7.
38. Besiki, S., Corinne J. and Shuheng W., 2012, "Establishing the value of socially-created metadata to image indexing," Library & Information Science Research, Vol. 34, No. 2, pp. 99-109.
39. Kim, H.-N., Rawashdeh, M., Alghamdi, A. and El Saddik, A., 2012, "Folksonomy-based personalized search and ranking in social media services," Information Systems, Vol. 37, No. 1, pp. 61-76.
40. Balakrishnan, V. and Zhang, X., 2014, "Implicit user behaviours to improve post-retrieval document relevancy," Computers in Human Behavior, Vol. 33, pp. 104-112.
41. Van der Weide, T. P. and Van Bommel, P., 2006, "Measuring the incremental information value of documents," Information Sciences, Vol 176, No 2, pp. 91-119.
42. Serrano, J. I., Del Castillo, M. D. and Iglesias, Á., 2014, "Effects of text essay quality on readers’ working memory by a computational model," Biologically Inspired Cognitive Architectures, Vol. 7, pp. 39-52.
43. Luarn, P., Yang, J.-C. and Chiu, Y.-P., 2014, "The network effect on information dissemination on social network sites," Computers in Human Behavior, Vol. 37, pp. 1-8.
44. Picard, W., 2013, "A formalization of social requirements for human interactions with service protocols," Information Sciences, Vol. 238, pp. 1-21.
45. Dass, M., Reddy, S. K. and Iacobucci, D., 2014, "Social networks among auction bidders: The role of key bidders and structural properties on auction prices," Social Networks, Vol. 37, pp. 14-28.
46. Cengiz Turker, U. and Balcisoy, S., 2014, "A visualisation technique for large temporal social network datasets in hyperbolic space," Journal of Visual Languages & Computing, Vol. 25, No. 3, pp. 227-242.
47. Lee, C., Scherngell, T. and Barber, M. J., 2011, "Investigating an online social network using spatial interaction models," Social Networks, Vol. 33, No. 2, pp. 129-133.
48. Sharmeen, F., Arentze, T. and Timmermans, H., 2014, "Dynamics of face-to-face social interaction frequency: role of accessibility, urbanization, changes in geographical distance and path dependence," Journal of Transport Geography, Vol. 34, pp. 211-220.
49. Benevenuto, F., Rodrigues, T., Cha, M. and Almeida, V., 2012, "Characterizing user navigation and interactions in online social networks," Information Sciences, Vol. 195, pp. 1-24.
50. Zhao, X., Yuan, J., Li, G., Chen, X. and Li, Z., 2012, "Relationship strength estimation for online social networks with the study on Facebook," Neurocomputing, Vol. 95, pp. 89-97.
51. Sosik, V. S. and Bazarova, N. N., 2014, "Relational maintenance on social network sites: How Facebook communication predicts relational escalation," Computers in Human Behavior, Vol. 35, pp. 124-131.
52. Li, W. and Xu, H., 2014, "Text-based emotion classification using emotion cause extraction," Expert Systems with Applications, Vol. 41, No. 4, pp. 1742-1749.
53. Rao, Y., Li, Q., Mao, X. and Wenyin, L., 2014, "Sentiment topic models for social emotion mining," Information Sciences, Vol. 266, pp. 90-100.
54. Chung, J. E., 2013, "Social interaction in online support groups: Preference for online social interaction over offline social interaction," Computers in Human Behavior, Vol. 29, No. 4, pp. 1408-1414.
55. Chang, C. C., 2013, "Examining users' intention to continue using social network games: A flow experience perspective," Telematics and Informatics, Vol. 30, No. 4, pp. 311-321.
56. Xie, Y., Luo, B. and Xu, R., 2013, "The learning system of collective behavior in students’ social network," Computers & Electrical Engineering, Vol. 39, No. 8, pp. 2521-2530.
57. Abrahams, A. S., Jiao, J., Fan, W., Wang, G. A. and Zhang, Z., 2013, "What's buzzing in the blizzard of buzz? Automotive component isolation in social media postings," Decision Support Systems, Vol. 55, No. 4, pp. 871-882.
58. Du, J., Xu, H. and Huang, X., 2014, "Box office prediction based on microblog," Expert Systems with Applications, Vol. 41, No. 4, pp. 1680-1689.
59. Mowbray, O., 2014, "The moderating role of social networks in the relationship between alcohol consumption and treatment utilization for alcohol-related problems," Journal of Substance Abuse Treatment, Vol. 46, No. 5, pp. 597-601.