研究生: |
許芙瑲 Fu-Chiang Hsu |
---|---|
論文名稱: |
智慧型專利文件分析-以群集及分類方法為基 Intelligent Patent Document Analysis Based on Clustering and Categorization Methods |
指導教授: |
張瑞芬
Amy J.C. Trappey 張力元 Charles V. Trappey |
口試委員: | |
學位類別: |
博士 Doctor |
系所名稱: |
工學院 - 工業工程與工程管理學系 Department of Industrial Engineering and Engineering Management |
論文出版年: | 2006 |
畢業學年度: | 94 |
語文別: | 英文 |
論文頁數: | 105 |
中文關鍵詞: | 專利分析 、資料探勘 、專利地圖分析 、專利群集分析 、專利技術群集分析 、專利技術成熟度 |
外文關鍵詞: | Patent analysis, Data mining, Patent map analysis, Patent technology clustering, Patent document clustering, Technology maturity measurement |
相關次數: | 點閱:2 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
在智慧財產逐漸受到重視的今日的,如何善用專利資訊以瞭解技術發展現狀並據此提升研發效率,為企業研發乃至未來產品上市能否成功的關鍵之一,擁有完善的專利管理與分析制度,將可輔助企業在創新研發上獲得支援,除可協助決策階層擬定研發策略外,更能發掘既有專利分佈,以避免因侵犯他人專利而可能遭受之鉅額損失。因此,本研究以智慧型專利文件分析為題,發展以群集及分類方法為基之專利文件分析系統,以資料探勘之技術,將專利資訊作有效之運用,也由於專利文件所提供之法律與技術揭露特性,透過專利分析也將可協助企業釐清特定技術發展現狀,並瞭解主要競爭對手分佈,由此提升企業競爭優勢。
本研究提出專利知識之擷取與專利分析之方法論,包含專利地圖分析、專利技術群集分析、專利文件群集分析、專利技術成熟度評估、與專利自動分類系統,期望透過這些分析輔助,協助企業提升專利分析效率,並提供企業擬定未來研究發展策略之輔助。此外,本研究也將依據以上所提出之方法論,建置一套整合性專利分析系統,並以動力手工具分析與無線射頻(RFID)分析為案例,探討以上分析流程,期能為企業在專利管理上帶來助益。
With the fast pace of technology development and the global nature of competitors in the marketplace, patent management has become an important issue for R&D knowledge management. In this thesis, we develop an integrated framework based on data mining techniques to help companies manage patent documents automatically and effectively. Since patents provide exclusive rights and legal protection for patent inventors, these documents play an important role in the development of technology. Through patent analysis, the companies determine the state of technology development and the degree of competition in the market.
This thesis proposes the process of patent knowledge extraction and methodologies of patent analysis to improve the efficiency of patent analysis. Furthermore, the methodologies proposed in this thesis include patent map analysis, patent technology clustering, patent document clustering and technology maturity measurement. Through these methodologies, companies derive rich information and achieve a better patent management. Moreover, the strategic plans of R&D can also be developed with the result of methodologies proposed in this thesis. In this research, the prototype is implemented and patents related to designs of innovative power hand-tools and radio frequency identification technologies are used to demonstrate the results of proposed framework.
1. Aizawa, A., 2003, “An information-theoretic perspective of tf–idf measures,” Information Processing and Management, Vol. 39, pp. 45-65.
2. Andersen, B., 1998, “The evolution of technological trajectories 1890-1990,” Structural Change and Economic Dynamics, Vol. 9, pp. 5-34.
3. Antonie, M.L., and Zaiane, O.R., 2002, “Text document categorization by term association,” Proceedings, IEEE International Conference on Data Mining, pp.19-26.
4. Bebenham, J., 1998, Knowledge Engineering, Springer-Verlag, Berline.
5. Berry, M., and Linoff, G., 1997, Data Mining Techniques: For Marketing, Sales, and Customer Support, John Wiley & Sons, Inc., NY.
6. Breschi, S., Lissoni, F., and Malerba, F., 2003, “Knowledge-relatedness in firm technological diversification,” Research Policy, Vol. 32, pp. 69-87.
7. Bryson, M. J., 2004, Strategic Planning for Public and Nonprofit Organizations: A Guide to Strengthening and Sustaining Organizational Achievement, Jossey-Bass, San Francisco.
8. Cantwell, J. and Vertova, G., 2004, “Historical evolution of technological diversification,” Research Policy, Vol. 33, pp. 511-529.
9. Chakrabarti, S., Dom, B., Agrawal, R., and Raghavan, P., 1998, “Scalable feature selection, classification and signature generation for organizing large text databases into hierarchical topic taxonomies,” The International Journal on Very Large Data Bases, Vol. 7, No. 3, pp. 163-178.
10. Chen, H, Schuffels, C., and Orwig, R., 1996, “Internet categorization and search: A self-organizing approach,” Journal of Visual Communication and Image Representation, Vol. 7, No. 1, pp 88-102.
11. Chen, Z., 2001, Data Mining and Uncertain Reasoning: An Integrated Approach, John Wiley & Sons, Inc., NY.
12. Cheng, C.H., Motwani, J., Kumar, A. and Jiang, J., 1997, “Clustering cases for case-based reasoning systems,” Journal of Computer Information Systems, Vol. 38, No. 1, pp. 30-37.
13. Chiang, J., and Chen, Y., 2001, “Hierarchical fuzzy-knn networks for news documents categorization,” Proceedings, the 10th IEEE International Conference on Fuzzy Systems, Vol. 2, pp. 720-723.
14. Deng, W., and Wu, W., 2001, “Document categorization and retrieval using semantic microfeatures and growing cell structures,” Proceeding of the 12th International Workshop on Database and Expert Systems Applications, pp. 270-274.
15. Dou, H.J.M., 2004, “Benchmarking R&D and companies through patent analysis using free databases and special software: A tool to improve innovative thinking,” World Patent Information, Vol. 26, No. 4, pp. 297-309.
16. Fall, C.J., Törcsvári, A., Benzineb, K. and Karetka, G., 2003, “Automated Categorization in the International Patent Classification,” ACM SIGIR Forum, Vol. 37, No. 1, pp. 10-25.
17. Fall, C.J., Törcsvári, A., Fiévet, P. and Karetka, G., 2004, “Automated categorization of German-language patent documents,” Expert Systems with Applications, Vol. 26, No. 2, pp. 269-277.
18. Fattori, M., Pedrazzi, G. and Turra, R., 2003, “Text mining applied to patent mapping: a practical business case,” World Patent Information, Vol. 25, No. 4, pp. 335-342.
19. Feigenbaum, E., and McCorduck, P., 1983, The Fifth Generation. Reading, Addison-Wesley, MA.
20. Gruber, T.R., 1993, “A translation approach to portable ontologies” Knowledge Acquisition, Vol. 5, No. 2, pp.199-220.
21. Gupta, V.K., and Pangannaya, N.B., 2000, Carbon nanotubes: bibliometric analysis of patents, World Patent Information, Vol. 22, pp.185-189.
22. Hicks, D., Breitzman, T.; Olivastro, D., and Hamilton, K., 2001, “The changing composition of innovative activity in the US - a portrait based on patent analysis,” Research Policy, Vol. 30, No. 4, pp. 681-703.
23. Holl, B., Jaffe, A., Trajtenberg, M., 2000, Market Value and Patent Citations: A First Look. NBER Working Paper Series, Cambridge, MA.
24. Hou, J.L., Chuo, H.C., and Sun, M.T., 2003, "Heuristic and integrated approach for technical document authority and authentication sequence determination," International Journal of Production Research, Vol. 42, No. 9, pp.1747-1768.
25. Hou, J.L., Trappey, C.V., and Trappey, A.J.C., 2003, “Enabling centralized enterprise knowledge management services for the technology value chain,” International Journal Services Technology and Management, Vol. 4, No. 4-6, pp. 424-441.
26. Hsu, F.C., Trappey, A.J.C., Trappey, C.V., Hou, J.L., and Liu, S.J., 2004, "Technology and knowledge document cluster analysis for enterprise R&D strategic planning," International Journal of Technology Management, Vol. 36, No. 4, pp. 336-353.
27. Hull, D., Aït-Mokhtar, S., Chuat, M., Eisele, A., and Gaussier, É., 2001, “Language technologies and patent search and classification,” World Patent Information, Vol. 23, No. 3, pp. 265-268.
28. Idris, K., 2004, Intellectual property: A Power Tool for Economic Growth, World Intellectual Property Organization, Geneva.
29. Jones, K.S., 1972, “A statistical interpretation of term specificity and its application in retrieval,” Journal of Documentation, Vol. 28, No. 1, pp. 11-20.
30. Kantrowitz, M., Mohit, B., and Mittal, V.O., 2000, “Stemming and its effects on TFIDF ranking,” Proceedings, The 23th Annual International ACM SIGIR’00 Conference on Research and Development in Information Retrieval, Athens, Greece, July 24-28, pp. 357-359.
31. Karki, M., 1997, “Patent citation analysis: A policy analysis tool,” World Patent Information, Vol. 19, No. 4, pp.269-272.
32. Kohonen, T., Kaski S., Lagus, K., Salojarvi, J., Honkela, J., Paatero, V., and Saarela, A., 2000, “Self organization of a massive document collection,” IEEE Transactions on Neural Networks, Vol. 11, No. 3, pp. 574-585.
33. Kostoff, R.N., 1998, “The use and misuse of citation analysis in research evaluation,” Scientometrics, Vol.43, No. 1, pp. 27-43.
34. Kostoff, R.N., Toothman, D.R., Eberhart, H.J., and Humenik, J. A., 2001, “Text mining using database tomography and bibliometrics: A review,” Technological Forecasting and Social Change, Vol. 68, No. 3, pp. 223-253.
35. Kostoff, R.N., Tshiteya, R., Pfeil, K.M., and Humenik, J.A., 2002, “Electrochemical power text mining using bibliometrics and database tomography,” Journal of Power Sources, Vol. 110, No. 1, pp. 163-176.
36. Krier, M., and Zacca, F., 2002, “Automatic categorisation applications at the European patent office,” World Patent Information, Vol. 24, pp.187–196.
37. Lai, K.K., and Wu, S.J., 2005, “Using the patent co-citation approach to establish a new patent classification system,” Information Processing and Management, Vol. 41, No. 2, 2005, pp. 313-330.
38. Lamus, J.F., 1997, “Evaluation of the rank command as a tool for the bibliometric analysis,” World Patent Information, Vol. 19, No. 1, pp. 83-84.
39. Larkey, L.S., 1999, “A patent search and classification system,” Proceedings of the Fourth ACM Conference on Digital Libraries, pp.179-183.
40. Lee, C.S., Chen, Y.J., and Jian Z.W., 2003, “Ontology-based fuzzy event extraction agent for chinese e-news summarization,” Expert Systems with Applications, Vol. 25, No. 3, pp. 431-447.
41. Lee, J.H., and Kim, Y.G., 2001, “A stage model of organizational knowledge management: a latent content analysis”, Expert Systems with Applications, Vol. 20, No. 4, pp. 299-311.
42. Liao, C.H. (Advisor: Prof. Kuo, Y.H.), 2002, “Automatic ontology construction approach and its application for information classification,” M.S. Thesis, Dept. of Computer Science and Information Engineering, National Cheng Kung University, Taiwan.
43. Lin, S.H. (Advisor: Prof. Lin, Z.C.), 2005, Integrating CMP of innovative knowledge and commercial knowledge in multi-agent prototype model, M. S. Thesis, Department of Mechanical Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan.
44. Loh, H., Tong; H.C., and Shen, L., 2006, “Automatic classification of patent documents for TRIZ users,” World Patent Information, Vol. 28, No. 1, pp. 6-13.
45. Lou, X., and Zincir-Heywood, A.N., 2003, “A comparison of SOM based document categorization systems,” Proceedings of the International Joint Conference on Neural Networks, Vol. 3, pp. 1786-1791.
46. Luhn, H.P., 1957, “A statistical approach to mechanized encoding and searching of literary information,” IBM Journal of Research and Development, Vol. 1, No. 4, pp. 309–317.
47. Matsuo, Y., and Ishizuka, M., 2004, “Keyword extraction from a single document using word co-occurrence statistical information,” International Journal on Artificial Intelligence Tools, Vol. 13, No 1, pp. 157-169.
48. Massey, L., 2003, “On the quality ART1 text clustering,” Neural Networks, Vol. 16, pp. 771-778.
49. Meier, J., and Sprague, R., 1996, “Towards a better understanding of electronic document management,” Proceedings of the Twenty-Ninth Hawaii International Conference on Systems Sciences, Vol. 5, pp. 53-61.
50. Meyer, M., 2000, “What is special about patent citations? Differences between scientific and patent citations,” Scientometrics, Vol. 49, No. 1, pp. 93-123.
51. Mladenic, D., and Grobelnik, M., 2003, “Feature selection on hierarchy of web documents,” Decision Support Systems, Vol. 35, pp. 45-87.
52. Moehrle, M.G., Walter, L., Geritz, A. and Müller, S., 2005, “Patent-based inventor profiles as a basis for human resource decisions in research and development,” R&D Management, Vol. 35, pp.513-524.
53. Mogee, M., 1991, “Using patent data for technology analysis and planning,” Research-Technology Management, Vol. 34, pp. 43-49.
54. Murata, M., Ma, Q., Uchimoto, K., Ozaku, H., Isahara, H., and Utiyama, M., 2000, “Japanese probabilistic information retrieval using location and category information,” Proceedings of the fifth international workshop on on Information retrieval with Asian languages, pp. 81-88.
55. Narin, F., 1994, “Patent bibliometrics,” Scientometics, Vol. 30, No. 1, pp. 147-155.
56. Narin, F., and Hamilton, K., 1996, “Biblometric performance measures,” Scientometrics, Vol. 36, No. 3, pp. 293-310.
57. O'Connor M.C., 2005, “Intermec says symbol violating trade law,” RFID Journal, http://www.rfidjournal.com/article/articleview/1773/1/1/, 2006/04/27
58. OECD, 2004, Patents and Innovation: Trend and Policy Challenges, OECD Publications, Paris.
59. Porter, M.F., 1980, “An algorithm for suffix stripping,” Program, Vol. 14, No. 3, pp. 130-137.
60. Richter, G., and MacFarlane, A., 2005, “The impact of metadata on the accuracy of automated patent classification,” World Patent Information, Vol. 27, No. 1, pp. 13-26.
61. Rumelhart, D., Hinton, G., and Williams R., 1986, Learning Internal Representations By Error Propagation, Parallel Distributed Processing Vol.1, MIT Press, Cambridge, MA
62. Salton, G., 1973, “Recent studies in automatic text analysis and document retrieval,” Journal of the ACM, Vol. 20, No. 2, pp. 258-278.
63. Schellner, I., 2002, “Japanese File Index classification and F-terms,” World Patent Information, Vol. 24, No. 3, pp. 197-201.
64. Sharma, S.C., 1996, Applied Multivariate Techniques, John Willy&Sons, NY.
65. Smith, H., 2002 “Automation of patent classification,” World Patent Information, Vol. 24, No. 4, pp. 269-271.
66. Tam, V., Santoso, A., and Setiono, R., 2002, “A comparative study of centroid-based, neighborhood-based and statistical approaches for effective document categorization,” Proceedings, 16th International Conference on Pattern Recognition, Vol. 4, pp.235-238.
67. Trappey, A.J.C., Hsu, F.C., Trappey, C.V., and Lin, C.I., 2005, Development of a patent document classification and search, Expert Systems with Applications, (accepted).
68. Trappey, A.J.C., and Kao, Burgess, H.S., 2005, "IP knowledge document summarization," Proceedings of the 2005 IEEE International Conference on Service Operations and Logistics, and Informatics (IEEE SOLI 2005).
69. Trappey, A.J.C., and Trappey, C.V., 2004, “Global content management services for product providers and purchasers,” Computers in Industry, Vol. 53, pp. 39-58.
70. Trappey, A.J.C., Hsu, F.C., Hou, J.L., Trappey, C.V., and Liu, S.J., 2004, “Designing a multi-channel legal knowledge service center using data analysis and contact center technology,” Proceedings of the 8th World Multiconference on Systemics, Cybernetics and Informatics (SCI 2004), Vol. 1, pp. 132-137.
71. Trappey, A.J.C., Trappey, C.V., Hsu, F.C., Kuo, J., Chao, C.L.J., and Lu, T.T.H., 2004, “The analysis and evaluation of customer satisfaction and service management,” Proceedings of the Annual International Symposium and 4th European Systems Engineering Conference (INCOSE 2004).
72. Trappey, C.V., Trappey, A.J.C., and Hsu, F.C., 2003, “Adapted risk control approach for SoC design project management,” Proceedings, the International Symposium on Product Life Cycle Management (PLM’2003).
73. Tu, C.L., and Trappey, A.J.C., 1998, “Decision making of mortgage loan approval using artificial neural network approach,” Journal of the Chinese Fuzzy Systems Association (Taiwan), Vol. 4, No.1, pp. 31-44.
74. Turban, E., and Aronson, J.E., 2001, Decision Support Systems and Intelligent Systems, Sixth Edition, Prentice-Hall International, Inc, Upper Saddle River, NJ.
75. von, Wartburg, I., Teichert, T. and Rost, K., 2005 “Inventive progress measured by multi-stage patent citation analysis,” Research Policy, Vol. 34, No. 10, pp. 1591-1607.
76. Wang, B.B., McKay, R.I., Abbass, H.A., and Barlow, M., 2002, “Learning text classifier using the domain concept hierarchy,” Proceedings, International Conference on Communications, Circuits and Systems and West Sino Expositions, Vol. 2, pp. 1230-1234.
77. Wang, W., Meng, W., and Yu, C., 2000, “Concept hierarchy based text database categorization in a metasearch engine environment,” Proceedings of the First International Conference on Web Information Systems Engineering, Vol. 1, pp. 283-290.
78. Yao, Y.H., Trappey, A.J.C., and Ho, P.S., 2003, “XML-based ISO9000 electronic document management system,” Robotics and CIM, Vol. 19, No. 4, pp. 355-370.
79. Yoon, B., and Park, Y., 2004, “A text-mining-based patent network: Analytical tool for high-technology trend,” Journal of High Technology Management Research, Vol. 15, pp. 37-50.
80. Yoon, B., Yoon, C. and Park, Y., 2002, “On the development and application of a self-organizing feature map-based patent map,” R&D Management, Vol. 32, No. 4, pp. 291-300.