簡易檢索 / 詳目顯示

研究生: 亞歷杭德拉
Alejandra Campero Diaz
論文名稱: A Data Mining and Time Series Integrated Approach for Analyzing Semiconductor MES and FDC Data to Enhance Overall Usage Effectiveness
整合資料挖礦和時間序列以分析半導體製造執行系統和事故預測及分類系統資料以提升綜合使用效益之研究
指導教授: 簡禎富
Chen-Fu Chien
口試委員:
學位類別: 碩士
Master
系所名稱: 工學院 - 工業工程與工程管理學系
Department of Industrial Engineering and Engineering Management
論文出版年: 2008
畢業學年度: 96
語文別: 英文
論文頁數: 71
中文關鍵詞: Overall usage effectivenessData miningDecision treeClusteringTime seriesIndirect material usageSemiconductor manufacturing
相關次數: 點閱:1下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • Wafer fabrication is a complex, costly and lengthy process that involves hundreds of process steps with monitoring of the corresponding process parameters at the same time to enhance the yield. Large amount of data is automatically collected during these processes in wafer fabrication facility. Thus, potential useful information can be extracted from huge data sources to enhance decision quality and enhance operational effectiveness. This study aims to develop a framework to integrate FDC and MES data and then propose an approach based on data mining and time series techniques to investigate the data in order to enhance the overall usage effectiveness (OUE) for cost reduction. We validated this approach with an empirical study in a semiconductor company in Taiwan and the results demonstrated the practical viability of this approach. The extracted information and knowledge is helpful to engineers for identifying the major tools factors affecting indirect material usage effectiveness as well as for indentify periods of time when a specific tool is working using either low or high quantity of material.


    ABSTRACT iv Acknowledgments v 1 CHAPTER: INTRODUCTION 1 1.1 Background and Motivation 1 1.2 Research aims 1 1.3 Overview of this thesis 2 2 CHAPTER: LITERATURE REVIEW 3 2.1 Semiconductor manufacturing industry 3 2.1.1 Industry overview 3 2.1.2 Semiconductor manufacturing process 4 2.1.3 Chemical Mechanical Polishing (CMP) 6 2.2 Fault Detection and Classification (FDC) 8 2.3 Overall Equipment Effectiveness (OEE) 10 2.4 Data mining 12 2.4.1 Data mining model functions 12 2.4.2 Data mining process 13 2.5 Time series concept 15 2.6 Clustering 16 2.6.1 Objective function-based clustering 17 2.7 Decision tree 18 2.7.1 Decision tree construction 19 2.7.2 Decision tree algorithms 20 2.7.3 Decision tree advantages in comparisons with other methods 21 3 CHAPTER: APROACH PROPOSED 23 3.1 Research framework 23 3.2 Problem definition 25 3.3 Data selection and Calculation 26 3.4 Data preparation 27 3.4.1 Data cleaning 28 3.4.2 Clustering of time series data 29 3.4.3 Data integration 30 3.5 Decision tree construction 31 3.5.1 Decision tree growing 31 3.5.2 Decision rules extraction and Evaluation 32 3.6 Evaluation and Interpretation 33 4 CHAPTER: AN EMPIRICAL STUDY 35 4.1 Case 1 37 4.1.1 Problem definition of real case 37 4.1.2 Data selection and Calculation 38 4.1.3 Data preparation 39 4.1.4 Decision tree construction 45 4.1.5 Evaluation and Interpretation 59 4.2 Case 2 62 4.2.1 Problem definition 62 4.2.2 Data selection and Calculation 62 4.2.3 Data preparation 62 4.2.4 Decision tree construction 64 4.2.5 Evaluation and Interpretation 66 4.3 Summary 67 5 CONCLUSION 69 REFERENCES 70

    Armes, V.A., Jerry Gililland, John Konopka, and Rich Schnabl (1995), “Semiconductor Manufacturing Productivity Overall Equipment Effectiveness (OEE) Guidebook, Revision 1”, SEMATECH.

    Berry, Michael J.A., and Gordon S. Linoff (2004), “Data Mining techniques: For Marketing, Sales, and Customer Relationship Management”, Wiley Publishing, Inc., Indianapolis, Indiana.

    Box, George, Gwilym M. Jenkins, and Gregory Reinsel (1976), “Time series analysis: Forecasting and control”, San Francisco: Holden-Day, 2nd ed.

    Braha, Dan, and Armin Shmilovici (2003), “On the use of Decision Tree Induction for Discovery of Interactions in a Photolithographic Process”, IEEE Transactions on Semiconductor Manufacturing, 16(4):644-652.

    Chang, Yung-Cheng, and Fan-Tien Cheng (2006), “Manufacturability of Multivariate Applications in the Semiconductor Industry”, 2006 IEEE, International Conference on Automation Science and Engineering.

    Chen, Argon, “Basic of FDC”, 3rd AEC/APC symposium-Asia Tutorial.

    Chen, Mao-Shiung, T.F. Yen, Barry Coonan (2004), “Real-time Fault Detection and Classification for Manufacturing Etch Tools”, 2004 Institute of Electrical and Electronics Engineers (IEEE).

    Chien, Chen-Fu and C. Hsu (2006), “A novel method for determining machine subgroups and backups with an empirical study for semiconductor manufacturing,” Journal of Intelligent Manufacturing, 17, 429-440.

    Chien, Chen-Fu, C. Hsu, H. Chou and C. Lin (2006), “Overall Wafer Effectiveness (OWE): A Novel Industry Standard for Wafer Productivity,” Proceedings of International Symposium of Semiconductor Manufacturing, pp. 317-320.

    Chien, Chen-Fu, H. Chen, J. Wu and C. Hu (2007), “Construct the OGE for promoting tool group productivity in semiconductor manufacturing,” International Journal of Production Research, 45(3), 509-524.

    Chien, Chen-Fu and L. Chen (2007), “Using Rough Set Theory to Recruit and Retain High-Potential Talents for Semiconductor Manufacturing,” IEEE Transactions on Semiconductor Manufacturing, 20(4), 528-541.

    Chien, Chen-Fu, W. Wang and J. Cheng (2007), “Data mining for yield enhancement in semiconductor manufacturing and an empirical study,” Expert Systems with Applications, 33(1), 192-198.

    CMP, Chemical Mechanical Planarization, polishing equipment, Crystec Technology Trading GmbH, from www.crystec.com.

    Cotofrei, Cotofrei, and Kilian Stoffel (2000), “Rule Extraction from Time Series Database using Classification Trees”, Swiss National Foundation.

    Dabbas, R.M., and H.Chen (2001). “Mining semiconductor manufacturing data for productivity improvement- an integrated relational database approach”, Computers in Industry, vol.45, pp.29-44(16).

    Fayyad, U.M., and K.B. Irani (1992), “On the Handling of Continuous-Values attributes in Decision Tree Generation”, Machine Learning, vol. 8, pp. 87-102.

    Geurts, Pierre (2002), “Contribution to Decision Tree Induction: Bias/Variance Trade off and Time Series Classification”, Annee Academique, http://www.montefiore.ulg.ac.be/services/stochastic/pubs/2002/Geu02.

    Geurts, Pierre (2001), “Pattern Extraction for Time Series Classification”, Springer-Verlag Berlin Heidelberg, pp. 115-127.

    Gionis, Aristides, and Heikki Mannila, “Segmentation algorithms for time series and sequence data, Helsinki Institute for Information Technology”, University of Helsinki, Finland, from http://www.cs.helsinki.fi/u/gionis/sdm05tutorial.pdf.

    Goodlin, Brian E., Duane S. Boning, Herbert H. Sawin, and Barry M. Wise (2003), “Simultaneous Fault Detection and Classification for Semiconductor Manufacturing Tools”, Journal of The Electromechanical Society, 150 (12) G778-G784.

    Han, J., and M. Kamber (2001), “Data Mining: Concepts and Techniques”, Morgan Kaufmann, Second Edition, pp. 346-389.

    Hechenbichler, Klaus (2004), “Clustering and Classification” Max-Planck-Institut für Psychiatrie.

    Hsu, S. and Chen-Fu Chien (2007), “Hybrid Data Mining Approach for Pattern Extraction from Wafer Bin Map to Improve Yield in Semiconductor Manufacturing,” International Journal of Production Economics, 107, 88-103.

    Iwata, Yoshio, and Samuel C. Wood (2000), “Effect of Fab Scale, Process Diversity and Setup on Semiconductor Wafer Processing Cost”, 2000 Institute of Electrical and Electronics Engineers (IEEE).

    Kass, G.V., (1980), “An Exploratory Technique for Investigating Large Quantities of Categorical Data”, Applied Statistic, pp. 119-127.

    Kaufmann, L., and P.J. Rousseeuw (1990), “Finding Groups in Data: An introduction to Cluster Analysis”, John Wiley & Sond Ltd., New York.

    Keogh, Eamonn J., Selina Chu, David Hart, and Michael J. Pazzani (2001), “An Online Algorithm for Segmenting Time Series”, Proceedings of IEEE international conference on Data Mining, pp. 289-296.

    Krishna, K and Murt, Narasimha M (1999), “Genetic K- Means Algorithms”, IEEE Trans. Syst. Man Cybernet. 29 (3) pp.433-439.

    Last, Mark, Abraham Kandel, and Horst Bunke, (2004) “Data Mining in Time Series Database”, World Scientific Press.

    Liao, T. Warren (2005), “Clustering of time series data – a survey”, Elsevier - The journal of pattern recognition 38 pp. 1857-1874.

    Lopez, Llanos Mora, Inmaculada Fortes Ruiz, Rafael Morales Bueno, and Francisco Triguero Ruiz (2004), “Dynamic Discretization of Continuous Values from Time Series”, Springer-Verlag Berlin Heidelberg, pp. 280-291.

    Mason, Scott J., (2000), “Overview of Semiconductor Fabrication Technology”, from www.engr.uky.edu.

    May, Gary S., and Costas J. Spanos (2006), “Fundamentals of Semiconductor Manufacturing and Process Control”, John Wiley & Sons, Inc.

    McQueen, J., (1967), “Some Methods for Classification and Analysis of Multivariate Observation”, Symposium on Mathematical Statistic and Probability, vol. 1, pp. 281-297.

    Mitra, Sushmita, Kishori M. Konwar, and Sankar K. Pal (2002), “Fuzzy Decision Tree, Linguistic Rules and Fuzzy Knowledge-Based Network: Generation and Evaluation”. IEEE Transactions on Systems, Man, and Cybernetics, pp. 328-339.

    Pace, John, Maxime Zagrebnov, and Joost van Herk (2003), “Data Analysis Theorical & Practical Approach”, AEC/APC symposium XV, Colorado Springs, CO.

    Pfitzner, Lothar, Nobert Benesch, Richard Ochsner, Christian Schmidt, Claus Schneider, Thomas Tschaftary, Ralph Trunk , and Hans-Martin Dudenhausen (2001), “Cost reduction strategies for wafer expenditures”, Elsevier- Microelectronic Engineering 56 pp. 61-71.

    Qin, S. Joe (2000), “Fault Detection and Classification Theory for the User”, Department of Chemical Engineering, University of Texas, from control/che.utexas.edu/qinlab.

    Rendell, David, and David Albrecht (2007), “Case Study: CMP Slurry Savings Using Precise Flow Control”, Controlled Environments Magazine (April 2007) .

    Sanderson, Rob (2008), COMP527: Data mining, Department of Computer Science, University of Liverpool.

    Shatkay, H., and S.B. Zdonik (1996), “Approximate queries and representations for large data sequences”, Proceeding of the Twelfth International Conference on Data Engineering, pp. 536-545.

    Sullivan, David G. (2008), “Data Mining: Preparing the Data”, Computer Science, Boston University.

    Tsai, Yi- Chun, and J. Jao (2002), “Activity-Based Costing Application in Indirect Material Cost Control-Photo-resist”, No.9, Creation RD.1, science-Based Industrial Park Hsin-Chu, Taiwan, R.O.C.

    Velleman, P.F., and D.C. Hoaglin (1981), “Applications, Basic, and Computing of Exploratory Data Analysis”. Boston: Duxbury Press

    Wu, Muh-Cherng, C.S. Chien, and K.S. Lu (2005), “Yield improvement planning for the recycle processes of test wafers”, The International Journal of advanced Manufacturing Technology, vol. 27, pp. 1228-1234.

    Yang, Ying (2003), “Discretization for Naive-Bayes Learning”, PhD thesis, School of Computer Science and Software Engineering of Monash University.

    Yamada, Yuu,Einoshin Suzuki, Hideto Yokio, and Katsuhiko Takabayashi (2003), “Decision-tree Induction from Time-series Data Based on a Standard-example Split Test”, Electrical and Computer Engineering, Yokohama National University.

    Y. Zhou, J. Hahn, M. S. Mannan (2003), “Fault detection and classification in chemical processes based on neural networks with feature extraction”, ISA Transactions, Vol.42, No.4, pp.651-664.

    無法下載圖示 全文公開日期 本全文未授權公開 (校內網路)
    全文公開日期 本全文未授權公開 (校外網路)

    QR CODE