研究生: |
林國照 Kuo-Chao Lin |
---|---|
論文名稱: |
以Gabor小波為基礎之熱影像與光學影像辨識 Infrared Image and Optical Image Recognitions Based on Gabor Wavelets |
指導教授: |
陳文良
Wen-Liang Chen |
口試委員: | |
學位類別: |
博士 Doctor |
系所名稱: |
工學院 - 動力機械工程學系 Department of Power Mechanical Engineering |
論文出版年: | 2007 |
畢業學年度: | 95 |
語文別: | 英文 |
論文頁數: | 94 |
中文關鍵詞: | 捷伯小波 、紅外線影像 、光學影像 、影像濾波 、影像辨識 |
外文關鍵詞: | Gabor wavelet, Infrared image, Optical image, Image filter, Image recognition |
相關次數: | 點閱:3 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
由於Gabor小波(Wavelet)具有多頻濾波能力之優異特性,以Gabor小波為影像表示(Representation)之辨識方法,被廣泛應用於臉型辨識。雖然,過去有諸多以Gabor小波函數,調整濾波器的中心頻率(Center frequency)與方向(Filter orientation)為基礎,進行影像辨識之研究,但尚有許多主題值得深入探討。
以Gabor小波為基礎之影像辨識,傳統方法大部分注重特徵擷取或降階處理。但對於完整影像辨識方法,應包含影像前處理、影像表示、特徵擷取(Feature extraction)、降階處理(Dimension reduction)與分類模式(Classification model)。本文將針對紅外線影像(Infrared image)與光學影像(Optical image)兩種影像類別,以Gabor小波為基礎,建置一套影像識別系統。因此,提出一種強化Gabor小波之表示方法,以全部原始影像之頻率響應為基準,去設計選擇濾波器之中心頻率與濾波器之方向變化之數量,做為Gabor小波所需影像表示之依據。其次,在分類模式方面,發展兩種方法去增強辨識效果。修正K值最接近鄰近規則(Modified K-nearest neighbor rule),是以原始影像間彼此關聯性(Cross correlation)大小,來決定對距離上之K值選擇。另外一個辨識模式,是以一個自動調整方式(Self-adaptive)用於以輻射函數為基底之類神經網路(Radial basis function networks) ,此方法可經由訓練過程,選擇隱藏層(Hidden neurons)之數量增減,藉由逼近誤差(Approximation error)去防範過度訓練(Over fitting)或訓練不足(Under fitting)。
最後,在紅外線影像與光學影像實用上,所提出之辨識系統,分成訓練影像與測試影像,經由試驗結果,顯示具有良好辨識效果,證明此方法可行性。
Gabor wavelets have been successfully applied in various areas from image processing to pattern recognition. This success is mostly due to the fact that Gabor wavelets are equipped with a multi-channel filter capability such that a desired representation of the filter banks can be developed through selecting the parameters, i.e., the center frequency and the orientation of the filter. Although many types of image recognition based on Gabor wavelets have been studied and reported, there are still many valuable topics to be further investigated.
The conventional image recognition based on Gabor wavelets is focusing on feature extraction in order to reduce the dimension of the training images. However, this feature extraction is only one of the processes in pattern recognition. For the recognition of the infrared image and optical image, it includes image representation, feature extraction or dimension reduction, classification technique, and the object character. The aim of this dissertation is to construct an image recognition system based on Gabor wavelets for infrared image and optical image. Therefore, An enhanced representation of the Gabor wavelets is proposed, in which the properties of Gaussian mask in Gabor wavelets is developed to enhance the enveloped function, and simultaneously the parameters of the filter based on Gabor wavelets is designed depending on the frequency response of the training images.
In addition, the classification technique is important in the pattern recognition. In this dissertation, two classification methods are developed. The modified K-nearest neighbor rule is employed, in which the proposed modified rule combined with variance of the training images is used to find the optimal K value of the nearest distance between training images and testing images. For the neural network classification model, the self-adaptive radial basis function networks is employed, in which the self-adaptive networks has the property that during the training iteration the number of hidden neurons can be either increased or decreased according to the approximation error to prevent over fitting or under fitting.
Furthermore, the practical application is also an important issue on the infrared image and optical image. Almost all of the reported Gabor-based image recognitions are applied on the face image. In order to consider the different kind of image, this dissertation proposes an image recognition method based on the Gabor wavelets, which can be applied on both the infrared image and optical image. Some experiments including infrared image and optical image recognitions are given. The good performances are verified through using the proposed scheme in this dissertation.
[1] J. G. Daugman, "Complete discrete 2-d Gabor transforms by neural networks for image analysis and compression", IEEE Transactions on Acoustics, Speech, and Signal Processin, vol.36, no. 7, pp. 1169-1179, 1988.
[2] T. S. Lee, "Image representation using 2D Gabor wavelets", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 18, no 10, pp. 959-971, 1996.
[3] J. K. Kamarainen, V. Kyrki, and H. Kalviainen, "Invariance properties of Gabor filter-based feature - - overview and applications", IEEE Transactions on Image Processing, vol.15, no. 5, pp. 1088-1099, 2006.
[4] I. Kalliomaki and J. Lampinen, "On steerability of Gabor-type filters for feature detection", Pattern Recognition Letters, vol. 28, pp. 904-911, 2007.
[5] V. Kyrki, J. K. Kamarainen and H. Kalviainen, "Simple Gabor feature space for invariant object recognition", Pattern Recognition Letters, vol. 25, pp. 311-318, 2004.
[6] L. Wiskott, J.M. Fellous, N. Kruger and C. Malsburg, "Face recognition by elastic bunch graph matching", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.19, no.7, pp. 775–779, 1997.
[7] D. H. Liu, K. M. Lam, and L. S. Shen, "Optimal sampling of Gabor feature for face recognition", Pattern Recognition Letters, vol. 25, pp. 267-276, 2004.
[8] F. Li and K. Xu, "Optimal Gabor kernel's scale and orientation selection for face classification", Optics & Laser Technology, vol. 39, Issue 4, pp. 852-857, 2007.
[9] S. Arivazhagan, L. Ganesan and S. Padam Priyal, "Texture classification using Gabor wavelets based rotation invariant features", Pattern Recognition Letters, vol. 27, pp. 1976-1982, 2006.
[10] C. S. Sastry, M. Ravindranath, A. K. Pujari and B.L. Deekshatulu, "A modified Gabor function for content based image retrieval", Pattern Recognition Letters, vol. 28, pp. 293-300, 2007.
[11] F. Benhammadi, M. N. Amirouche, H. Hentous, K. B. Beghdad and M. Aissani, "Fingerprint matching from minutiae texture maps", Pattern Recognition, vol. 40, Issue 1, pp. 189-197, 2007.
[12] J. Yang, L. Liu, T. Jiang and Y. Fan, "A modified Gabor filter design method for fingerprint image enhancement", Pattern Recognition Letters, vol. 24, pp. 1805-1817, 2003.
[13] Y. M. Su and J. F. Wang, "A novel stroke extraction method for Chinese characters using Gabor filters", Pattern Recognition, vol. 36, pp. 635-647, 2003.
[14] X. Wang, X. Ding and C. Liu, "Gabor filters-based feature extraction for character recognition", Pattern Recognition, vol. 38, Issue 3, pp. 369-379, 2005.
[15] L. L. Shen, L. Bai, and M. Fairhurst, "Gabor wavelets and general discriminant analysis for face identification and verification", Image and Vision Computing, vol. 25, Issue 5, pp. 553-563, 2007.
[16] C. Liu, "Gabor-based kernel PCA with fractional power polynomial models for face recognition", IEEE Transactions On Pattern Analysis and Machine Intelligence, vol. 26, no. 5, pp. 572-581, 2004.
[17] B. Zhu, L. Jiang, Y. Luo, and Y. Tao, "Gabor feature-based apple quality inspection using kernel principal component analysis", Journal of Food Engineering, vol. 81, Issue 4, pp. 741-749, 2007.
[18] C. Liu, H. Wechsler, "Gabor feature based classification using the enhanced fisher linear discriminant model for face recognition", IEEE Transactions on Image Processing, vol. 11, no. 4, pp 467-476, 2002.
[19] L. C. Lin, C. J. Cheng, "Pattern discrimination of joint transform correlator based on wavelet subband filtering", Optics Communications 233, pp.283-296, 2004.
[20] R. Alterson, M. Spetsakis, "Object recognition with adaptive Gabor features", Image and Vision Computing, vol. 22, Issue 12, pp. 1007-1014, 2004.
[21] Y. Xiang, V. Yingling, and J. Silverberg, "Quantification of trabecular bone mass and orientation using Gabor wavelets", Proceedings of the 2003 ACM Symposium on Applied Computing SAC’03, pp. 183-188, 2003.
[22] D.A. Pollen and S.F. Ronner, "Visual cortical neurons as localized spatial frequency filters", IEEE Transactions on Systems, Man and Cybernetics, vol.13, no.5, pp. 907–916, 1983.
[23] R. C. Gonzalez, P. Wintz, Digital Image Processing, Addison-Wesley Publishing company, 1987.
[24] H. Xiong, T. Zhang, and Y. S. Moon, " A translation- and scale-invariant adaptive wavelet transform", IEEE Transactions on Image Processing, vol. 9, pp. 2100-2108, 2000.
[25] T. Suk, J. Flusser, "Combined blur and affine moment invariants and their use in pattern recognition", Pattern Recognition, vol.36, pp. 2895-2907, 2003.
[26] L. Hermes, J. M. Buhmann, "A minimum entropy approach to adaptive image polygonization", IEEE Transactions on Image Processing, vol. 12, pp. 1243-1258, 2003.
[27] X. Fu, L. Wang, "Data dimensionality reduction with application to simplifying RBF network structure and improving classification performance", IEEE Transactions on Systems, Man and Cybernetics—part B: cybernetics, vol. 33, no.3, pp. 399-409, 2003.
[28] P.N. Belhumeur, J.P. Hespanha, and D.J. Kriegman, "Eigenfaces vs. fisherface: recognition using class specific linear projection", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 19, no. 7, pp. 711–720, 1997.
[29] A. Hyvarinen, E. Oja, "Independent component analysis:algorithms and applications", Neural Networks, vol. 13, pp 411-430, 2000.
[30] X.Y. Jing, D. Zhang and Y.F. Yao, "Improvements on the linear discrimination technique with application to face recognition", Pattern Recognition Letter, vol. 24, pp. 2695–2701, 2003.
[31] A. Chakrabarti, A. N. Rajagopalan, and R. Chellappa, "Super-Resolution of Face Images Using Kernel PCA-Based Prior", IEEE Transactions on Multimedia, vol. 9, no. 4, pp. 888–892, 2007.
[32] Lu, J., Plataniotis, K.N., and Venetsanopoulos, A.N., "Face recognition using kernel direct discriminant analysis algorithms". IEEE Transactions on Neural Networks, vol.14, no. 1, pp. 117-126, 2003.
[33] Z. Zhu, M. Tang, and H. Lu, "A new robust circular Gabor based object matching by using weighted Hausdorff distance", Pattern Recognition Letters, vol. 25, pp. 515-523, 2004.
[34] J. S. R. Jang, C. T. Sun, and E. Mizutani, Neuro-Fuzzy and Soft Computing, Prentice Hall, 1997.
[35] M. B. Saintey, D. P. Almond, "An artificial neural network interpreter for transient thermography image data", NDT & E International, vol. 30, no.5, pp. 291-295, 1997.
[36] W. Ko, H. S. Cho, "Solder joints inspection using a neural network and fuzzy rule-based classification method", IEEE Transactions on Electronics Packaging Manufacturing, vol. 23, no. 2, pp. 93-103, 2000.
[37] W. Pedrycz, "Conditional fuzzy clustering in the design of radial basis function neural networks", IEEE Transactions on Neural Networks, vol. 9, no. 4, pp. 601–612, 1988.
[38] A. Esposito, M. Marinaro, D. Oricchio and S. Scarpetta, "Approximation of continuous and discontinuous mappings by a growing neural RBF-based algorithm", Neural Networks, vol. 13, no. 6, pp. 651–665, 2000.
[39] K.Z. Mao, "RBF neural network center selection based on Fisher ratio class separability measure", IEEE Transactions on Neural Networks, vol.13, no. 5, pp. 1211–1217, 2002.
[40] M.J. Er, S.Q. Wu, J.W. Lu and H.L. Toh, "Face recognition with radial basis function (RBF) neural networks", IEEE Transactions on Neural Networks, vol. 13, no. 3, pp. 697–710, 2002.
[41] F. Girosi and T. Poggio, "Networks and the best approximation property", Biol. Cybern, vol. 63, no. 3, pp. 169–176, 1990.
[42] D. J. C. Mackay, "Bayesian interpolation", Neural Computing, vol. 4, no. 3, pp. 415-447, 1992.
[43] S. Lee, R. M. Kil, "A Gaussian potential function network with hierarchically self-organizating learning", Neural Network, vol. 4, pp. 207-224, 1991.
[44] J. C. Platt, "A resource allocating network for function interpolation", Neural Computing, vol. 3, pp. 213-225, 1991.
[45] Y. W. Lu, N. Sundararajan, P. Saratchandran, "A sequential learning scheme for function approximation and using minimal radial basis neural networks", Neural Computing, vol. 9, no.2, pp. 1-18, 1997.
[46] X. Xie, K. M. Lam, "Gabor-based kernel PCA with doubly nonlinear mapping for face recognition with a single face image", IEEE Transactions on Image Processing, vol. 15, no. 9, pp 2481-2492, 2006.
[47] C. Liu, "Independent component analysis of Gabor features for face recognition", IEEE Transactions on neural networks, vol. 14, no. 4, pp 919-928, 2003.
[48] W. F. Liu, Z. F. Wang, "Facial expression recognition based on fusion of multiple Gabor feature", The 18th international conference on pattern recognition, 2006.
[49] M. Grudin, "On internal representations in face recognition", Pattern Recognition, vol. 33, pp. 1161–1177, 2000.
[50] B. Moghaddam, "Principal manifolds and probabilistic subspaces for visual recognition", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, no. 6, pp. 780–788, 2002.
[51] T.K. Kim, H. Kim, W. Hwang and J. Kittler, "Independent component analysis in a local facial residue space for face recognition", Pattern Recognition, vol. 37, pp. 1873–1885, 2004.
[52] Liu and H. Wechsler, "Evolutionary pursuit and its application to face recognition", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22, no. 6, pp. 570–582, 2000.
[53] L. Wiskott, J.M. Fellous, N. Kruger and C. Malsburg, "Face recognition by elastic bunch graph matching", IEEE Transactions on Pattern Analysis and Machine Intelligence, 19, pp. 775–779, 1997.
[54] A. Tefas, C. Kotropoulos and I. Pitas, "Using support vector machines to enhance the performance of elastic graph matching for frontal face authentication", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 23, no. 7, pp. 735–746, 2001.
[55] R. Newport, "Infrared Electrical Inspection Myths ", SPIE Vol. Thermosense XIX, vol. 3056, pp124-132, 1997.
[56] Dong-Ho Lee, "Thermal Analysis of Integrated-Circuit Chips Using Thermographic Imaging Techniques" IEEE Transaction on Instrumentation and Measurement, vol. 43, no.6, pp. 824-829, 1994.
[57] C. H. Blazquez, "Detection of Problems in High Power Voltage Transmission and Distribution Lines with an Infrared Scanner/Video System ", SPIE, vol. 2245, Thermosense XⅤI, pp.27-32, 1994.
[58] K. W. Whang, J. K. Kim, "Discharge physics of alternating current plasma display panels", Journal of Display Technology, vol. 1, no. 2. pp. 295-303, 2005.
[59] C. K. Kelch, P. E. Grover, "Using Thermography to Detect Misalignment in Coupled Equipment", SPIE, vol. 2766, Thermosense XⅧ, pp.91-100, 1996.
[60] M. .Prystay, H. Wang, and A. Garcia-Rejon., "Application of Thermographic Temperature Measurements in Injection Moulding and Blow Moulding of Plastics", SPIE, vol. 2766, Thermosense XⅧ, pp.5- 14, 1996.
[61] R. P. Madding, N. A. MacNamara, "Steam Leak Cost Estimation Using Thermographically Acquired Pipe Temperature Data", SPIE, vol. 3056, Thermosense XIX, pp.146-152, 1997.
[62] S. Johnson, R. Neff, "Improving thermographic analysis using image histories", SPIE, vol. 3056, Thermosense XIX, pp.164-166, 1997.
[63] D. H. Lee, "Thermal Analysis of Integrated-Circuit Chips Using Thermographic Imaging Techniques" IEEE Transactions on Instrumentation and Measurement, vol. 43, no.6, pp.824-829, 1994.
[64] E. Keenan, R. G. Wright, M. Zgol, R. Mulligan, V. Tagliava, and L. V. Kirkland, " IR circuit board and IC failure detection" IEEE Transactions on Aerospace and Electronic Systems Magazine, vol. 19, no.6, pp.9-15, 2004.
[65] L. G. Allred, T. R. Howard, and G. Serpen, "Thermal Imaging is the Sole Basis for Repairing Circuit Cards in the F-16 Flight Control Panel ", AUTOTESTCON '96, Test Technology and Commercialization Conference, pp. 418-424, 1996.
[66] V. Szekely, M. Rencz, "Image Processing Procedures for the Thermal Measurements" IEEE Transactions on Components and Packaging Technology, vol. 22, no. 2, pp.259-265, 1999.
[67] M. J. Berg. , "Comparison of Component and System Level thermal performance for various thermally enhanced packages and PCB layouts", IEEE /CPMT Electronic Packaging Technology Conference, pp.69-74, 1997.
[68] S. A. Merryman, R. M. Nelms, "Diagnostic Technique for Power Systems Utilizing Infrared Thermal Imaging", IEEE Transactions on Industrial Electronics, vol. 42, no.6, pp.615-628, 1995.
[69] M. Sonka, V. Hlavac, and R. Boyle, Image Processing, Analysis, and Machine Vision, PWS Publising, 1999.
[70] R. L. Allen, D. W. Mills, Signal Analysis, IEEE, 2004.
[71] J. Duchene and S. Leclercq, "An Optimal Transformation for Discriminant and Principal Component Analysis", IEEE Transactions on pattern analysis and Machine intelligence, vol. 10 no. 6, pp. 978-983, 1986.
[72] G. C. Goodwin, K. S. Sin, Adaptive Filtering Prediction and Control, Prentice-Hall, 1984.
[73] Accessible at http://www.cam-orl.co.uk/facedatabase.html