研究生: |
劉圳堯 Liu, Zhen-Yao |
---|---|
論文名稱: |
基於簡化群體演算法的深度學習模型超參數優化及其在圖像分類與圖結構數據中的應用研究 Simplified Swarm Optimization for Hyperparameter Optimization of Deep Learning Models and Applications in Image Classification and Graph-Structured Data |
指導教授: |
葉維彰
Yeh, Wei-Chang |
口試委員: |
惠霖
Hui, Lin 黃煌文 Huang, Huang-Wen 陳以錚 Chen, Yi-Cheng 陳永輝 Chen, Yung-Hui |
學位類別: |
博士 Doctor |
系所名稱: |
工學院 - 工業工程與工程管理學系 Department of Industrial Engineering and Engineering Management |
論文出版年: | 2025 |
畢業學年度: | 113 |
語文別: | 英文 |
論文頁數: | 97 |
中文關鍵詞: | 簡化群體演算法 、卷積神經網路 、二分圖 、圖卷積網路 、環境聲音分類 、推薦系統 |
外文關鍵詞: | Simplified Swarm Optimization, Convolutional Neural Network, Bipartite Graph, Graph Convolutional Network, Environmental Sound Classification, Recommendation System |
相關次數: | 點閱:3 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
近年來,環境聲音分類(Environmental Sound Classification, ESC)逐漸受到學術界的廣泛關注。然而,由於環境聲音源的多樣性與複雜性——其受眾多參數驅動——使得聲音分類成為一項頗具挑戰性的任務。與此同時,推薦系統已成為現代數位平臺(如電子商務、社交媒體、流媒體服務、音樂應用程式和學術搜尋引擎)的核心組成部分。這些系統通常採用二分圖結構來建模使用者與物品之間的關係。在本研究中,環境音訊信號被轉換為圖像形式(如頻譜圖),從而可以利用卷積神經網路進行基於圖像的聲音事件分類。該方法充分發揮了視覺模式識別的優勢,更有效地提取了環境聲音的時域和頻域特徵。 借助強大的結構化資料處理能力和高階資訊建模能力,圖神經網路(Graph Neural Networks, GNNs),特別是圖卷積網路(Graph Convolutional Networks, GCNs)已成為眾多推薦問題中的先進解決方案。GCN能夠有效融合結構關係(例如使用者-物品交互)與節點特徵(例如使用者偏好和物品屬性),從而實現更為準確和個性化的推薦。在本研究中,還引入了眼動追蹤技術,用於分析使用者的視覺注意模式,從而在圖結構中增強使用者偏好行為的建模。為應對超參數調整與模型優化的挑戰,本研究採用簡化群體演算法(Simplified Swarm Optimization, SSO)對卷積神經網路和二分圖卷積網路的超參數進行優化。通過在圖像分類任務(基於ESC資料集生成的頻譜圖)和圖結構推薦任務上的大量實驗驗證,SSO訓練的模型在性能和準確率方面均顯著優於過去傳統的多數研究方法。這些結果突顯了所提出的基於SSO的優化框架在電腦視覺和圖結構資料處理等多個深度學習領域中的有效性、適應性和廣泛適用性。
Noise is an ever-present aspect of daily life, and ambient sounds play a crucial role in a variety of domains, including urban intelligence, spatial localization, security monitoring, mechanical auditory systems, and ecological observation. In recent years, Environmental Sound Classification (ESC) has garnered increasing attention in the academic community. However, the diversity and complexity of environmental sound sources—driven by numerous parameters—make sound classification a particularly challenging task. Simultaneously, recommendation systems have become integral to modern digital platforms, such as e-commerce, social media, streaming services, music applications, and academic search engines. These systems commonly employ bipartite graphs to model relationships between users and items.
In this study, environmental audio signals are converted into image representations (e.g., spectrograms), allowing the use of convolutional neural networks for image-based classification of sound events. This approach leverages the strengths of visual pattern recognition to better capture the temporal and spectral features of environmental sounds. With their strong capacity to process structured data and capture high-order interactions, Graph Neural Networks (GNNs)—particularly Graph Convolutional Networks (GCNs)—have become state-of-the-art solutions for many recommendation problems. GCNs effectively integrate structural relationships (e.g., user-item interactions) with node features (e.g., user preferences and item attributes), enabling more accurate and personalized recommendations. In our research, eye-tracking technology is incorporated to analyze visual attention patterns and enhance the modeling of user preference behaviors within the graph structure.
To address the challenge of hyperparameter tuning and model optimization, this study optimizes the hyperparameters of both convolutional neural networks and bipartite GCNs by Simplified Swarm Optimization algorithm. Through extensive experiments on both image classification tasks (using spectrograms derived from ESC datasets) and graph-based recommendation tasks, the models trained via SSO demonstrate consistently high performance and improved accuracy compared to traditional research methods. These results highlight the effectiveness, adaptability, and broad applicability of the proposed SSO-based optimization framework across distinct deep learning domains, including computer vision and graph-structured data processing.
[1] S. Chu, S. Narayanan, and C.-C. J. Kuo, "Environmental sound recognition with time–frequency audio features," IEEE Transactions on Audio, Speech, and Language Processing, vol. 17, no. 6, pp. 1142-1158, 2009.
[2] F. Demir, M. Turkoglu, M. Aslan, and A. Sengur, "A new pyramidal concatenated CNN approach for environmental sound classification," Applied Acoustics, vol. 170, p. 107520, 2020.
[3] P. Aumond et al., "A study of the accuracy of mobile technology for measuring urban noise pollution in large scale participatory sensing campaigns," Applied Acoustics, vol. 117, pp. 219-226, 2017.
[4] J. Cao, M. Cao, J. Wang, C. Yin, D. Wang, and P.-P. Vidal, "Urban noise recognition with convolutional neural network," Multimedia Tools and Applications, vol. 78, no. 20, pp. 29021-29041, 2019/10/01 2019, doi: 10.1007/s11042-018-6295-8.
[5] R. Radhakrishnan, A. Divakaran, and A. Smaragdis, "Audio analysis for surveillance applications," in IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005., 2005: IEEE, pp. 158-161.
[6] M. Crocco, M. Cristani, A. Trucco, and V. Murino, "Audio surveillance: A systematic review," ACM Computing Surveys (CSUR), vol. 48, no. 4, pp. 1-46, 2016.
[7] P. Laffitte, Y. Wang, D. Sodoyer, and L. Girin, "Assessing the performances of different neural network architectures for the detection of screams and shouts in public transportation," Expert systems with applications, vol. 117, pp. 29-41, 2019.
[8] A. J. Eronen et al., "Audio-based context recognition," IEEE Transactions on Audio, Speech, and Language Processing, vol. 14, no. 1, pp. 321-329, 2005.
[9] M. Xu, C. Xu, L. Duan, J. S. Jin, and S. Luo, "Audio keywords generation for sports video analysis," ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), vol. 4, no. 2, pp. 1-23, 2008.
[10] D. Barchiesi, D. Giannoulis, D. Stowell, and M. D. Plumbley, "Acoustic scene classification: Classifying environments from the sounds they produce," IEEE Signal Processing Magazine, vol. 32, no. 3, pp. 16-34, 2015.
[11] R. F. Lyon, "Machine Hearing: An Emerging Field [Exploratory DSP]," IEEE Signal Processing Magazine, vol. 27, no. 5, pp. 131-139, 2010, doi: 10.1109/MSP.2010.937498.
[12] M. Green and D. Murphy, "Environmental sound monitoring using machine learning on mobile devices," Applied Acoustics, vol. 159, p. 107041, 2020.
[13] P. Intani and T. Orachon, "Crime warning system using image and sound processing," in 2013 13th International Conference on Control, Automation and Systems (ICCAS 2013), 2013: IEEE, pp. 1751-1753.
[14] V. P. Romero, L. Maffei, G. Brambilla, and G. Ciaburro, "Modelling the soundscape quality of urban waterfronts by artificial neural networks," Applied Acoustics, vol. 111, pp. 121-128, 2016.
[15] A. Agha, R. Ranjan, and W.-S. Gan, "Noisy vehicle surveillance camera: A system to deter noisy vehicle in smart city," Applied Acoustics, vol. 117, pp. 236-245, 2017.
[16] K. J. Piczak, "ESC: Dataset for environmental sound classification," in Proceedings of the 23rd ACM international conference on Multimedia, 2015, pp. 1015-1018.
[17] J. Salamon, C. Jacoby, and J. P. Bello, "A dataset and taxonomy for urban sound research," in Proceedings of the 22nd ACM international conference on Multimedia, 2014, pp. 1041-1044.
[18] V. Bisot, R. Serizel, S. Essid, and G. Richard, "Feature learning with matrix factorization applied to acoustic scene classification," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 6, pp. 1216-1229, 2017.
[19] D. Stowell, D. Giannoulis, E. Benetos, M. Lagrange, and M. D. Plumbley, "Detection and classification of acoustic scenes and events," IEEE Transactions on Multimedia, vol. 17, no. 10, pp. 1733-1746, 2015.
[20] P. Dhanalakshmi, S. Palanivel, and V. Ramalingam, "Classification of audio signals using AANN and GMM," Applied soft computing, vol. 11, no. 1, pp. 716-723, 2011.
[21] J. Ludena-Choez and A. Gallardo-Antolin, "Acoustic Event Classification using spectral band selection and Non-Negative Matrix Factorization-based features," Expert Systems with Applications, vol. 46, pp. 77-86, 2016.
[22] J. Salamon and J. P. Bello, "Unsupervised feature learning for urban sound classification," in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015: IEEE, pp. 171-175.
[23] J. T. Geiger and K. Helwani, "Improving event detection for audio surveillance using gabor filterbank features," in 2015 23rd European Signal Processing Conference (EUSIPCO), 2015: IEEE, pp. 714-718.
[24] M. Mulimani and S. G. Koolagudi, "Segmentation and characterization of acoustic event spectrograms using singular value decomposition," Expert Systems with Applications, vol. 120, pp. 413-425, 2019.
[25] J. Xie and M. Zhu, "Investigation of acoustic and visual features for acoustic scene classification," Expert Systems with Applications, vol. 126, pp. 20-29, 2019.
[26] A. Krizhevsky, I. Sutskever, and G. E. Hinton, "Imagenet classification with deep convolutional neural networks," Advances in neural information processing systems, vol. 25, 2012.
[27] J. Deng, A. Berg, S. Satheesh, H. Su, A. Khosla, and L. Fei-Fei, "Imagenet large scale visual recognition competition 2012 (ILSVRC2012)," See net. org/challenges/LSVRC, vol. 41, p. 6, 2012.
[28] K. J. Piczak, "Environmental sound classification with convolutional neural networks," in 2015 IEEE 25th international workshop on machine learning for signal processing (MLSP), 2015: IEEE, pp. 1-6.
[29] J. Salamon and J. P. Bello, "Deep convolutional neural networks and data augmentation for environmental sound classification," IEEE Signal processing letters, vol. 24, no. 3, pp. 279-283, 2017.
[30] N. Takahashi, M. Gygli, B. Pfister, and L. Van Gool, "Deep convolutional neural networks and data augmentation for acoustic event detection," arXiv preprint arXiv:1604.07160, 2016.
[31] Y. Tokozume, Y. Ushiku, and T. Harada, "Learning from between-class examples for deep sound recognition," arXiv preprint arXiv:1711.10282, 2017.
[32] V. Boddapati, A. Petef, J. Rasmusson, and L. Lundberg, "Classifying environmental sounds using image recognition networks," Procedia computer science, vol. 112, pp. 2048-2056, 2017.
[33] Z. Mushtaq and S.-F. Su, "Environmental sound classification using a regularized deep convolutional neural network with data augmentation," Applied Acoustics, vol. 167, p. 107389, 2020.
[34] Z. Mushtaq, S.-F. Su, and Q.-V. Tran, "Spectral images based environmental sound classification using CNN with meaningful data augmentation," Applied Acoustics, vol. 172, p. 107581, 2021.
[35] Y. Chen, Q. Guo, X. Liang, J. Wang, and Y. Qian, "Environmental sound classification with dilated convolutions," Applied Acoustics, vol. 148, pp. 123-132, 2019.
[36] S. Abdoli, P. Cardinal, and A. L. Koerich, "End-to-end environmental sound classification using a 1D convolutional neural network," Expert Systems with Applications, vol. 136, pp. 252-263, 2019.
[37] F. Medhat, D. Chesmore, and J. Robinson, "Masked conditional neural networks for sound classification," Applied Soft Computing, vol. 90, p. 106073, 2020.
[38] H. Knebl, "Algorithms and data structures," Cham: Springer Nature Switzerland AG, 2020.
[39] F. AS and T. B. Antony, "Spectral graph theory," 2023.
[40] F. Zhou, Q. Yang, T. Zhong, D. Chen, and N. Zhang, "Variational graph neural networks for road traffic prediction in intelligent transportation systems," IEEE Transactions on Industrial Informatics, vol. 17, no. 4, pp. 2802-2812, 2020.
[41] S. Casale-Brunet, P. Ribeca, P. Doyle, and M. Mattavelli, "Networks of Ethereum Non-Fungible Tokens: A graph-based analysis of the ERC-721 ecosystem," in 2021 IEEE International Conference on Blockchain (Blockchain), 2021: IEEE, pp. 188-195.
[42] J. Chang et al., "Sequential recommendation with graph neural networks," in Proceedings of the 44th international ACM SIGIR conference on research and development in information retrieval, 2021, pp. 378-387.
[43] X. Liu, J. Chen, and Q. Wen, "A Survey on Graph Classification and Link Prediction based on GNN," arXiv preprint arXiv:2307.00865, 2023.
[44] N. N. Daud, S. H. Ab Hamid, M. Saadoon, F. Sahran, and N. B. Anuar, "Applications of link prediction in social networks: A review," Journal of Network and Computer Applications, vol. 166, p. 102716, 2020.
[45] S. Xiao, S. Wang, Y. Dai, and W. Guo, "Graph neural networks in node classification: survey and evaluation," Machine Vision and Applications, vol. 33, no. 1, p. 4, 2022.
[46] K. Shah, H. Patel, D. Sanghvi, and M. Shah, "A comparative analysis of logistic regression, random forest and KNN models for the text classification," Augmented Human Research, vol. 5, no. 1, p. 12, 2020.
[47] W. Lotter, G. Kreiman, and D. Cox, "A neural network trained for prediction mimics diverse features of biological neurons and perception," Nature machine intelligence, vol. 2, no. 4, pp. 210-219, 2020.
[48] F. Zhou and Y. Chai, "Near-sensor and in-sensor computing," Nature Electronics, vol. 3, no. 11, pp. 664-671, 2020.
[49] T. P. Lillicrap, A. Santoro, L. Marris, C. J. Akerman, and G. Hinton, "Backpropagation and the brain," Nature Reviews Neuroscience, vol. 21, no. 6, pp. 335-346, 2020.
[50] D. Bhatt et al., "CNN variants for computer vision: History, architecture, application, challenges and future scope," Electronics, vol. 10, no. 20, p. 2470, 2021.
[51] F. Emmert-Streib, Z. Yang, H. Feng, S. Tripathi, and M. Dehmer, "An introductory review of deep learning for prediction models with big data," Frontiers in Artificial Intelligence, vol. 3, p. 4, 2020.
[52] A. Egri-Nagy and A. Törmänen, "The game is not over yet—go in the post-alphago era," Philosophies, vol. 5, no. 4, p. 37, 2020.
[53] L. Floridi and M. Chiriatti, "GPT-3: Its nature, scope, limits, and consequences," Minds and Machines, vol. 30, pp. 681-694, 2020.
[54] Z. Liu and J. Zhou, Introduction to graph neural networks. Springer Nature, 2022.
[55] W. Fan et al., "A graph neural network framework for social recommendations," IEEE Transactions on Knowledge and Data Engineering, vol. 34, no. 5, pp. 2033-2047, 2020.
[56] W. Jiang and J. Luo, "Graph neural network for traffic forecasting: A survey," Expert Systems with Applications, vol. 207, p. 117921, 2022.
[57] S. Wu, F. Sun, W. Zhang, X. Xie, and B. Cui, "Graph neural networks in recommender systems: a survey," ACM Computing Surveys, vol. 55, no. 5, pp. 1-37, 2022.
[58] I. Ullah, M. Manzo, M. Shah, and M. G. Madden, "Graph convolutional networks: analysis, improvements and results," Applied Intelligence, pp. 1-12, 2022.
[59] T. N. Kipf and M. Welling, "Semi-supervised classification with graph convolutional networks," arXiv preprint arXiv:1609.02907, 2016.
[60] S. Zhang, H. Tong, J. Xu, and R. Maciejewski, "Graph convolutional networks: a comprehensive review," Computational Social Networks, vol. 6, no. 1, pp. 1-23, 2019.
[61] M. M. Bronstein, J. Bruna, Y. LeCun, A. Szlam, and P. Vandergheynst, "Geometric deep learning: going beyond euclidean data," IEEE Signal Processing Magazine, vol. 34, no. 4, pp. 18-42, 2017.
[62] M. Gori, G. Monfardini, and F. Scarselli, "A new model for learning in graph domains," in Proceedings. 2005 IEEE international joint conference on neural networks, 2005., 2005, vol. 2: IEEE, pp. 729-734.
[63] F. Scarselli, M. Gori, A. C. Tsoi, M. Hagenbuchner, and G. Monfardini, "The graph neural network model," IEEE transactions on neural networks, vol. 20, no. 1, pp. 61-80, 2008.
[64] C. Gallicchio and A. Micheli, "Graph echo state networks," in The 2010 international joint conference on neural networks (IJCNN), 2010: IEEE, pp. 1-8.
[65] Z. Wu, S. Pan, F. Chen, G. Long, C. Zhang, and S. Y. Philip, "A comprehensive survey on graph neural networks," IEEE transactions on neural networks and learning systems, vol. 32, no. 1, pp. 4-24, 2020.
[66] L. Zhao et al., "T-GCN: A temporal graph convolutional network for traffic prediction," IEEE transactions on intelligent transportation systems, vol. 21, no. 9, pp. 3848-3858, 2019.
[67] W.-L. Chiang, X. Liu, S. Si, Y. Li, S. Bengio, and C.-J. Hsieh, "Cluster-gcn: An efficient algorithm for training deep and large graph convolutional networks," in Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, 2019, pp. 257-266.
[68] S. Vashishth, S. Sanyal, V. Nitin, and P. Talukdar, "Composition-based multi-relational graph convolutional networks," arXiv preprint arXiv:1911.03082, 2019.
[69] M. Chen, Z. Wei, Z. Huang, B. Ding, and Y. Li, "Simple and deep graph convolutional networks," in International conference on machine learning, 2020: PMLR, pp. 1725-1735.
[70] J. You, B. Liu, Z. Ying, V. Pande, and J. Leskovec, "Graph convolutional policy network for goal-directed molecular graph generation," Advances in neural information processing systems, vol. 31, 2018.
[71] D. Wang, P. Cui, and W. Zhu, "Structural deep network embedding," in Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining, 2016, pp. 1225-1234.
[72] J. Qiu, J. Tang, H. Ma, Y. Dong, K. Wang, and J. Tang, "Deepinf: Social influence prediction with deep learning," in Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, 2018, pp. 2110-2119.
[73] H. Wan, Y. Luo, B. Peng, and W.-S. Zheng, "Representation Learning for Scene Graph Completion via Jointly Structural and Visual Embedding," in IJCAI, 2018: Stockholm, Sweden, pp. 949-956.
[74] J. Yang, J. Lu, S. Lee, D. Batra, and D. Parikh, "Graph r-cnn for scene graph generation," in Proceedings of the European conference on computer vision (ECCV), 2018, pp. 670-685.
[75] C. He et al., "Bipartite graph neural networks for efficient node representation learning," arXiv preprint arXiv:1906.11994, vol. 120, 2019.
[76] A. T. Duchowski, "A breadth-first survey of eye-tracking applications," Behavior Research Methods, Instruments, & Computers, vol. 34, no. 4, pp. 455-470, 2002.
[77] R. L. Solso, Cognition and the visual arts. MIT press, 1994.
[78] M. Resnick and W. Albert, "The impact of advertising location and user task on the emergence of banner ad blindness: An eye-tracking study," International Journal of Human-Computer Interaction, vol. 30, no. 3, pp. 206-219, 2014.
[79] I. Portugal, P. Alencar, and D. Cowan, "The use of machine learning algorithms in recommender systems: A systematic review," Expert Systems with Applications, vol. 97, pp. 205-227, 2018.
[80] M. Krol and M. Krol, "A novel approach to studying strategic decisions with eye-tracking and machine learning," Judgment and Decision Making, vol. 12, no. 6, p. 596, 2017.
[81] S.-I. Amari, "Natural gradient works efficiently in learning," Neural computation, vol. 10, no. 2, pp. 251-276, 1998.
[82] S. Ruder, "An overview of gradient descent optimization algorithms," arXiv preprint arXiv:1609.04747, 2016.
[83] X. Huang, Q. Song, Y. Li, and X. Hu, "Graph recurrent networks with attributed random walks," in Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019, pp. 732-740.
[84] C. Deng, Z. Zhao, Y. Wang, Z. Zhang, and Z. Feng, "Graphzoom: A multi-level spectral approach for accurate and scalable graph embedding," arXiv preprint arXiv:1910.02370, 2019.
[85] X. Wang, M. Zhu, D. Bo, P. Cui, C. Shi, and J. Pei, "Am-gcn: Adaptive multi-channel graph convolutional networks," in Proceedings of the 26th ACM SIGKDD International conference on knowledge discovery & data mining, 2020, pp. 1243-1253.
[86] Y. Jiang, H. Lin, Y. Li, Y. Rong, H. Cheng, and X. Huang, "Exploiting node-feature bipartite graph in graph convolutional networks," Information Sciences, vol. 628, pp. 409-423, 2023.
[87] W.-C. Yeh, Y.-P. Lin, Y.-C. Liang, C.-M. Lai, and C.-L. Huang, "Simplified swarm optimization for hyperparameters of convolutional neural networks," Computers & Industrial Engineering, vol. 177, p. 109076, 2023.
[88] D. Dev, Deep learning with hadoop. Packt Publishing Ltd, 2017.
[89] N. Kumar, J. Mayank, and A. Mondal, "Reliability aware energy optimized scheduling of non-preemptive periodic real-time tasks on heterogeneous multiprocessor system," IEEE Transactions on Parallel and Distributed Systems, vol. 31, no. 4, pp. 871-885, 2019.
[90] K. Huang et al., "Expected energy optimization for real-time multiprocessor SoCs running periodic tasks with uncertain execution time," IEEE Transactions on Sustainable Computing, 2018.
[91] D. Konar, S. Bhattacharyya, K. Sharma, S. Sharma, and S. R. Pradhan, "An improved Hybrid Quantum-Inspired Genetic Algorithm (HQIGA) for scheduling of real-time task in multiprocessor system," Applied Soft Computing, vol. 53, pp. 296-307, 2017.
[92] G. Manimaran and C. S. R. Murthy, "An efficient dynamic scheduling algorithm for multiprocessor real-time systems," IEEE Transactions on Parallel and Distributed Systems, vol. 9, no. 3, pp. 312-319, 1998.
[93] İ. Özkan and E. Ülker, "Derin öğrenme ve görüntü analizinde kullanılan derin öğrenme modelleri," Gaziosmanpaşa Bilimsel Araştırma Dergisi, vol. 6, no. 3, pp. 85-104, 2017.
[94] K. Simonyan and A. Zisserman, "Very deep convolutional networks for large-scale image recognition," arXiv preprint arXiv:1409.1556, 2014.
[95] S. Li, Y. Yao, J. Hu, G. Liu, X. Yao, and J. Hu, "An ensemble stacked convolutional neural network model for environmental event sound recognition," Applied Sciences, vol. 8, no. 7, p. 1152, 2018.
[96] Y. Su, K. Zhang, J. Wang, and K. Madani, "Environment sound classification using a two-stream CNN based on decision-level fusion," Sensors, vol. 19, no. 7, p. 1733, 2019.
[97] X. Zhang, Y. Zou, and W. Shi, "Dilated convolution neural network with LeakyReLU for environmental sound classification," in 2017 22nd international conference on digital signal processing (DSP), 2017: Ieee, pp. 1-5.
[98] M. Lim et al., "Convolutional Neural Network based Audio Event Classification," KSII Transactions on Internet & Information Systems, vol. 12, no. 6, 2018.
[99] E. Akbal, "An automated environmental sound classification methods based on statistical and textural feature," Applied Acoustics, vol. 167, p. 107413, 2020.
[100] A. M. Tripathi and A. Mishra, "Adv-ESC: Adversarial attack datasets for an environmental sound classification," Applied Acoustics, vol. 185, p. 108437, 2022.
[101] T. Tuncer, A. Subasi, F. Ertam, and S. Dogan, "A novel spiral pattern and 2D M4 pooling based environmental sound classification method," Applied Acoustics, vol. 170, p. 107508, 2020.
[102] Ö. İnik, "CNN hyper-parameter optimization for environmental sound classification," Applied Acoustics, vol. 202, p. 109168, 2023/01/01/ 2023, doi: https://doi.org/10.1016/j.apacoust.2022.109168.
[103] B. Ma, X. Li, Y. Xia, and Y. Zhang, "Autonomous deep learning: A genetic DCNN designer for image classification," Neurocomputing, vol. 379, pp. 152-161, 2020.
[104] C. B. Gonçalves, J. R. Souza, and H. Fernandes, "CNN architecture optimization using bio-inspired algorithms for breast cancer detection in infrared images," Computers in Biology and Medicine, vol. 142, p. 105205, 2022/03/01/ 2022, doi: https://doi.org/10.1016/j.compbiomed.2021.105205.
[105] P. Singh, S. Chaudhury, and B. K. Panigrahi, "Hybrid MPSO-CNN: Multi-level Particle Swarm optimized hyperparameters of Convolutional Neural Network," Swarm and Evolutionary Computation, vol. 63, p. 100863, 2021/06/01/ 2021, doi: https://doi.org/10.1016/j.swevo.2021.100863.
[106] Z. Zhang, S. Xu, S. Zhang, T. Qiao, and S. Cao, "Attention based convolutional recurrent neural network for environmental sound classification," Neurocomputing, vol. 453, pp. 896-903, 2021/09/17/ 2021, doi: https://doi.org/10.1016/j.neucom.2020.08.069.
[107] R. Ying, R. He, K. Chen, P. Eksombatchai, W. L. Hamilton, and J. Leskovec, "Graph convolutional neural networks for web-scale recommender systems," in Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, 2018, pp. 974-983.
[108] J. Bruna, W. Zaremba, A. Szlam, and Y. LeCun, "Spectral networks and locally connected networks on graphs," arXiv preprint arXiv:1312.6203, 2013.
[109] M. Niepert, M. Ahmed, and K. Kutzkov, "Learning Convolutional Neural Networks for Graphs," in 33rd International Conference on Machine Learning, New York, NY, Jun 20-22 2016, vol. 48, SAN DIEGO: Jmlr-Journal Machine Learning Research, in Proceedings of Machine Learning Research, 2016. [Online]. Available: <Go to ISI>://WOS:000684193702011. [Online]. Available: <Go to ISI>://WOS:000684193702011
[110] M. Defferrard, X. Bresson, and P. Vandergheynst, "Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering," in 30th Conference on Neural Information Processing Systems (NIPS), Barcelona, SPAIN, 2016, vol. 29, LA JOLLA: Neural Information Processing Systems (Nips), in Advances in Neural Information Processing Systems, 2016. [Online]. Available: <Go to ISI>://WOS:000458973700045. [Online]. Available: <Go to ISI>://WOS:000458973700045
[111] K. Y. L. Xu, C. T. Li, Y. L. Tian, T. Sonobe, K. Kawarabayashi, and S. Jegelka, "Representation Learning on Graphs with Jumping Knowledge Networks," in 35th International Conference on Machine Learning (ICML), Stockholm, SWEDEN, Jul 10-15 2018, vol. 80, SAN DIEGO: Jmlr-Journal Machine Learning Research, in Proceedings of Machine Learning Research, 2018. [Online]. Available: <Go to ISI>://WOS:000683379205060. [Online]. Available: <Go to ISI>://WOS:000683379205060
[112] G. H. Li, M. Müller, A. Thabet, B. Ghanem, and Ieee, "DeepGCNs: Can GCNs Go as Deep as CNNs?," in IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, SOUTH KOREA, Oct 27-Nov 02 2019, NEW YORK: Ieee, in IEEE International Conference on Computer Vision, 2019, pp. 9266-9275, doi: 10.1109/iccv.2019.00936. [Online]. Available: <Go to ISI>://WOS:000548549204039
[113] X. Wang, X. N. He, Y. X. Cao, M. Liu, T. S. Chua, and M. Assoc Comp, "KGAT: Knowledge Graph Attention Network for Recommendation," in 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD), Anchorage, AK, Aug 04-08 2019, NEW YORK: Assoc Computing Machinery, 2019, pp. 950-958, doi: 10.1145/3292500.3330989. [Online]. Available: <Go to ISI>://WOS:000485562500097
[114] Y. Wang, C. F. Jing, S. S. Xu, and T. Guo, "Attention based spatiotemporal graph attention networks for traffic flow forecasting," (in English), Information Sciences, vol. 607, pp. 869-883, Aug 2022, doi: 10.1016/j.ins.2022.05.127.
[115] C. Zhuang and Q. Ma, "Dual graph convolutional networks for graph-based semi-supervised classification," in Proceedings of the 2018 world wide web conference, 2018, pp. 499-508.
[116] B. Jiang, Z. Y. Zhang, D. D. Lin, J. Tang, B. Luo, and I. C. Soc, "Semi-supervised Learning with Graph Learning-Convolutional Networks," in 32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, Jun 16-20 2019, NEW YORK: Ieee, in IEEE Conference on Computer Vision and Pattern Recognition, 2019, pp. 11305-11312, doi: 10.1109/cvpr.2019.01157. [Online]. Available: <Go to ISI>://WOS:000542649304093
[117] K. Hassani and A. H. Khasahmadi, "Contrastive Multi-View Representation Learning on Graphs," in International Conference on Machine Learning (ICML), Electr Network, Jul 13-18 2020, vol. 119, SAN DIEGO: Jmlr-Journal Machine Learning Research, in Proceedings of Machine Learning Research, 2020. [Online]. Available: <Go to ISI>://WOS:000683178504022. [Online]. Available: <Go to ISI>://WOS:000683178504022
[118] X. Wang et al., "AM-GCN: Adaptive Multi-channel Graph Convolutional Networks," in 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), Electr Network, Aug 23-27 2020, NEW YORK: Assoc Computing Machinery, 2020, pp. 1243-1253, doi: 10.1145/3394486.3403177. [Online]. Available: <Go to ISI>://WOS:000749552301025
[119] C. Y. He et al., "Cascade-BGNN: Toward Efficient Self-supervised Representation Learning on Large-scale Bipartite Graphs," (in English), Arxiv, preprint 2020, doi: arXiv:1906.11994.
[120] J. N. Sun et al., "Multi-Graph Convolution Collaborative Filtering," in 19th IEEE International Conference on Data Mining (ICDM), Beijing, PEOPLES R CHINA, Nov 08-11 2019, NEW YORK: Ieee, in IEEE International Conference on Data Mining, 2019, pp. 1306-1311, doi: 10.1109/icdm.2019.00165. [Online]. Available: <Go to ISI>://WOS:000555729900156
[121] Y. H. Liu et al., "Cross-view Correspondence Reasoning based on Bipartite Graph Convolutional Network for Mammogram Mass Detection," in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Electr Network, Jun 14-19 2020, LOS ALAMITOS: Ieee Computer Soc, in IEEE Conference on Computer Vision and Pattern Recognition, 2020, pp. 3811-3821, doi: 10.1109/cvpr42600.2020.00387. [Online]. Available: <Go to ISI>://WOS:000620679504009
[122] Z. C. Wang, M. Zhou, and C. Arnold, "Toward heterogeneous information fusion: bipartite graph convolutional networks for <i>in silico</i> drug repurposing," (in English), Bioinformatics, ; Proceedings Paper vol. 36, pp. 525-533, Jul 2020, doi: 10.1093/bioinformatics/btaa437.
[123] R. D. Jin, T. Q. Xia, X. Liu, T. Murata, and K. S. Kim, "Predicting Emergency Medical Service Demand With Bipartite Graph Convolutional Networks," (in English), IEEE Access, vol. 9, pp. 9903-9915, 2021, doi: 10.1109/access.2021.3050607.
[124] P. F. Jiao, M. H. Tang, H. T. Liu, Y. P. Wang, C. Y. Lu, and H. M. Wu, "Variational autoencoder based bipartite network embedding by integrating local and global structure," (in English), Information Sciences, vol. 519, pp. 9-21, May 2020, doi: 10.1016/j.ins.2020.01.033.
[125] L. Katus et al., "Implementing neuroimaging and eye tracking methods to assess neurocognitive development of young infants in low-and middle-income countries," Gates Open Research, vol. 3, 2019.
[126] S. P. Devlin, N. L. Brown, S. Drollinger, C. Sibley, J. Alami, and S. L. Riggs, "Scan-based eye tracking measures are predictive of workload transition performance," Applied ergonomics, vol. 105, p. 103829, 2022.
[127] A. Moran, M. Campbell, and D. Ranieri, "Implications of eye tracking technology for applied sport psychology," Journal of Sport Psychology in Action, vol. 9, no. 4, pp. 249-259, 2018.
[128] M. Kuhar and T. Merčun, "Exploring user experience in digital libraries through questionnaire and eye-tracking data," Library & Information Science Research, vol. 44, no. 3, p. 101175, 2022.
[129] M. L. Mele and S. Federici, "Gaze and eye-tracking solutions for psychological research," Cognitive processing, vol. 13, pp. 261-265, 2012.
[130] M. Nückles, "Investigating visual perception in teaching and learning with advanced eye-tracking methodologies: Rewards and challenges of an innovative research paradigm," Educational Psychology Review, vol. 33, no. 1, pp. 149-167, 2021.
[131] P. Majaranta and A. Bulling, "Eye tracking and eye-based human–computer interaction," in Advances in physiological computing: Springer, 2014, pp. 39-65.
[132] Y. Mashhadi Hashem Marandi and H. Sajedi, "Intelligent Web Advertisement Based on Eye-Tracking and Machine Learning," International Journal of Web Research, vol. 3, no. 2, pp. 31-44, 2020.
[133] Q. Wei, H. Cao, Y. Shi, X. Xu, and T. Li, "Machine learning based on eye-tracking data to identify Autism Spectrum Disorder: A systematic review and meta-analysis," Journal of biomedical informatics, vol. 137, p. 104254, 2023.
[134] C. Schweikert, L. Gobin, S. Xie, S. Shimojo, and D. Frank Hsu, "Preference prediction based on eye movement using Multi-layer Combinatorial Fusion," in Brain Informatics: International Conference, BI 2018, Arlington, TX, USA, December 7–9, 2018, Proceedings 11, 2018: Springer, pp. 282-293.
[135] E. Tuba, N. Bačanin, I. Strumberger, and M. Tuba, "Convolutional neural networks hyperparameters tuning," in Artificial intelligence: theory and applications: Springer, 2021, pp. 65-84.
[136] W.-C. Yeh, "A two-stage discrete particle swarm optimization for the problem of multiple multi-level redundancy allocation in series systems," Expert Systems with Applications, vol. 36, no. 5, pp. 9192-9200, 2009.
[137] R. Poli, J. Kennedy, and T. Blackwell, "Particle swarm optimization," Swarm intelligence, vol. 1, no. 1, pp. 33-57, 2007.
[138] W.-C. Yeh, "Orthogonal simplified swarm optimization for the series–parallel redundancy allocation problem with a mix of components," Knowledge-Based Systems, vol. 64, pp. 1-12, 2014.
[139] F. Dusberger and G. R. Raidl, "Solving the 3-staged 2-dimensional cutting stock problem by dynamic programming and variable neighborhood search," Electronic Notes in Discrete Mathematics, vol. 47, pp. 133-140, 2015.
[140] Q. Chen, Y. Cui, and Y. Chen, "Sequential value correction heuristic for the two-dimensional cutting stock problem with three-staged homogenous patterns," Optimization Methods and Software, vol. 31, no. 1, pp. 68-87, 2016.
[141] W.-C. Yeh, W.-W. Chang, and Y. Y. Chung, "A new hybrid approach for mining breast cancer pattern using discrete particle swarm optimization and statistical method," Expert Systems with Applications, vol. 36, no. 4, pp. 8204-8211, 2009.
[142] W.-C. Yeh, "New parameter-free simplified swarm optimization for artificial neural network training and its application in the prediction of time series," IEEE Transactions on Neural Networks and Learning Systems, vol. 24, no. 4, pp. 661-665, 2013.
[143] W.-C. Yeh, Y.-P. Lin, Y.-C. Liang, and C.-M. Lai, "Convolution Neural Network Hyperparameter Optimization Using Simplified Swarm Optimization," arXiv preprint arXiv:2103.03995, 2021.
[144] P.-C. Su, S.-Y. Tan, Z. Liu, and W.-C. Yeh, "A Mixed-Heuristic Quantum-Inspired Simplified Swarm Optimization Algorithm for scheduling of real-time tasks in the multiprocessor system," Applied Soft Computing, vol. 131, p. 109807, 2022.
[145] Y. Jiang, Z. Liu, J.-H. Chen, W.-C. Yeh, and C.-L. Huang, "A novel binary-addition simplified swarm optimization for generalized reliability redundancy allocation problem," Journal of Computational Design and Engineering, vol. 10, no. 2, pp. 758-772, 2023.
[146] J. Kennedy and R. Eberhart, "Particle swarm optimization," in Proceedings of ICNN'95-international conference on neural networks, 1995, vol. 4: ieee, pp. 1942-1948.
[147] C.-L. Huang, "A particle-based simplified swarm optimization algorithm for reliability redundancy allocation problems," Reliability Engineering & System Safety, vol. 142, pp. 221-230, 2015.
[148] Z. Liu, L.-M. Hu, and W.-C. Yeh, "Risk-averse two-stage stochastic programming-based closed-loop supply chain network design under uncertain demand," Applied Soft Computing, vol. 147, p. 110743, 2023/11/01/ 2023, doi: https://doi.org/10.1016/j.asoc.2023.110743.
[149] Z. Liu and W.-C. Yeh, "Simplified swarm optimisation for CNN hyperparameters: a sound classification approach," International Journal of Web and Grid Services, vol. 20, no. 1, pp. 93-113, 2024.
[150] W.-C. Yeh, "An improved simplified swarm optimization," Knowledge-Based Systems, vol. 82, pp. 60-69, 2015.
[151] D. Karaboga and B. Akay, "A comparative study of artificial bee colony algorithm," Applied mathematics and computation, vol. 214, no. 1, pp. 108-132, 2009.
[152] C.-M. Lai, C.-C. Chiu, Y.-C. Shih, and H.-P. Huang, "A hybrid feature selection algorithm using simplified swarm optimization for body fat prediction," Computer Methods and Programs in Biomedicine, vol. 226, p. 107183, 2022.
[153] X. Zhang, W.-c. Yeh, Y. Jiang, Y. Huang, Y. Xiao, and L. Li, "A case study of control and improved simplified swarm optimization for economic dispatch of a stand-alone modular microgrid," Energies, vol. 11, no. 4, p. 793, 2018.
[154] C.-M. Lai, W.-C. Yeh, and C.-Y. Chang, "Gene selection using information gain and improved simplified swarm optimization," Neurocomputing, vol. 218, pp. 331-338, 2016.
[155] Z. K. Peng and F. L. Chu, "Application of the wavelet transform in machine condition monitoring and fault diagnostics: a review with bibliography," Mechanical Systems and Signal Processing, vol. 18, no. 2, pp. 199-221, 2004/03/01/ 2004, doi: https://doi.org/10.1016/S0888-3270(03)00075-X.
[156] A. K. McCallum, K. Nigam, J. Rennie, and K. Seymore, "Automating the construction of internet portals with machine learning," Information Retrieval, vol. 3, pp. 127-163, 2000.
[157] X. Glorot and Y. Bengio, "Understanding the difficulty of training deep feedforward neural networks," in Proceedings of the thirteenth international conference on artificial intelligence and statistics, 2010: JMLR Workshop and Conference Proceedings, pp. 249-256.
[158] W. Liu, Y. Cao, and R. W. Proctor, "How do app icon color and border shape influence visual search efficiency and user experience? Evidence from an eye-tracking study," International Journal of Industrial Ergonomics, vol. 84, p. 103160, 2021.
[159] M. Lee and R. J. Faber, "Effects of product placement in on-line games on brand memory: A perspective of the limited-capacity model of attention," Journal of advertising, vol. 36, no. 4, pp. 75-90, 2007.
[160] K.-C. Hamborg, M. Bruns, F. Ollermann, and K. Kaspar, "The effect of banner animation on fixation behavior and recall performance in search tasks," Computers in Human Behavior, vol. 28, no. 2, pp. 576-582, 2012.
[161] M. Koester, M. Rüth, K. C. Hamborg, and K. Kaspar, "Effects of personalized banner ads on visual attention and recognition memory," Applied Cognitive Psychology, vol. 29, no. 2, pp. 181-192, 2015.