研究生: |
劉寧漢 Ning-Han Liu |
---|---|
論文名稱: |
複音音樂資料庫之近似搜尋 Similarity Search in Polyphonic Music Database |
指導教授: |
陳良弼
Arbee L.P. Chen |
口試委員: | |
學位類別: |
博士 Doctor |
系所名稱: |
電機資訊學院 - 資訊工程學系 Computer Science |
論文出版年: | 2005 |
畢業學年度: | 93 |
語文別: | 英文 |
論文頁數: | 101 |
中文關鍵詞: | 多音音樂 、內涵式音樂查詢 、索引及查詢處理 、音樂分類 、重複樣型 、近似重覆樣型 |
外文關鍵詞: | Polyphonic Music, Content-based Music Retrieval, kNN Search, Index and Query Processing, Music Classification, Repeating Pattern |
相關次數: | 點閱:4 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
在本論文中,我們首先討論從多音資料庫中取回資訊時所常用的技術,並提出一個新的近似搜尋的方法。從一個巨大的音樂資料庫中查詢多音音樂是一個很有趣的問題,近來,很多的研究者嘗試提供能在多音資料庫有效率地依據內涵來查詢音樂的方法。但是大部份在近似搜尋時均無法順利進行。在本論文中,我們提出了三種多音音樂的表示法及對應的近似量測法,此外還提出一個能取回k首包含與查詢最近似之片段的樂曲。由於我們的近似量測是根據修訂距離,我們提出的方法先在查詢的音樂使用刪除及替換兩種運算來產生部份可能的結果之樣式,再借由串列形態之索引結構找出解答並提供額外的資訊以求得修訂距離,最後經由比較已知查詢與可能結果之修訂距離來加速查詢處理的速度。
隨著網路及多媒體的發展,越來越多的音樂可以在網際網路找到,因良好的分類可增加查詢結果的正確率,使得自動音樂分類成為重要的研究課題。在本論文中,我們提出依據內涵來自動分類音樂。在此方法中,我們使用樂曲中的重複樣型用來代表音樂的特色。我們首先量測各個重複樣型在各類別音樂中的重要程度,並計算需分類樂曲之重複樣型與已分類之重複樣型之距離及重要程度,最後依據各類別之分數高低決定該樂曲所屬類別。
從音樂擷取樣型是一個很有趣的問題,擷取出的樣型可用來作為音樂查詢或音樂分析的特徵。以往的研究大都專注於重覆樣型的擷取,但實際上音樂中重複的片段是會有變化的。在此論文的第三部份,我們提出一個新的方法能快速的找出所有的近似重覆樣型。在此方法中,先產生所有的音樂片段並對應至向量空間的位置,經由索引結構及範圍搜尋法預判可能的重複數量以決定是否可能為答案。
最後經由實驗的驗證,在此論文中所提出的三個方法在多音音樂查詢、音樂分類及樣型擷取均優於傳統的方法。
In this thesis, we first discuss the techniques used in polyphonic music information retrieval and propose a novel technique of similarity search. Querying polyphonic music from a large data collection is an interesting topic. Recently, researchers attempt to provide efficient methods for content-based retrieval in polyphonic music databases where queries are polyphonic. However, most of them do not work well for similarity search, which is important to many applications. In this thesis, we propose three polyphonic representations with the associated similarity measures and a novel method to retrieve k music works that contain segments most similar to the query. In general, most of the index-based methods for similarity search generate all the possible answers to the query and then perform exact matching on the index for each possible answer. Based on the edit distance, our method generates only a few possible answers by performing the deletion or replacement operations on the query. Each possible answer is then used to perform exact matching on a list-based index, which allows the insertion operation to be performed. For each possible answer, its edit distance to the query is regarded as a lower bound of the edit distances between the matched results and the query. Based on the kNN results that match a possible answer, the possible answers that cannot provide better results are skipped. By using this mechanism, we design a method for efficient kNN search in polyphonic music databases.
With the popularity of multimedia applications, a large amount of music data has been accumulated on the Internet. Automatic classification of music data becomes a critical technique for providing an efficient and effective retrieval of music data. In this thesis, we propose a new approach for classifying music data based on their contents. In this approach, we focus on music features represented as rhythmic and melodic sequences. Moreover, we use repeating patterns of music data to do music classification. For each pattern discovered from a group of music data, we employ a series of measurements to estimate its usefulness for classifying this group of music data. According to the patterns contained in a music piece, we determine which class it should be assigned to.
Pattern extraction from music strings is an important problem. The patterns extracted from music strings can be used as features for music retrieval or analysis. Previous works on music pattern extraction only focus on exact repeating patterns. However, music segments with minor differences may sound similar. The concept of the prototypical melody has therefore been proposed to represent these similar music segments. In musicology, the number of music segments that are similar to a prototypical melody implies the importance degree of the prototypical melody to the music work. In this thesis, a novel approach is developed to extract all the prototypical melodies in a music work. Our approach considers each music segment as a candidate for the prototypical melody and uses the edit distance to determine the set of music segments that are similar to this candidate. A lower bounding mechanism, which estimates the number of similar music segments for each candidate and prunes the impossible candidates is designed to speed up the process.
[1] Agrawal, R. and R. Srikant, “Mining Sequential Patterns,” in Proceedings of IEEE Conference on Data Engineering, pp: 3-14, 1995.
[2] Anagnostopoulou, C. and G. Westermann, “Classification in Music: A Computational Model for Paradigmatic Analysis,” in Proceedings of the International Computer Music Conference, 1997.
[3] Aucouturier, J. J. and F. Pachet, “Music Similarity Measures: What’s the Use?” in Proceedings of International Symposium on Music Information Retrieval, 2002.
[4] Baeza-Yates, R. and B. Ribeiro-Neto, Modern Information Retrieval, Addison Wesley, 1999.
[5] Bakhmutova, V., V. D. Gusev, and T. N. Titkova, “The Search for Adaptations in Song Melodies,” Computer Music Journal, Vol. 21, No. 1, pages 58~67, Spring 1997.
[6] Baroni, M., R. Dalmonte, and C. Jacoboni, “Theory and Analysis of European Melody,” in A. Marsden and A. Pople (eds.), Computer Representation and Models in Music, Academic Press, 1992.
[7] Beckmann, N., H. P. Kriegel, R. Schneider, and B. Seeger, "The R*-tree: An efficient and Robust Access Method for Points and Rectangles," in Proceedings of ACM SIGMOD Int'l. Conf. on Management of Data, 1990.
[8] Bello, JP, L. Daudet and MB Sandler, “Time-domain Polyphonic Transcription using Self-generating Databases,” in Proceeding of the 112th convention of the audio engineering Society, 2002.
[9] Blackburn, S. and D. DeRoure , ”A Tool for Content-based Navigation of Music,” in Proceedings of the ACM Conference on Multimedia, 1998.
[10] Chai, W. and B. Vercoe, “Folk Music Classification Using Hidden Markov Models,” in Proceedings of International Conference on Artificial Intelligence, 2001.
[11] Chen, C. C. and Arbee L.P. Chen, “Query by Rhythm: An Approach for Song Retrieval in Music Database,” in Proceedings of IEEE Workshop on Research Issues in Data Engineering, pp: 139-146, 1998.
[12] Clausen, M., R. Engelbrecht, D. Meyer and J. Schmitz, “Proms: A web-based Tool for Searching in Polyphonic Music,” in Proceedings of the 1st International Symposium on Music Information Retrieval (ISMIR’00), 2000.
[13] Crochemore, M. and W. Rytter, Text Algorithms, Oxford University Press, 1994.
[14] Dannenberg, R. B., B. Thom, and D. Watson, “A Machine Learning Approach to Musical Style Recognition,” in Proceedings of International Computer Music Conference, 1997.
[15] DeRoure, D. and S. Blackburn, “Content-based Navigation of Music using Melodic Pitch Contours,” Multimedia Systems, 8(3), Springer, 2000.
[16] Dovey, MJ, ”A Technique for "Regular Expression" Style Searching in Polyphonic Music,” in Proceedings of the 2nd International Symposium on Music Information Retrieval (ISMIR’01), 2001.
[17] Downie, S. and M. Nelson, “Evaluation of a Simple and Effective Music Information Retrieval Method,” in Proceedings of the ACM International Conference on Research and Development in Information Retrieval (SIGIR’00), 2000.
[18] Durey, AS and MA Clements, “Melody Spotting using Hidden Markov Models,” in Proceedings of the 2nd International Symposium on Music Information Retrieval (ISMIR’01), 2001.
[19] Ghias, A., H. Logan, D. Chamberlin, and BC Smith, “Query by Humming: Musical Information Retrieval in an Audio Database,” in Proceedings of the ACM Conference on Multimedia, 1995.
[20] Gonnet, G. H. and R. Baeza-Yates, Handbook of Algorithms and Data Structures: in Pascal and C, Addison-Wesley Publishing Company, 1991.
[21] Gusfield, D., Algorithms on Strings, Trees, and Sequences, Cambridge University Press, 1997.
[22] Han J., and M. Kamber, Data Mining Concepts and Techniques, Morgan Kaufmann Publishers, 2001.
[23] Hsu, J. L., Arbee L.P. Chen, H. C. Chen and N. H. Liu, “The Effectiveness Study of Various Music Information Retrieval Approaches,” In Proceedings of ACM International Conference on Information and knowledge Management (CIKM’02), 2002.
[24] Hsu, J. L., C. C. Liu, and A. L.P. Chen, “Discovering Non-trivial Repeating Patterns in Music Data,” in IEEE Transactions on Multimedia, Vol. 3, No. 3, 2001.
[25] Hsu, J. L., C. C. Liu, and A. L.P. Chen, “Efficient Repeating Pattern Finding in Music Databases,” in Proceedings of Seventh International Conference on Information and Knowledge Management (CIKM'98), 1998.
[26] Jones, G. T., Music Theory, Harper & Row, Publishers, New York, 1974.
[27] Kornstadt, A., “Themefinder: A Web-Based Melodic Search Tool,” Computing in Musicology 11, MIT Press, 1998.
[28] Krumhansl, C. L., Cognitive Foundations of Musical Pitch, Oxford University Press, New York, 1990.
[29] Lemstrőm, K. and S. Perttu, “SEMEX – An Efficient Music Retrieval Prototype,” in Proceedings of the 1st International Symposium on Music Information Retrieval (ISMIR’00), 2000.
[30] Lin, C. R., N. H. Liu, Y. H. Wu and A. L.P. Chen, "Music Classification Using Significant Repeating Patterns," in Proceedings of International Conference on Database Systems for Advanced Applications (DASFAA’04), 2004.
[31] Liu, C. C., J. L. Hsu, and A. L.P. Chen, “Efficient Theme and Non-Trivial Repeating Pattern Discovering in Music Databases,” in Proceedings of the 15th International Conference on Data Engineering, 1999.
[32] MacNab, RJ, LA Smith, D. Bainbridge and IH Witten, “The New Zealand Digital Library MELodyinDEX,” Digital Library Magazine, 1997.
[33] McCreight, EM, “A Space-economical Suffix Tree Construction Algorithm,” Journal of the ACM, 23(2): 262—272, 1976.
[34] McNab, RJ, LS Smith, IH Witten and CL Henderson, “Tune Retrieval in the Multimedia Library,” Multimedia Tools and Applications, 10(2/3) Kluwer Academic Publishers, 2000.
[35] MIDI Manufaturers Association, Los Angeles, California. The Complete Detailed MIDI 1.0 Specification, 1996.
[36] Monti, G. and M. Samder, “Pitch Locking Monophonic Music Analysis,” in Proceeding of the 112th convention of the audio engineering Society, 2002.
[37] Moshe, S., Dynamic Programming, Marcel Dekker Inc., 1992.
[38] Narmour, E., The Analysis and Cognition of Basic Melodic Structures, The University of Chicago Press, Chicago, 1990.
[39] Park, S., W. W. Chu, J. Yoon and C. Hsu, “Efficient Searches for Similar Subsequences of Different Lengths in Sequence Databases,” in Proceedings of the International Conference on Data Engineering (ICDE’00), 2000.
[40] Paulus, J. and A. Klapuri, “Measuring the Similarity of Rhythmic Patterns,” in Proceedings of International Symposium on Music Information Retrieval (ISMIR’02), 2002.
[41] Pei, J., J. W. Han, B. Mortazavi-Asi, and H. Pinto, “PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern Growth,” in Proceedings of IEEE Conference on Data Engineering, 2001.
[42] Pickens, J., “Feature Selection for Polyphonic Music Retrieval,” in Proceedings of the ACM International Conference on Research and Development in Information Retrieval (SIGIR’01), 2001.
[43] Pickens, J., JP Bello, G. Monti, T. Crawford, M. Dovey, M. Sandler and D. Byrd, “Polyphonic Score Retrieval using Polyphonic Audio Queries: A Harmonic Modeling Approach,” in Proceedings of the 3rd International Symposium on Music Information Retrieval (ISMIR’02), 2002.
[44] Pickens, J. and T. Crawford, “Harmonic Models for Polyphonic Music Retrieval,” in Proceedings of ACM International Conference on Information and knowledge Management (CIKM’02), 2002.
[45] Pienimäk, A. “Indexing Music Databases Using Automatic Extraction of Frequent Phrases,” in Proceedings of the 3rd International Symposium on Music Information Retrieval (ISMIR’02), 2002.
[46] Pogue, D. and S. Speck, Classical Music for Dummies, IDG books worldwide Inc., 1999.
[47] Raphael, C., “Automatic Transcription of Piano Music,” in Proceedings of the 3rd International Symposium on Music Information Retrieval (ISMIR’00), 2000.
[48] Rolland, P. Y., “FlExPat: a Novel Algorithm for Musical Pattern Discovery,” in Proceedings of the 12th Colloquium on Musical Informatics (XII CIM), Gorizia, Italy, September 24-26, 1998.
[49] Rolland, P. Y. and J. G. Ganascia, “Musical Pattern Extraction and Similarity Assessment,” in Miranda, E. (eds.), Readings in Music and Artificial Intelligence, New York and London: Gordon & Breach - Harwood Academic Publishers, 1999.
[50] Rolland, P. Y., “FIExPat: Flexible Extraction of Sequential Patterns,” in Proceedings of the IEEE International Conference on Data Mining (ICDM’01), 2001.
[51] Salton, G., Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer, Addison Wesley, 1989.
[52] Selfridge-Field, E., “Conceptual and Representational Issues in Melodic Comparison,” in Hewlett, W. B. and E. Selfridge-Field (eds.), Melodic Similarity: Concepts, Procedures, and Applications (Computing in Musicology: 11), The MIT Press, 1998.
[53] Shih, H. H., S. S. Narayanan, and C. C. Jay Kuo, “Automatic Main Melody Extraction From MIDI Files with a Modified Lempel-Ziv Algorithm,” in Proceedings of International Symposium on Intelligent Multimedia, Video and Speech Processing, 2001.
[54] Sundberg, J., A. Friberg, and L. Fryden, “Common Secrets of Musicians and Listeners: An Analysis-by-synthesis Study of Musical Performance,” in Representing Musical Structure, P. Howell, R. West, and I. Cross (eds.), Academic Press, London, 1991.
[55] Tseng, Y. H. “Content-based Retrieval for Music Collections,” in Proceedings of the ACM International Conference on Research and Development in Information Retrieval (SIGIR’99), 1999.
[56] Tzanetakis, G., A. Ermolinskyi, and P. Cook, “Pitch Histograms in Audio and Symbolic Music Information Retrieval,” in Proceedings of International Symposium on Music Information Retrieval, 2002.
[57] Tzanetakis, G., G. Essl, and P. Cook, “Automatic Musical Genre Classification Of Audio Signals,” in Proceedings of International Symposium on Music Information Retrieval, 2001.
[58] Uitdenbogerd, A. and J. Zobel, “Melodic Matching Techniques for Large Music Databases,” in Proceedings of the ACM Conference on Multimedia, 1999.
[59] Waterman, M. S., Introduction to Computational Biology: Maps, Sequences and Genomes, Chapman & Hall, 1995.
[60] Wiggins, G., E. Miranda, A. Smaill and M. Harris, “A Framework for the Evaluation of Music Representation Systems,” Computer Music Journal, 17(3): 31—42, 1997.
[61] Whitman, B. and P. Smaragdis, “Combining Musical and Cultural Features for Intelligent Style Detection,” in Proceedings of International Symposium on Music Information Retrieval, 2002.
[62] Witten, I. (project leader) et al., “The New Zealand Digital Library Project,” http://nzdl2.cs.waikato.ac.nz/, University of Waikato, New Zealand, April 2000.
[63] Wu, S. and U. Manber, “Fast Text Searching Allowing Errors,” Communications of the ACM, 35(10): 83—91, 1992.
[64] Yanase, T. and A. Takasu, “Phrase Based Feature Extraction for Musical Information Retrieval,” in Proceedings of IEEE Pacific Rim Conference on Communications, Computers, and Signal Processing, 1999.
[65] Yip, CL and B. Kao, “A Study on N-gram Indexing of Musical Features,” in Proceedings of IEEE International Conference on Multimedia and Expo (ICME’00), 2000.