研究生: |
方一帆 Fang, Yi-Fan |
---|---|
論文名稱: |
哼唱選歌的改進方法與實作 Improvement and Implementation of Query by Singing/Humming Systems |
指導教授: |
張智星
Jang, Jyh-Shing Roger |
口試委員: | |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 資訊工程學系 Computer Science |
論文出版年: | 2010 |
畢業學年度: | 99 |
語文別: | 中文 |
論文頁數: | 27 |
中文關鍵詞: | 哼唱選歌 、音樂檢索 、旋律辨識 |
相關次數: | 點閱:2 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
卡拉OK在許多國家是一項普遍的娛樂活動。卡拉OK機以動態歌詞來輔助使用者的歌唱過程,但使用者偶爾會遭遇到選歌的問題。時至今日,當使用者要點歌時,使用者必須回憶起歌曲的名稱,再於歌本查找並輸入歌曲。
相較於一般需要記憶歌曲名的歌本查找方式,哼唱選歌是一項方便快速的音樂檢索方式,哼唱選歌系統需要使用者輸入一片段哼唱旋律,而系統則以此旋律利用旋律辨識比對歌曲資料庫中,辨識哪些歌曲之旋律較符合使用者哼唱的片段,並輸出這些歌曲曲名給使用者參考是否為使用者想要點的歌曲,以此達到點歌的目的。卡拉OK歌唱系統的點歌部分,如果能在一般的歌本查找法之外,加入哼唱選歌功能,更能增進唱歌娛樂的順暢度。
本論文的歌曲資料庫較特別的地方為,資料庫每首歌曲中皆有標示歌詞開始唱的起點,本研究利用此資料庫有標示起唱位置的優勢,判斷出歌曲中重複的片段,並於辨識時避免重複辨識,而順利降低辨識時間。
為更進一步增進辨識率,我們研究辨識失敗的個案,發現使用者哼唱旋律存在調性不穩的現象,為針對使用者哼唱的走調現象進行校正,本論文提出分段移調的比對方式,並驗證此方法增進辨識率的效果。
Karaoke singing is a popular activity in many countries. Karaoke machine provide dynamic lyrics to help user singing, but user may encounter a problem in querying a song. So far when user queries a song, user must recall the song names then lookup it in a song list given by the Karaoke machine.
Compared to query by lookup system, query by singing/humming system is a more convenient way to make a query. Query by singing/humming system get a melody singing/humming by user, and then compare this melody to the melodies in the database to recognize the songs which have the melody similar to the melody given by user. Finally, the system output the recognized songs to user as reference to make a query. If we can add the query by singing/humming system into Karaoke machine to work with query by lookup system, we can improve the fluency.
The database adapted by this research has all sentence onsets tagged, we will take advantage of this feature to recognize repeating melody fragment within a song and remove the unnecessary melody fragment. By this way, we can improve the recognition speed.
By studying the cases which the system failed to recognize them correctly, we found that the tonality may vary during the user’s singing/humming period. So we fragment the singing/humming clip into sections and then do transposition with different sections, respectively. We also do some experiments to verify the benefit from this method.
【1】 Lawrencer R. Rabiner, "On the Use of Autocorrelation Analysis for Pitch Detection", IEEE Trans. ASSP, vol. 25, pp. 24-33, Feb 1977.
【2】 Jang, J.-S. Roger, Hong-Ru Lee, Ming-Yang Kao, "Content-based Music Retrieval Using Linear Scaling and Branch-and-bound Tree Search", IEEE International Conference on Multimedia and Expo, Waseda University, Tokyo, Japan, August 2001.
【3】 林子喬, 「哼唱選歌於嵌入式卡拉OK系統之研究與實作」, 碩士論文, 國立清華大學資訊工程學系, 民國九十八年六月。
【4】 Jang, J.-S. Roger, Ming-Yang Kao, "A Query-by-Singing System based on Dynamic Programming", International Workshop on Intelligent Systems Resolutions (the 8th Bellman Continuum), PP. 85-89, Hsinchu, Taiwan, Dec 2000.
【5】 Chung-Che Wang, Jyh-Shing Roger Jang, Wennen Wang, "An Improved Query by Singing/Humming System Using Melody and Lyrics Information", International Symposium on Music Information Retrieval, Utrecht, Netherlands, Aug 2010.
【6】 Jang, J.-S. Roger, "Audio Signal Processing and Recognition", http://mirlab.org/jang/books/audioSignalProcessing/, last accessed 17 Sep 2010.
【7】 Parsons, Thomas W, “Voice and Speech Processing”, McGraw-Hill, 1986.
【8】 冼鏡光, 「C名題精選-使用C語言-技巧篇」, 儒林, 2002.
【9】 M. Suzuki, T. Hosoya, A. Ito, and S. Makino, "Music Information Retrieval from a Singing Voice Using Lyrics and Melody Information", EURASIP Journal on Advances in Signal Processing, vol. 2007, Article ID 38727, 8 pages, 2007.
【10】 MIREX 2010, http://www.music-ir.org/mirex/wiki/2010:MIREX_HOME, 2010
【11】 MIDOMI, http://www.midomi.com/, 2010