簡易檢索 / 詳目顯示

研究生: 王惠瑩
Hui-Ying Wang
論文名稱: Self-Organized Picture Browsing For Photo Gallery
自動組織之圖像瀏覽技術
指導教授: 王家祥
Jia-Shung Wang
口試委員:
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 資訊工程學系
Computer Science
論文出版年: 2008
畢業學年度: 96
語文別: 英文
論文頁數: 88
中文關鍵詞: 照片瀏覽
相關次數: 點閱:25下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 中文摘要
    當生命的重要時刻都記錄在相片中,回顧過去和展望未來便同等重要。許多工具利用照片的拍攝時間來協助使用者瀏覽照片。但是當人們開始分享照片,使用照片的時間資訊變得不可行。因此,我們提出兩種以視覺相似性為根據的照片瀏覽架構,以避免使用時間資訊造成的混淆。
    環狀瀏覽將照片排列出一維環狀的順序,此順序可以從任一張照片開始,完整瀏覽收藏中的所有照片一遍,並保證整體的視覺不相似性是最小的,也就是,連續的照片會是盡可能相似的景物。格狀瀏覽提供二維的瀏覽,在這個瀏覽中,代表照片被挑選出來,安排成格狀的結構,用以有效地呈現整個收藏的概況。其他非代表照片會依相似性被隱藏在代表照片之下,如此一來,放在一起的照片多半是相似的景物,而收藏的細節也可由此取得。
    實驗結果顯示我們的方法能有效地在大小不同的螢幕上呈現大量收藏。評量結果顯示,給定一張照片進行搜尋,環狀瀏覽及格狀瀏覽的搜尋能力皆足以提供可接受的搜尋結果。


    CHAPTER 1 INTRODUCTION 1 1-1 INFORMATION RETRIEVAL AND IMAGE RETRIEVAL 1 1-2 CATEGORIZATION OF IMAGES 3 1-3 THE PROPOSED IMAGE BROWSING 4 CHAPTER 2 RELATED WORKS 9 CHAPTER 3 IMAGE REPRESENTATION 19 3-1 MPEG-7 19 3-2 COLOR LAYOUT DESCRIPTOR (CLD) 20 3-3 EDGE HISTOGRAM DESCRIPTOR (EHD) 24 3-4 HOMOGENEOUS TEXTURE DESCRIPTOR (HTD) 31 CHAPTER 4 PHOTO RING 39 4-1 BROWSING SEQUENCE 40 4-2 TRAVELING SALESPERSON SOLUTION 42 Mapping photos to a TSP Tour 42 Solutions to TSP 42 The selected TSP Algorithm 43 Linear programming based cutting-plane 44 Branch-and-cut 45 CHAPTER 5 PHOTO GRID 47 5-1 REPRESENTATIVE DETERMINATION 47 5-2 MULTIDIMENSIONAL SCALING 49 5-3 DIRECT APPROACH 55 5-4 INDIRECT APPROACH 57 Pre-clustering by K-means 59 CHAPTER 6 EXPERIMENTAL RESULTS 62 6-1 TESTING DATASET 62 6-2 RESULTS ON IMAGE REPRESENTATION 65 Evaluation metrics of content-based image retrieval 65 Precision 66 Recall 66 mAP: Average precision [2] 67 6-3 RESULTS ON PHOTO RING 71 Evaluation metrics of photo ring 71 Degree of fragmentation 72 Retrieval efficiency on photo rings 72 6-4 RESULTS ON PHOTO GRID 76 Evaluation metrics of photo grid 76 Average rank ratio (aveRankRatio(range)) 77 Average coverage of Top X (aveCoverageTopX(range)) 79 CHAPTER 7 CONCLUSION 84 REFERENCES 86

    [1] L. Jiebo, M. Boutell, and C. Brown, "Pictures are not taken in a vacuum - an overview of exploiting context for semantic scene content understanding," Signal Processing Magazine, IEEE, vol. 23, pp. 101-114, 2006.
    [2] R. Datta, D. Joshi, J. Li, and J. Z. Wang, "Image Retrieval: Ideas, Influences, and Trends of the New Age," ACM Computing Surveys, 2008, to appear.
    [3] Riya, "http://www.riya.com/."
    [4] J. Philbin, O. Chum, M. Isard, J. A. S. J. Sivic, and A. A. Z. A. Zisserman, "Object retrieval with large vocabularies and fast spatial matching," in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2007, pp. 1-8.
    [5] P. Viola and M. Jones, "Rapid object detection using a boosted cascade of simple features," in Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2001, pp. 511-518.
    [6] D. Ritendra, G. Weina, L. Jia, and Z. W. James, "Toward bridging the annotation-retrieval gap in image search by a generative modeling approach," in Proceedings of the 14th annual ACM international conference on Multimedia Santa Barbara, CA, USA: ACM, 2006.
    [7] J.-Y. Chen, C. A. Bounman, and J. C. Dalton, "Similarity pyramids for browsing and organization of large image databases," in Proceedings of Conference on Human Vision and Electionic Imaging III San Jose: SPIE/IS&T, 1998.
    [8] T. A. C. Tammara and B. B. Benjamin, "Does zooming improve image browsing?," in Proceedings of the fourth ACM conference on Digital libraries Berkeley, California, United States: ACM, 1999.
    [9] K. Allan, P. Celine, L. C. Michael, F. Dennis, S. Bill, and G. Jacek, "FotoFile: a consumer multimedia organization and retrieval system," in Proceedings of the SIGCHI conference on Human factors in computing systems: the CHI is the limit Pittsburgh, Pennsylvania, United States: ACM, 1999.
    [10] P. Zoran, D. Minh, V. Martin, and P. Pearl, "Integrated Browsing and Searching of Large Image Collections," in Proceedings of the 4th International Conference on Advances in Visual Information Systems: Springer-Verlag, 2000.
    [11] K. Hyunmo and B. Shneiderman, "Visualization methods for personal photo collections: browsing and searching in the PhotoFinder," in Proceedings of IEEE International Conference on Multimedia and Expo, 2000, pp. 1539-1542 vol.3.
    [12] C. P. John, "AutoAlbum: Clustering Digital Photographs using Probabilistic Model Merging," in Proceedings of the IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL'00): IEEE Computer Society, 2000.
    [13] B. B. Benjamin, "PhotoMesa: a zoomable image browser using quantum treemaps and bubblemaps," in Proceedings of the 14th annual ACM symposium on User interface software and technology Orlando, Florida: ACM, 2001.
    [14] R. Kerry, B. Wojciech, S. David, and W. Kenneth, "Does organisation by similarity assist image browsing?," in Proceedings of the SIGCHI conference on Human factors in computing systems Seattle, Washington, United States: ACM, 2001.
    [15] S. Chia, L. Neal, Fr, d, and V. ric, "Personal digital historian: story sharing around the table." vol. 10: ACM, 2003, pp. 15-22.
    [16] S. Chia, L. Neal, M. Baback, B. Paul, and B. Ryan Scott, "Personal digital historian: user interface design," in CHI '01 extended abstracts on Human factors in computing systems Seattle, Washington: ACM, 2001.
    [17] J. C. Platt, M. Czerwinski, and B. A. Field, "PhotoTOC: automatic clustering for browsing personal photographs," in Proceedings of the 2003 Joint Conference of the Fourth International Conference on Information, Communications and Signal Processing, 2003, pp. 6-10.
    [18] M. D. Steven, W. Curtis, R. Asta, G. Steven, and M. Steven De, "MediaBrowser: reclaiming the shoebox," in Proceedings of the working conference on Advanced visual interfaces Gallipoli, Italy: ACM, 2004.
    [19] F. H. David, M. D. Steven, B. Patrick, and W. Curtis, "Time quilt: scaling up zoomable photo browsers for large, unstructured photo collections," in CHI '05 extended abstracts on Human factors in computing systems Portland, OR, USA: ACM, 2005.
    [20] G. P. Nguyen and M. Worring, "Interactive access to large image collections using similarity-based visualization," Journal of Visual Languages and Computing, vol. 19, pp. 203-224, 2008.
    [21] ACDSee, "http://www.acdsee.com/."
    [22] Picasa, "http://picasa.google.com/index.html."
    [23] iPhoto, "http://www.apple.com/ilife/iphoto/."
    [24] Flickr, "http://www.flickr.com/."
    [25] Webshots, "http://www.webshots.com/."
    [26] H. Xian-Sheng, L. Lie, and Z. Hong-Jiang, "Content based photograph slide show with incidental music," in Proceedings of the 2003 International Symposium on Circuits and Systems, 2003, pp. 648-651.
    [27] J.-C. Chen, W.-T. Chu, J.-H. Kuo, C.-Y. Weng, and J.-L. Wu, "Tiling slideshow," in Proceedings of the 14th annual ACM international conference on Multimedia Santa Barbara, CA, USA: ACM, 2006.
    [28] X.-S. Hua, L. Lu, and H.-J. Zhang, "Automatically converting photographic series into video," in Proceedings of the 12th annual ACM international conference on Multimedia New York, NY, USA: ACM, 2004.
    [29] A. W. M. Smeulders, A. W. M. Smeulders, M. Worring, S. Santini, A. A. G. A. Gupta, and R. A. J. R. Jain, "Content-based image retrieval at the end of the early years," Transactions on Pattern Analysis and Machine Intelligence, vol. 22, pp. 1349-1380, 2000.
    [30] E. Kasutani and A. Yamada, "The MPEG-7 color layout descriptor: a compact image feature description for high-speed image/video segment retrieval," in Proceeding of International Conference on Image Processing, 2001, pp. 674-677 vol.1.
    [31] C. S. Won, D. K. Park, and S.-J. Park, "Efficient use of MPEG-7 edge histogram descriptor," ETRI vol. 24, pp. 23-30, 2002.
    [32] Y. M. Ro, M. Kim, and H. K. Kang, "MPEG-7 homogeneous texture descriptor," ETRI, vol. 23, pp. 41-51, 2001.
    [33] R. Durbin and D. Willshaw, "An analogue approach to the travelling salesman problem using an elastic net method," Nature, vol. 326, pp. 689-691, 1987.
    [34] D. Applegate, R. Bixby, V. Chvátal, and W. Cook, "On the solution of traveling salesman problems," Doc.Math.J.DMV Extra Volume ICM III, pp. 645-656, 1998.
    [35] Concorde, "http://www.tsp.gatech.edu/concorde/index.html."

    無法下載圖示 全文公開日期 本全文未授權公開 (校內網路)
    全文公開日期 本全文未授權公開 (校外網路)

    QR CODE