基於全景接圖導引以維持時軸一致性之視訊畫面濃縮技術

簡易檢索 / 詳目顯示

回結果列表

研究生：	顏子傑 Yen, Tzu-Chieh
論文名稱：	基於全景接圖導引以維持時軸一致性之視訊畫面濃縮技術 Maintaining Temporal Coherence in Video Retargeting Using Mosaic-Guided Scaling
指導教授：	林嘉文 Lin, Chia-Wen
口試委員:
學位類別：	碩士 Master
系所名稱：	電機資訊學院 - 電機工程學系 Department of Electrical Engineering
論文出版年：	2010
畢業學年度：	98
語文別：	英文
論文頁數：	42
中文關鍵詞：	視訊畫面調適、視訊畫面濃縮、視訊畫面縮放、空間及時軸一致性
外文關鍵詞：	Video Adaptation, Video Retargeting, Video Scaling, Spatio-temporal Coherence
相關次數：	點閱：4 下載：0
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

隨著科技的發展，影像/視訊內容的分享越來越普及，許多人喜歡將生活周遭發生的事物透過行動裝置，例如：手機、PDA、…等，立即分享給親朋好友。但是受限於行動裝置螢幕大小的限制，我們必須對於這些影像/視訊進行畫面濃縮處理才能在這些行動裝置上進行播放。但無可避免的這個過程會造成資訊上的損失。近年來已經有不少研究課題討論如何實現基於內容為主的影像/視訊畫面濃縮，目的是希望畫面經過濃縮之後能盡量維持畫面中人眼感興趣的區域，並且縮小或裁切掉人眼較不感興趣的區域，使得濃縮後的影像/視訊能呈現原始影像/視訊所要分享的訊息。然而，對於濃縮後的視訊如何確保畫面之間在空間軸上以及時軸上的一致性將會是影響視訊品質的關鍵。目前現存的方法對於同時包含有背景鏡頭移動與物體移動的影片很難在畫面濃縮後保持在時軸上的一致性。因此這裡我們提出了一個新的視訊濃縮方法，利用全景接圖導引的方式去決定每一個相對應區域的縮放比例，以確保畫面之間在時軸上的一致性。我們提出的方法先將影片中屬於同一個場景的每一張畫面接合產生一張全景圖，並根據全景圖產生一張全域縮放圖，接著每一張畫面參考全域縮放圖上的資訊與提出的空間軸上限制做最佳化處理，最後得到每一張畫面個別的局部縮放圖和濃縮後的畫面。實驗的結果顯示我們提出的視訊濃縮方法可以有效維持濃縮後的視訊畫面在時軸上的一致性，甚至對於包含有背景鏡頭移動與物體移動的影片也能有不錯的效果。

Video retargeting from a full-resolution video to a lower-resolution display will inevitably cause information loss. Content-aware video retargeting techniques have been studied to avoid critical visual information loss while retargeting a video. Maintaining the spatio-temporal coherence of a retargeted video is very critical on visual quality. Camera motions and object motions, however, usually make it difficult to maintain temporal coherence with existing video retargeting schemes. In this thesis, we propose the use of a panoramic mosaic to guide the scaling of corresponding regions of video frames in a video shot to ensure good temporal coherence. In the proposed method, after aligning video frames in a shot to a panoramic mosaic constructed for the shot, a global scaling map for these frames is derived from the panoramic mosaic. Subsequently, the local scaling maps of individual frames are derived from the global map and is further refined according to spatial coherence constraints. Our experimental results show that the proposed method can effectively maintain temporal coherence so as to achieve good visual quality even a video contains camera motions and object motions.

Content
摘  要    i
Abstract    ii
Content    iii
Chapter 1 Introduction    1
Chapter 2 Related Work    5
Chapter 3 Formulation of Video Resizing    9
Chapter 4 Proposed Video Retargeting Scheme    11
4.1 Initialization    12
4.1.1 The Frame-Level Saliency Maps    12
4.1.2 The Initial Local Scaling Maps    13
4.1.3 The Shot-Level Panoramic Mosaic    14
4.2 Mosaic-Guided Video Retargeting    15
4.2.1 The Global Scaling Map    15
4.2.2 Global Map Constraint    17
4.2.3 Spatial Coherence Constraints    18
4.2.4 Iterative Optimization Procedure    19
Chapter 5 Experiments and Discussion    23
5.1 Performance Evaluation    23
5.2 Limitations    27
Chapter 6 Conclusion    40
References    41

                                

[1] S. Avidan and A. Shamir, “Seam carving for content-aware image resizing,” ACM Trans. Graphics, vol. 26, no. 3, pp. 16, 2007.
[2] C.-K. Chiang, S.-F. Wang, Y.-L. Chen and S.-H. Lai, “Fast JND-based video carving with GPU acceleration for real-time video retargeting,” IEEE Trans. Circuits Syst. Video Technol, vol. 19, no. 11, pp. 1588□1597, Nov. 2009.
[3] M. A. Fischler and R. C. Bolles, “Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography,” ACM Commun., vol. 24, no. 6, pp. 381□395, June 1981.
[4] Y. Guo, F. Liu, J. Shi, Z.-H. Zhou, and M. Gleicher, “Image retargeting using mesh parametrization,” IEEE Trans. Multimedia, vol. 11, no. 5, pp. 856□867, Aug. 2009.
[5] D. Han, X. Wu and M. Sonka, “Optimal multiple surfaces searching for video/image resizing - a graph-theoretic approach,” in Proc. IEEE Int. Conf. Comput. Vis., Sept. □Oct. 2009, Kyoto, Japan.
[6] L. Itti, C. Koch and E. Niebur, “A model of saliency-based visual attention for rapid scene analysis,” IEEE Trans. Pattern Anal. Match. Intell, vol. 20, no. 11, pp. 1254□1259, Nov. 1998.
[7] J.-S. Kim, J.-H. Kim and C.-S. Kim, "Adaptive image and video retargeting technique based on fourier analysis," in Proc. IEEE Int. Conf. Computer Vision and Pattern Recognition, pp. 1730□1737, Sept. 2009, Kyoto, Japan.
[8] S. Kopf, J. Kiess, H. Lemelson and W. Effelsberg, “FSCAV-fast seam carving for size adaptation of videos,” in Proc. ACM Int. Conf. Multimedia, pp. 321□330, Oct. 2009, Beijing, China.
[9] F. Liu and M. Gleicher, “Video retargeting: automating pan and scan,” in Proc. ACM Int. Conf. Multimedia, pp. 241□250. Oct. 2006, Santa Barbara, CA.
[10] D. G. Lowe, “Distinctive image features from scale-invariant keypoints,” Int. J. Comput. Vis., vol. 60, no. 2, pp. 91□110, 2004.
[11] Z. Lu, W. Lin, X. Yang, E. Ong and S. Yao, “Modeling visual attention's modulatory aftereffects on visual sensitivity and quality evaluation,” IEEE Trans. Image Process., vol. 14, no. 11, pp. 1928□1942, Nov. 2005.
[12] T. Ren, Y. Liu and G. Wu, "Image retargeting based on global energy optimization,” in Proc. IEEE Int. Conf. Multimedia Expo, pp. 406□409. June 2009, New York, USA.
[13] M. Rubinstein, A. Shamir and S. Avidan, “Improved seam carving for video retargeting,” ACM Trans. Graphics, vol. 27, no. 3, pp. 16, 2008.
[14] V. Setlur, T. Lechner, M. Nienhaus and B. Gooch, “Retargeting images and video for preserving information saliency,” IEEE Computer Graphics and Applications, vol. 27, no. 5, pp. 80□88, Sept.-Oct. 2007.
[15] A. Shamir and O. Sorkine, “Visual media retargeting,” in ACM SIGGRAPH ASIA Courses (SIGGRAPH ASIA '09), 2009, pp. 1□13.
[16] D. Simakov, Y. Caspi, E. Shechtman and M. Irani, “Summarizing visual data using bidirectional similarity,” in Proc. IEEE Int. Conf. Comput. Vis. Pattern Recognit., pp. 1□8, June 2008, Anchorage, Alaska.
[17] R. Szeliski, “Image alignment and stitching: a tutorial,” Foundations and Trends in Computer Graphics and Vision (FTCGV), vol. 2, no. 1, pp. 1□104, 2006.
[18] Y.-S. Wang, C.-L. Tai, O. Sorkine and T.-Y. Lee, "Optimized scale-and-stretch for image resizing," ACM Trans. Graphics, vol. 27, no. 5, Dec. 2008.
[19] Y.-S. Wang, H. Fu, O. Sorkine, T.-Y. Lee and H.-P. Seidel, “Motion-aware temporal coherence for video resizing,” ACM Trans. Graphics, vol. 28, no. 5, 2009.
[20] L. Wolf, M. Guttmann and D. Cohen-Or, "Non-homogeneous content-driven video-retargeting," in Proc. IEEE Int. Conf. Comput. Vis., pp. 1□6, Oct. 2007, Rio de Janeiro, Brazil.
[21] Y.-F. Zhang, S.-M. Hu and R. R. Martin, “Shrinkability maps for content-aware video resizing,” Computer Graphics Forum, vol. 27, no. 7, pp. 1797□1804, 2008.

全文公開日期本全文未授權公開 (校內網路)
全文公開日期本全文未授權公開 (校外網路)

簡易檢索 / 詳目顯示

相關論文