研究生: |
林志偉 Chih-Wei Lin |
---|---|
論文名稱: |
以影像為基礎之新景合成 Image-Based Novel View Synthesis |
指導教授: |
黃仲陵
Chung-Lin Huang |
口試委員: | |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 電機工程學系 Department of Electrical Engineering |
論文出版年: | 2006 |
畢業學年度: | 94 |
語文別: | 英文 |
論文頁數: | 54 |
中文關鍵詞: | 影像合成 、三焦張量 、基礎矩陣 、影像變形 |
外文關鍵詞: | trifocal tensor, view synthesis, fundamental matrix, view morphing |
相關次數: | 點閱:2 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
摘要
近年來,我們可以在許多電影或者是電視轉播上可以看到很多種類的特效。其中有一項就是利用視角的變化所造成的特殊效果。不過,大部分的影片都是利用快速的相機切換來達成這樣的效果,這是一個很高成本的工作。在本篇論文中,我們提出了一個以影像為基礎新景合成的方法,可以利用電腦視覺的方法來達到這樣的效果。我們提出的方法主要是利用視角的變形(View Morphing)和三焦張量的轉換(Trifocal Tensor Transfer)來達成視角合成的特殊效果。因此我們可以只利用兩台相機來達成多台相機的效果。首先,我們先利用角點偵測(Harris Corner Detector)找出影像當中的特殊點。接著再利用正規劃的相關運算(Normalized Cross Correlation) 先去找到初步的對應點,不過這些對應點必定會存在一些錯誤的對應點。因此,我們再使用一致隨機取樣(Random Sample Consensus)去將這些錯誤的對應點從初步的對應點中挑出來。一致隨機取樣所使用的模型為基礎矩陣(Fundamental matrix)並利用極點限制來判斷此對應點是否為有效的對應點。我們就可以利用正規劃直接線性轉換(Normalized Direct Linear Transformation)來算出所利用的多視角幾何的參數。
為了簡化尋找密集對應點的問題,我們假設我們所拍攝的場景可以切成好幾個的平面。所以,就可利用單應矩陣(Homography matrix) 來找出密集的對應點。再利用視角的變形和三焦張量的轉換來製造新視角的影像。我們使用的方法只利用簡單的幾何關係而且是以影像為基礎,所以我們並不需要強烈的相機校正和三維空間的模型就可以達成視角的合成。
Abstract
In this thesis, we present a novel view synthesis approach which encapsulates view morphing and trifocal transfer. We can achieve the effect of having many virtual cameras, but in practice we only have two real ones. First, we use Harris corner detector to detect feature points. Then apply the normalized cross correlation and random sample consensus (RANSAC) to extract correspondences and make use of normalized direct linear transformation to solve the parameters of the multiple-view geometry.
For simplifying the problem of finding dense correspondences, we assume that the scene is piecewise planar. Thus, we can make use of the homography matrices to determine the dense correspondences between the two images. The method we use does not need the strong calibration and the complex model.
Bibliographies
[1] S.Avidan, A.Shashua, “Novel View Synthesis by Cascading Trilinear Tensors,” IEEE Trans. on Visualization and Computer Graphics, Vol.4, No.4, pp.293-306, 1998.
[2] M.A. Fischler and R.C. Bolles, “A paradigm for model Fitting with application to
image analysis and automated cartography,” Communications of the ACM,
24(6):381{395, 1981.
[3] A. Shashua, “Algebraic functions for recognition,”IEEE Trans. Pattern Analysis
Machine Intelligence,vol. 17, pp. 779--789.
[4] Richard Hartley and Andrew Zisserman (2003), Multiple View Geometry in computer vision, 2nd edition, Cambridge University Press.
[6] Sun Zhaohui, A.M. Tekalp,”Image registration using a 3-D scene representation,” International Conference on Image Processing ,3-Volume Set-Volume 1, 1998
[7] N. Inamoto and H. Saito, “Fly-Through Viewpoint Video System for Multi-View Soccer Movie Using Viewpoint Interpolation,” Proc. of Visual Communications and Image Processing, SPIE, Vol. 5150, 122, Jul. 2003.
[8] J. P. Lewis, Fast Normalized Cross-Correlation, Industrial Light & Magic ,
http://www.idiom.com/~zilla/Work/nvisionInterface/nip.html
[9] S. Seitz, C. Dyer,’’View morphing,’’ Proc. ACM SIGGRAPH 1996, 1996, pp.
21–30.
[10] R. Manning, C. Dyer,’’Interpolating view and scene motion by dynamic view morphing,’’Proc. CVPR, 1999, pp. 388–394.
[11] S.Vedula, S.Baker,T. Kanade,’’Spatio-Temporal View Interpolation Eurographics Workshop on Rendering,’’ 2002.
[12] Z. Zhang, L. Wang, B. Guo, H. Shum, ”Feature-based light field morphing,”in:
Proc. ACM SIGGRAPH 2002, 2002, pp. 457–464.