Center for Nonlinear Studies

Monday, July 20, 200911:00 AM - 12:00 PMCNLS Conference Room (TA-3, Bldg 1690)
Seminar
Fast Reconstruction of the World from Photos and Videos
Jan-Michael FrahmDept of Computer Science, University of North Carolina at Chapel Hill
In recent years photo/video sharing web sites like Flickr and YouTube have become increasingly popular. Nowadays, every day terra bytes of photos and videos are uploaded. These data survey large parts of the world throughout the different seasons, various weather conditions and all times of the day. In the talk I will present my work on the highly efficient reconstruction of 3D models from these data. It addresses a variety of the current challenges that have to be addressed to achieve a concurrent 3D model from these data. The challenges are: estimation of the geometric and radiometric camera calibration from videos and photos, efficient robust camera motion estimation for (quasi-)degenerate estimation problems, high performance stereo estimation from multiple views, automatic selection of correct views from noisy image/video collections, image based location recognition for topology detection. In the talk I will discuss the details of our real-time camera motion estimation from video using our Adaptive Real-Time Random Sample Consensus (ARRSAC) and our high performance salient feature tracker, which simultaneously estimates the radiometric camera calibration and tracks the motion of the salient feature points. Furthermore our technique to achieve robustness against (quasi-) degenerate data will be introduced. It allows to detect and overcome the case of data, which under-constrain the camera motion estimation problem. Additionally our optimal stereo technique for determining the scene depths with constant precision throughout the scene volume will be explained during the talk. It allows to perform the scene depth estimation from a large set of views with optimal computational effort while obtaining the depth with constant precision throughout the reconstruction volume. I also discuss our fast technique for the image based location recognition, which uses commodity graphics processors to achieve real-time performance while providing high recognition rates. Furthermore in the talk I present our work on 3D reconstruction from internet photo collections. It combines image based recognition with geometric constraints to efficiently perform the simultaneous selection of correct views and the 3D reconstruction from large collections of photos. The talk will also explain the future challenges in all the mentioned areas.

Host: Alexei N. Skurikhin, ISR-2, alexei@lanl.gov, 7-5067