FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent
This paper introduces FlowMap, an end-to-end differentiablemethod that solves for precise camera poses, camera intrinsics, and perframe dense depth of a video sequence. Our method performs per-videogradient-descent minimization of a simple least-squares objective thatcompares the optical flow induced by depth, intrinsics, and poses againstcorrespondences obtained via off-the-shelf optical flow and point tracking. Alongside the use of point tracks to encourage long-term geometricconsistency, we introduce differentiable re-parameterizations of depth,intrinsics, and pose that are amenable to first-order optimization. Weempirically show that camera parameters and dense depth recovered byour method enable photo-realistic novel view synthesis on 360◦trajectories using Gaussian Splatting. Our method not only far outperforms priorgradient-descent based bundle adjustment methods, but surprisingly performs on par with COLMAP, the state-of-the-art SfM method, on thedownstream task of 360◦ novel view synthesis—even though our methodis purely gradient-descent based, fully differentiable, and presents a complete departure from conventional SfM.

