Results 1 - 10
of
12
Combining Discriminative and Generative Methods for 3D Deformable Surface and Articulated Pose Reconstruction
"... Historically non-rigid shape recovery and articulated pose estimation have evolved as separate fields. Recent methods for non-rigid shape recovery have focused on improving the algorithmic formulation, but have only considered the case of reconstruction from point-to-point correspondences. In contra ..."
Abstract
-
Cited by 6 (2 self)
- Add to MetaCart
Historically non-rigid shape recovery and articulated pose estimation have evolved as separate fields. Recent methods for non-rigid shape recovery have focused on improving the algorithmic formulation, but have only considered the case of reconstruction from point-to-point correspondences. In contrast, many techniques for pose estimation have followed a discriminative approach, which allows for the use of more general image cues. However, these techniques typically require large training sets and suffer from the fact that standard discriminative methods do not enforce constraints between output dimensions. In this paper, we combine ideas from both domains and propose a unified framework for articulated pose estimation and 3D surface reconstruction. We address some of the issues of discriminative methods by explicitly constraining their prediction. Furthermore, our formulation allows for the combination of generative and discriminative methods into a single, common framework. 1.
Non-Rigid Structure from Locally-Rigid Motion
"... We introduce locally-rigid motion, a general framework for solving the M-point, N-view structure-from-motion problem for unknown bodies deforming under orthography. The key idea is to first solve many local 3-point, N-view rigid problems independently, providing a “soup ” of specific, plausibly rigi ..."
Abstract
-
Cited by 5 (0 self)
- Add to MetaCart
We introduce locally-rigid motion, a general framework for solving the M-point, N-view structure-from-motion problem for unknown bodies deforming under orthography. The key idea is to first solve many local 3-point, N-view rigid problems independently, providing a “soup ” of specific, plausibly rigid, 3D triangles. The main advantage here is that the extraction of 3D triangles requires only very weak assumptions: (1) deformations can be locally approximated by near-rigid motion of three points (i.e., stretching not dominant) and (2) local motions involve some generic rotation in depth. Triangles from this soup are then grouped into bodies, and their depth flips and instantaneous relative depths are determined. Results on several sequences, both our own and from related work, suggest these conditions apply in diverse settings—including very challenging ones (e.g., multiple deforming bodies). Our starting point is a novel linear solution to 3-point structure from motion, a problem for which no general algorithms currently exist. 1.
Capturing 3D stretchable surfaces from single images in closed form
- In CVPR
, 2009
"... We present a closed-form solution to the problem of recovering the 3D shape of a non-rigid potentially stretchable surface from 3D-to-2D correspondences. In other words, we can reconstruct a surface from a single image without a priori knowledge of its deformations in that image. State-of-the-art so ..."
Abstract
-
Cited by 3 (2 self)
- Add to MetaCart
We present a closed-form solution to the problem of recovering the 3D shape of a non-rigid potentially stretchable surface from 3D-to-2D correspondences. In other words, we can reconstruct a surface from a single image without a priori knowledge of its deformations in that image. State-of-the-art solutions to non-rigid 3D shape recovery rely on the fact that distances between neighboring surface points must be preserved and are therefore limited to inelastic surfaces. Here, we show that replacing the inextensibility constraints by shading ones removes this limitation while still allowing 3D reconstruction in closed-form. We demonstrate our method and compare it to an earlier one using both synthetic and real data. 1.
Exploring Ambiguities for Monocular Non-Rigid Shape Estimation
"... Recovering the 3D shape of deformable surfaces from single images is difficult because many different shapes have very similar projections. This is commonly addressed by restricting the set of possible shapes to linear combinations of deformation modes and by imposing additional geometric constraint ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
Recovering the 3D shape of deformable surfaces from single images is difficult because many different shapes have very similar projections. This is commonly addressed by restricting the set of possible shapes to linear combinations of deformation modes and by imposing additional geometric constraints. Unfortunately, because image measurements are noisy, such constraints do not always guarantee that the correct shape will be recovered. To overcome this limitation, we introduce an efficient approach to exploring the set of solutions of an objective function based on point-correspondences and to proposing a small set of candidate 3D shapes. This allows the use of additional image information to choose the best one. As a proof of concept, we use either motion or shading cues to this end and show that we can handle a complex objective function without having to solve a difficult non-linear minimization problem. Key words: 3D shape recovery, deformation model, nonrigid surfaces. 1
EPFL-CVLab
"... It has recently been shown that deformable 3D surfaces couldberecoveredfromsinglevideostreams. However,existing techniques either require a reference view in which theshapeof the surfaceis knownapriori,which oftenmay not be available, or require tracking points over long sequences,whichishardtodo. I ..."
Abstract
- Add to MetaCart
It has recently been shown that deformable 3D surfaces couldberecoveredfromsinglevideostreams. However,existing techniques either require a reference view in which theshapeof the surfaceis knownapriori,which oftenmay not be available, or require tracking points over long sequences,whichishardtodo. Inthispaper,weovercometheselimitations. Tothisend, we establish correspondences between pairs of frames in whichtheshapeisdifferentandunknown. Wethenestimate homographiesbetweencorrespondinglocalplanarpatches in both images. These yield approximate 3D reconstructionsof pointswithin eachpatchup to a scale factor. Since we consider overlapping patches, we can enforce them to beconsistentoverthewholesurface. Finally,alocaldeformation model is used to fit a triangulated mesh to the 3D point cloud, which makes the reconstruction robust to both noiseandoutliersintheimagedata. 1.
Informàtica Industrial (CSIC-UPC)
"... Recent works have shown that 3D shape of non-rigid surfaces can be accurately retrieved from a single image given a set of 3D-to-2D correspondences between that image and another one for which the shape is known. However, existing approaches assume that such correspondences can be readily establishe ..."
Abstract
- Add to MetaCart
Recent works have shown that 3D shape of non-rigid surfaces can be accurately retrieved from a single image given a set of 3D-to-2D correspondences between that image and another one for which the shape is known. However, existing approaches assume that such correspondences can be readily established, which is not necessarily true when large deformations produce significant appearance changes between the input and the reference images. Furthermore, it is either assumed that the pose of the camera isknown, or the estimatedsolution ispose-ambiguous. Inthispaperwerelaxalltheseassumptionsand,givena setof3Dand2Dunmatchedpoints,wepresentanapproach tosimultaneouslysolvetheircorrespondences,computethe camera pose and retrieve the shape of the surface in the
Implicitly Constrained Gaussian Process Regression for Monocular Non-Rigid Pose Estimation
"... Estimating 3D pose from monocular images is a highly ambiguous problem. Physical constraints can be exploited to restrict the space of feasible configurations. In this paper we propose an approach to constraining the prediction of a discriminative predictor. We first show that the mean prediction of ..."
Abstract
- Add to MetaCart
Estimating 3D pose from monocular images is a highly ambiguous problem. Physical constraints can be exploited to restrict the space of feasible configurations. In this paper we propose an approach to constraining the prediction of a discriminative predictor. We first show that the mean prediction of a Gaussian process implicitly satisfies linear constraints if those constraints are satisfied by the training examples. We then show how, by performing a change of variables, a GP can be forced to satisfy quadratic constraints. As evidenced by the experiments, our method outperforms state-of-the-art approaches on the tasks of rigid and non-rigid pose estimation. 1
Mathieu Salzmann
"... We present a closed-form solution to the problem of recovering the 3D shape of a non-rigid potentially stretchable surface from 3D-to-2D correspondences. In other words, we can reconstruct a surface from a single image without a priori knowledge of its deformations in that image. State-of-the-art so ..."
Abstract
- Add to MetaCart
We present a closed-form solution to the problem of recovering the 3D shape of a non-rigid potentially stretchable surface from 3D-to-2D correspondences. In other words, we can reconstruct a surface from a single image without a priori knowledge of its deformations in that image. State-of-the-art solutions to non-rigid 3D shape recovery rely on the fact that distances between neighboring surface points must be preserved and are therefore limited to inelastic surfaces. Here, we show that replacing the inextensibility constraints by shading ones removes this limitation while still allowing 3D reconstruction in closed-form. We demonstrate our method and compare it to an earlier one using both synthetic and real data. 1.
CVPR #774 CVPR 2012 Submission #774. CONFIDENTIAL REVIEW COPY. DO NOT DISTRIBUTE. CVPR
"... Latent variable models provide valuable compact representations for learning and inference in many computer vision tasks. However, most existing models cannot directly encode prior knowledge about the specific problem at hand. In this paper, we introduce a constrained latent variable model whose gen ..."
Abstract
- Add to MetaCart
Latent variable models provide valuable compact representations for learning and inference in many computer vision tasks. However, most existing models cannot directly encode prior knowledge about the specific problem at hand. In this paper, we introduce a constrained latent variable model whose generated output inherently accounts for such knowledge. To this end, we propose an approach that explicitly imposes equality and inequality constraints on the model’s output during learning, thus avoiding the computational burden of having to account for these constraints at inference. Our learning mechanism can exploit non-linear kernels, while only involving sequential closedform updates of the model parameters. We demonstrate the effectiveness of our constrained latent variable model on the problem of non-rigid 3D reconstruction from monocular images, and show that it yields qualitative and quantitative improvements over several baselines. 1.
Resolving Occlusion in Multiframe Reconstruction of Deformable Surfaces
"... Occlusion is troublesome for almost all computer vision algorithms. To a certain extent, the difficulty is alleviated when multiple frames are given. On the other hand, when we consider the recovery of shapes of moving deformable objects, observed using a monocular camera, the problem appears diffic ..."
Abstract
- Add to MetaCart
Occlusion is troublesome for almost all computer vision algorithms. To a certain extent, the difficulty is alleviated when multiple frames are given. On the other hand, when we consider the recovery of shapes of moving deformable objects, observed using a monocular camera, the problem appears difficult again. In this paper, we show a method that outperforms previous approaches to reconstruction when feature data is unavailable, perhaps due to occlusion. Our key intuition is that portions of the surface that are visible in some frame can be reliably reconstructed in that frame; further, the reliable portions can be stitched together to find even missing portions, much the way a human eye would hallucinate. Our techniques are based on optimization in Riemannian shape spaces, and is demonstrated on isometric surfaces without involving any kind of machine learning methods. 1.

