Jiang, Chenfanfu Qi, Siyuan Zhu, Yixin Huang, Siyuan Lin, Jenny Yu, Lap-Fai Terzopoulos, Demetri Zhu, Song-Chun
Published in
International Journal of Computer Vision
We propose a systematic learning-based approach to the generation of massive quantities of synthetic 3D scenes and arbitrary numbers of photorealistic 2D images thereof, with associated ground truth information, for the purposes of training, benchmarking, and diagnosing learning-based computer vision and robotics algorithms. In particular, we devis...
Li, Hongyang Liu, Yu Ouyang, Wanli Wang, Xiaogang
Published in
International Journal of Computer Vision
In this paper, we propose a zoom-out-and-in network for generating object proposals. A key observation is that it is difficult to classify anchors of different sizes with the same set of features. Anchors of different sizes should be placed accordingly based on different depth within a network: smaller boxes on high-resolution layers with a smaller...
Brégier, Romain Devernay, Frédéric Leyrit, Laetitia Crowley, James L.
Published in
International Journal of Computer Vision
The pose of a rigid object is usually regarded as a rigid transformation, described by a translation and a rotation. However, equating the pose space with the space of rigid transformations is in general abusive, as it does not account for objects with proper symmetries—which are common among man-made objects. In this article, we define pose as a d...
Hofmann, Martin Seeland, Marco Mäder, Patrick
Published in
International Journal of Computer Vision
The projection of a real world scenery to a planar image sensor inherits the loss of information about the 3D structure as well as the absolute dimensions of the scene. For image analysis and object classification tasks, however, absolute size information can make results more accurate. Today, the creation of size annotated image datasets is effort...
Peng, Xi Feris, Rogerio S. Wang, Xiaoyu Metaxas, Dimitris N.
Published in
International Journal of Computer Vision
We propose a novel method for real-time face alignment in videos based on a recurrent encoder–decoder network model. Our proposed model predicts 2D facial point heat maps regularized by both detection and regression loss, while uniquely exploiting recurrent learning at both spatial and temporal dimensions. At the spatial level, we add a feedback lo...
Liu, Bo Jing, Liping Li, Jia Yu, Jian Gittens, Alex Mahoney, Michael W.
Published in
International Journal of Computer Vision
With significant advances in imaging technology, multiple images of a person or an object are becoming readily available in a number of real-life scenarios. In contrast to single images, image sets can capture a broad range of variations in the appearance of a single face or object. Recognition from these multiple images (i.e., image set classifica...
Chen, Da Mirebeau, Jean-Marie Cohen, Laurent D.
Published in
International Journal of Computer Vision
In this paper, we propose a novel curvature penalized minimal path model via an orientation-lifted Finsler metric and the Euler elastica curve. The original minimal path model computes the globally minimal geodesic by solving an Eikonal partial differential equation (PDE). Essentially, this first-order model is unable to penalize curvature which is...
Xiao, Guobao Wang, Hanzi Yan, Yan Suter, David
Published in
International Journal of Computer Vision
Geometric model fitting is a fundamental research topic in computer vision and it aims to fit and segment multiple-structure data. In this paper, we propose a novel superpixel-guided two-view geometric model fitting method (called SDF), which can obtain reliable and consistent results for real images. Specifically, SDF includes three main parts: a ...
Lenc, Karel Vedaldi, Andrea
Published in
International Journal of Computer Vision
Despite the importance of image representations such as histograms of oriented gradients and deep Convolutional Neural Networks (CNN), our theoretical understanding of them remains limited. Aimed at filling this gap, we investigate two key mathematical properties of representations: equivariance and equivalence. Equivariance studies how transformat...
Zhang, Miaomiao Fletcher, P. Thomas
Published in
International Journal of Computer Vision
This paper introduces Fourier-approximated Lie algebras for shooting (FLASH), a fast geodesic shooting algorithm for diffeomorphic image registration. We approximate the infinite-dimensional Lie algebra of smooth vector fields, i.e., the tangent space at the identity of the diffeomorphism group, with a low-dimensional, bandlimited space. We show th...