• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Using spin images for efficient object recognition in cluttered 3d scenes”, (1999)

by A E Johnson, M Hebert
Venue:IEEE Trans. on PAMI,
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 582
Next 10 →

Local features and kernels for classification of texture and object categories: a comprehensive study

by J. Zhang, S. Lazebnik, C. Schmid - International Journal of Computer Vision , 2007
"... Recently, methods based on local image features have shown promise for texture and object recognition tasks. This paper presents a large-scale evaluation of an approach that represents images as distributions (signatures or histograms) of features extracted from a sparse set of keypoint locations an ..."
Abstract - Cited by 653 (34 self) - Add to MetaCart
Recently, methods based on local image features have shown promise for texture and object recognition tasks. This paper presents a large-scale evaluation of an approach that represents images as distributions (signatures or histograms) of features extracted from a sparse set of keypoint locations and learns a Support Vector Machine classifier with kernels based on two effective measures for comparing distributions, the Earth Mover’s Distance and the χ 2 distance. We first evaluate the performance of our approach with different keypoint detectors and descriptors, as well as different kernels and classifiers. We then conduct a comparative evaluation with several state-of-the-art recognition methods on four texture and five object databases. On most of these databases, our implementation exceeds the best reported results and achieves comparable performance on the rest. Finally, we investigate the influence of background correlations on recognition performance via extensive tests on the PASCAL database, for which ground-truth object localization information is available. Our experiments demonstrate that image representations based on distributions of local features are surprisingly effective for classification of texture and object images under challenging real-world conditions, including significant intra-class variations and substantial background clutter.
(Show Context)

Citation Context

...ll changes in the position of the support region and puts more emphasis on the gradients that are near the center of the region. The SPIN descriptor, based on spin images used for matching range data =-=[26]-=-, is a rotation-invariant two-dimensional histogram of intensities within an image region. The two dimensions of the histogram are d, the distance of the center, and i, the intensity value. The entry ...

Computer Vision: Algorithms and Applications

by Richard Szeliski , 2010
"... ..."
Abstract - Cited by 252 (2 self) - Add to MetaCart
Abstract not found

A sparse texture representation using local affine regions

by Svetlana Lazebnik, Cordelia Schmid, Jean Ponce - IEEE Transactions on Pattern Analysis and Machine Intelligence , 2005
"... This article introduces a texture representation suitable for recognizing images of textured surfaces under a wide range of transformations, including viewpoint changes and non-rigid deformations. At the feature extraction stage, a sparse set of affine Harris and Laplacian regions is found in the im ..."
Abstract - Cited by 210 (15 self) - Add to MetaCart
This article introduces a texture representation suitable for recognizing images of textured surfaces under a wide range of transformations, including viewpoint changes and non-rigid deformations. At the feature extraction stage, a sparse set of affine Harris and Laplacian regions is found in the image. Each of these regions can be thought of as a texture element having a characteristic elliptic shape and a distinctive appearance pattern. This pattern is captured in an affine-invariant fashion via a process of shape normalization followed by the computation of two novel descriptors, the spin image and the RIFT descriptor. When affine invariance is not required, the original elliptical shape serves as an additional discriminative feature for texture recognition. The proposed approach is evaluated in retrieval and classi-fication tasks using the entire Brodatz database and a publicly available collection of 1000 photographs of textured surfaces taken from different viewpoints.
(Show Context)

Citation Context

...d by computing rotation-invariant descriptors over the normalized regions. In Section 3.2, we introduce two novel rotation-invariant descriptors: one based on spin images used for matching range data =-=[13]-=-, and one based on Lowe’s SIFT descriptor [27]. 3. Perform clustering on the affine-invariant descriptors to obtain a more compact repre1 For the sake of this illustration, we disregard the orthogonal...

Recognizing Objects in Range Data Using Regional Point Descriptors

by Andrea Frome, Daniel Huber, Ravi Kolluri, Thomas Bülow, Jitendra Malik - EUROPEAN CONFERENCE ON COMPUTER VISION , 2004
"... Recognition of three dimensional (3D) objects in noisy and cluttered scenes is a challenging problem in 3D computer vision. One approach that has been successful in past research is the regional shape descriptor. In this paper, we introduce two new regional shape descriptors: 3D shape contexts a ..."
Abstract - Cited by 205 (8 self) - Add to MetaCart
Recognition of three dimensional (3D) objects in noisy and cluttered scenes is a challenging problem in 3D computer vision. One approach that has been successful in past research is the regional shape descriptor. In this paper, we introduce two new regional shape descriptors: 3D shape contexts and harmonic shape contexts. We evaluate the performance of these descriptors on the task of recognizing vehicles in range scans of scenes using a database of 56 cars. We compare the two novel descriptors to an existing descriptor, the spin image, showing that the shape context based descriptors have a higher recognition rate on noisy scenes and that 3D shape contexts outperform the others on cluttered scenes.
(Show Context)

Citation Context

...t we follow in this paper. Methods which use regional point descriptors have proven successful in the context of image-based recognition [17][15][2] as well as 3D recognition and surface matching [22]=-=[13]-=-[5][21]. A regional point descriptor characterizes some property of the scene in a local support region surrounding a basis point. In our case, the descriptors characterize regional surface shape. Ide...

Recognizing Objects in Adversarial Clutter: Breaking a Visual CAPTCHA

by Greg Mori, Jitendra Malik , 2003
"... In this paper we explore object recognition in clutter. We test our object recognition techniques on Gimpy and EZGimpy, examples of visual CAPTCHAs. A CAPTCHA ("Completely Automated Public Turing test to Tell Computers and Humans Apart") is a program that can generate and grade tests that ..."
Abstract - Cited by 202 (4 self) - Add to MetaCart
In this paper we explore object recognition in clutter. We test our object recognition techniques on Gimpy and EZGimpy, examples of visual CAPTCHAs. A CAPTCHA ("Completely Automated Public Turing test to Tell Computers and Humans Apart") is a program that can generate and grade tests that most humans can pass, yet current computer programs can't pass. EZ-Gimpy (see Fig. 1, 5), currently used by Yahoo, and Gimpy (Fig. 2,9) are CAPTCHAs based on word recognition in the presence of clutter. These CAPTCHAs provide excellent test sets since the clutter they contain is adversarial; it is designed to confuse computer programs. We have developed efficient methods based on shape context matching that can identify the word in an EZGimpy image with a success rate of 92%, and the requisite 3 words in a Gimpy image 33% of the time. The problem of identifying words in such severe clutter provides valuable insight into the more general problem of object recognition in scenes. The methods that we present are instances of a framework designed to tackle this general problem.
(Show Context)

Citation Context

...nearby pixels, which suggests the use of a log-polar coordinate system. An example is shown in Fig. 3(c). A related approach, developed for 3D data, is the spin images technique of Johnson and Hebert =-=[9]. 2.1. Matchin-=-g Framework The work by Belongie et al. [3] resulted in � extremely ��� good performance, e.g. accuracy on the MNIST handwrit��� ten digit set, as well as on a variety of 3D object recog...

On Visual Similarity Based 3D Model Retrieval

by Ding-Yun Chen, Xiao-Pei Tian, Yu-te Shen, Ming Ouhyoung , 2003
"... A large number of 3D models are created and available on the Web, since more and more 3D modelling and digitizing tools are developed for ever increasing applications. The techniques for content-based 3D model retrieval then become necessary. In this paper, a visual similarity-based 3D model retriev ..."
Abstract - Cited by 197 (4 self) - Add to MetaCart
A large number of 3D models are created and available on the Web, since more and more 3D modelling and digitizing tools are developed for ever increasing applications. The techniques for content-based 3D model retrieval then become necessary. In this paper, a visual similarity-based 3D model retrieval system is proposed.

A Large-Scale Hierarchical Multi-View RGB-D Object Dataset

by Kevin Lai, Liefeng Bo, Xiaofeng Ren, Dieter Fox
"... Abstract — Over the last decade, the availability of public image repositories and recognition benchmarks has enabled rapid progress in visual object category and instance detection. Today we are witnessing the birth of a new generation of sensing technologies capable of providing high quality synch ..."
Abstract - Cited by 160 (11 self) - Add to MetaCart
Abstract — Over the last decade, the availability of public image repositories and recognition benchmarks has enabled rapid progress in visual object category and instance detection. Today we are witnessing the birth of a new generation of sensing technologies capable of providing high quality synchronized videos of both color and depth, the RGB-D (Kinectstyle) camera. With its advanced sensing capabilities and the potential for mass adoption, this technology represents an opportunity to dramatically increase robotic object recognition, manipulation, navigation, and interaction capabilities. In this paper, we introduce a large-scale, hierarchical multi-view object dataset collected using an RGB-D camera. The dataset contains 300 objects organized into 51 categories and has been made publicly available to the research community so as to enable rapid progress based on this promising technology. This paper describes the dataset collection procedure and introduces techniques for RGB-D based object recognition and detection, demonstrating that combining color and depth information substantially improves quality of results. I.
(Show Context)

Citation Context

...image is a view of an object and we extract one set of features capturing the shape of the view and another set capturing the visual appearance. We use state-of-the-art features including spin images =-=[17]-=- from the shape retrieval community and SIFT descriptors [21] from the computer vision community. Shape features are extracted from the 3D locations of each depth pixel in physical space, expressed in...

Discriminative learning of Markov random fields for segmentation of 3d scan data

by Dragomir Anguelov, Ben Taskar, Vassil Chatalbashev, Daphne Koller, Dinkar Gupta, Geremy Heitz, Andrew Ng - In Proc. of the Conf. on Computer Vision and Pattern Recognition (CVPR , 2005
"... We address the problem of segmenting 3D scan data into objects or object classes. Our segmentation framework is based on a subclass of Markov Random Fields (MRFs) which support efficient graph-cut inference. The MRF models incorporate a large set of diverse features and enforce the preference that a ..."
Abstract - Cited by 156 (3 self) - Add to MetaCart
We address the problem of segmenting 3D scan data into objects or object classes. Our segmentation framework is based on a subclass of Markov Random Fields (MRFs) which support efficient graph-cut inference. The MRF models incorporate a large set of diverse features and enforce the preference that adjacent scan points have the same classification label. We use a recently proposed maximummargin framework to discriminatively train the model from a set of labeled scans; as a result we automatically learn the relative importance of the features for the segmentation task. Performing graph-cut inference in the trained MRF can then be used to segment new scenes very efficiently. We test our approach on three large-scale datasets produced by different kinds of 3D sensors, showing its applicability to both outdoor and indoor environments containing diverse objects. 1.
(Show Context)

Citation Context

...ta which is often noisy and sparse. The 3D scan segmentation problem has been addressed primarily in the context of detect1 ing known rigid objects for which reliable features can be extracted (e.g., =-=[11, 5]-=-). The more difficult task of segmenting out object classes or deformable objects from 3D scans requires the ability to handle previously unseen object instances or configurations. This is still an op...

Salient geometric features for partial shape matching and similarity

by Ran Gal, Daniel Cohen-or - j-TOG
"... This article introduces a method for partial matching of surfaces represented by triangular meshes. Our method matches surface regions that are numerically and topologically dissimilar, but approximately similar regions. We introduce novel local surface descriptors which efficiently represent the ge ..."
Abstract - Cited by 148 (4 self) - Add to MetaCart
This article introduces a method for partial matching of surfaces represented by triangular meshes. Our method matches surface regions that are numerically and topologically dissimilar, but approximately similar regions. We introduce novel local surface descriptors which efficiently represent the geometry of local regions of the surface. The descriptors are defined independently of the underlying triangulation, and form a compatible representation that allows matching of surfaces with different triangulations. To cope with the combinatorial complexity of partial matching of large meshes, we introduce the abstraction of salient geometric features and present a method to construct them. A salient geometric feature is a compound high-level feature of nontrivial local shapes. We show that a relatively small number of such salient geometric features characterizes the surface well for various similarity applications. Matching salient geometric features is based on indexing rotation-invariant features and a voting scheme accelerated by geometric hashing. We demonstrate the effectiveness of our method with a number of applications, such as computing self-similarity, alignments, and subparts similarity.

Fast Point Feature Histograms (FPFH) for 3D Registration

by Radu Bogdan Rusu, Nico Blodow, Michael Beetz - in In Proceedings of the International Conference on Robotics and Automation (ICRA , 2009
"... Abstract — In our recent work [1], [2], we proposed Point Feature Histograms (PFH) as robust multi-dimensional features which describe the local geometry around a point p for 3D point cloud datasets. In this paper, we modify their mathematical expressions and perform a rigorous analysis on their rob ..."
Abstract - Cited by 143 (7 self) - Add to MetaCart
Abstract — In our recent work [1], [2], we proposed Point Feature Histograms (PFH) as robust multi-dimensional features which describe the local geometry around a point p for 3D point cloud datasets. In this paper, we modify their mathematical expressions and perform a rigorous analysis on their robustness and complexity for the problem of 3D registration for overlapping point cloud views. More concretely, we present several optimizations that reduce their computation times drastically by either caching previously computed values or by revising their theoretical formulations. The latter results in a new type of local features, called Fast Point Feature Histograms (FPFH), which retain most of the discriminative power of the PFH. Moreover, we propose an algorithm for the online computation of FPFH features for realtime applications. To validate our results we demonstrate their efficiency for 3D registration and propose a new sample consensus based method for bringing two datasets into the convergence basin of a local non-linear optimizer: SAC-IA (SAmple Consensus Initial Alignment). I.
(Show Context)

Citation Context

...The ICP method has seen many improvements from its original form, from using non-linear optimization methods [7], [8], finding good initial guesses [9], [10], or estimating better point features [9], =-=[11]-=-, [12], to addressing the problem of ICP’s computational complexity [13], [14], to name a few. Our present contributions fall within the area of feature estimation and selection for point corresponden...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University