• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

DMCA

Discovering objects and their location in images (2005)

Cached

  • Download as a PDF

Download Links

  • [www.cs.ucsd.edu]
  • [cseweb.ucsd.edu]
  • [cseweb.ucsd.edu]
  • [www.ai.mit.edu]
  • [www.ri.cmu.edu]
  • [www.di.ens.fr]
  • [people.csail.mit.edu]
  • [people.csail.mit.edu]
  • [www.cs.washington.edu]
  • [www.cs.cmu.edu]
  • [luthuli.cs.uiuc.edu]
  • [www.cs.cmu.edu]
  • [www.cs.cmu.edu]
  • [people.csail.mit.edu]
  • [homes.cs.washington.edu]
  • [www.csd.uwo.ca]
  • [www.cs.cmu.edu]
  • [www.vision.jhu.edu]
  • [people.csail.mit.edu]
  • [people.csail.mit.edu]
  • [luthuli.cs.uiuc.edu]
  • [people.csail.mit.edu]
  • [www.vision.jhu.edu]
  • [www.cs.cmu.edu]
  • [bryanrussell.org]
  • [homes.cs.washington.edu]
  • [www.csd.uwo.ca]

  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Josef Sivic , Bryan C. Russell , Alexei A. Efros , Andrew Zisserman , William T. Freeman
Venue:In ICCV
Citations:272 - 9 self
  • Summary
  • Citations
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@INPROCEEDINGS{Sivic05discoveringobjects,
    author = {Josef Sivic and Bryan C. Russell and Alexei A. Efros and Andrew Zisserman and William T. Freeman},
    title = {Discovering objects and their location in images},
    booktitle = {In ICCV},
    year = {2005}
}

Share

Facebook Twitter Reddit Bibsonomy

OpenURL

 

Abstract

We seek to discover the object categories depicted in a set of unlabelled images. We achieve this using a model developed in the statistical text literature: probabilistic Latent Semantic Analysis (pLSA). In text analysis this is used to discover topics in a corpus using the bag-of-words document representation. Here we treat object categories as topics, so that an image containing instances of several categories is modeled as a mixture of topics. The model is applied to images by using a visual analogue of a word, formed by vector quantizing SIFT-like region descriptors. The topic discovery approach successfully translates to the visual domain: for a small set of objects, we show that both the object categories and their approximate spatial layout are found without supervision. Performance of this unsupervised method is compared to the supervised approach of Fergus et al. [8] on a set of unseen images containing only one object per image. We also extend the bag-of-words vocabulary to include ‘doublets ’ which encode spatially local co-occurring regions. It is demonstrated that this extended vocabulary gives a cleaner image segmentation. Finally, the classification and segmentation methods are applied to a set of images containing multiple objects per image. These results demonstrate that we can successfully build object class models from an unsupervised analysis of images. 1.

Keyphrases

object category    image segmentation    bag-of-words document representation    visual analogue    topic discovery approach    small set    multiple object    object class model    approximate spatial layout    probabilistic latent semantic analysis    unsupervised analysis    statistical text literature    unsupervised method    sift-like region descriptor    unlabelled image    several category    visual domain    supervised approach    segmentation method    unseen image    local co-occurring region    text analysis   

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University