Download:
|
by Paul Viola, Michael Jones
http://www.ai.mit.edu/people/viola/research/publications/CVPR-2001.ps.gz
Add To MetaCart
Abstract:
This paper describes a machine learning approach for visual object detection which is capable of processing images extremely rapidly and achieving high detection rates. This work is distinguished by three key contributions. The first is the introduction of a new image representation called the "Integral Image " which allows the features used by our detector to be computed very quickly. The second is a learning algorithm, based on AdaBoost, which selects a small number of critical visual features from a larger set and yields extremely efficient classifiers[6]. The third contribution is a method for combining increasingly more complex classifiers in a "cascade " which allows background regions of the image to be quickly discarded while spending more computation on promising object-like regions. The cascade can be viewed as an object specific focus-of-attention mechanism which unlike previous approaches provides statistical guarantees that discarded regions are unlikely to contain the object of interest. In the domain of face detection the system yields detection rates comparable to the best previous systems. Used in real-time applications, the detector runs at 15 frames per second without resorting to image differencing or skin color detection. 1.
Citations
|
1175
|
A decision-theoretic generalization of on-line learning and an application to boosting
– Freund, Schapire
- 1997
|
|
622
|
Neural network-based face detection
– Rowley, Baluja, et al.
- 1998
|
|
542
|
The Design and Use of Steerable Filters
– Freeman, Adelson
- 1991
|
|
498
|
Boosting the margin: A new explanation for the effectiveness of voting methods
– Schapire, Freund, et al.
- 1997
|
|
379
|
Training support vector machines: an application to face detection
– Osuna, Freund, et al.
- 1997
|
|
341
|
A model of saliency-based visualattention for rapid scene analysis
– Itti, Koch, et al.
- 1998
|
|
238
|
A statistical method of 3d object detection applied to faces and cars
– SCHNEIDERMAN, KANADE
|
|
158
|
A General Framework for Object Detection
– Papageorgiou, Oren, et al.
- 1998
|
|
157
|
P.: Boosting image retrieval
– Tieu, Viola
|
|
134
|
A: Statistical Pattern Recognition
– Webb
- 1999
|
|
125
|
Summed-area tables for texture mapping
– Crow
- 1984
|
|
124
|
Modeling Visual Attention via Selective Tuning
– Tsotsos, Culhane, et al.
- 1995
|
|
87
|
A SNoW-based face detector
– Yang, Roth, et al.
- 2000
|
|
63
|
Joint Induction of Shape Features and Tree Classifiers
– Amit, Geman, et al.
- 1997
|
|
44
|
Coarse-to-fine face detection
– Fleuret, Geman
|
|
41
|
Overcomplete steerable pyramid filters and rotation invariance
– Greenspan, Belongie, et al.
- 1994
|
|
20
|
Example-based learning for view-based face detection
– Sung, Poggio
- 1998
|
|
1
|
Overcomplete steerable pyramid filISBN 0-7695-1272-0/01 $10.00 (C) 2001 IEEE ters and rotation invariance
– Greenspan, Belongie, et al.
- 1994
|