• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

An Approach to the Parameterization of Structure for Fast Categorization,” Int (2010)

by C Rasche
Venue:Journal of Computer Vision
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 18
Next 10 →

Content-Based Video Description for Automatic Video Genre Categorization

by Bogdan Ionescu, Klaus Seyerlehner, Christoph Rasche, Constantin Vertan
"... Abstract. In this paper, we propose an audio-visual approach to video genre categorization. Audio information is extracted at block-level, which has the advantage of capturing local temporal information. At temporal structural level, we asses action contents with respect to human perception. Further ..."
Abstract - Cited by 3 (3 self) - Add to MetaCart
Abstract. In this paper, we propose an audio-visual approach to video genre categorization. Audio information is extracted at block-level, which has the advantage of capturing local temporal information. At temporal structural level, we asses action contents with respect to human perception. Further, color perception is quantified with statistics of color distribution, elementary hues, color properties and relationship of color. The last category of descriptors determines statistics of contour geometry. An extensive evaluation of this multi-modal approach based on on more than 91 hours of video footage is presented. We obtain average precision and recall ratios within [87 % − 100%] and [77 % − 100%], respectively, while average correct classification is up to 97%. Additionally, movies displayed according to feature-based coordinates in a virtual 3D browsing environment tend to regroup with respect to genre, which has potential application with real content-based browsing systems.
(Show Context)

Citation Context

...direction histograms, edge direction coherence, which are highly low-level edge pixel statistics. Our approach in contrast, proposes a novel method which uses curve partitioning and curve description =-=[13]-=-. The contour description is based on a characterization of geometric attributes for each individual contour, e.g. degree ofContent-Based Video Description for Automatic Video Genre Categorization 57...

CoPhIR Image Collection under the Microscope

by Michal Batko, Petra Budíková, David Novak, Michal Batko, Petra Kohoutkova, David Novak - In Proc. SISAP , 2009
"... Abstract—The Content-based Photo Image Retrieval (CoPhIR) dataset is the largest available database of digital images with corresponding visual descriptors. It contains five MPEG-7 global descriptors extracted from more than 106 million images from Flickr photo-sharing system. In this paper, we anal ..."
Abstract - Cited by 3 (1 self) - Add to MetaCart
Abstract—The Content-based Photo Image Retrieval (CoPhIR) dataset is the largest available database of digital images with corresponding visual descriptors. It contains five MPEG-7 global descriptors extracted from more than 106 million images from Flickr photo-sharing system. In this paper, we analyze this dataset focusing on 1) efficiency of similarity-based indexing and searching and on 2) expressiveness of combination of the descriptors with respect to subjective perception of visual similarity. We treat the descriptors as metric spaces and then combine them into a multi-metric space. We analyze distance distributions of individual descriptors, measure intrinsic dimensionality of these datasets and statistically evaluate correlation between these descriptors. Further, we use two methods to assess subjective accuracy and satisfaction of similarity retrieval based on a combination of descriptors that is recommended for CoPhIR, and we compare these results on databases of 10 and 100 million CoPhIR images. Finally, we suggest, explore and evaluate two approaches to improve the accuracy: 1) applying logarithms in order to weaken influence of a single descriptor contribution if it deviates from the rest, and 2) the possibility of categorization of the dataset and identifying visual characteristics important for individual categories. Keywords-metric space; MPEG-7; visual descriptors; CoPhIR dataset; dataset analysis I.
(Show Context)

Citation Context

...ifficult because category instances are structurally variable. (. . . ) The variability persists at different levels ranging from the single contour to the entire configuration of features or parts.” =-=[15]-=- Our aim is not to introduce a novel approach to automatic categorization but to study how categorization of CoPhIR database could improve the similarity retrieval according to the visual descriptors ...

A Novel Structural-Description Approach For Image Retrieval

by Christoph Rasche, Constantin Vertan
"... We tested our image classification methodology in the photo-annotation task of the ImageCLEF competition [Nowak, 2010] using a visual-only approach performing automated labeling. Our labeling process consisted of three phases: 1) feature extraction using color histogramming and using a novel method ..."
Abstract - Cited by 3 (2 self) - Add to MetaCart
We tested our image classification methodology in the photo-annotation task of the ImageCLEF competition [Nowak, 2010] using a visual-only approach performing automated labeling. Our labeling process consisted of three phases: 1) feature extraction using color histogramming and using a novel method of structural description, that was exploited in a statistical manner only; 2) classification using Linear Discriminant (LD) or Average-Retrieval Rank (ARR) methods that provided the confidence (scalar) values, which were then thresholded to obtain the binary values; 3) eliminating labels (setting binary values to 0) on the testing set thereby exploiting the calculated joint-probabilities for pairs of concepts from the training set. The results show that our present system performs better on ’whole-image’ labels than on object labels.

N.: Fisher kernel based relevance feedback for multimodal video retrieval

by Bogdan Ionescu, Jasper Uijlings, Nicu Sebe
"... This paper proposes a novel approach to relevance feedback based on the Fisher Kernel representation in the context of multimodal video retrieval. The Fisher Kernel representa-tion describes a set of features as the derivative with respect to the log-likelihood of the generative probability distribu ..."
Abstract - Cited by 3 (0 self) - Add to MetaCart
This paper proposes a novel approach to relevance feedback based on the Fisher Kernel representation in the context of multimodal video retrieval. The Fisher Kernel representa-tion describes a set of features as the derivative with respect to the log-likelihood of the generative probability distribu-tion that models the feature distribution. In the context of relevance feedback, instead of learning the generative prob-ability distribution over all features of the data, we learn it only over the top retrieved results. Hence during relevance feedback we create a new Fisher Kernel representation based on the most relevant examples. In addition, we propose to use the Fisher Kernel to capture temporal information by cutting up a video in smaller segments, extract a feature vector from each segment, and represent the resulting fea-ture set using the Fisher Kernel representation. We evaluate our method on the MediaEval 2012 Video Genre Tagging Task, a large dataset, which contains 26 categories in 15.000 videos totalling up to 2.000 hours of footage. Results show that our method significantly improves results over existing state-of-the-art relevance feedback techniques. Furthermore, we show significant improvements by using the Fisher Ker-nel to capture temporal information, and we demonstrate that Fisher kernels are well suited for this task.
(Show Context)

Citation Context

...l frames. Global HoG (81 values) [32] - from this category, we compute global Histogram of oriented Gradients (HoG) over all frames. Structural descriptors (1,430 values) - the structural description =-=[33]-=- is based on a characterization of geometric attributes for each individual contour, e.g. degree of curvature, angularity, circularity, symmetry and ”wiggliness”, as proposed in [33]. These descriptor...

Background Invariant Static Hand Gesture Recognition based on Hidden Markov Models

by Radu-laurențiu Vieriu, Ionuț Mironică, Bogdan-tudor Goraș
"... Abstract — This paper addresses the problem of Static Hand Gesture Recognition (SHGR) and proposes a fast yet simple solution based on Discrete Hidden Markov Models (DHMMs) that use features extracted from the hand contours. In addition to previous work, the use of depth information ensures robustne ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
Abstract — This paper addresses the problem of Static Hand Gesture Recognition (SHGR) and proposes a fast yet simple solution based on Discrete Hidden Markov Models (DHMMs) that use features extracted from the hand contours. In addition to previous work, the use of depth information ensures robustness to the overall system, making it background invariant. Experiments carried on a challenging noisy dataset reveal the superior discriminating as well as generalizing abilities of statistical models, when compared to state-of-the-art methods. I.
(Show Context)

Citation Context

...vasive and reduce considerably the naturalness of gestures.sMore recently, in [6] fingertips are obtained by analyzingscurvature segments extracted from contours, using an approachssimilar to that in =-=[7]-=-. While the features themselves seemspromising, they do require appropriate classification algorithmssin order to use the considerable amount of information theysprovide. Low level features (e.g. appe...

Video Genre Categorization and Representation using Audio-Visual Information

by Bogdan Ionescua, Klaus Seyerlehnerb, Christoph Raschea, Constantin Vertana, Patrick Lambertc , 2012
"... We propose an audio-visual approach to video genre classification using content descriptors that exploit audio, color, temporal, and contour information. Audio information is extracted at block-level, which has the advantage of capturing local temporal information. At the temporal structure level, w ..."
Abstract - Add to MetaCart
We propose an audio-visual approach to video genre classification using content descriptors that exploit audio, color, temporal, and contour information. Audio information is extracted at block-level, which has the advantage of capturing local temporal information. At the temporal structure level, we consider action content in relation to human perception. Color perception is quantified using statistics of color distribution, elementary hues, color properties, and relationships between colors. Further, we compute statistics of contour geometry and relationships. The main contribution of our work lies in harnessing the descriptive power of the combination of these descriptors in genre classification. Validation was carried out on over 91 hours of video footage encompassing 7 common video genres, yielding average precision and recall ratios of 87%−100 % and 77%−100%, respectively, and an overall average correct classification of up to 97%. Also, experimental comparison as part of 1 the MediaEval 2011 benchmarking campaign demonstrated the superiority of the proposed audio-visual descriptors over other existing approaches. Finally, we discuss a 3D video browsing platform that displays movies using feature-based coordinates and thus regroups them according to genre.
(Show Context)

Citation Context

...rams, edge direction coherence, [43], which do not exploit real contour geometry and properties. Our approach, in contrast, proposes a novel method which uses curve partitioning and curve description =-=[29]-=-. The contour description is based on a characterization of geometric attributes of each individual contour, for instance, degree of curvature, angularity, and ”wiggliness”. These 17 attributes are us...

Approaching Shape Matching with the Local/Global Space Affiliation and mailing address:

by Christoph Rasche, Laboratorul De Analiza Si Prelucrarea
"... A shape matching approach is introduced, which is based on a novel curve description, namely a (lo-cal/global) amplitude space. Two matching princi-ples are tested with this description. First, a point-based (correspondence) matching is carried out with the entire amplitude space, for which the MPG7 ..."
Abstract - Add to MetaCart
A shape matching approach is introduced, which is based on a novel curve description, namely a (lo-cal/global) amplitude space. Two matching princi-ples are tested with this description. First, a point-based (correspondence) matching is carried out with the entire amplitude space, for which the MPG7 re-trieval score is 78.74%. Second, a segment-based matching with abstracted boundary segments is in-troduced, with the goal to move away from the typical constraints of point-based matching. Those segments are obtained by analyzing the local/global space. The retrieval score for this type of matching is 70.48 % and although it is lower than the former, it can be applied to gray-scale images. When the two matching metrics are combined, a retrieval score of 84.80 % is obtained, which is near top-performing, reported methods. Us-ing an optimization method for the distance matrix, the score can be driven up to 95.01 % (2nd best re-ported so far). The particular advantage of the pre-sented approach is that it allows part interpretation (irrespective of the matching type).
(Show Context)

Citation Context

...ng (optimization), and the latter with learning (see also table 1 for comparison of scores). Presented Approach The shape description used here is based on a (multi-resolution) local/global analysis (=-=Rasche, 2010-=-), in which no modification of the contour occurs - as opposed to the curvaturescale space, which is generated by lowpass filtering the contour and hence creating a fine/coarse scale (Mokhtarian and B...

classification of

by Bogdan E. Ionescu, Christoph Rasche, Constantin Vertan, Patrick Lambert
"... contour-color-action approach to automatic ..."
Abstract - Add to MetaCart
contour-color-action approach to automatic
(Show Context)

Citation Context

...ment, etc.). The method is transposed from static image indexing, where it has been successfully validated on retrieving tens of semantic concepts, e.g. outdoor, doors/entrances, fruits, people, etc. =-=[7]-=-. The main novel aspect is however the combination of all these parameters for the classification of 7 common genres. Each genre shows some specificity for these parameters (empirically determined), f...

Grouping and Description of Partitioned Segments Affiliation and mailing address:

by Christoph Rasche, Laboratorul De Analiza Si Prelucrarea
"... A methodology for the detection and geometric char-acterization of groups of segments is introduced. One set of groups focuses on a precise geometric charac-terization of the alignment of two and four segments; and on a geometric characterization of shapes up to five corners, whose outlines are obta ..."
Abstract - Add to MetaCart
A methodology for the detection and geometric char-acterization of groups of segments is introduced. One set of groups focuses on a precise geometric charac-terization of the alignment of two and four segments; and on a geometric characterization of shapes up to five corners, whose outlines are obtained from iso-contours. Another set of groups focuses on a loose geometric characterization of three or more segments. The grouping processes occur relatively fast as only keypoints are used, such as the segments end- and midpoints. The grouping output is tested in an im-age classification task, evaluated on three image col-lections (Urban&Natural, Landuse and Caltech 101), whereby a structural as well as a statistical form of representation is tested. The classification accuracy is comparable to other approaches.
(Show Context)

Citation Context

... The parameters were taken from the output of various types of image preprocessing. The input to most grouping processes are a list of partitioned segments S, partitioned by the method introduced in (=-=Rasche, 2010-=-). The segments (coarse) geometry can be straight, curved or elongated (amplitude larger than chord length) - hereafter the term ’curved’ includes the elongated case. We use the term straight for stra...

The Representative Capacity of Parameters Derived from the Radial Signature

by Christoph Rasche
"... A method for the boundary representation of ’simple ’ shapes is presented. It is based on the radial signature and exploits its extrema information to arrive at a low-dimensional geometric description (ca. 10 dimensions). This short description can represent shapes well, which is demonstrated on the ..."
Abstract - Add to MetaCart
A method for the boundary representation of ’simple ’ shapes is presented. It is based on the radial signature and exploits its extrema information to arrive at a low-dimensional geometric description (ca. 10 dimensions). This short description can represent shapes well, which is demonstrated on the Corel and MPEG7 collection. Its key advantage is the short computation duration. If this description is extended by further radial-based parameters and combined with Fourier descriptors, it leads to almost the same retrieval performance as the best-performing signature approach, which also uses Fourier descriptors. In a classification task, the radial-based descriptors clearly outperform the Fourier descriptors.
(Show Context)

Citation Context

...y shapes (in gray-scale images) that cannot be assigned clearly to a single bias only. The strength of the description lies in allowing multiple biases and avoids a ’feature classification’ (see also =-=[4]-=- for explanations). - Elongation, η: measures the spatial extent of the shape and is 0 for symmetric shapes such as circles, squares and pentagons; it is proportional to the range (rng) of radii other...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University