Results 1 - 10
of
21
A Fully Automated Content-Based Video Search Engine Supporting Spatiotemporal Queries
- IEEE Transactions on Circuits and Systems for Video Technology
, 1998
"... The rapidity with which digital information, particularly video, is being generated has necessitated the development of tools for efficient search of these media. Content-based visual queries have been primarily focused on still image retrieval. In this paper, we propose a novel, interactive system ..."
Abstract
-
Cited by 85 (4 self)
- Add to MetaCart
The rapidity with which digital information, particularly video, is being generated has necessitated the development of tools for efficient search of these media. Content-based visual queries have been primarily focused on still image retrieval. In this paper, we propose a novel, interactive system on the Web, based on the visual paradigm, with spatiotemporal attributes playing a key role in video retrieval. We have developed innovative algorithms for automated video object segmentation and tracking, and use real-time video editing techniques while responding to user queries. The resulting system, called VideoQ (demo available at http://www.ctr.columbia.edu/VideoQ/), is the first on-line video search engine supporting automatic objectbased indexing and spatiotemporal queries. The system performs well, with the user being able to retrieve complex video clips such as those of skiers and baseball players with ease. Index Terms---Content based, information retreival, object oriented, spat...
VideoQ: An Automated Content Based Video Search System Using Visual Cues
- In Proceedings of ACM Multimedia
, 1997
"... The rapidity with which digital information, particularly video, is being generated, has necessitated the development of tools for efficient search of these media. Content based visual queries have been primarily focussed on still image retrieval. In this paper, we propose a novel, real-time, intera ..."
Abstract
-
Cited by 75 (1 self)
- Add to MetaCart
The rapidity with which digital information, particularly video, is being generated, has necessitated the development of tools for efficient search of these media. Content based visual queries have been primarily focussed on still image retrieval. In this paper, we propose a novel, real-time, interactive system on the Web, based on the visual paradigm, with spatio-temporal attributes playing a key role in video retrieval. We have developed algorithms for automated video object segmentation and tracking and use real-time video editing techniques while responding to user queries. The resulting system performs well, with the user being able to retrieve complex video clips such as those of skiers, baseball players, with ease. 1. Introduction The ease of capture and encoding of digital images has caused a massive amount of visual information to be produced and disseminated rapidly. Hence efficient tools and systems for searching and retrieving visual information are needed. While there are...
Information retrieval on the Web
- ACM Computing Surveys
, 2000
"... In this paper we review studies of the growth of the Internet and technologies that are useful for information search and retrieval on the Web. We present data on the Internet from several different sources, e.g., current as well as projected number of users, hosts, and Web sites. Although numerical ..."
Abstract
-
Cited by 58 (0 self)
- Add to MetaCart
In this paper we review studies of the growth of the Internet and technologies that are useful for information search and retrieval on the Web. We present data on the Internet from several different sources, e.g., current as well as projected number of users, hosts, and Web sites. Although numerical figures vary, overall trends cited
NeTra-V: Towards an Object-based Video Representation
- IEEE Transactions on Circuits and Systems for Video Technology
, 1998
"... There is a growing need for new representations of video that allow not only compact storage of data but also content-based functionalities such as search and manipulation of objects. We present here a prototype system, called NeTra-V, that is currently being developed to address some of these conte ..."
Abstract
-
Cited by 57 (2 self)
- Add to MetaCart
There is a growing need for new representations of video that allow not only compact storage of data but also content-based functionalities such as search and manipulation of objects. We present here a prototype system, called NeTra-V, that is currently being developed to address some of these content related issues. The system has a twostage video processing structure: a global feature extraction and clustering stage, and a local feature extraction and object-based representation stage. Key aspects of the system include a new spatio-temporal segmentation and objecttracking scheme, and a hierarchical object-based video representation model. The spatio-temporal segmentation scheme combines the color/texture image segmentation and affine motion estimation techniques. Experimental results show that the proposed approach can handle large motion. The output of the segmentation, the alpha plane as it is referred to in the MPEG-4 terminology, can be used to compute local image properties. Thi...
Automatic Key Video Object Plane Selection Using the Shape Information in the MPEG-4 Compressed Domain
- IEEE Trans. Multimedia
, 2000
"... Object-based video representation, such as the one suggested by the MPEG-4 standard, offers a framework that is better suited for object-based video indexing and retrieval. In such a framework, the concept of a "key frame" is replaced by that of a "key video object plane". In this paper, we propo ..."
Abstract
-
Cited by 17 (4 self)
- Add to MetaCart
Object-based video representation, such as the one suggested by the MPEG-4 standard, offers a framework that is better suited for object-based video indexing and retrieval. In such a framework, the concept of a "key frame" is replaced by that of a "key video object plane". In this paper, we propose a method for key video object plane selection using the shape information in the MPEG-4 compressed domain. The shape of the video object is approximated using information on the shape coding modes in the MPEG-4 bitstream. Two popular shape distance measures, the Hamming and Hausdorff distance measures, are modified to measure the similarities between the approximated shapes of the video objects. Although they feature different computational and implementation complexity tradeoffs, the corresponding algorithms achieve essentially the same performance levels in selecting key video object planes that represent efficiently the salient content of the video objects. Key words: Key video ...
Model-Based Classification of Visual Information for Content-Based Retrieval
- STORAGE AND RETRIEVAL FOR IMAGE AND VIDEO DATABASES VII, IS&T/SPIE
, 1999
"... Most existing approaches to content-based retrieval rely on query by example or user sketch based on low-level features; these are not suitable for semantic (object level) distinctions. In other approaches, information is classified according to a predefined set of classes and classification is eith ..."
Abstract
-
Cited by 17 (8 self)
- Add to MetaCart
Most existing approaches to content-based retrieval rely on query by example or user sketch based on low-level features; these are not suitable for semantic (object level) distinctions. In other approaches, information is classified according to a predefined set of classes and classification is either performed manually or using class-specific algorithms. Most of these systems lack flexibility: the user does not have the ability to define or change the classes, and new classification schemes require implementation of new class-specific algorithms and/or the input of an expert. In this paper, we present a different approach to content-based retrieval and a novel framework for classification of visual information in which (1) users define their own visual classes and classifiers are learned automatically; and (2) multiple fuzzy-classifiers and machine learning techniques are combined for automatic classification at multiple levels (region, perceptual, object-part, object and scene). We p...
VIDEX: An Integrated Generic Video Indexing Approach
- In Proceedings of the ACM Multimedia Conference
, 2000
"... ABSTRACT This paper presents an integrated generic technique for lowand high-level video indexing. The proposed approach tries to integrate the advantages of existing low- and high-level video indexing approaches by reducing their shortcomings. Furthermore, the model introduces concepts for a detail ..."
Abstract
-
Cited by 16 (2 self)
- Add to MetaCart
ABSTRACT This paper presents an integrated generic technique for lowand high-level video indexing. The proposed approach tries to integrate the advantages of existing low- and high-level video indexing approaches by reducing their shortcomings. Furthermore, the model introduces concepts for a detailed structuring of video streams, and for correlations of lowand high-level video objects. The proposed model is called generic, as it only defines a framework of classes for an integrated video indexing system. It has been verified by implementing a prototype of a distributed multimedia information system supporting content-based video retrieval. Categories and Subject Descriptors H.3.1 [Information Storage and Retrieval]: Content Analysis and Indexing--abstracting methods, indexing methods
Integration of Visual and Text-Based Approaches for the Content Labeling and Classification of Photographs
- Classification of Photographs. ACM SIGIR'99 Workshop on Multimedia Indexing and Retrieval
, 1999
"... Annotating photographs automatically with content descriptions facilitates organization, storage, and search over visual information. We present an integrated approach for scene classification that combines image-based and text-based approaches. On the text side, we use the text accompanying an imag ..."
Abstract
-
Cited by 13 (5 self)
- Add to MetaCart
Annotating photographs automatically with content descriptions facilitates organization, storage, and search over visual information. We present an integrated approach for scene classification that combines image-based and text-based approaches. On the text side, we use the text accompanying an image in a novel TF*IDF vector-based approach to classification. On the image side, we present a novel OF*IIF (object frequency) vector-based approach to classification. Objects are defined by clustering of segmented regions of training images. The image based OF*IIF approach is synergistic with the text based TF*IDF approach. By integrating the TF*IDF approach and the OF*IIF approach, we achieved a classification accuracy of 86%. This is an improvement of approximately 12% over existing image classifiers, an improvement of approximately 3% over the TF*IDF image classifier based on textual information, and an improvement of approximately 4% over the OF*IIF image classifier based on visual inform...
A Framework for Video Modelling
- In the Proc. of International Conference on Applied Informatics
, 2000
"... In recent years, research in video databases has increased greatly, but relatively little work has been done in the area of semantic content-based retrieval. In this paper, we present a framework for video modelling with emphasis on semantic content of video data. The video data model presented dist ..."
Abstract
-
Cited by 11 (5 self)
- Add to MetaCart
In recent years, research in video databases has increased greatly, but relatively little work has been done in the area of semantic content-based retrieval. In this paper, we present a framework for video modelling with emphasis on semantic content of video data. The video data model presented distinguishes four layers: the raw data layer, the feature layer, the object layer and the event layer. It supports automatic definition of high-level concepts, such as video objects and events, based on extracted features. We focus our attention on event descriptions in this paper and give two modelling examples in the medical and soccer domain to show that the proposed event grammar can be efficiently used in different domains. Key Words: multimedia, video modelling, content-based retrieval

