DMCA
ClassView: Hierarchical Video Shot Classification, Indexing, and Accessing (2004)
Venue: | IEEE TRANS. ON MULTIMEDIA |
Citations: | 42 - 4 self |
Citations
4373 | Induction of decision trees
- Quinlan
- 1986
(Show Context)
Citation Context ...feature-based data clustering techniques are unsuitable for video classification because of the semantic gap [19]–[21]. Decision tree classifier is also widely used for supervised data classificatio=-=n [33]��-=-�[35], but it may consist of too many internal nodes which are consequently very difficult to comprehend and interpret. Even after pruning, the decision tree structures induced by the existing machine... |
2843 |
Pattern classification
- Duda, Hart, et al.
- 2001
(Show Context)
Citation Context ...ication rules for the visual concept nodes at the scene level in the classifier. The posterior probability of a video shot with the feature values being in a video scene can be computed via Bayes law =-=[47]-=-: where and indicate the conditional probabilities for the presence and absence of the video scene , (7) (8)sFAN et al.: ClassView: HIERARCHICAL VIDEO SHOT CLASSIFICATION 77 Fig. 7. Bottom-up procedur... |
2750 | R-trees: a dynamic index structure for spatial searching
- Guttman
- 1984
(Show Context)
Citation Context ...nsitive video classification problem: When very large video data set comes into view, efficient video database indexing can no longer be ignored [12]. However, the traditional database indexing trees =-=[13]��-=-�[18], such as R-tree, SR-tree, and SS-tree, are unsuitable for video database indexing and management because of the curse of dimensionality [49]. Video retrieval can be performed in an efficient way... |
1976 | Introduction to WordNet: An On-line Lexical Database
- Miller, Beckwith, et al.
- 1993
(Show Context)
Citation Context ...ical tree structure of our semantics-sensitive video classifier is derived from the domain-dependent concept hierarchy of video contents and is provided by domain experts or obtained by using WordNet =-=[43]-=-, [44]. Each visual concept node in this classifier defines a specific semantic visual concept which makes sense to human beings, the contextual and logical relationships between the higher level visu... |
1618 | Content-based image retrieval at the end of the early years.
- Smeulders, Worring, et al.
- 2000
(Show Context)
Citation Context ...r from the following challenging problems. Semantics-sensitive video classification problem: When very large video data set comes into view, efficient video database indexing can no longer be ignored =-=[12]. -=-However, the traditional database indexing trees [13]–[18], such as R-tree, SR-tree, and SS-tree, are unsuitable for video database indexing and management because of the curse of dimensionality [49... |
1052 |
Query by image and video content: the qbic system.
- Flickner, al
- 1995
(Show Context)
Citation Context ... in video databases [1], [2]. The recent development of content-based video retrieval systems has advanced our capabilities for searching videos via color, layout, texture, motion, and shape features =-=[3]��-=-�[11]. However, these content-based video retrieval systems still suffer from the following challenging problems. Semantics-sensitive video classification problem: When very large video data set comes... |
722 | CURE: An efficient clustering algorithm for large databases
- Guha, Rastogi, et al.
- 1998
(Show Context)
Citation Context ...sual features and high-level semantic visual concepts [22]–[32]. The traditional pure feature-based data clustering techniques are unsuitable for video classification because of the semantic gap [19=-=]–[21]. -=-Decision tree classifier is also widely used for supervised data classification [33]–[35], but it may consist of too many internal nodes which are consequently very difficult to comprehend and inter... |
709 | Efficient and effective clustering methods for spatial data min ing
- Ng, Han
- 1994
(Show Context)
Citation Context ...el visual features and high-level semantic visual concepts [22]–[32]. The traditional pure feature-based data clustering techniques are unsuitable for video classification because of the semantic ga=-=p [19]–[-=-21]. Decision tree classifier is also widely used for supervised data classification [33]–[35], but it may consist of too many internal nodes which are consequently very difficult to comprehend and ... |
652 | Relevance Feedback: A Power Tool for Interactive Content-Based Image Retrieval,”
- Rui, Huang, et al.
- 1998
(Show Context)
Citation Context ...e the naive users can exchange their subjective judgments with the database system interactively, the online relevance feedback approach is more suitable for serving a large population of naive users =-=[5-=-], [28], [29]. However, the conventional online rele84 IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 6, NO. 1, FEBRUARY 2004 vance feedback techniques suffer from the following problems when they are applied ... |
590 | The x-tree : An index structure for high-dimensional data
- Berchtold
- 1996
(Show Context)
Citation Context ...ve video classification problem: When very large video data set comes into view, efficient video database indexing can no longer be ignored [12]. However, the traditional database indexing trees [13]�=-=��[18]-=-, such as R-tree, SR-tree, and SS-tree, are unsuitable for video database indexing and management because of the curse of dimensionality [49]. Video retrieval can be performed in an efficient way by c... |
576 | BIRCH: an efficient data clustering method for very large databases. - Zhang, Ramakrishnon, et al. - 1996 |
551 | SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries,”
- Wang, Li, et al.
- 2001
(Show Context)
Citation Context ...o classify the unlabeled video clips to the known semantic visual concepts defined by the domain-dependent concept hierarchy so that more efficient video database indexing and access can be supported =-=[26]-=-. After the classification, the unlabeled video shots inherit the semantic labels assigned for the visual concept nodes they belong to, thus automatic video annotation is supported by using the widely... |
542 | Photobook: Content-based manipulation of image databases. - Pentland, Picard, et al. - 1994 |
438 | The sr-tree: An index structure for high-dimensional nearest neighbor queries - Katayama, Satoh - 1997 |
345 | Similarity indexing with the ss-tree - White, Jain - 1996 |
235 |
Relevance feedback in image retrieval: A comprehensive review Multimedia Systems,
- Zhou, Thomas
- 2003
(Show Context)
Citation Context ...ode on the classifier and database indexing structure), its feature subspace, feature weights (i.e., importances) and classification rule are predetermined without considering the user’s subjectivit=-=y [50]��-=-�[52]. The same domain-dependent concept hierarchy is used as the inherent database indexing tree structure for video management and holden for all the users. While it is very important to enable real... |
218 | The TV-tree: an index structure for high dimensional data, - Lin, Jagadish, et al. - 1995 |
211 | Mindreader: Querying databases through multiple examples.
- Ishikawa, Subramanya, et al.
- 1998
(Show Context)
Citation Context ...importance for video shot representation and classification for the corresponding visual concept node. Instead of searching the weights from the high-dimensional original feature space (i.e., ) [28], =-=[29], -=-we first use decision tree to obtain the feature subsets, and , for the corresponding visual concept node [33]–[35]. Determining the feature subsets first via decision tree has reduced the search bu... |
207 |
Small sample size effects in statistical pattern recognition: Recommendations for practitioners.
- Raudys, Jain
- 1991
(Show Context)
Citation Context ...12]. However, the traditional database indexing trees [13]–[18], such as R-tree, SR-tree, and SS-tree, are unsuitable for video database indexing and management because of the curse of dimensionalit=-=y [49]-=-. Video retrieval can be performed in an efficient way by classifying the similar videos into the same cluster [10], [11]. Unfortunately, there is a semantic gap between low-level visual features and ... |
187 | The hB-tree: A multiattribute indexing method with good guaranteed performance - Lomet, Salzberg - 1990 |
168 | Interactive Learning using a Society of Models", - Minka, Picard - 1996 |
138 | Dimensionality Reduction Using Genetic Algorithms
- Raymer, Punch, et al.
- 2000
(Show Context)
Citation Context ...xing are normally in high-dimensions [49]. One reasonable solution is first to classify videos into a set of clusters and then to perform the dimension reduction on these clusters independently [41], =-=[42]-=-, the traditional database indexing trees can supposedly be used for indexing these video clusters independently with relatively low-dimensional features. However, the pure feature-based clustering te... |
123 |
Video visualization for compact presentation and fast browsing of pictorial content. Circuits and Systems for Video Technology,
- YEUNG, YEO
- 1997
(Show Context)
Citation Context ...ase browsing can be supported. III. SEMANTICS-SENSITIVE VIDEO CLASSIFIER Video analysis and feature extraction are necessary steps for supporting hierarchical semantics-sensitive video classification =-=[45]-=-. In our approach, a MPEG video sequence is first partitioned into a set of video shots by using our automatic video shot detection technique. In general, threshold setting plays a critical role in au... |
114 | Automated construction of classifications: conceptual clustering versus numerical taxonomy, 1EEE Trans. on
- Michalski, Stepp
- 1983
(Show Context)
Citation Context ...efficient way by classifying the similar videos into the same cluster [10], [11]. Unfortunately, there is a semantic gap between low-level visual features and high-level semantic visual concepts [22]�=-=��[32]. -=-The traditional pure feature-based data clustering techniques are unsuitable for video classification because of the semantic gap [19]–[21]. Decision tree classifier is also widely used for supervis... |
102 |
An Integrated System for Content-based Video Retrieval and Browsing.
- Zhang, Wu, et al.
- 1997
(Show Context)
Citation Context ...le for video database indexing and management because of the curse of dimensionality [49]. Video retrieval can be performed in an efficient way by classifying the similar videos into the same cluster =-=[10], -=-[11]. Unfortunately, there is a semantic gap between low-level visual features and high-level semantic visual concepts [22]–[32]. The traditional pure feature-based data clustering techniques are un... |
83 | Constructing table-of-content for videos
- Rui, Huang, et al.
- 1999
(Show Context)
Citation Context ... hierarchical video database browsing because of the lack of efficient video summary presentation structure [12]. In order to support video browsing, some pioneer works have been proposed in the past =-=[36]��-=-�[39]. However, these existing techniques just focus on browsing a video sequence and they did not address how to support the concept-oriented hierarchical video database browsing [40]. A key issue to... |
80 |
Clustering methods for video browsing and annotation
- Zhong, Zhang, et al.
- 1996
(Show Context)
Citation Context ...sed in the past [36]–[39]. However, these existing techniques just focus on browsing a video sequence and they did not address how to support the concept-oriented hierarchical video database browsin=-=g [40]-=-. A key issue to the concept-oriented hierarchical video database browsing is whether the visual summaries found make sense to the naive users and how to interpret the contextual and logical relations... |
73 |
Virage video engine
- Hampapur, Gupta, et al.
- 1997
(Show Context)
Citation Context ...ual concepts and enable concept-oriented hierarchical video database browsing [12]. – Query-by-keywords is also used in some content-based video retrieval systems based on manual text annotation [3]=-=, [6]-=-. The keywords, which are used for describing and indexing the videos in the database, are subjectively added by database constructionist without a well-defined structure. Since the keywords used for ... |
65 | NeTra-V: toward an object-based video representation - Deng, Manjunath - 1998 |
61 | Relevance feedback techniques in image retrieval.
- Rui, Huang
- 2001
(Show Context)
Citation Context ... high importance for video shot representation and classification for the corresponding visual concept node. Instead of searching the weights from the high-dimensional original feature space (i.e., ) =-=[28], -=-[29], we first use decision tree to obtain the feature subsets, and , for the corresponding visual concept node [33]–[35]. Determining the feature subsets first via decision tree has reduced the sea... |
61 |
Unifying keywords and visual contents in image retrieval
- Zhou, Huang
(Show Context)
Citation Context ...n the classifier and database indexing structure), its feature subspace, feature weights (i.e., importances) and classification rule are predetermined without considering the user’s subjectivity [50=-=]–[52]-=-. The same domain-dependent concept hierarchy is used as the inherent database indexing tree structure for video management and holden for all the users. While it is very important to enable real-time... |
56 |
Rule-based video classification system for basketball video indexing,”
- Zhou, Vellaikal, et al.
- 2000
(Show Context)
Citation Context ...n an efficient way by classifying the similar videos into the same cluster [10], [11]. Unfortunately, there is a semantic gap between low-level visual features and high-level semantic visual concepts =-=[22]–[-=-32]. The traditional pure feature-based data clustering techniques are unsuitable for video classification because of the semantic gap [19]–[21]. Decision tree classifier is also widely used for sup... |
50 |
Query by video clip
- Jain, Vailaya, et al.
- 1999
(Show Context)
Citation Context ...video databases [1], [2]. The recent development of content-based video retrieval systems has advanced our capabilities for searching videos via color, layout, texture, motion, and shape features [3]�=-=��[11]-=-. However, these content-based video retrieval systems still suffer from the following challenging problems. Semantics-sensitive video classification problem: When very large video data set comes into... |
50 | ViBE: A compressed video database structured lor active browsing and search,”
- Taskiran, Chen, et al.
- 2004
(Show Context)
Citation Context ...aracterize video in the database: shot-based and object-based. In this paper, we focus on the shot-based approach because video shots are good choice as the basic unit for video content indexing [36]�=-=��[38]-=-. In order to support more efficient video database management, we classify video shots into a set of hierarchical database management units as shown in Fig. 1. In order to achieve hierarchical video ... |
48 | MediaNet: a multimedia information network for knowledge representation
- Benitez, Smith, et al.
- 2000
(Show Context)
Citation Context ...ree structure of our semantics-sensitive video classifier is derived from the domain-dependent concept hierarchy of video contents and is provided by domain experts or obtained by using WordNet [43], =-=[44]-=-. Each visual concept node in this classifier defines a specific semantic visual concept which makes sense to human beings, the contextual and logical relationships between the higher level visual con... |
45 | An Automatic Hierarchical Image Classification Scheme - Huang, Kumar, et al. - 1998 |
37 | Semantic clustering and querying on heterogeneous features for visual data - Sheikholeslami, Chang, et al. - 1998 |
27 | Adaptive nearest neighbor search for relevance feedback in large image databases,” in
- Wu, Manjunath
- 2001
(Show Context)
Citation Context ...s have been done to integrate the online relevance feedback with the inherent database indexing structure, thus the conventional online relevance feedback techniques cannot scale to the database size =-=[53]-=-. The conventional nearest neighbor search is also unsuitable for supporting online relevance feedback because it treats all the visual features with the same importance. If the naive users do not hav... |
26 | A statistical approach to decision tree modeling
- Jordan
- 1994
(Show Context)
Citation Context ...re-based data clustering techniques are unsuitable for video classification because of the semantic gap [19]–[21]. Decision tree classifier is also widely used for supervised data classification [33=-=]–[35]-=-, but it may consist of too many internal nodes which are consequently very difficult to comprehend and interpret. Even after pruning, the decision tree structures induced by the existing machine lear... |
26 |
Clustering and singular value decomposition for approximate indexing in high dimensional spaces”.
- Thomasian, Castelli, et al.
- 1998
(Show Context)
Citation Context ...d indexing are normally in high-dimensions [49]. One reasonable solution is first to classify videos into a set of clusters and then to perform the dimension reduction on these clusters independently =-=[41]-=-, [42], the traditional database indexing trees can supposedly be used for indexing these video clusters independently with relatively low-dimensional features. However, the pure feature-based cluster... |
26 |
Advanced Database Indexing.
- Manolopoulos, Theodoridis, et al.
- 2000
(Show Context)
Citation Context ...in each group is less than a predefined threshold , where is the total number of video shots in the group, and is the dimensions of the discriminating features for the corresponding leaf cluster node =-=[48]-=-. A. Integrated Video Query Our hierarchical video database indexing structure can also support more powerful query-by-example. As mentioned above, the naive users can select two approaches to achieve... |
24 | A fully automatic content-based video search engine supporting multi-object spatio-temporal queries - Chang, Chen, et al. - 1998 |
22 | Scenic classification methods for image and video database - Yu, Wolf - 1995 |
21 | Name-It: Association of face and name - Satoh, Kanade - 1997 |
20 | Automatic semantic structure reconstruction and representation generation for broadcast news - Huang, Liu, et al. - 1999 |
14 | Classification, Simplification and Dynamic Visualization of Scene Transition Graphs for Video Browsing - YEO, YEUNG - 1998 |
11 | Multimedia Knowledge Integration, Summarization and Evaluation
- BENITEZ, CHANG
- 2002
(Show Context)
Citation Context ...very important to generate this concept hierarchy atomatically according to the user’s subjectivity, however, it is very hard if not impossible for current machine learning techniques to achieve thi=-=s [54]-=-. Using the common knowledge or the domain knowledge from the experts is a good tradeoff for us to address this hard problem now, obviously, automatic techniques are expected to be provided in the fut... |
7 |
Adaptive motion-compensated video coding scheme towards content-based bit rate allocation
- Fan, Yau, et al.
- 2000
(Show Context)
Citation Context ...eo sequence is first partitioned into a set of video shots by using our automatic video shot detection technique. In general, threshold setting plays a critical role in automatic video shot detection =-=[46]-=-. The thresholds for shot detection should be adapted to the activities of video contents. It is impossible to use a universal threshold that can satisfy various conditions because the thresholds for ... |
6 | Description Schemes for Video Programs, Users and devices
- Salembier, Qian, et al.
- 2000
(Show Context)
Citation Context ...109/TMM.2003.819583 1520-9210/02$17.00 © 2004 IEEE nologies need to be developed for indexing, browsing, filtering, searching, and updating the vast amount of information available in video databases=-= [1], -=-[2]. The recent development of content-based video retrieval systems has advanced our capabilities for searching videos via color, layout, texture, motion, and shape features [3]–[11]. However, thes... |
6 | Active Learning for Information Retrieval: Using 3D Models As An Example, Carnegie Mellon Technical Report - Zhang, Chen |
5 |
On Image Classification: City versus Landscape
- Vailaya, Jain, et al.
- 1998
(Show Context)
Citation Context ...ideo sources: movies, video news, and medical videos. The video scene generation results from two sources are shown in Figs. 8 and 9. For one-level and two-state image classification techniques [26], =-=[30]-=-, their classification accuracy can be achieved higher than 90%. As compared with these traditional one-level andsFAN et al.: ClassView: HIERARCHICAL VIDEO SHOT CLASSIFICATION 79 TABLE I THE AVERAGE P... |
5 |
VideoZoom spatial-temporal video browsing
- Smith
- 1999
(Show Context)
Citation Context ...archical video database browsing because of the lack of efficient video summary presentation structure [12]. In order to support video browsing, some pioneer works have been proposed in the past [36]�=-=��[39]-=-. However, these existing techniques just focus on browsing a video sequence and they did not address how to support the concept-oriented hierarchical video database browsing [40]. A key issue to the ... |
3 |
Object-based multimedia content description schemes and applications for MPEG-7. Signal Processing: Image Commun
- BERITEZ, PAEK, et al.
- 2000
(Show Context)
Citation Context ...MM.2003.819583 1520-9210/02$17.00 © 2004 IEEE nologies need to be developed for indexing, browsing, filtering, searching, and updating the vast amount of information available in video databases [1],=-= [2]. -=-The recent development of content-based video retrieval systems has advanced our capabilities for searching videos via color, layout, texture, motion, and shape features [3]–[11]. However, these con... |
3 | Learning classification trees, Statist - Buntine - 1992 |