Results 1 - 10
of
17
A Survey on Visual Content-Based Video Indexing and Retrieval
"... Abstract—Video indexing and retrieval have a wide spectrum of promising applications, motivating the interest of researchers worldwide. This paper offers a tutorial and an overview of the landscape of general strategies in visual content-based video indexing and retrieval, focusing on methods for vi ..."
Abstract
-
Cited by 24 (0 self)
- Add to MetaCart
(Show Context)
Abstract—Video indexing and retrieval have a wide spectrum of promising applications, motivating the interest of researchers worldwide. This paper offers a tutorial and an overview of the landscape of general strategies in visual content-based video indexing and retrieval, focusing on methods for video structure analysis, including shot boundary detection, key frame extraction and scene segmentation, extraction of features including static key frame features, object features and motion features, video data mining, video annotation, video retrieval including query interfaces, similarity measure and relevance feedback, and video browsing. Finally, we analyze future research directions. Index Terms—Feature extraction, video annotation, video browsing, video retrieval, video structure analysis. I.
Diverse Sequential Subset Selection for Supervised Video Summarization
"... Video summarization is a challenging problem with great application potential. Whereas prior approaches, largely unsupervised in nature, focus on sampling use-ful frames and assembling them as summaries, we consider video summarization as a supervised subset selection problem. Our idea is to teach t ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
(Show Context)
Video summarization is a challenging problem with great application potential. Whereas prior approaches, largely unsupervised in nature, focus on sampling use-ful frames and assembling them as summaries, we consider video summarization as a supervised subset selection problem. Our idea is to teach the system to learn from human-created summaries how to select informative and diverse subsets, so as to best meet evaluation metrics derived from human-perceived quality. To this end, we propose the sequential determinantal point process (seqDPP), a proba-bilistic model for diverse sequential subset selection. Our novel seqDPP heeds the inherent sequential structures in video data, thus overcoming the deficiency of the standard DPP, which treats video frames as randomly permutable items. Mean-while, seqDPP retains the power of modeling diverse subsets, essential for summa-rization. Our extensive results of summarizing videos from 3 datasets demonstrate the superior performance of our method, compared to not only existing unsuper-vised methods but also naive applications of the standard DPP model. 1
VISTO: VIsual STOryboard for Web Video Browsing ABSTRACT
"... Web video browsing is rapidly becoming a very popular activity in the Web scenario, causing the production of a concise video content representation a real need. Currently, static video summary techniques can be used to this aim. Unfortunately, they require long processing time and hence all the sum ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
(Show Context)
Web video browsing is rapidly becoming a very popular activity in the Web scenario, causing the production of a concise video content representation a real need. Currently, static video summary techniques can be used to this aim. Unfortunately, they require long processing time and hence all the summaries are produced in advance without any users customization. With an increasing number of videos and with the large users heterogeneousness, this is a burden. In this paper we propose VISTO, a summarization technique that produces customized on-the-fly video storyboards. The mechanism uses a fast clustering algorithm that selects the most representative frames using their HSV color distribution and allows users to select the storyboard length and the processing time. An objective and subjective evaluation shows that the storyboards are produced with good quality and in a time that allows on-the-fly usage. Categories and Subject Descriptors
Vehicle Tracking in Occlusion and Clutter
"... I hereby declare that I am the sole author of this thesis. This is a true copy of the thesis, including any required final revisions, as accepted by my examiners. I understand that my thesis may be made electronically available to the public. ii Vehicle tracking in environments containing occlusion ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
(Show Context)
I hereby declare that I am the sole author of this thesis. This is a true copy of the thesis, including any required final revisions, as accepted by my examiners. I understand that my thesis may be made electronically available to the public. ii Vehicle tracking in environments containing occlusion and clutter is an active research area. The problem of tracking vehicles through such environments presents a variety of challenges. These challenges include vehicle track initialization, tracking an unknown number of targets and the variations in real-world lighting, scene conditions and camera vantage. Scene clutter and target occlusion present additional challenges. A stochastic framework is proposed which allows for vehicles tracks to be identified from a sequence of images. The work focuses on the identification of vehicle tracks present in transportation scenes, namely, vehicle movements at intersections. The framework combines background subtraction and motion history based approaches to deal with the segmentation problem. The tracking problem is solved using a Monte Carlo Markov Chain Data Association (MCMCDA) method. The method includes a novel concept of including the notion of discrete, independent regions in the MCMC scoring function. Results are presented which show that the framework is capable of tracking vehicles in scenes containing multiple vehicles that occlude one another, and that are occluded by foreground scene objects. iii Acknowledgments I would like to express my sincerest appreciation to Professor David Clausi and Professor Paul Fieguth for their support and guidance in both scholastic and personal matters over the course of my Masters research. I would also like to acknowledge the support of the Ontario Centres of Excellence
VSUMM: An Approach Based on Color Features for Automatic Summarization and a Subjective Evaluation Method ∗†
"... Abstract—The fast evolution of digital video has brought many new multimedia applications and, as a consequence, research into new technologies that aim at improving the effectiveness and efficiency of video acquisition, archiving, cataloging and indexing, as well as increasing the usability of stor ..."
Abstract
- Add to MetaCart
(Show Context)
Abstract—The fast evolution of digital video has brought many new multimedia applications and, as a consequence, research into new technologies that aim at improving the effectiveness and efficiency of video acquisition, archiving, cataloging and indexing, as well as increasing the usability of stored videos. Among all possible research areas, video summarization is one of the most important topics, which may enable a quick browsing of a large collection of video data and to achieve efficient content access and representation. Essentially, this research area consists of automatically generating a short summary of a video, which can either be a static summary or a dynamic summary. In this paper, we present VSUMM, a methodology for the development of static video summaries. The method is based on color feature extraction from video frames and unsupervised classification. We also develop a new subjective method to evaluate video static summaries. The video summaries are manually created by users and compared with different approaches found in the literature. Experimental results show – with a confidence level of 98 % – that the proposed solution provided static video summaries with superior quality relative to the approaches to which it was compared. Keywords-Video summarization; Static video summary; Keyframes; Subjective evaluation; Clustering; Color histogram. I.
Kernel Methods for Unsupervised Domain Adaptation
, 2015
"... This thesis concludes a wonderful four-year journey at USC. I would like to take the chance to express my sincere gratitude to my amazing mentors and friends during my Ph.D. training. First and foremost I would like to thank my adviser, Prof. Fei Sha, without whom there would be no single page of th ..."
Abstract
- Add to MetaCart
This thesis concludes a wonderful four-year journey at USC. I would like to take the chance to express my sincere gratitude to my amazing mentors and friends during my Ph.D. training. First and foremost I would like to thank my adviser, Prof. Fei Sha, without whom there would be no single page of this thesis. Fei is smart, knowledgeable, and inspiring. Being truly fortunate, I got an enormous amount of guidance and support from him, financially, academically, and emotionally. He consistently and persuasively conveyed the spirit of adventure in research and academia of which I appreciate very much and from which my interests in trying out the faculty life start. On one hand, Fei is tough and sets a high standard on my research at “home”— the TEDS lab he leads. On the other hand, Fei is enthusiastically supportive when I reach out to conferences and the job market. These combined make a wonderful mix. I cherish every mind-blowing discussion with him, which sometimes lasted for hours. I would like to thank our long-term collaborator, Prof. Kristen Grauman, whom I see as my other academic adviser. Like Fei, she has set such a great model for me to follow on the road of becoming a good researcher. She is a deep thinker, a fantastic writer, and a hardworking professor. I will never forget how she praised our good work, how she hesitated on my poor
Reusing a Compound-Based Infrastructure for Searching and Annotating Video Stories
"... Abstract-The fast evolution of technology has led to a growing demand for multimedia data, increasing the amount of research into efficient systems to manage those materials. Significant in those efforts is the work being done by the Content-Based Image Retrieval (CBIR) community in processing and r ..."
Abstract
- Add to MetaCart
(Show Context)
Abstract-The fast evolution of technology has led to a growing demand for multimedia data, increasing the amount of research into efficient systems to manage those materials. Significant in those efforts is the work being done by the Content-Based Image Retrieval (CBIR) community in processing and retrieving images, along with their further combination with annotations. Nowadays, images play a key role in digital applications. Contextual integration of images with different sources is vital – it involves reusing and aggregating a large amount of information with other media types. In particular, if we consider video data, annotations can be used to summarize textual descriptions and metadata, while images can be used to summarize videos into storyboards, providing an easy way to navigate and to
Multimedia Tools and Applications manuscript No.
"... (will be inserted by the editor) Scalable storyboards in handheld devices: applications and evaluation metrics ..."
Abstract
- Add to MetaCart
(Show Context)
(will be inserted by the editor) Scalable storyboards in handheld devices: applications and evaluation metrics
VideoTrees: Improving the presentation of video surrogates using hierarchy
"... As the amount of available video content increases, so does the need for better ways of browsing all this material. Because the na-ture of video makes it hard to process, the need arises for adequate surrogates for video that can readily be skimmed and browsed. This paper explores the effects of the ..."
Abstract
- Add to MetaCart
(Show Context)
As the amount of available video content increases, so does the need for better ways of browsing all this material. Because the na-ture of video makes it hard to process, the need arises for adequate surrogates for video that can readily be skimmed and browsed. This paper explores the effects of the use of hierarchy in a pic-torial summary of keyframes. A novel type of video surrogate is presented: the VideoTree. To test whether using hierarchy makes for a better user experience and performance, a prototype browser was developed and tested in a preliminary usability study. Users performed better using the VideoTrees browser than using a regular storyboard-based browser. They also found it more flex-ible, yet more difficult and confusing to use. 1.
Video summarization Video annotation
"... Compact composite descriptors Self-growing and self-organized neural gas network the most representative example of multimedia data. More partic-ularly, every single day a quite large quantity of digital videos is produced and therefore, the amount of information seems huge and uncontrollable. Thus, ..."
Abstract
- Add to MetaCart
(Show Context)
Compact composite descriptors Self-growing and self-organized neural gas network the most representative example of multimedia data. More partic-ularly, every single day a quite large quantity of digital videos is produced and therefore, the amount of information seems huge and uncontrollable. Thus, video web sites have become over-crowded, since they have to deal at the same time with this high amount of data and with the visiting users. It is no coincidence that in 2011, YouTube video-sharing website, had more than 1 trillion n of many mmed up al videos to their store and transmission and on the other hand as th ysis of their visual content. As far as the second goal is con the interest of the researches can be considered that is focu methods, that are connected either with the generation of a repre-sentative video abstraction or with an automatic annotation of the semantic content of each single video (Larson et al., 2011; Money