MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Content-based classification and retrieval of audio (1998) [14 citations — 0 self]

Download:
Download as a PDF | Download as a PS
by Tong Zhang, C. -c. Jay Kuo
in SPIE’s 43rd Annual Meeting - Conference on Advanced Signal Processing Algorithms, Architectures, and Implementations VIII
http://biron.usc.edu/~tzhang/respie98.ps.gz
Add To MetaCart

Abstract:

An online audio classification and segmentation system is presented in this research, where audio recordings are classified and segmented into speech, music, several types of environmental sounds and silence based on audio content analysis. This is the first step of our continuing work towards a general content-based audio classification and retrieval system. The extracted audio features include temporal curves of the energy function, the average zerocrossing rate, the fundamental frequency of audio signals, as well as statistical and morphological features of these curves. The classification result is achieved through a threshold-based heuristic procedure. The audio database that we have built, details of feature extraction, classification and segmentation procedures, and experimental results are described. It is shown that, with the proposed new system, audio recordings can be automatically segmented and classified into basic types in real time with an accuracy of over 90%. Outlines of further classification of audio into finer types and a query-by-example audio retrieval system on top of the coarse classification are also introduced.

Citations

160 Construction and evaluation of a robust multifeature speech/music discriminator – Scheirer, Slaney - 1997
93 Content-Based Retrieval of Music and Audio – Foote - 1997
92 Digital processing of speech signal – Rabiner, Schafer - 1978
63 Real-Time Discrimination of Broadcast Speech/Music – Saunders - 1996
30 Acoustic segmentation for audio browsers – Kimber, Wilcox - 1996
6 The Master Handbook of Acoustics – Everest - 1994
2 Smoliar: "Toward Content-based Audio Indexing and Retrieval and a New Speaker Discrimination Technique", downloaded from http://www.iss.nus.sg/People/lwyse/lwyse.html – Wyse, S - 1995
2 Chamberlin: "Query By Humming - Musical Information Retrieval in An Audio Database – Ghias, Logan, et al. - 1995
2 et al.: "Content-Based Classification – Wold, Blum, et al. - 1996
2 et al.: "Audio Feature Extraction and Analysis for Scene Classification – Liu, Huang, et al. - 1997
2 Sethi: "Audio Characterization for Video Indexing – Patel, I - 1996