Content-based classification and retrieval of audio (1998) [14 citations — 0 self]
Abstract:
An online audio classification and segmentation system is presented in this research, where audio recordings are classified and segmented into speech, music, several types of environmental sounds and silence based on audio content analysis. This is the first step of our continuing work towards a general content-based audio classification and retrieval system. The extracted audio features include temporal curves of the energy function, the average zerocrossing rate, the fundamental frequency of audio signals, as well as statistical and morphological features of these curves. The classification result is achieved through a threshold-based heuristic procedure. The audio database that we have built, details of feature extraction, classification and segmentation procedures, and experimental results are described. It is shown that, with the proposed new system, audio recordings can be automatically segmented and classified into basic types in real time with an accuracy of over 90%. Outlines of further classification of audio into finer types and a query-by-example audio retrieval system on top of the coarse classification are also introduced.
Citations
| 160 | Construction and evaluation of a robust multifeature speech/music discriminator – Scheirer, Slaney - 1997 |
| 93 | Content-Based Retrieval of Music and Audio – Foote - 1997 |
| 92 | Digital processing of speech signal – Rabiner, Schafer - 1978 |
| 63 | Real-Time Discrimination of Broadcast Speech/Music – Saunders - 1996 |
| 30 | Acoustic segmentation for audio browsers – Kimber, Wilcox - 1996 |
| 6 | The Master Handbook of Acoustics – Everest - 1994 |
| 2 | Smoliar: "Toward Content-based Audio Indexing and Retrieval and a New Speaker Discrimination Technique", downloaded from http://www.iss.nus.sg/People/lwyse/lwyse.html – Wyse, S - 1995 |
| 2 | Chamberlin: "Query By Humming - Musical Information Retrieval in An Audio Database – Ghias, Logan, et al. - 1995 |
| 2 | et al.: "Content-Based Classification – Wold, Blum, et al. - 1996 |
| 2 | et al.: "Audio Feature Extraction and Analysis for Scene Classification – Liu, Huang, et al. - 1997 |
| 2 | Sethi: "Audio Characterization for Video Indexing – Patel, I - 1996 |

