| G. Tzanetakis and P. Cook. A framework for audio analysis based on classification and temporal segmentation. In Proc.25th Euromicro Conference. Workshop on Music Technology and Audio Processing, 1999. |
....of 98 is reported. Scheirer and Slaney [5] used thirteen features, of whicheight are extracted from the power spectrum density, for classifying audio segments. A correct classification percentage of 94.2 is reported for 20 msec segments and 98.6 for 2.4 sec segments. Tzanetakis and Cook [8] proposed a general framework for integrating, experimenting and evaluating different techniques of audio segmentation and classification. In addition they proposed a segmentation method based on feature change detection. For their experiments on a large data set a classifier performance of about ....
....of the signal is more clear in music than in speech. This is due to the fact that speechcontains frequent short pauses, where both the RMS and ZC are close to zero, and therefore correlated in this case. We exploit this possible discrimination in a feature defined for the classification. In [5] [8] and [9] the classification uses features extracted from the power spectrum density computed by the FFT as the spectral centroid, whichhowever is strongly correlated with the zero crossing rate. The maximal frequency and the pitchhave been also used, as well as the power spectrum ....
G. Tzanetakis and P. Cook. A framework for audio analysis based on classification and temporal segmentation. In Proc.25th Euromicro Conference. Workshop on Music Technology and Audio Processing, 1999.
....addition, the user can combine time regions to form a time tree that can be used for multi resolution browsing and annotation. The tree captures the hierarchical nature of music pieces, and therefore can be used for musical analysis. 5 Applications The initial description of MARSYAS appeared in [Tzanetakis and Cook, 1999a] A more detailed description of the segmentation algorithm as well as some user experiments performed for evaluation are given in [Tzanetakis and Cook, 1999b] A number of di#erent applications have been developed using our system. Most of them are undergraduate senior theses and class ....
Tzanetakis, G. and Cook, P. (1999a). A framework for audio analysis based on classification and temporal segmentation. In Proc.25th Euromicro Conference. Workshop on Music Technology and Audio Processing, Milan, Italy. IEEE Computer Society.
....the feature calculation an MP3 decoder was implemented in C . The source code is loosely based on the Fraunhaufer institute reference software implementation and other open source implementations. The feature calculation, classification and segmentation componenents were integrated using MARSYAS [10] an object oriented framework for building audio analysis tools written in C and JAVA. The combined decoding and classification segmentation runs real time on a Pentium II PC. The source code for the feature calculation is available upon request. 8. FUTURE WORK For statistical pattern ....
G. Tzanetakis and P. Cook, "A framework for audio analysis based on classification and temporal segmentation, " in Proc.25th Euromicro Conference. Workshop on Music Technology and Audio Processing, Milan, Italy, 1999, IEEE Computer Society.
....can then be edited by adding deleting boundaries until the desired segmentation is reached. Finally, the plug in architecture of the system easily allows the use of segmentation results from other analysis tools such as a speech recognition system. The system has been implemented using MARSYAS [8] an object oriented framework for building audio analysis applications. A client server architecture is used. The graphical user interface (written in JAVA) acts as a client to the server engine (written in C ) where all the signal processing is done. The system runs on Solaris, SGI, Linux and ....
G. Tzanetakis and P. Cook, "A framework for audio analysis based on classification and temporal segmentation," in Proc.25th Euromicro Conference. Workshop on Music Technology and Audio Processing, Milan, Italy, 1999, IEEE Computer Society.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC