64 citations found. Retrieving documents...
Eric Scheirer and Malcolm Slaney, "Construction and evaluation of a robust multifeatures speech/music discriminator," IEEE Transactions on Acoustics, Speech, and Signal Processing (ICASSP'97), 1997, pp. 1331--1334.

 Home/Search   Document Details and Download   Summary   Related Articles   Check  

This paper is cited in the following contexts:

First 50 documents  Next 50

Semantic Indexing Of Multimedia Using Audio, Text And Visual.. - Iyengar Nock Neti (2002)   (1 citation)  (Correct)

....combinations. The intermediate level labels thus generated are combined for the high level rocket launch event retrieval. We used 24 dim Mel Frequency Cepstral Coefficients, common in ASR systems, as our low level features. Features used by other authors include centroid frequency, pitch etc [1, 11, 12, 13]. In the first experiment, we study the effect of using a HMM for duration modeling of a single intermediate concept (Explosion) In the second experiment, we look at the effect of different audioonly fusion strategies. We note here that Scheme 3 in section 3 can be viewed as implicit fusion ....

E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator," in Proc. ICASSP '97, Munich, Germany, 1997, pp. 1331--1334.


Computationally Measurable Temporal Differences between Speech.. - Gerhard (2003)   (Correct)

.... The power of a waveform is typically calculated on a short time basis, by windowing the waveform, as in the STFT, squaring the samples and taking the mean [83] The square root of this result is the engineering quantity known as the root mean square value, which has been used by other researchers [63, 79]. Average normalized power (P ) for a digital waveform, is equivalent to the average normalized energy (E) per sample. The definitions of these quantities, for a waveform w(t) are presented in Equations 1.1 through 1.4 [31] The value p(t) will be used throughout this work to represent the ....

....consist of periods of high power (voiced phonemes) followed by periods of low power (unvoiced phonemes, inter word pauses) while music tends to have a more consistent power distribution. A measure of the power distribution is used in [62] while a measure of the power modulation rate is used in [63], where the authors claim that speech tends to have a power modulation rate of around 4 Hz. 1.5.2 Fundamental Frequency (f 0 ) Only periodic or pseudo periodic waveforms can have a valid f 0 . Perceptually, periodic and pseudo periodic signals have a pitch. Periodic signals exactly repeat to ....

[Article contains additional citation context not shown here]

Eric Scheirer and Malcolm Slaney. Construction and evaluation of a robust multifeature speech/music discriminator. In International Conference on Acoustics, Speech and Signal Processing, volume II, pages 1331--1334. IEEE, 1997.


FDU at TREC2002: Filtering, QA, Web and Video Tasks - Wu, Huang, Niu, Xia, Feng..   (Correct)

.... Energy, Low Short Time Energy Ratio, Noise Frame Ratio, Mean and Covariance of Brightness, Spectral Flux, Spectral Roll off Point, Mean and Covariance of LPC, Mean and Covariance of MFCC, Mean and Covariance of Pitch, Mean and Covariance of Band Spectrum, Mean and Covariance of Band Width [Lu01][ Scheirer97]. Nearest Neighbor Model and Gaussian Mixture Model are trained by TREC 10 Videos. Applying these trained models on 1 second window, we can get the type of each window. In our submission, Run01 uses NN Model and Run02 uses 16 mixture GMM Model. The Ranking Value of speech, music and monologue ....

E. Scheirer and M. Slaney, Construction and Evaluation of a Robust Multifeature Music/Speech Discriminator, Proc. of ICASSP'97, vol. II, pp 1331-1334. IEEE, April 1997


TREC 2002 Video Track Experiments at MediaTeam Oulu and .. - Rautiainen, Penttilä..   (Correct)

....of the signal was divided into frames of 50 ms overlapping by 10 ms, and the power inside every frame was calculated. The four used features were the variance of the frame by frame power, and the variance of the first and the second order differentials of the power, and finally, low energy ratio [5], which is computed as the percentage of 50ms frames with RMS power less than the threshold percentage of the mean RMS power. A threshold level of 20 for low energy ratio was found to give best results, and the spread of the four features was increased by log transformations. In the training ....

Scheirer E & Slaney M (1997) Construction and evaluation of a robust multifeature speech/music discriminator. Proc. ICASSP.


The Importance Of Sequences In Musical Similarity - Michael Casey Goldsmiths (2006)   Self-citation (Slaney)   (Correct)

No context found.

Eric Scheirer and Malcolm Slaney, "Construction and evaluation of a robust multifeatures speech/music discriminator," IEEE Transactions on Acoustics, Speech, and Signal Processing (ICASSP'97), 1997, pp. 1331--1334.


Speech Discrimination Based On Multiscale Spectro--Temporal - Modulations Nima Mesgarani (2004)   Self-citation (Slaney)   (Correct)

No context found.

E. Scheirer, M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator", ICASSP'97, 1997.


A Hierarchical Approach To Automatic Musical Genre Classification - Burred, Lerch (2003)   (Correct)

No context found.

E. Scheirer and M. Slaney, "Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator," in Proc. ICASSP, 1997.


Comparing MFCC and MPEG-7 Audio Features for Feature.. - Xiong, Regunathan (2004)   (Correct)

No context found.

E. Scheirer and S. Malcolm, "Construction and evaluation of a robust multifeature speech/music discriminator, " Proc. ICASSP-97, April 1997, Munich, Germany.


Speech/music Discrimination Using A Single Warped - Lpc-Based Feature Mu   (Correct)

No context found.

E. Scheirer and M. Slaney. Construction and evaluation of a robust multifeature speech/music discriminator. Proc. IEEE ICASSP'97, pages 1331--1334, 1997.


Fusion Of Descriptors For Speech / Music Classification - Julie Mauclair And   (Correct)

No context found.

E. Scheirer and M. Slaney. Construction and evaluation of a robust multifeature speech/music discriminator. In Processing, pages 1331--1334, Munich, Germany, April 1997. IEEE.


Speech Identification Using a Sequence-Based Heuristic - Heinrich   (Correct)

No context found.

E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator". In Proc. ICASSP97, 1997, pp. 1331--1334.


Content Analysis for Audio Classification and Segmentation - Lu, Zhang, Jiang (2002)   (8 citations)  (Correct)

No context found.

E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature music/speech discriminator," in Proc. ICASSP' 97, Apr. 1997, vol. II, pp. 1331--1334.


Pitch Extraction and Fundamental Frequency: History and Current.. - Gerhard (2003)   (Correct)

No context found.

Eric Scheirer and Malcolm Slaney. Construction and evaluation of a robust multifeature speech/music discriminator. In International Conference on Acoustics, Speech and Signal Processing, volume II, pages 1331--1334. IEEE, 1997.


Automatic extraction of music descriptors from acoustic.. - Zils, Pachet (2004)   (Correct)

No context found.

Eric D. Scheirer, and Malcolm Slaney. Construction and evaluation of a robust multifeature speech/music discriminator. Proc. ICASSP '97.


st International Symposium on Computer Music Modeling and.. - Cmmr Springer Verlag   (Correct)

No context found.

Eric D. Scheirer, and Malcolm Slaney. Construction and evaluation of a robust multifeature speech/music discriminator. Proc ICASSP '97, pp. 13311334.


Automatic Musical Instrument Recognition - Eronen (2001)   (2 citations)  (Correct)

No context found.

Scheirer, Slaney. (1997). "Construction and evaluation of a robust multifeature speech/music discriminator ". In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP'97, pp. 1331 - 1334.


Amadeus: A Scalable Hmm-Based Audio Information Retrieval System - Eloi Batlle Jaume (2004)   (Correct)

No context found.

E. Scheirer and M. Slaney, "Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator, " in Proc. ICASSP, 1997, pp. 1331--1334.


Silence As A Cue To Rhythm In The Analysis Of Speech And Song - David Gerhard Computer   (Correct)

No context found.

Scheirer, E. and Slaney, M. (1997). Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator. IEEE International Conference on Acoustics, Speech and Signal Processing, II:1331-1334.


A Hierarchical Approach To Automatic Musical Genre Classification - Burred, Lerch (2003)   (Correct)

No context found.

E. Scheirer and M. Slaney, "Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator," in Proc. ICASSP, 1997.


Audio Signal Classification: History and Current Techniques - Gerhard (2003)   (Correct)

No context found.

Eric Scheirer and Malcolm Slaney. Construction and evaluation of a robust multifeature speech/music discriminator. In International Conference on Acoustics, Speech and Signal Processing, volume II, pages 1331--1334. IEEE, 1997.


Robust Singing Detection in Speech/Music Discriminator Design - Wu Chou And (2001)   (1 citation)  (Correct)

No context found.

E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator", Proc. ICASSP'97, pp.1331-1334, 1997.


Computational Auditory Scene Recognition - Peltonen (2001)   (4 citations)  (Correct)

No context found.

Scheirer, E. D. and Slaney, M. "Construction and Evaluation of A Robust Multifeature Speech/Music Discriminator". In Proceedings of the 1997 IEEE Conference on Acoustics, Speech and Signal Processing, volume 2, pp. 1331 - 1334, Munich, Germany, April 1997.


Speech, Music and Songs Discrimination in the Context of.. - Ezzaidi, Rouat (2002)   (Correct)

No context found.

Scheirer E. and Stanley M., "Construction and evaluation of a robust multifeature speech/music discriminator," in ICASSP'97, 1997, vol. II, pp. 1331--1334.


Spectral Sound Gap Filling - Iddo Drori Alon   (Correct)

No context found.

E. Scheirer and M. Slaney. Construction and evaluation of a robust multifeature speech/music discriminator. In Proceedings of International Conference on Acoustic, Speech, and Signal Processing, pages 1331--1334, 1997.


Incorporating Audio Cues into Dialog and Action Scene Extraction - Chen, Rizvi, Özsu (2003)   (2 citations)  (Correct)

No context found.

E. Scheirer and M. Slaney, "Construction and evaluation of a robust multifeature speech/music discriminator, pp. 21--24, April 1997.

First 50 documents  Next 50

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC