Results 1 - 10
of
2,829
Construction And Evaluation Of A Robust Multifeature Speech/music Discriminator
, 1997
"... We report on the construction of a real-time computer system capable of distinguishing speech signals from music signals over a wide range of digital audio input. We have examined 13 features intended to measure conceptually distinct properties of speech and/or music signals, and combined them in se ..."
Abstract
-
Cited by 350 (5 self)
- Add to MetaCart
We report on the construction of a real-time computer system capable of distinguishing speech signals from music signals over a wide range of digital audio input. We have examined 13 features intended to measure conceptually distinct properties of speech and/or music signals, and combined them in several multidimensional classification frameworks. We provide extensive data on systemperformanceand the cross-validated training/test setup used to evaluate the system. For the datasets currently in use, the best classifier classifies with 5.8% error on a frame-by-frame basis, and 1.4% error when integrating long (2.4 second) segments of sound. 1. OVERVIEW The problem of distinguishing speech signals from music signals has become increasingly important as automatic speech recognition (ASR) systems are applied to more and more "real-world" multimedia domains. If we wish to build systems that perform ASR on soundtrack data, for example, it is important to be able to distinguish which segments...
Reconstructing individual monophonic instruments from musical mixtures using scene completion
"... Monaural sound source separation is the process of separating sound sources from a single channel mixture. In mixtures of pitched musical instru-ments, the problem of overlapping harmonics poses a significant challenge to source separation and reconstruction. One standard method to resolve over-lapp ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
Monaural sound source separation is the process of separating sound sources from a single channel mixture. In mixtures of pitched musical instru-ments, the problem of overlapping harmonics poses a significant challenge to source separation and reconstruction. One standard method to resolve over
Probabilistic Modelling of Note Events in the Transcription of Monophonic Melodies
, 2004
"... This thesis concerns the problem of automatic transcription of music and proposes a method for transcribing monophonic melodies. Recently, computational music content analysis has received considerable attention among researchers due to the rapid growth of music databases. In this area of research, ..."
Abstract
-
Cited by 10 (2 self)
- Add to MetaCart
This thesis concerns the problem of automatic transcription of music and proposes a method for transcribing monophonic melodies. Recently, computational music content analysis has received considerable attention among researchers due to the rapid growth of music databases. In this area of research
Probabilistic Classication of Monophonic Instrument Playing Techniques
"... Understanding the underlying intentions of a music per-former is crucial to enable a machine such as an automated accompaniment system to interact intelligently with a musi-cian. Particularly, understanding the symbol associated with ..."
Abstract
- Add to MetaCart
Understanding the underlying intentions of a music per-former is crucial to enable a machine such as an automated accompaniment system to interact intelligently with a musi-cian. Particularly, understanding the symbol associated with
Convolutive speech bases and their application to supervised speech separation
- IEEE Transactions on Audio, Speech and Language Processing
, 2007
"... In this paper we present a convolutive basis decomposition method and its application on simultaneous speakers separation from monophonic recordings. The model we propose is a convolutive version of the non-negative matrix factorization algorithm. Due to the non-negativity constraint this type of co ..."
Abstract
-
Cited by 92 (6 self)
- Add to MetaCart
of coding is very well suited for intuitively and efficiently representing magnitude spectra. We present results that reveal the nature of these basis functions and we introduce their utility in separating monophonic mixtures of known speakers.
arXiv: Untangling Phase and Time in Monophonic Sounds UNTANGLING PHASE AND TIME IN MONOPHONIC SOUNDS
"... We are looking for a mathematical model of monophonic sounds with independent time and phase dimensions. With such a model we can resynthesise a sound with arbitrarily modulated frequency and progress of the timbre. We propose such a model and show that it exactly fulfils some natural properties, li ..."
Abstract
- Add to MetaCart
We are looking for a mathematical model of monophonic sounds with independent time and phase dimensions. With such a model we can resynthesise a sound with arbitrarily modulated frequency and progress of the timbre. We propose such a model and show that it exactly fulfils some natural properties
Blind Source Separation by Sparse Decomposition in a Signal Dictionary
, 2000
"... Introduction In blind source separation an N-channel sensor signal x(t) arises from M unknown scalar source signals s i (t), linearly mixed together by an unknown N M matrix A, and possibly corrupted by additive noise (t) x(t) = As(t) + (t) (1.1) We wish to estimate the mixing matrix A and the M- ..."
Abstract
-
Cited by 270 (33 self)
- Add to MetaCart
Introduction In blind source separation an N-channel sensor signal x(t) arises from M unknown scalar source signals s i (t), linearly mixed together by an unknown N M matrix A, and possibly corrupted by additive noise (t) x(t) = As(t) + (t) (1.1) We wish to estimate the mixing matrix A and the M-dimensional source signal s(t). Many natural signals can be sparsely represented in a proper signal dictionary s i (t) = K X k=1 C ik ' k (t) (1.2) The scalar functions ' k
Monophonic sound Source Separation with an unsupervised network of spiking neurones
, 2006
"... ..."
Improving Performance of an HMM-Ba Monophone-Level Normalized C
"... In this paper, we propose a novel confidence scoring method that is applied to N-best hypotheses output from an HMM-based classifier. In the first pass of the proposed method, the HMM-based classifier with monophone models outputs N-best hypotheses (word candidates) and boundaries of all the monopho ..."
Abstract
- Add to MetaCart
In this paper, we propose a novel confidence scoring method that is applied to N-best hypotheses output from an HMM-based classifier. In the first pass of the proposed method, the HMM-based classifier with monophone models outputs N-best hypotheses (word candidates) and boundaries of all
Continuous Hindi Speech Recognition using Monophone based Acoustic Modeling
"... Speech is a natural way of communication and it provides an intuitive user interface to machines. Although the performance of automatic speech recognition (ASR) system is far from perfect. The overall performance of any speech recognition system is highly depends on the acoustic modeling. Hence gene ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
) and perceptual linear prediction (PLP) both are used as a feature extraction techniques in our proposed system. Monophone based acoustic modeling is done by Hidden Markov Model (HMM) at the back-end of an ASR system. HTK 3.4.1 toolkit is used for the implementation of this system. The system is trained for 70
Results 1 - 10
of
2,829