• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 2,829
Next 10 →

Construction And Evaluation Of A Robust Multifeature Speech/music Discriminator

by Eric Scheirer, Malcolm Slaney , 1997
"... We report on the construction of a real-time computer system capable of distinguishing speech signals from music signals over a wide range of digital audio input. We have examined 13 features intended to measure conceptually distinct properties of speech and/or music signals, and combined them in se ..."
Abstract - Cited by 350 (5 self) - Add to MetaCart
We report on the construction of a real-time computer system capable of distinguishing speech signals from music signals over a wide range of digital audio input. We have examined 13 features intended to measure conceptually distinct properties of speech and/or music signals, and combined them in several multidimensional classification frameworks. We provide extensive data on systemperformanceand the cross-validated training/test setup used to evaluate the system. For the datasets currently in use, the best classifier classifies with 5.8% error on a frame-by-frame basis, and 1.4% error when integrating long (2.4 second) segments of sound. 1. OVERVIEW The problem of distinguishing speech signals from music signals has become increasingly important as automatic speech recognition (ASR) systems are applied to more and more "real-world" multimedia domains. If we wish to build systems that perform ASR on soundtrack data, for example, it is important to be able to distinguish which segments...

Reconstructing individual monophonic instruments from musical mixtures using scene completion

by Jinyu Han, Bryan Pardo
"... Monaural sound source separation is the process of separating sound sources from a single channel mixture. In mixtures of pitched musical instru-ments, the problem of overlapping harmonics poses a significant challenge to source separation and reconstruction. One standard method to resolve over-lapp ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
Monaural sound source separation is the process of separating sound sources from a single channel mixture. In mixtures of pitched musical instru-ments, the problem of overlapping harmonics poses a significant challenge to source separation and reconstruction. One standard method to resolve over

Probabilistic Modelling of Note Events in the Transcription of Monophonic Melodies

by Matti Ryynänen , 2004
"... This thesis concerns the problem of automatic transcription of music and proposes a method for transcribing monophonic melodies. Recently, computational music content analysis has received considerable attention among researchers due to the rapid growth of music databases. In this area of research, ..."
Abstract - Cited by 10 (2 self) - Add to MetaCart
This thesis concerns the problem of automatic transcription of music and proposes a method for transcribing monophonic melodies. Recently, computational music content analysis has received considerable attention among researchers due to the rapid growth of music databases. In this area of research

Probabilistic Classication of Monophonic Instrument Playing Techniques

by Akira Maezawa, Katsutoshi Itoyama, Toru Takahashi, Tetsuya Ogata, Hiroshi G. Okuno
"... Understanding the underlying intentions of a music per-former is crucial to enable a machine such as an automated accompaniment system to interact intelligently with a musi-cian. Particularly, understanding the symbol associated with ..."
Abstract - Add to MetaCart
Understanding the underlying intentions of a music per-former is crucial to enable a machine such as an automated accompaniment system to interact intelligently with a musi-cian. Particularly, understanding the symbol associated with

Convolutive speech bases and their application to supervised speech separation

by Paris Smaragdis - IEEE Transactions on Audio, Speech and Language Processing , 2007
"... In this paper we present a convolutive basis decomposition method and its application on simultaneous speakers separation from monophonic recordings. The model we propose is a convolutive version of the non-negative matrix factorization algorithm. Due to the non-negativity constraint this type of co ..."
Abstract - Cited by 92 (6 self) - Add to MetaCart
of coding is very well suited for intuitively and efficiently representing magnitude spectra. We present results that reveal the nature of these basis functions and we introduce their utility in separating monophonic mixtures of known speakers.

arXiv: Untangling Phase and Time in Monophonic Sounds UNTANGLING PHASE AND TIME IN MONOPHONIC SOUNDS

by Henning Thielemann
"... We are looking for a mathematical model of monophonic sounds with independent time and phase dimensions. With such a model we can resynthesise a sound with arbitrarily modulated frequency and progress of the timbre. We propose such a model and show that it exactly fulfils some natural properties, li ..."
Abstract - Add to MetaCart
We are looking for a mathematical model of monophonic sounds with independent time and phase dimensions. With such a model we can resynthesise a sound with arbitrarily modulated frequency and progress of the timbre. We propose such a model and show that it exactly fulfils some natural properties

Blind Source Separation by Sparse Decomposition in a Signal Dictionary

by M. Zibulevsky, B. A. Pearlmutter, P. Bofill, P. Kisilev , 2000
"... Introduction In blind source separation an N-channel sensor signal x(t) arises from M unknown scalar source signals s i (t), linearly mixed together by an unknown N M matrix A, and possibly corrupted by additive noise (t) x(t) = As(t) + (t) (1.1) We wish to estimate the mixing matrix A and the M- ..."
Abstract - Cited by 270 (33 self) - Add to MetaCart
Introduction In blind source separation an N-channel sensor signal x(t) arises from M unknown scalar source signals s i (t), linearly mixed together by an unknown N M matrix A, and possibly corrupted by additive noise (t) x(t) = As(t) + (t) (1.1) We wish to estimate the mixing matrix A and the M-dimensional source signal s(t). Many natural signals can be sparsely represented in a proper signal dictionary s i (t) = K X k=1 C ik ' k (t) (1.2) The scalar functions ' k

Monophonic sound Source Separation with an unsupervised network of spiking neurones

by Ramin Pichevar, Jean Rouat , 2006
"... ..."
Abstract - Cited by 6 (2 self) - Add to MetaCart
Abstract not found

Improving Performance of an HMM-Ba Monophone-Level Normalized C

by Muhammad Ghulam, Takashi Fukuda
"... In this paper, we propose a novel confidence scoring method that is applied to N-best hypotheses output from an HMM-based classifier. In the first pass of the proposed method, the HMM-based classifier with monophone models outputs N-best hypotheses (word candidates) and boundaries of all the monopho ..."
Abstract - Add to MetaCart
In this paper, we propose a novel confidence scoring method that is applied to N-best hypotheses output from an HMM-based classifier. In the first pass of the proposed method, the HMM-based classifier with monophone models outputs N-best hypotheses (word candidates) and boundaries of all

Continuous Hindi Speech Recognition using Monophone based Acoustic Modeling

by Ankit Kumar, Mohit Dua, Tripti Choudhary
"... Speech is a natural way of communication and it provides an intuitive user interface to machines. Although the performance of automatic speech recognition (ASR) system is far from perfect. The overall performance of any speech recognition system is highly depends on the acoustic modeling. Hence gene ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
) and perceptual linear prediction (PLP) both are used as a feature extraction techniques in our proposed system. Monophone based acoustic modeling is done by Hidden Markov Model (HMM) at the back-end of an ASR system. HTK 3.4.1 toolkit is used for the implementation of this system. The system is trained for 70
Next 10 →
Results 1 - 10 of 2,829
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University