• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Burst Detection Based on Measurements of Intensity Discrimination (2000)

by J P Hosom, R A Cole
Venue:ICSLP 2000
Add To MetaCart

Tools

Sorted by:
Results 1 - 2 of 2

Automatic phoneme alignment based on acoustic-phonetic modeling

by John-paul Hosom - In ICSLP , 2002
"... This paper presents a method for speaker-independent automatic phonetic alignment that is distinguished from standard HMM-based “forced alignment ” in three respects: (1) specific acoustic-phonetic features are used, in addition to PLP features, by the phonetic classifier; (2) the units of classific ..."
Abstract - Cited by 12 (2 self) - Add to MetaCart
This paper presents a method for speaker-independent automatic phonetic alignment that is distinguished from standard HMM-based “forced alignment ” in three respects: (1) specific acoustic-phonetic features are used, in addition to PLP features, by the phonetic classifier; (2) the units of classification consist of distinctive phonetic features instead of phonemes; and (3) observation probabilities depend not only on the current state, but also on the state transition information. This proposed method is compared with a state-of-the-art baseline forcedalignment system on a number of corpora, including telephone speech, microphone speech, and children’s speech. The new method has agreement of 92.57 % within 20 msec on the TIMIT corpus, which is a 26 % reduction in error over the baseline method (with 89.95 % agreement on TIMIT). Average reduction in error over all corpora is 28%. 1.
(Show Context)

Citation Context

...ed to compute VOT for distinguishing between consonants. The burst detection method uses intensity discrimination to select a number of candidate frames for further spectral-domain ANN classification =-=[7]-=-. This method has a total error rate of 13.20% (5.14% insertions and 8.06% deletions) on the TIMIT corpus; on telephone and cellular speech corpora, the deletion rate remains about the same, and the i...

unknown title

by John-paul Hosom
"... This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and education use, including for instruction at the authors institution and sharing with colleagues. Other uses, including reproduction and distribution, or sel ..."
Abstract - Add to MetaCart
This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and education use, including for instruction at the authors institution and sharing with colleagues. Other uses, including reproduction and distribution, or selling or licensing copies, or posting to personal, institutional or third party websites are prohibited. In most cases authors are permitted to post their version of the article (e.g. in Word or Tex form) to their personal website or institutional repository. Authors requiring further information regarding Elsevier’s archiving and manuscript policies are encouraged to visit: http://www.elsevier.com/copyright Author's personal copy Available online at www.sciencedirect.com
(Show Context)

Citation Context

...y-discrimination feature for the entire signal and for seven frequency bands, (2) the time derivatives of these intensity-discrimination features, (3) a relative-energy based burst-detection feature (=-=Hosom and Cole, 2000-=-), and (4) normalized log-scale energy computed with a 40-ms Hamming window, to focus on energy changes that are more rapid than can be measured with the 100-ms energy window used in Section 3.1.Auth...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University