15 citations found. Retrieving documents...
Klatt D. and Klatt L. "Analysis, synthesis, and perception of voice quality variations among female and male talkers. " J. Acoust. Soc. Am., 87(2):820--857, 1990.

 Home/Search   Document Not in Database   Summary   Related Articles   Check  

This paper is cited in the following contexts:
SYNTHESIS OF INITIAL (/s/-) STOP-LIQUID CLUSTERS USING HLsyn - David Williams Sensimetrics (1996)   (1 citation)  (Correct)

.... an articulo acoustic utterance specification which is then transformed by means of a set of physiologically and acoustically motivated mapping relations into a specification in terms of the larger set of lower level (LL) acoustic parameters needed to control a KLSYN88 formant synthesizer [3]. In effect, the HLsyn synthesis system provides an articulatory interface to a formant synthesizer. 1.1. Functions of the HLsyn parameters Ten user settable parameters are included in the HLsyn synthesis system. The functions of these parameters can be described in terms of three broad ....

Klatt, D. H, and L. C. Klatt (1990) "Analysis, synthesis, and perception of voice quality variations among female and male talkers." JASA 53: 1070-1082.


Acoustic Variability In Spontaneous Conversational Speech Of.. - Ann Syrdal Att   (Correct)

....measures reflecting glottal characteristics were made on the conversational speech excerpts. These are H2 H1 (the level in dB of the second harmonic minus that of the first harmonic) and a count of the number of episodes of vocal creak. H2 H1 has been used as an index of glottal configuration, [8] and for wide band speech it is positive for laryngealized or pressed phonation, negative for breathy phonation, and approximately null for modal phonation. To minimize the influence of F1 on H1 and H2 amplitudes, research on glottal characteristics has focused on a highly restricted inventory, ....

Klatt, D. H., and Klatt, L. C. (1990). "Analysis, synthesis, and perception of voice quality variations among female and male talkers," J. Acoust. Soc. Amer., 87, 820-857.


A New Speech Synthesis System Based On The Arx Speech.. - Weizhong Zhu And (1996)   (1 citation)  (Correct)

....voice quality. We have proposed an adaptive pitch synchronous analysis method to estimate the vocal tract (formant antiformant) and voice source parameters from a natural speech waveform [2] Using this method, a voicing source waveform is approximated by the Rosenberg Klatt (RK) model [3] and the unvoiced source is represented by a white noise. The Kalman filter algorithm is used to estimate the formant antiformant parameters from the coefficients of the ARX model. After having automatically obtained estimates of the source and vocal tract parameters from natural speech, a ....

Klatt, D. H. and Klatt, L. C., "Analysis, synthesis, and perception of voice quality variations among female and male talkers," J. Acoust. Soc. Amer., Vol. 87, pp.820-857, 1990.


Concatenation-based MIDI-to-Singing Voice Synthesis - Macon, Jensen-Link, al. (1997)   (4 citations)  (Correct)

....this adjustment (e.g. 18] Pitch variation Since the prosody modification step in the sinusoidal synthesis algorithm transforms the pitch of every frame to match its MIDI specified target, the result is a signal that does not exhibit the natural pitch fluctuations of the human voice. In [20], a simple equation for quasirandom pitch fluctuations in speech is proposed: DeltaF 0 = F 0 100 (sin(12:7t) sin(7:1t) sin(4:7t) 3: 4) The addition of this fluctuation to the desired pitch contour gives the voice a more human feel, since a slight aperiodic wavering is present. ....

....component and T in is a spectral tilt parameter controlled by a MIDI vocal effort control function input by the user. This function produces a frequency dependent gain scaling function parameterized by T in , as shown in Figure 3. In studies of acoustic correlates of perceived voice qualities [21, 20], it has been shown that utterances perceived as soft and breathy also exhibit a higher level of high frequency aspiration noise than fully phonated utterances, especially in females. In other work with the ABS OLA model, it was shown that a frequency dependent noiselike character could be ....

D. H. Klatt and L. C. Klatt, "Analysis, synthesis, and perception of voice quality variations among female and male talkers," Journal of the Acoustical Society of America, vol. 87, pp. 820--857, February 1990.


A Singing Voice Synthesis System Based On Sinusoidal.. - Macon, Jensen-Link.. (1997)   (2 citations)  (Correct)

....can draw contours that control vibrato depth over the course of the musical phrase, thus providing a mechanism for adding expressiveness to the vocal passage. A global setting of the vibrato rate is also possible. Addition of a slight nonperiodic drift of the pitch period (as suggested by [5] [7], and others) also contributes to a more human sounding result. Vocal effort scaling Another important attribute of the vocal source in singing is the variation of spectral tilt with loudness. Crescendo of the voice is accompanied by a leveling of the usual downward tilt of the source spectrum ....

....the sinusoidal model is a frequency domain representation, spectral tilt changes can be quite easily implemented by adjusting the slope of the sinusoidal amplitudes. Breathiness, which manifests itself as highfrequency noise in the speech spectrum, is another acoustic correlate of vocal intensity [7]. This frequency dependent noise energy can be generated within the ABS OLA model framework by employing a phase modulation technique during synthesis [8] Vocal tract length scaling In synthesis of bass voices using a voice inventory recorded from a baritone male vocalist, it was found that ....

L. Klatt, D.H.; Klatt, "Analysis, synthesis, and perception of voice quality variations among female and male talkers, " Journal of the Acoustical Society of America, vol. 87, pp. 820--57, February 1990.


Survey of Data-Driven Approaches to Speech Synthesis - Ng (1998)   (Correct)

.... to the standard prosodic parameters of fundamental frequency, duration, and energy, other information related to the characteristics of the glottal source have also been found to be important in determining voice quality; these parameters include spectral tilt, open quotient, and aspiration noise [16]. The desire to improve the quality of synthesized speech has prompted work in developing more realistic models of the voice source. For example, a source model that can be dynamically controlled during synthesis using rules and whose parameters can be automatically estimated from natural speech ....

D. H. Klatt and L. C. Klatt, "Analysis, synthesis, and perception of voice quality variations among female and male talkers," Journal of the Acoustical Society of America, vol. 87, no. 2, pp. 820--857, 1990.


Spectral Methods For Voice Source Parameters Estimation - Boris Doval (1997)   (Correct)

....domain) ffl it seems more closely linked to the perceptual features of voice quality than time domain processing; ffl we shall show that one can design simpler methods (both conceptually and in terms of processing) for parameter estimations. The main spectral parameters found in the litterature [7] [6] for synthesizing voices with different qualities are: 1 spectral tilt; 2 amplitude of the first few harmonics; 3 increase in the first formant bandwidth; 4 noise in the voice source. In contrast, the parameters generally used for glottal signal modelling are defined in time domain (for ....

....are mostly used for speech analysis because no exact formulas are available for linking these parameters with time domain glottal flow models used for synthesis. Therefore a spectral model of the periodic glottal flow spectrum is needed, because the glottal flow models that have been proposed [4] [7] but are defined in time domain. We first made an analytic spectral study of these models, to link their parameters with spectral parameters [3] Following this work, it seemed possible to process voice quality by processing the speech amplitude spectrum, using simple linear filtering schemes. ....

[Article contains additional citation context not shown here]

Klatt D. and Klatt L. "Analysis, synthesis, and perception of voice quality variations among female and male talkers." J. Acoust.


Extraction of Vocal-Tract System Characteristics from.. - Yegnanarayana, Veldhuis   (2 citations)  (Correct)

....regions. In practice the closed phase can be very short to the extent that it may vanish completely. For example, in high pitched (e.g. female) voices the vocal folds have been observed to start opening directly after closure [5] Closure may also not be complete, in which case some leakage occurs [11]. We will only consider voiced speech and we will adopt the well known source filter model [6] 7] 12] for the analysis of the speech signal. The estimation problem is then a model parameter estimation problem. The source filter model consists of a source that generates a sequence of glottal ....

....not only the vocal tract, but also the trachea and the coupling of the trachea and the vocal tract is time varying, due to the vocal fold motion. Third, the overall system shows some nonlinear behavior. The presence of the subglottal tract including the trachea has several effects [3] 4] 5] [11], 14] It will increase the damping, shift the resonance frequencies, and may introduce additional poles and zeros. Furthermore, it is difficult to analyze the nonlinear behavior because good models for it do not exist. In this study we assume that the nonlinear effects are not significant. Thus, ....

D.H. Klatt and L.C. Klatt, "Analysis synthesis, and perception of voice quality variations among female and male talkers," Journal of the Acoustical Society of America, vol. 87, no. 2, pp. 820--856, 1990.


Glottal Source Estimation: Methods Of Applying The.. - Edward Riegelsberger   (Correct)

....gradient descent produces more consistent fits than the more tractable Prony based techniques. 1. INTRODUCTION In simple speech synthesis and speech coding systems, the glottal source in voiced speech is frequently modeled as a series of impulses at the fundamental frequency. Recent research [2, 3] has indicated that more accurate modeling of the glottal source waveform results in more natural sounding synthesis and coding. Appropriate glottal source waveforms have been shown to help distinguish between breathy, pressed, or normal phonations and incorporate speaker distinctive qualities. ....

D. H. Klatt and L. C. Klatt, "Analysis, synthesis and perception of voice quality variations among male and female talkers," Journal of the Acoustical Society of America, vol. 87, pp. 820--857, February 1990.


Robust Text-Independent Speaker Identification over.. - Murthy, Beaufays.. (1997)   (1 citation)  (Correct)

....a fact that humans are able to distinguish among speakers based on their voices. Studies on inter speaker variations and factors affecting voice quality have revealed that there are various parameters at both the segmental and suprasegmental levels that contribute to speaker variability [3] 4] [5], 6] 7] Despite the fact that one cannot exactly quantify interspeaker variability in terms of features, current speaker identification systems perform very well with clean speech. However, the performance of these systems can decrease significantly under certain acoustic conditions, such as ....

D. H. Klatt and L. C. Klatt, "Analysis, synthesis and perception of voice quality variations among female and male talkers," J. Acoust. Soc. Amer., vol. 87, no. 2, pp. 820-857, 1990.


The voice source as a causal/anticausal linear filter - Boris Doval Christophe   (Correct)

No context found.

Klatt D. and Klatt L. "Analysis, synthesis, and perception of voice quality variations among female and male talkers. " J. Acoust. Soc. Am., 87(2):820--857, 1990.


Spectral correlates of voice open quotient and glottal.. - Limits And Experimental   (Correct)

No context found.

Klatt D. and Klatt L. (1990) "Analysis, synthesis, and perception of voice quality variations among female and male talkers" J. Acoust. Soc. Am. 87, 820-857.


Glottal Flow Derivative Modeling With The - Wavelet Smoothed Excitation   (Correct)

No context found.

D. Klatt and L. Klatt, "Analysis, synthesis, and perception of voice quality variations among female and male talkers," J. Acoust. Soc. Am., 87, pp. 82085.


Acoustic-Phonetic Feature-Based Signal Processing for Automatic.. - Ali   (Correct)

No context found.

Klatt, D.H. and Klatt, L.C., "Analysis, synthesis and perception of voice quality variations among female and male talkers", J. Acoust. Soc. Am., 87, pp. 820-857, 1990.


On The Relation Between Voice Source Parameters And Prosodic.. - Strik And (1992)   (1 citation)  (Correct)

No context found.

D.H. Klatt and L. Klatt (1990), "Analysis, synthesis, and perception of voice quality variations among female and male talkers", J. Acoust. Soc. Am., Vol. 87, pp. 820-857.

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC