• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Learning to speak. Sensori-motor control of speech movements (1997)

by G Bailly
Venue:Speech Commun
Add To MetaCart

Tools

Sorted by:
Results 1 - 8 of 8

Functional Phonology -- Formalizing the interactions between articulatory and perceptual drives

by Paulus Petrus Gerardus Boersma , 1998
"... ..."
Abstract - Cited by 119 (20 self) - Add to MetaCart
Abstract not found

Audiovisual Speech Synthesis

by G. Bailly - International Journal of Speech Technology , 2001
"... This paper presents the main approaches used to synthesize talking faces, and provides greater detail on a handful of these approaches. No system is described exhaustively, however, and, for purposes of conciseness, not all existing systems are reviewed. An attempt is made to distinguish between fac ..."
Abstract - Cited by 15 (3 self) - Add to MetaCart
This paper presents the main approaches used to synthesize talking faces, and provides greater detail on a handful of these approaches. No system is described exhaustively, however, and, for purposes of conciseness, not all existing systems are reviewed. An attempt is made to distinguish between facial synthesis itself (i.e the manner in which facial movements are rendered on a computer screen), and the way these movements may be controlled and predicted using phonetic input.

How phonological structures can be culturally selected for learnability

by Pierre-Yves Oudeyer - ADAPTIVE BEHAVIOR , 2005
"... ..."
Abstract - Cited by 9 (5 self) - Add to MetaCart
Abstract not found

The weight of phonetic substance in the structure of sound inventories

by Nathalie Vallée, Louis-jean Boë, Jean-luc Schwartz, Pierre Badin, Christian Abry - ZAS Papers in Linguistics , 2002
"... In the research field initiated by Lindblom & Liljencrants in 1972, we illustrate the possibility of giving substance to phonology, predicting the structure of phonological systems with nonphonological principles, be they listener-oriented (perceptual contrast and stability) or speaker-oriented (art ..."
Abstract - Cited by 2 (1 self) - Add to MetaCart
In the research field initiated by Lindblom & Liljencrants in 1972, we illustrate the possibility of giving substance to phonology, predicting the structure of phonological systems with nonphonological principles, be they listener-oriented (perceptual contrast and stability) or speaker-oriented (articulatory contrast and economy). We proposed for vowel systems the Dispersion-Focalisation Theory (Schwartz et al., 1997b). With the DFT, we can predict vowel systems using two competing perceptual constraints weighted with two parameters, respectively λ and α. The first one aims at increasing auditory distances between vowel spectra (dispersion), the second one aims at increasing the perceptual salience of each spectrum through formant proximities (focalisation). We also introduced new variants based on research in physics- namely, phase space (λ,α) and polymorphism of a given phase, or superstructures in phonological organisations (Vallée et al., 1999) which allow us to generate 85.6 % of 342 UPSID systems from 3- to 7-vowel qualities. No similar theory for consonants seems to exist yet. Therefore we present in detail a typology of consonants, and then suggest ways to explain plosive vs. fricative and voiceless vs. voiced consonants predominances by i)

Towards the Use of a Virtual Talking Head and of Speech Mapping tools for pronunciation training

by Pierre Badin, Gérard Bailly, Louis-Jean Boë - In Proceedings of the ESCA Tutorial and Research Workshop on Speech Technology in Language Learning , 1998
"... The Speech Mapping concept posits that speech sequences can be represented by trajectories in a multiparametric space, whose elements are related to each other by relationships representing speech production mechanisms. This concept is viewed as a useful framework for pronunciation training, in a sc ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
The Speech Mapping concept posits that speech sequences can be represented by trajectories in a multiparametric space, whose elements are related to each other by relationships representing speech production mechanisms. This concept is viewed as a useful framework for pronunciation training, in a scheme where the teacher uses a Virtual Talking Head, to manipulate audio-visual speech stimuli in order to fulfil a double task: (1) evaluating and improving the learner's perception of the target language sounds, and (2) helping the learner produce the corresponding articulations by acquiring the internalisation of the relations between articulatory gestures and resulting sounds. We describe a set of data and models, including a virtual talking head, that can be useful for pronunciation training, present a few experiments supporting this approach, and suggest some directions for the future. 1. Introduction A second language learner can be considered phonologically deaf, i.e. he/she may not...

Invariance and variability in the production of the height feature in French vowels

by Lucie Ménard A, Jean-luc Schwartz B, Jérôme Aubin A
"... This paper investigates the organization of the vowel space in French speakers. Speakers from 4 years of age to adulthood were recorded in order to generate significant betweenspeaker variability. Each speaker produced repetitions of the ten French oral vowels /i y u e ø o � œ � a/. Acoustic analyse ..."
Abstract - Add to MetaCart
This paper investigates the organization of the vowel space in French speakers. Speakers from 4 years of age to adulthood were recorded in order to generate significant betweenspeaker variability. Each speaker produced repetitions of the ten French oral vowels /i y u e ø o � œ � a/. Acoustic analyses show that despite considerable between-speaker variability in the relative positions of the vowels within the vowel space, speakers tend to

Data-driven articulatory inversion incorporating articulator priors

by Adam Lammert, Daniel P. W. Ellis
"... Recovering the motions of speech articulators from the acoustic speech signal has a long history, starting from the observation that a simple concatenated tube model is a reasonable model for the origin of formant resonances. In this work, we take a different approach making minimal assumptions abou ..."
Abstract - Add to MetaCart
Recovering the motions of speech articulators from the acoustic speech signal has a long history, starting from the observation that a simple concatenated tube model is a reasonable model for the origin of formant resonances. In this work, we take a different approach making minimal assumptions about the interdependence of acoustics and articulators by estimating the full joint distribution of the two spaces based on a corpus of paired data, derived from an articulatory synthesizer. This approach allows us to estimate posterior distributions of articulator state as well as finding the maximum-likelihood trajectories. We present examples comparing this approach to a related, earlier approach that did not incorporate prior distributions over articulator space, and demonstrate the advantages of learning the models from realistic utterances. We also indicate benefits available from jointly estimating particular pairs of articulators that have high mutual dependence. Index Terms: articulatory inversion, speech acoustics 1.

Correspondence:

by Yunusova Yana, Rosenthal Jeffrey S, Rudy Krista, Baljko Melanie, Daskalogiannakis John, Yana Yunusova
"... Positional targets for lingual consonants defined using electromagnetic articulography ..."
Abstract - Add to MetaCart
Positional targets for lingual consonants defined using electromagnetic articulography
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University