The ubiquity of the telephone suggests it as an ideal messaging tool. The slow, serial output of speech, however, makes it difficult to find important messages quickly. MailCall, a telephone-based messaging system using speech recognition, takes a step toward effective conversational messaging with a combination of filtering, random access, and recognition error handling. Incoming voice and text messages are categorized and prioritized based on the user's current interests as inferred from the calendar, rolodex, etc. MailCall then updates the speech recognizer's vocabulary on the fly to support random access---so that the user can go to interesting messages directly rather than having to step through the list one message at a time. Inevitable recognition errors are handled by an interface algorithm which verifies potential mistakes and lets the user correct them quickly; further, touch-tone equivalents for several speech commands are provided. And the user can send replies to messages or place calls using the rolodex. The result is a system which retrieves the user's most important messages first, supports
|
814
|
Attention, intentions, and the structure of discourse
– Grosz, Sidner
- 1986
|
|
414
|
Toward Memory-Based Reasoning
– Stanfill, Waltz
- 1986
|
|
413
|
Using collaborative filtering to weave an information tapestry
– Goldberg, Nichols, et al.
|
|
240
|
Put-that-there’: Voice and gesture at the graphics interface
– Bolt
- 1980
|
|
175
|
Collaborative interface agents
– Lashkari, Metral, et al.
- 1994
|
|
63
|
Voicenotes: a speech interface for a handheld voice notetaker
– Stifleman, Arons, et al.
- 1993
|
|
49
|
The Information Lens: An Intelligent System for Information Sharing in Organizations
– Malone, Grant, et al.
- 1986
|
|
48
|
Expressive richness: a comparison of speech and text as media for revision
– Chalfonte, Fish, et al.
- 1991
|
|
47
|
Hyperspeech: Navigating in speech-only hypermedia
– Arons
- 1991
|
|
41
|
Technopoly-the surrender of culture to technology. Vintage Books
– Postman
- 1992
|
|
39
|
Steps toward graceful interaction in spoken and written manmachine communication
– Hayes, Reddy
- 1983
|
|
29
|
User interfaces for voice applications
– Kamm
- 1995
|
|
26
|
An introduction to discourse analysis
– Coulthard
- 1977
|
|
26
|
The cost of errors in a spoken language system
– Hirschman, Pao
- 1993
|
|
25
|
Voice Communication With Computers - Conversational Systems
– Schmandt
- 1994
|
|
24
|
Phoneshell: the Telephone as Computer Terminal
– Schmandt
- 1993
|
|
19
|
A Conversational Telephone Messaging System
– Schmandt, Arons
- 1984
|
|
18
|
Pronouncing Surnames Automatically
– Spiegel
- 1985
|
|
17
|
Design of a Generic Learning Interface Agent
– Metral
- 1993
|
|
17
|
Doppelganger goes to school: Machine learning for user modeling
– Orwant
- 1993
|
|
17
|
Multimedia Nomadic Services on Today’s Hardware
– Schmandt
- 1994
|
|
14
|
Toward a definition of voice documents
– Muller, Daniel
- 1990
|
|
13
|
A robust parser and dialog generator for a conversational office system
– Schmandt, Arons
- 1986
|
|
12
|
Surfing the web by voice
– Hemphill, Thrift
- 1995
|
|
11
|
Speechacts: A testbed for continuous speech applications
– Martin, Kehler
- 1994
|
|
10
|
Let your fingers do the spelling: Implicit disambiguation of words spelled with the telephone keypad
– Davis
- 1991
|
|
10
|
Dialing for documents: an experiment in information theory
– Rau, Skiena
- 1996
|
|
9
|
Tools for Building Asynchronous Servers to Support Speech and Audio Applications
– Arons
- 1992
|
|
9
|
Chatter: A Conversational Telephone Agent
– Ly
- 1993
|
|
9
|
Chatter: A Conversational Learning Speech Interface
– Ly, Schmandt
- 1994
|
|
9
|
Speech synthesis gives voiced access to an electronic mail system
– Schmandt
- 1984
|
|
7
|
Logic and Conversation," Syntax and Semantics: Speech Acts
– Grice
- 1975
|
|
7
|
Phonetool: Integrating telephones and workstations
– Schmandt, Casner
- 1989
|
|
6
|
Speech recognition architectures for multimedia environments
– Ly, Schmandt, et al.
- 1993
|
|
6
|
The smart environment for retrieval system evaluation-advantages and problem areas
– Salton
- 1981
|
|
6
|
Not just another voice mail system
– Stifelman
- 1991
|
|
5
|
VoiceNotes: An Application for a Voice-Controlled Hand-Held Computer
– Stifelman
- 1992
|
|
4
|
Voice Activated Interaction System Based on HMM-based Speaker-Independent Word Spotting
– Kitai, Imamura, et al.
- 1991
|
|
4
|
Perception of synthetic speech generated by rule
– Pisoni, Nusbaum, et al.
- 1985
|
|
3
|
NewsTalk: A Speech Interface to a Personalized Information Agent
– Herman
- 1995
|
|
2
|
StoryWriter: A speechoriented editor
– Danis, Comerford, et al.
- 1994
|
|
2
|
Eudora: Bringing the P.O. Where You Live
– Dorner
- 1988
|
|
2
|
The Effects of Several
– Engelbeck, Roberts
- 1989
|
|
2
|
User's Guide. BBN Systems and Technologies: A Division of Bolt Beranek and Newman
– Prototyper
- 1993
|
|
2
|
DAGGER: a parser for Directed Acyclic Graphs of Grammars Enhancing Recognition
– Hemphill
- 1993
|
|
2
|
EPHOD: an Electronic PHOnetic Dictionary and tool set
– Hemphill
- 1993
|
|
2
|
Reliable Spelling Despite Unreliable Letter Recognition
– Marx, Schmandt
- 1994
|
|
1
|
FLANGE: Formal LANguage Grammars for Everyone
– Hemphill
- 1993
|
|
1
|
Semi-Structured Messages are Surprisingly Useful for Computer-Supported Coordination
– al
- 1987
|
|
1
|
Putting People First: Specifying Names in Speech Interfaces
– Marx, Schmandt
- 1994
|