| M. Muller and J. Daniel. Toward a Definition of Voice Documents. In Proc. COIS 90, 1990. |
....speech recognition systems have employed a visual component for all or part of the feedback, a telephone based system does not enjoy such a luxury. Described below are several early investigations into systems employing speech as the sole interactional medium. 2.2. 1 Hyperphone Hyperphone [Muller Daniel 1990] is a speech only document browser using speech recognition for input and speech synthesis for output. Typed links were constructed between different sections of the text to enable voice only browsing; the user may follow a linear sequence of navigation known as a Parsed Voice String or interrupt ....
....10 from Paul Martin about anyone for hiking Figure 5.7: Scanning headers in SpeechActs. An obvious shortcoming of this technique is the need to remember the number of each message. If the system supported barge in, SpeechActs might be able to use voice direct 69 manipulation [Stifelman 1992, Muller Daniel 1990] to say that one right after the header of the interesting message was read. 5.4.2 Clustering by sender An alternative method of summarization, the one explored in MailCall, is clustering messages by sender. The goal is to provide an effective summary of the messages from which the user can ....
M. Muller and J. Daniel. "Toward a definition of voice documents." Proceedings of COIS, 1990.
....other interfaces, so that the user felt that they were navigating and in control. Arons acknowledged that representing and manipulating a hypermedia database becomes much more complex in the speech domain than with traditional media. Related systems include those described by Resnick (1990) and Muller (1990), both cited by 10 Arons (1991) 2.4 Previous work with spoken language extensions to WWW browsers Many groups around the country, and presumably around the world, are working on projects that are similar in many ways to OGI s SLAM system. Earlier versions of MacMosaic had been compiled with ....
Muller, M. & Daniel, J. (1990). Toward a definition of voice documents, Proceedings of Conference on Office Information Systems, Cambridge, MA, 25-27 April 1990.
....is first heard and (2) latency in the system s response time, while receiving the spoken command from the remote speech server. Generally, the user s command follows (lags behind) the playback of the target message. This can cause the wrong message to be selected for playback. Muller and Daniel [Muller90] suggest a partially overlapping temporal window to select the correct target. In Nomadic Radio, the temporal target window for a message being scanned, extends 2 seconds after it has finished playing (see figure 4.4) Figure 4.4: Scanning email messages and selecting the current message within ....
Muller, M. and J. Daniel. Toward a definition of voice documents. Proceedings of COIS '90, pp. 174-182, ACM, 1990.
....system [3] Speech and audio, however, exist only as a time varying signal the auditory system cannot browse through a set of recordings the way the eye can scan a display. Speech interfaces must present information sequentially while visual interfaces can present information simultaneously [5, 10]. These factors lead to significantly different design issues when using speech [15] as opposed to text, video, or graphics. Recorded speech cannot be manipulated, viewed, or organized on a display in the same manner as text or video images. Schematic representations of speech signals (e.g. ....
M. J. Muller and J. E. Daniel. Toward a definition of voice documents. In Proceedings of COIS '90, 1990.
....are also discussed. While speech is a powerful communications medium, it exists only temporally the ear cannot browse around a set of recordings the way the eye can scan a screen of text and images. Speech and audio interfaces must be sequential, while visual interfaces can be simultaneous [Gave86, Mull90]. These confounding features lead to significantly different design issues when using speech [Schm89] rather than text, video, or graphics. 1 The word hyperspeech is used much like hypertext or hypermedia, as a generic term for speech only hypermedia it is not the name of the application ....
....application encourages free form browsing, allowing users to focus on accessing information rather than navigation. Zellweger s paths are appropriate for scripted documents and narrations; this system focuses on conversational interactions. Muller and Daniel s description of the HyperPhone system [Mull90] provides a good overview of many important issues in voice I O hypermedia. They state that navigation tends to be modeled spatially in almost any interface, and that voice navigation is particularly difficult to map into the spatial domain. HyperPhone voice documents are a collection of ....
M. J. Muller and J. E. Daniel. Toward a definition of voice documents. In Proceedings of COIS '90, 1990.
No context found.
M. Muller and J. Daniel. Toward a Definition of Voice Documents. In Proc. COIS 90, 1990.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC