14 citations found. Retrieving documents...
Srihari, R. K. 1995 Computational Models for Integrating Linguistic and Visual Information: A Survey. Artificial Intelligence Review 8(5-6):349-369.

 Home/Search   Document Details and Download   Summary   Related Articles   Check  

This paper is cited in the following contexts:
Multilevel Integration of Vision and Speech Understanding.. - Wachsmuth, al. (1999)   (Correct)

....scheme. Finally, we will give some experimental results of the implemented system (section 6) showing the robustness of our approach and a short conclusion (section 7) 2 Related Work In literature the topic of integrating vision and speech understanding is referenced from different viewpoints [21]. The construction of mental pictures [7] can be induced by verbal descriptions or previously seen objects. They are used to reason about scenes which are currently not visible. This is an important aspect in language understanding when spatial knowledge is involved [29, 1, 16] Other systems [13, ....

R. K. Srihari. Computational models for integrating linguistic and visual information: A survey. In Artificial Intelligence Review, 8, pages 349--369, Netherlands, 1994. Kluwer Academic Publishers.


Using Speech in Visual Object Recognition - Wachsmuth, al. (2000)   (Correct)

....mental abilities for human communication. Therefore, intuitive human computer interfaces especially have to support these two input modalities. In artificial intelligence, there has been a long tradition to realize these capabilities as separate tasks. But as mentioned by many researchers (e.g. [14]) integrating language and vision has implications on both processing tasks. In particular, a separated approach is inherently error prone because all decisions are exclusively based on one part of the whole input. One possibility to overcome these drawbacks but still using a separated approach is ....

R. K. Srihari. Computational models for integrating linguistic and visual information: A survey. In Artificial Intelligence Review, 8, pages 349--369, Netherlands, 1994. Kluwer Academic Publishers.


Co-Operative Neural Networks and `integrated' Classification - Ahmad, Vrusias, Tariq (2002)   (Correct)

....are subjective. The content based image retrieval paradigm was conceived in order to address these limitations. Here retrieval is based on the visual properties of an image such as colour and texture. Though successful to a certain extent this method has its own limitations. Srihari [13] is one of the first researchers to contemplate the notion of a more effective understanding and retrieval of pictures if the two modalities of vision and language are combined. For humans, the correlation of visual information with speech text is a given in that such a correspondence expedites ....

Srihari R.K., "Computational Models for Integrating Linguistic and Visual Information: A Survey, Artificial Intelligence Review, special issue on Integrating Language and Vision", Volume 8, pp. 349--369, 1995.


Communicative Rhythm in Gesture and Speech - Wachsmuth (1999)   (2 citations)  (Correct)

.... multimodal input systems requires, on the one hand, the processing of single modalities and, on the other hand, the integration of multiple modalities [5] To enable a technical system to coordinate and integrate perceived speech and gestures in their natural flow, two problems have to be solved [23]: The segmentation problem: Given that the system is to process open input, how is the right chunk of information determined that the system takes in for processing at a time How are consecutive chunks linked together The correspondence problem: Given that the system is to integrate ....

. R.K. Srihari. Computational models for integrating linguistic and visual information: a survey. Artificial Intelligence Review 8: 349-369, 1995.


Communicative Rhythm in Gesture and Speech - Wachsmuth (1999)   (2 citations)  (Correct)

.... systems requires, on the one hand, the processing of single modalities and, on the other hand, the integration of multiple modalities (Coutaz et al. 1995) To enable a technical system to coordinate and integrate perceived speech and gestures in their natural flow, two problems have to be solved (Srihari, 1995): The segmentation problem: Given that the system is to process open input, how is the right chunk of information determined that the system takes in for processing at a time How are consecutive chunks linked together The correspondence problem: Given that the system is to integrate ....

Srihari, R.K. (1995). Computational models for integrating linguistic and visual information: a survey. Artificial Intelligence Review, 8, 349-369.


An Architecture For Exploiting Qualitative, Scene-Specific.. - Chopra (1997)   (Correct)

....the caption. As we shall see later, we extend the use of collateral information in this domain, as well as make the ideas developed in Srihari s thesis suitable for application in other domains. Recently, there has been a lot of academic interest in the Integration of Natural Language and Vision [Srihari, 1995b; McKevitt, 1994; Srihari, 1995a ] One of the objectives of that research effort is to use the interpretation of data in one modality to drive the interpretation of data in the other. We highlight the fact that collateral based vision exploits a reliable hypothesis of scene contents. We obtain ....

Rohini K. Srihari. Computational models for integrating linguistic and visual information: A survey. Artificial Intelligence Review (special issue on integration of NLP and Vision), 8(5):349--369, 1995.


An Architecture For Exploiting Qualitative, Scene-Specific.. - Chopra   Self-citation (Srihari)   (Correct)

....the caption. As we shall see later, we extend the use of collateral information in this domain, as well as make the ideas developed in Srihari s thesis suitable for application in other domains. Recently, there has been a lot of academic interest in the Integration of Natural Language and Vision [Srihari, 1995b; McKevitt, 1994; Srihari, 1995a ] One of the objectives of that research effort is to use the interpretation of data in one modality to drive the interpretation of data in the other. We highlight the fact that collateral based vision exploits a reliable hypothesis of scene contents. We obtain ....

Rohini K. Srihari. Computational models for integrating linguistic and visual information: A survey. Artificial Intelligence Review (special issue on integration of NLP and Vision), 8(5):349--369, 1995.


Andrew Salway, Mike Graham, Eleftheria Tomadaki and Yan Xu, - Linking Video And   (Correct)

No context found.

Srihari, R. K. 1995 Computational Models for Integrating Linguistic and Visual Information: A Survey. Artificial Intelligence Review 8(5-6):349-369.


Vision-Language Integration in AI: a reality check - Katerina Pastra And   (Correct)

No context found.

R. Srihari, `Computational models for integrating linguistic and visual information: A survey', Artificial Intelligence Review, 8(5/6), 349--369, (1994).


Unknown -   (Correct)

No context found.

Srihari (1995a). Rohini K. Srihari, "Computational Models for Integrating Linguistic and Visual Information: A Survey," Artificial Intelligence Review, special issue on Integrating Language and Vision, Vol. 8 (5-6), pp.349-369.


A Self-Referential Perceptual Inference Framework for Video.. - Town, Sinclair (2003)   (Correct)

No context found.

R. Srihari. Computational models for integrating linguistic and visual information: A survey. Artificial Intelligence Review, special issue on Integrating Language and Vision, 8:349--369, 1995.


Vision-Language Integration in AI: a reality check - Katerina Pastra And   (Correct)

No context found.

R. Srihari, `Computational models for integrating linguistic and visual information: A survey', Artificial Intelligence Review, 8(5/6), 349--369, (1994).


A Computational Model to Connect Gestalt Perception and Natural.. - Dhande (2003)   (2 citations)  (Correct)

No context found.

R. K. Srihari. Computational models for integrating linguistic and visual information: A survey. Artificial Intelligence Review, special issue on Integrating Language and Vision, 8:349--369, 1995.


Integrated Analysis of Speech and Images as a . . . - Obtained (2002)   (Correct)

No context found.

Rohini K. Srihari. Computational Models for Integrating Linguistic and Visual Information: A Survey. Artificial Intelligence Review, 8:349--369, 1994.

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC