• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Speakers role recognition in multiparty audio recordings using social network analysis and duration distribution modeling (2007)

by A Vinciarelli
Venue:IEEE Transactions on Multimedia 9
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 34
Next 10 →

Social Signal Processing: Survey of an Emerging Domain

by Alessandro Vinciarelli , Maja Pantic , Hervé Bourlard , 2008
"... The ability to understand and manage social signals of a person we are communicating with is the core of social intelligence. Social intelligence is a facet of human intelligence that has been argued to be indispensable and perhaps the most important for success in life. This paper argues that next- ..."
Abstract - Cited by 150 (32 self) - Add to MetaCart
The ability to understand and manage social signals of a person we are communicating with is the core of social intelligence. Social intelligence is a facet of human intelligence that has been argued to be indispensable and perhaps the most important for success in life. This paper argues that next-generation computing needs to include the essence of social intelligence – the ability to recognize human social signals and social behaviours like turn taking, politeness, and disagreement – in order to become more effective and more efficient. Although each one of us understands the importance of social signals in everyday life situations, and in spite of recent advances in machine analysis of relevant behavioural cues like blinks, smiles, crossed arms, laughter, and similar, design and development of automated systems for Social Signal Processing (SSP) are rather difficult. This paper surveys the past efforts in solving these problems by a computer, it summarizes the relevant findings in social psychology, and it proposes a set of recommendations for enabling the development of the next generation of socially-aware computing.

Bridging the Gap Between Social Animal and Unsocial Machine: A Survey of Social Signal Processing

by Alessandro Vinciarelli, Maja Pantic, Dirk Heylen, Catherine Pelachaud, Isabella Poggi, Marc Schröder, et al. - IEEE TRANSACTIONS ON AFFECTIVE COMPUTING
"... Social Signal Processing is the research domain aimed at bridging the social intelligence gap between humans and machines. This article is the first survey of the domain that jointly considers its three major aspects, namely modeling, analysis and synthesis of social behaviour. Modeling investigate ..."
Abstract - Cited by 34 (6 self) - Add to MetaCart
Social Signal Processing is the research domain aimed at bridging the social intelligence gap between humans and machines. This article is the first survey of the domain that jointly considers its three major aspects, namely modeling, analysis and synthesis of social behaviour. Modeling investigates laws and principles underlying social interaction, analysis explores approaches for automatic understanding of social exchanges recorded with different sensors, and synthesis studies techniques for the generation of social behaviour via various forms of embodiment. For each of the above aspects, the paper includes an extensive survey of the literature, points to the most important publicly available resources, and outlines the most fundamental challenges ahead.

Role recognition for meeting participants: an approach based on lexical information and Social Network Analysis

by N. P. Garg, S. Favre, H. Salamin, D. Hakkani Tür, A. Vinciarelli - in Proceedings of the ACM International Conference on Multimedia, 2008
"... This paper presents experiments on the automatic recognition of roles in meetings. The proposed approach combines two sources of information: the lexical choices made by people playing different roles on one hand, and the Social Networks describing the interactions between the meeting participants o ..."
Abstract - Cited by 30 (13 self) - Add to MetaCart
This paper presents experiments on the automatic recognition of roles in meetings. The proposed approach combines two sources of information: the lexical choices made by people playing different roles on one hand, and the Social Networks describing the interactions between the meeting participants on the other hand. Both sources lead to role recognition results significantly higher than chance when used separately, but the best results are obtained with their combination. Preliminary experiments obtained over a corpus of 138 meeting recordings (over 45 hours of material) show that around 70 % of the time is labeled correctly in terms of role.
(Show Context)

Citation Context

...ns in the data used in this work). To the best of our knowledge, only few works have been dedicated to the automatic recognition of roles. Some of them recognize functional roles in broadcast data [2]=-=[9]-=-, i.e. the tasks that different people perform in television and radio programs (e.g. anchorman or guest), and another recognizes functional roles in movies [11] (e.g. hero or hero’s friends). The rec...

Social Signal Processing: State-of-the-art and future perspectives of an emerging domain

by Alessandro Vinciarelli, Maja Pantic, Hervé Bourlard, Alex Pentland - IN PROCEEDINGS OF THE ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA , 2008
"... The ability to understand and manage social signals of a person we are communicating with is the core of social intelligence. Social intelligence is a facet of human intelligence that has been argued to be indispensable and perhaps the most important for success in life. This paper argues that next- ..."
Abstract - Cited by 26 (7 self) - Add to MetaCart
The ability to understand and manage social signals of a person we are communicating with is the core of social intelligence. Social intelligence is a facet of human intelligence that has been argued to be indispensable and perhaps the most important for success in life. This paper argues that next-generation computing needs to include the essence of social intelligence – the ability to recognize human social signals and social behaviours like politeness, and disagreement – in order to become more effective and more efficient. Although each one of us understands the importance of social signals in everyday life situations, and in spite of recent advances in machine analysis of relevant behavioural cues like blinks, smiles, crossed arms, laughter, and similar, design and development of automated systems for Social Signal Processing (SSP) are rather difficult. This paper surveys the past efforts in solving these problems by a computer, it summarizes the relevant findings in social psychology, and it proposes aset of recommendations for enabling the development of the next generation of socially-aware computing.
(Show Context)

Citation Context

...ormed with different algorithms including Dynamic Bayesian Networks [54] and layered hidden Markov models [57]. The recognition of roles has been addressed in two main contexts: broadcast material [7]=-=[90]-=-[92] and small scale meetings [6][23][96]. The works in [90][92] apply Social Network Analysis [91] to detect the role of people in broadcast news and movies, respectively. The approach in [7] recogni...

Role Recognition in Multiparty Recordings using Social Affiliation Networks and Discrete Distributions

by Sarah Favre, Hugues Salamin, John Dines, Ro Vinciarelli - In Proceedings of the ACM International Conference on Multimodal Interfaces , 2008
"... This paper presents an approach for the recognition of roles in multiparty recordings. The approach includes two major stages: extraction of Social Affiliation Networks (speaker diarization and representation of people in terms of their social interactions), and role recognition (application of disc ..."
Abstract - Cited by 20 (6 self) - Add to MetaCart
This paper presents an approach for the recognition of roles in multiparty recordings. The approach includes two major stages: extraction of Social Affiliation Networks (speaker diarization and representation of people in terms of their social interactions), and role recognition (application of discrete probability distributions to map people into roles). The experiments are performed over several corpora, including broadcast data and meeting recordings, for a total of roughly 90 hours of material. The results are satisfactory for the broadcast data (around 80 percent of the data time correctly labeled in terms of role), while they still must be improved in the case of the meeting recordings (around 45 percent of the data time correctly labeled). In both cases, the approach outperforms significantly chance.
(Show Context)

Citation Context

...led with Gaussian distributions. To the best of our knowledge, only a few works have been dedicated to the automatic recognition of roles. Some of them recognize functional roles in broadcast data [4]=-=[13]-=-, i.e. the tasks that different people perform in television and radio programs (e.g. anchorman or guest), and another recognizes functional roles in movies [15] (e.g. hero or hero’s friends). The rec...

Automatic Role Recognition in Multiparty Recordings: Using Social Affiliation Networks for Feature Extraction

by Hugues Salamin, Sarah Favre, Ro Vinciarelli
"... Abstract—Automatic analysis of social interactions attracts increasing attention in the multimedia community. This paper considers one of the most important aspects of the problem, namely the roles played by individuals interacting in different settings. In particular, this work proposes an automati ..."
Abstract - Cited by 18 (6 self) - Add to MetaCart
Abstract—Automatic analysis of social interactions attracts increasing attention in the multimedia community. This paper considers one of the most important aspects of the problem, namely the roles played by individuals interacting in different settings. In particular, this work proposes an automatic approach for the recognition of roles in both production environment contexts (e.g., news and talk-shows) and spontaneous situations (e.g., meetings). The experiments are performed over roughly 90 hours of material (one of the largest databases used for role recognition in the literature) and show that the recognition effectiveness depends on how much the roles influence the behavior of people. Furthermore, this work proposes the first approach for modeling mutual dependences between roles and assesses its effect on role recognition performance.
(Show Context)

Citation Context

...a. Furthermore, the role can be used to segment the data into semantically coherent segments [11][12]. The main contributions of this paper with respect to previous approaches proposed by the authors =-=[13]-=- and the rest of the literature are as follows: • The approach proposed in [13] can be applied only to groups involving at least 8-10 persons because it is based on simple Social Networks and these ne...

Predicting Two Facets of Social Verticality in Meetings from Five-Minute Time Slices and Nonverbal Cues

by Dinesh Babu Jayagopi, Sileye Ba, Jean-marc Odobez, Daniel Gatica-perez
"... This paper addresses the automatic estimation of two aspects of social verticality (status and dominance) in small-group meetings using nonverbal cues. The correlation of nonverbal behavior with these social constructs have been extensively documented in social psychology, but their value for comput ..."
Abstract - Cited by 18 (6 self) - Add to MetaCart
This paper addresses the automatic estimation of two aspects of social verticality (status and dominance) in small-group meetings using nonverbal cues. The correlation of nonverbal behavior with these social constructs have been extensively documented in social psychology, but their value for computational models is, in many cases, still unknown. We present a systematic study of automatically extracted cues- including vocalic, visual activity, and visual attention cues- and investigate their relative effectiveness to predict both the most-dominant person and the high-status project manager from relative short observations. We use five hours of task-oriented meeting data with natural behavior for our experiments. Our work suggests that, although dominance and role-based status are related concepts, they are not equivalent and are thus not equally explained by the same nonverbal cues. Furthermore, the best cues can correctly predict the person with highest dominance or role-based status with an accuracy of 70 % approximately.

Social Signals, their Function, and Automatic Analysis: A Survey

by Alessandro Vinciarelli, Maja Pantic, Hervé Bourlard, Alex Pentland - ICMI'08 , 2008
"... Social Signal Processing (SSP) aims at the analysis of social behaviour in both Human-Human and Human-Computer interactions. SSP revolves around automatic sensing and interpretation of social signals, complex aggregates of nonverbal behaviours through which individuals express their attitudes toward ..."
Abstract - Cited by 13 (2 self) - Add to MetaCart
Social Signal Processing (SSP) aims at the analysis of social behaviour in both Human-Human and Human-Computer interactions. SSP revolves around automatic sensing and interpretation of social signals, complex aggregates of nonverbal behaviours through which individuals express their attitudes towards other human (and virtual) participants in the current social context. As such, SSP integrates both engineering (speech analysis, computer vision, etc.) and human sciences (social psychology, anthropology, etc.) as it requires multimodal and multidisciplinary approaches. As of today, SSP is still in its early infancy, but the domain is quickly developing, and a growing number of works is appearing in the literature. This paper provides an introduction to nonverbal behaviour involved in social signals and a survey of the main results obtained so far in SSP. It also outlines possibilities and challenges that SSP is expected to face in the next years if it is to reach its full maturity.
(Show Context)

Citation Context

...of precision in role assignment licly available, 5 roles) [24] AMI Meeting Corpus (138 recordings, 45h.00m simulated 67.9% of the data time correctly lapublicly available, 4 roles) beledintermsofrole =-=[70]-=- Radio news bulletins (96 recordings, 6 25h.00m real 80% of the data time correctly laroles) beledintermsofrole [72] Movies (3 recordings , 4 roles) 5h.46m real 95% of roles correctly assigned [74] Th...

Mining group nonverbal conversational patterns using probabilistic topic models

by Dinesh Babu Jayagopi, Daniel Gatica-perez - IEEE TRANS. MULTIMED , 2010
"... The automatic discovery of group conversational behavior is a relevant problem in social computing. In this paper, we present an approach to address this problem by defining a novel group descriptor called bag of group-nonverbal-patterns (NVPs) defined on brief observations of group interaction, an ..."
Abstract - Cited by 11 (5 self) - Add to MetaCart
The automatic discovery of group conversational behavior is a relevant problem in social computing. In this paper, we present an approach to address this problem by defining a novel group descriptor called bag of group-nonverbal-patterns (NVPs) defined on brief observations of group interaction, and by using principled probabilistic topic models to discover topics. The proposed bag of group NVPs allows fusion of individual cues and facilitates the eventual comparison of groups of varying sizes. The use of topic models helps to cluster group interactions and to quantify how different they are from each other in a formal probabilistic sense. Results of behavioral topics discovered on the Augmented Multi-Party Interaction (AMI) meeting corpus are shown to be meaningful using human annotation with multiple observers. Our method facilitates “group behavior-based” retrieval of group conversational segments without the need of any previous labeling.
(Show Context)

Citation Context

...s online using multimodal visualization [13], [39], [47], interactive furniture [1] or wearable devices [26]. While modeling individuals allows to study dominance [23], [43], status [21], roles [11], =-=[48]-=-, personality [38], modeling groups could reveal group interest [16], interactivity and centrality in groups [36], and identify cooperative groups against competitive groups [24]. The methods studied ...

CHARACTERIZING CONVERSATIONAL GROUP DYNAMICS USING NONVERBAL BEHAVIOUR

by Dinesh Babu Jayagopi, Bogdan Raducanu, Daniel Gatica-perez
"... This paper addresses the novel problem of characterizing conversational group dynamics. It is well documented in social psychology that depending on the objectives a group, the dynamics are different. For example, a competitive meeting has a different objective from that of a collaborative meeting. ..."
Abstract - Cited by 10 (6 self) - Add to MetaCart
This paper addresses the novel problem of characterizing conversational group dynamics. It is well documented in social psychology that depending on the objectives a group, the dynamics are different. For example, a competitive meeting has a different objective from that of a collaborative meeting. We propose a method to characterize group dynamics based on the joint description of a group members ’ aggregated acoustical nonverbal behaviour to classify two meeting datasets (one being cooperative-type and the other being competitive-type). We use 4.5 hours of real behavioural multi-party data and show that our methodology can achieve a classification rate of upto 100%. Index Terms — Competitive and cooperative meetings, group dynamics, nonverbal cues 1.
(Show Context)

Citation Context

...ng only nonverbal behaviour? Various works have analyzed face-to-face meetings group conversations [2], attempting to characterize individual social attributes like dominance [3, 4],status [5], roles =-=[6, 7]-=- and personalities [8]. The works differ widely in the cues and the models they employ. Very few works however have attempted to characterize groups as a whole. In [9] four short conversations were ch...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University