hello, who is calling? can words reveal the social nature of conversations?

48
Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Upload: dustin-johnston

Post on 21-Jan-2016

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Hello, Who is Calling? Can Words Reveal the Social

Nature of Conversations?

Page 2: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

OUTLINE

Abstract Introduction Corpus Automatic Speech Recognition Nature of Telephone Conversations Supervised Classification of Types of

Social Relationships Conclusions

Page 3: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Abstract

Aims to infer the social nature of conversation from their content automatically This motivation is stem from the need to

understand how social disengagement affects cognitive decline or depression among older adult.

Page 4: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Abstract

First step:Learned a binary classifier to filter out business related conversation.

Page 5: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Abstract

For classifying different type, we investigated feature related to: Language use (entropy) Hand-craft dictionary (LIWC) Latent Dirichlet models (LDA, unsupervised)

Page 6: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Introduction

This paper focus on understanding the social interaction of an individual over a period of time.

Page 7: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Introduction

Telephone conversation present several advantages for analysis Solely to an audio channel, without recourse

to gestures or facial expression. Simplify both collection and analysis. Transcribe using automatic speech

recognition (ASR) systems.

Page 8: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Introduction

our first task was to classify social and non-social conversations and reverse listing was useful to a certain extent.

Page 9: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Introduction

This study focus on using the resulting classifier as a tool to probe the nature of telephone conversations as well as to test whether the scores obtained from it can serve as a proxy for degree of social familiarity.

Page 10: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Corpus

Corpus consists of 12067 lane-line telephone conversation 10 volunteers, 79years or older Approximately 12 month Native English speaker form USA

Page 11: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Corpus

Corpus includes meta-data such as Call direction (incoming or outgoing) Time of call Time of duration DTMF/caller ID (if available)

Page 12: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Corpus

For each subject, twenty telephone numbers were corresponding to Top ten most frequent call Top ten longest call

Page 13: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Corpus

Subject were asked to identify relationship at these calls as Immediate family Near relatives Close friends Casual friends Strangers Business

Page 14: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Corpus

Of the 8,558 available conversations, 2,728 were identified as residential conversations and 1,095 were identified as business conversations using reverse listings from multiple sources phone directory lookup exit interviews internet lookup.

Page 15: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Automatic Speech Recognition

The acoustic models were trained on about 2000 hours of telephone speech from Switchboard and Fisher corpora.

Page 16: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Automatic Speech Recognition

Decoding is performed in three stages speaker-independent models vocal-tract normalized models speaker-adapted models

Page 17: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Nature of Telephone Conversations

Classification Experiments Can the Scores of the Binary Classifier

Differentiate Types of Social Relationship? How Informative are Openings and

Closings in Differentiating Telephone Conversations?

Why are Short Conversations difficult to Classify?

Can Openings Help Predict Relative Lengths of Conversations?

Page 18: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Classification Experiments

We assume that the majority of our training set reflect the true nature of the conversations. and expect to employ the classifier

subsequently for correcting the errors arising when personal conversations occur on business lines.

Page 19: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Classification Experiments

We learned a baseline SVM classifier using a balanced training set. Found that RBF kernel is most effective,

accuracy achieve 87.5% on verification data.

Page 20: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Can the Scores of the Binary Classifier Differentiate Types of

Social Relationship? We computed SVM score statistics for

all conversations with family and friends And for comparison, we also computed the

statistics for all conversations automatically tagged as residential as well as all conversations in the data.

Page 21: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?
Page 22: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

How Informative are Openings and Closings in Differentiating Telephone Conversations?

The structure of openings facilitate establishing identity of the conversants and the purpose of their call.

Closings in personal conversations are likely to include a pre-closing signal that allows either party to mention any unmentioned mentionables before conversation ends.

Page 23: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?
Page 24: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Why are Short Conversations difficult to Classify?

The results from the above section appear to contradict that shorter conversations suffer from poor classification, as a 30-word window can give very good performance.

Page 25: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?
Page 26: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Why are Short Conversations difficult to Classify?

These observations suggest that the long and short conversations are inherently different in nature, at least in their openings.

Page 27: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Can Openings Help Predict Relative

Lengths of Conversations? It is natural to ask whether openings can predict relative lengths of conversations.

Page 28: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Can Openings Help Predict Relative

Lengths of Conversations? Features from very short conversations may contain both openings and closings. both a hello and a goodbye.

Page 29: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Supervised Classification of Types of

Social Relationships We investigate the performance of classifiers to differentiate the following binary classes. Residential vs business Family vs all other Family vs other residential Familiar vs non-familiar

Page 30: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Supervised Classification of Types of

Social Relationships

Page 31: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Lexical Statistics

Speakers who share close social ties are likely to engage in conversations on a wide variety of topics and this is likely to reflect in the entropy of their language use.

Page 32: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?
Page 33: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Lexical Statistics

It is shown that people talk longer, more rapidly and have wider range of language use when conversing with a familiar contact and/or family member.

Page 34: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Lexical Statistics

Only the speaking rate showed significant differences among the residential/business categories, with business conversations being conducted at a slower pace at least for the elderly demographic in our corpus.

Page 35: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Linguistic inquiry and Word Count

The categories have significant overlap and a given word can map to zero or more categories.

Page 36: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?
Page 37: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Linguistic inquiry and Word Count

The clear benefit of LIWC is that the word categories have very clear and pre-labeled meanings.

Page 38: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?
Page 39: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Latent Dirichlet allocation

Unsupervised clustering and feature selection can make use of data for which we have no labels.

Page 40: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Latent Dirichlet allocation

LDA models a conversation as a bag of words.

Experimentally, we found best cross-validation results were obtained when α and β were set to 0.01 and 0.1 respectively.

Page 41: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Latent Dirichlet allocation

Most interesting, when the number of clusters are reduced to two, the LDA model managed to segment residential and business conversations with relatively high accuracy (80%).

Page 42: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?
Page 43: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Classifying Types of Social Relationships

Before performing classification, we produce balanced datasets that have equal numbers of conversations for each category.

Page 44: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?
Page 45: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Conclusions

We show that the business calls can be separated from social calls with accuracies as high as 85%.

Page 46: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Conclusions

When compared to language use (entropy)and hand-crafted dictionaries (LIWC), posteriors over topics computed using a latent Dirichlet model provide superior performance.

Page 47: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Conclusions

openings of conversations were found to be more informative in classifying conversation than closings or random segments, when using automated transcripts.

Page 48: Hello, Who is Calling? Can Words Reveal the Social Nature of Conversations?

Conclusions

In future work, we plan to examine subject specific language use, turn taking and affect to further improve the classification of social calls.