1 back channel communication antoine raux dialogs on dialogs 02/25/2005
Post on 15-Jan-2016
224 views
TRANSCRIPT
![Page 1: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/1.jpg)
1
Back Channel Communication
Antoine Raux
Dialogs on Dialogs 02/25/2005
![Page 2: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/2.jpg)
2
Outline
• From Back Channel to backchannels
• Function of the Back Channel
• Characteristics of the Back Channel
• The Back Channel in Spoken Dialogue Systems
![Page 3: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/3.jpg)
3
From back channel…
• 70s: Conversation Analysts attempt to describe systematic rules for turn-taking management– Goal: minimize gaps and overlaps between speakers
• BUT many overlaps in natural speech– E.g.: “mm-hmm”, “okay”, “yeah”…
• “Back channel” (Yngve 1970): Parallel channel for communication (Duncan 1972)– “Back channel communication does not constitute a
turn or a claim for a turn”– But it “may participate in a variety of communication
functions, including the regulation of speaking turns.”
![Page 4: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/4.jpg)
4
…to backchannels
• “Backchannel”: listener-produced signal such as “mm-hmm”, “yeah”…(“To backchannel”: to produce such signals)
• Does not imply the will to take the turn
• Implies some form of acknowledgment (in general)
![Page 5: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/5.jpg)
5
Front Channel Back Channel
Function Propositional Transactional Conversation managmt Social
Conversation managmt Social
Protocol Turn-takingFloor sharing
? (controlled by FC?)No floor to share
Lexical content Anything vocalizations, short words, phrases (“That’s true”)
Front vs Back Channel
![Page 6: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/6.jpg)
6
Front-channel cues to back-channel signals• Koiso et al (1998)
• Analyze the relationship between different syntactic and prosodic features and the occurrence of backchannels
![Page 7: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/7.jpg)
7
Koiso et al (Methodology)
• Data: 8 dialogs from Japanese Map Task corpus:– replica of the Edinburgh MT– Face-to-face and speech only (no difference)
• Features– Syntactic: POS– Duration of last mora (normal/long/short)– F0 pattern of last mora (flat-fall, rise…)– Peak F0 (low/high)– Energy pattern (late-decr, decr, no-decr)– Peak energy (low/high)
![Page 8: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/8.jpg)
8
Koiso et al (Results)
• Frequency of feature values
BC > no-BC
POS=verb-phrase, post-position, conjunction
F0 pat=flat-fall or rise-fallEnergy pat=late-decr
Peak energy=high
no-BC > BC
POS=adv, conjunction, interjection, filler
Dur=short
F0 pat=fall or flat
Energy pat=non-decr
Peak energy=low
![Page 9: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/9.jpg)
9
Koiso et al (Results)
• Decision Tree analysis
• Compare the loss in performance by not using each feature– POS: single best feature– Prosodic features altogether: as good as POS
![Page 10: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/10.jpg)
10
Koiso et al (Discussion)
• Some POS strongly inhibit BC
• Individual prosodic features are not good indicators of BC occurrence
• BC occurrence is conditioned by both POS and prosody (as a whole)
• What about other languages?
• What about BC overlapping with speech?
![Page 11: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/11.jpg)
11
BC cues in English and Japanese
• Ward and Tsukahara (2000)
• Tests one hypothesis (“BC are triggered by low pitch cues”) for two languages
![Page 12: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/12.jpg)
12
The Low Pitch Cue
• Both in American English and Japanese, it appears that “after a region of low pitch lasting 110 ms the listener tends to produce back-channel feedback”.
• Goal of this paper: quantitatively test this on naturally occurring conversations
![Page 13: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/13.jpg)
13
Ward and Tsukahara (Methodology)
• Data:– English: 8 conversations, 12 speakers (first
author participates in 5 conversations!)– Japanese: 18 conversations, 24 speakers
• Prediction:– Every 10ms decide BC/no-BC by applying a
hand coded rule with 5 parameters tuned to the data
![Page 14: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/14.jpg)
14
Ward and Tsukahara (Results)
• Each predicted BC was considered correct if it fell within 500ms of an actual BC
• Low pitch region rule is better than chance both in English and Japanese
![Page 15: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/15.jpg)
15
Ward and Tsukahara (Results)
• Issues:– Evaluation (tolerance window size, speakers
produce BCs with different frequencies…)– No actual comparison between languages– Are low pitch regions and BCs simply
correlated to other phenomena (syntactic completion, disfluencies…) or is there a direct cause/consequence relationship?
![Page 16: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/16.jpg)
16
Effects of Native Language and Gender on BC• Feke (2003)
• Conversation Analysis study of BC in native-English and native-Spanish, same- and mixed-gender dialogs
![Page 17: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/17.jpg)
17
Definition of BC
• BC: responses of the participant that is “clearly not holding the floor”…
• Very loose compared to previous papers:– e.g. “How did you find Quechua?” is a BC
• Distinguishes In-Between BC and Overlap BC
![Page 18: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/18.jpg)
18
Feke (Methodology)
• Recorded 8 non-scripted conversations between 8 different speakers (2 native languages x 2 genders x 2 subjects)
• Manually coded In-Between BCs and Overlap BCs
![Page 19: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/19.jpg)
19
Feke (Results)
• No differences observed across cultures
• Participants of both genders tend to use more BC when conversing with someone of the opposite gender
• Difference seems bigger for females than for males
![Page 20: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/20.jpg)
20
Feke (Discussion)
• Interesting/surprising result from the ethnological/sociological point of view
• Very few data points, no significance analysis
• Only looked at number of BCs
• Consequences on SDS? (e.g. using gender information in BC prediction, selecting the gender of an agent…)
![Page 21: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/21.jpg)
21
BC in Practical Systems…
• Takeuchi et al (2003)
• Method to determine the timing of turn transitions and aizuchi (≈BC) on Japanese Human-Human corpus
![Page 22: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/22.jpg)
22
Takeuchi (Approach)
• Similar to Koiso et al, but only using automatically extracted features
• Every 100 ms decide between:– Take turn– Aizuchi (BC)– Leave turn (wait)
![Page 23: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/23.jpg)
23
Takeuchi (Approach)
• Decision Tree using– Syntax (POS, content/function words)– Utterance duration– Pause duration/pause since last content wd– Content word duration– F0– Power
![Page 24: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/24.jpg)
24
Takeuchi (Results)
• Precision/Recall of frame classification:– Around 80% on the training set– Less then 50% on a test set
• Subjective evaluation:– Artificially insert BC at predicted time– Timing was judged “good” in 70-80%– On real utterances: 72% (!)
![Page 25: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/25.jpg)
25
Takeuchi (Discussion)
• Found that syntactic information did not help (contradicts Koiso?)
• Underscores the difficulty of evaluating turn-taking/backchanneling systems
![Page 26: 1 Back Channel Communication Antoine Raux Dialogs on Dialogs 02/25/2005](https://reader035.vdocuments.us/reader035/viewer/2022062409/56649d785503460f94a5a8f8/html5/thumbnails/26.jpg)
26
Conclusion
• Hard to account for simultaneous turns in conversation
• Back Channel framework offers one explanation
• But most work remains very specific
• Missing a good theory of conversation…