natural language processing for signed languages

131
Natural Language Processing for Signed Languages Kayo Yin DeepMind SL Reading Group November 5 2021

Upload: others

Post on 01-Mar-2022

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Natural Language Processing for Signed Languages

Natural Language Processing for Signed LanguagesKayo YinDeepMind SL Reading Group November 5 2021

Page 2: Natural Language Processing for Signed Languages

Signed Languages

2

● Fully-fledged natural languages● Expressed through various cues● Independent of spoken languages

Page 3: Natural Language Processing for Signed Languages

Signed Languages

3

● 200 signed languages● ~70m deaf people

Page 4: Natural Language Processing for Signed Languages

Signed Languages

4

● Primary and preferred means of communication for Deaf communities

Page 5: Natural Language Processing for Signed Languages

Who Benefits from Natural Language Processing?

5

Page 6: Natural Language Processing for Signed Languages

Who Benefits from Natural Language Processing?

6

Page 7: Natural Language Processing for Signed Languages

Who Benefits from Natural Language Processing?

7

Page 8: Natural Language Processing for Signed Languages

Who Benefits from Natural Language Processing?

8

Let’s allow everyone to benefit from technology using their preferred language!

Page 9: Natural Language Processing for Signed Languages

Who is Working on Sign Language Processing?

9

Page 10: Natural Language Processing for Signed Languages

Who is Working on Sign Language Processing?

10

Little NLP involvement

Mostly computer vision

Page 11: Natural Language Processing for Signed Languages

Who is Working on Sign Language Processing?

11

Current models ignore the linguistic structure of signed languages

Page 12: Natural Language Processing for Signed Languages

Who is Working on Sign Language Processing?

12

Incorporate linguistic insight into Sign Language Processing

Page 13: Natural Language Processing for Signed Languages

Extending NLP to Signed Languages

13POS TaggingSyntactic Parsing

Named Entity Recognition

Coreference Resolution

● Both spoken and signed languages express the grammar of natural languages

Page 14: Natural Language Processing for Signed Languages

Extending NLP to Signed Languages

14POS TaggingSyntactic Parsing

Named Entity Recognition

Coreference Resolution

● Both spoken and signed languages express the grammar of natural languages

● Extend core NLP tools to signed languages

Page 15: Natural Language Processing for Signed Languages

Challenges: Data Scarcity

15

Taxonomy of language resources(Joshi et al., 2020)

● Need large, realistic datasets

Page 16: Natural Language Processing for Signed Languages

Challenges: Data Scarcity

16

Taxonomy of language resources(Joshi et al., 2020)

● Need large, realistic datasets● All signed languages are

extremely low-resource

Page 17: Natural Language Processing for Signed Languages

Challenges: Data Scarcity

17

● Difficult to recruit and record signers for data collection

Page 18: Natural Language Processing for Signed Languages

Challenges: Data Scarcity

18

● Difficult to recruit and record signers for data collection● Finding / training annotators is challenging

Page 19: Natural Language Processing for Signed Languages

Challenges: Data Scarcity

19

● Difficult to recruit and record signers for data collection● Finding / training annotators is challenging● 1 minute of labelled data requires 600 minutes of data collection

Page 20: Natural Language Processing for Signed Languages

Challenges: Spatial Dependencies

20

● Grounding in signing space

Page 21: Natural Language Processing for Signed Languages

Challenges: Spatial Dependencies

21

● Grounding in signing space● We need to model the spatial

discourse

Page 22: Natural Language Processing for Signed Languages

Natural Language Processing for Signed Languages

22

In this talk, we explore:

● Data augmentation for Sign Language Translation

Page 23: Natural Language Processing for Signed Languages

Natural Language Processing for Signed Languages

23

In this talk, we explore:

● Data augmentation for Sign Language Translation

● Coreference resolution for pronominal indexing signs

Page 24: Natural Language Processing for Signed Languages

Data Augmentation for Sign Language Gloss Translation

Amit Moryossef*, Kayo Yin*, Graham Neubig, Yoav Goldberg(MTSummit21 AT4SSL Workshop)

*Equal contribution

24

Page 25: Natural Language Processing for Signed Languages

Sign Language Translation

25

Page 26: Natural Language Processing for Signed Languages

Sign Language Translation

26

Page 27: Natural Language Processing for Signed Languages

Overcoming Data Scarcity

27

● Gloss-to-text translation = extremely low resource MT

Page 28: Natural Language Processing for Signed Languages

Overcoming Data Scarcity

28

● Gloss-to-text translation = extremely low resource MT

● How is the relationship between a signed and spoken language different from two spoken languages?

Page 29: Natural Language Processing for Signed Languages

Overcoming Data Scarcity

29

● Gloss-to-text translation = extremely low resource MT

● How is the relationship between a signed and spoken language different from two spoken languages?

● Can we improve gloss-to-text translation using pseudo-parallel data?

Page 30: Natural Language Processing for Signed Languages

Signed vs. Spoken Languages

30

● Lexical similarity

● Syntactic similarity

Page 31: Natural Language Processing for Signed Languages

Signed vs. Spoken Languages

31

● Lexical similarity

● Syntactic similarity

Page 32: Natural Language Processing for Signed Languages

Signed vs. Spoken Languages

32

● Lexical similarity

● Syntactic similarity

Page 33: Natural Language Processing for Signed Languages

Signed vs. Spoken Languages

33

Page 34: Natural Language Processing for Signed Languages

Signed vs. Spoken Languages

34

Page 35: Natural Language Processing for Signed Languages

Signed vs. Spoken Languages

35

➔ Signed-spoken language pairs are lexically similar but syntactically different

● Syntactic similarity

Page 36: Natural Language Processing for Signed Languages

Data Augmentation

36

I’m looking forward to seeing the children tomorrow.

Page 37: Natural Language Processing for Signed Languages

Data Augmentation

37

I’m looking forward to seeing the children tomorrow.

Page 38: Natural Language Processing for Signed Languages

Data Augmentation

38

I’m looking forward to seeing the children tomorrow.

LOOK FORWARD SEE CHILD TOMORROW

Page 39: Natural Language Processing for Signed Languages

Data Augmentation

39

I’m looking forward to seeing the children tomorrow.

FORWARD LOOK TOMORROW CHILD SEE

Page 40: Natural Language Processing for Signed Languages

Data

40

● NCSLGR (SignStream, 2007)○ American Sign Language (ASL) - English○ 1,875 parallel sentences

● PHOENIX 2014T (Camgoz et al., 2018)○ German Sign Language (DGS) - German○ 8,257 parallel sentences

Page 41: Natural Language Processing for Signed Languages

Model Training

41

Real Data

NMT (Yin and Read, 2020)

Synthesized Data

Page 42: Natural Language Processing for Signed Languages

Model Training

42

Real Data

NMT (Yin and Read, 2020)

Synthesized Data

Page 43: Natural Language Processing for Signed Languages

Model Training

43

Real Data

NMT (Yin and Read, 2020)

Synthesized Data

Page 44: Natural Language Processing for Signed Languages

Model Training

44

Real Data

NMT (Yin and Read, 2020)

Synthesized Data

Page 45: Natural Language Processing for Signed Languages

Results

45

Page 46: Natural Language Processing for Signed Languages

Results

46

Page 47: Natural Language Processing for Signed Languages

Results

47

Page 48: Natural Language Processing for Signed Languages

Results

48

● Consistent translation improvements using data augmentation to leverage lexical similarities and handle syntactic differences

● Data augmentation using monolingual spoken language data is a promising approach

Page 49: Natural Language Processing for Signed Languages

Signed Coreference Resolution

Kayo Yin, Kenneth DeHaan, Malihe Alikhani(EMNLP 2021)

49

Page 50: Natural Language Processing for Signed Languages

Coreference Resolution

Page 51: Natural Language Processing for Signed Languages

Coreference Resolution

Page 52: Natural Language Processing for Signed Languages

Signed Coreference Resolution

Page 53: Natural Language Processing for Signed Languages

Signed Coreference Resolution

Page 54: Natural Language Processing for Signed Languages

➔ Novel challenges in modeling discourse and spatial context

Signed Coreference Resolution

Page 55: Natural Language Processing for Signed Languages

➔ Novel challenges in modeling discourse and spatial context

➔ Better understanding of grounding in different forms of communication

Signed Coreference Resolution

Page 56: Natural Language Processing for Signed Languages

➔ Novel challenges in modeling discourse and spatial context

➔ Better understanding of grounding in different forms of communication

➔ Broaden the scope of NLP to multiple modalities

Signed Coreference Resolution

Page 57: Natural Language Processing for Signed Languages

➔ Novel challenges in modeling discourse and spatial context

➔ Better understanding of grounding in different forms of communication

➔ Broaden the scope of NLP to multiple modalities

➔ Enable Sign Language Processing technologies

Signed Coreference Resolution

Page 58: Natural Language Processing for Signed Languages

1. Pronominal Pointing Signs

2. Signed Coreference Resolution

3. Unsupervised Continuous Multigraph

4. Results & Discussion

Outline

Page 59: Natural Language Processing for Signed Languages

➔ Pointing signs with a pronominal function

➔ Referents are established in the signing space

➔ Point to the actual location of the referent

➔ Assign a locus to the referent

Pronominal Pointing Signs

Page 60: Natural Language Processing for Signed Languages

➔ Pointing signs with a pronominal function

➔ Referents are established in the signing space

➔ Point to the actual location of the referent

➔ Assign a locus to the referent

Pronominal Pointing Signs

Page 61: Natural Language Processing for Signed Languages

➔ Pointing signs with a pronominal function

➔ Referents are established in the signing space

➔ Point to the actual location of the referent

Pronominal Pointing Signs

Page 62: Natural Language Processing for Signed Languages

➔ Pointing signs with a pronominal function

➔ Referents are established in the signing space

➔ Point to the actual location of the referent

➔ Assign a locus to the referent

Pronominal Pointing Signs

Page 63: Natural Language Processing for Signed Languages

Pronominal Pointing Signs

Page 64: Natural Language Processing for Signed Languages

Pronominal Pointing Signs

Page 65: Natural Language Processing for Signed Languages

➔ Pointing signs can serve other functions

Complexities of Pointing Signs

Page 66: Natural Language Processing for Signed Languages

➔ Pointing signs can serve other functions

➔ Difficult to distinguish between different pointing signs based solely on

local visual features

Complexities of Pointing Signs

Page 67: Natural Language Processing for Signed Languages

English Pronouns

+ Carry some meaning on its own

Complexities of Pointing Signs

ASL Pointing Signs

Page 68: Natural Language Processing for Signed Languages

English Pronouns

+ Carry some meaning on its own

Complexities of Pointing Signs

ASL Pointing Signs

- Use the same handshape, harder to distinguish on its own

Page 69: Natural Language Processing for Signed Languages

English Pronouns

+ Carry some meaning on its own- The same word can refer to

multiple entities at once

Complexities of Pointing Signs

ASL Pointing Signs

- Use the same handshape, harder to distinguish on its own

Page 70: Natural Language Processing for Signed Languages

English Pronouns

+ Carry some meaning on its own- The same word can refer to

multiple entities at once

Complexities of Pointing Signs

ASL Pointing Signs

- Use the same handshape, harder to distinguish on its own

My mother never liked Alice, she thought she was up to no good.

Page 71: Natural Language Processing for Signed Languages

English Pronouns

+ Carry some meaning on its own- The same word can refer to

multiple entities at once

Complexities of Pointing Signs

ASL Pointing Signs

- Use the same handshape, harder to distinguish on its own

+ 1 locus = 1 referent

Page 72: Natural Language Processing for Signed Languages

English Pronouns

+ Carry some meaning on its own- The same word can refer to

multiple entities at once

Complexities of Pointing Signs

ASL Pointing Signs

- Use the same handshape, harder to distinguish on its own

+ 1 locus = 1 referent- Loci can be reassigned to

different referents

Page 73: Natural Language Processing for Signed Languages

English Pronouns

+ Carry some meaning on its own- The same word can refer to

multiple entities at once

Complexities of Pointing Signs

ASL Pointing Signs

- Use the same handshape, harder to distinguish on its own

+ 1 locus = 1 referent- Loci can be reassigned to

different referents- Referents can be assigned

multiple loci

Page 74: Natural Language Processing for Signed Languages

➔ Theories of coreference in spoken languages may be extended to

signed languages

Why study Signed Coreference Resolution in NLP?

Page 75: Natural Language Processing for Signed Languages

➔ Theories of coreference in spoken languages may be extended to

signed languages

◆ Discourse Representation Theory (Kamp et al., 2011; Steinbach and Onea 2016)

Why study Signed Coreference Resolution in NLP?

Page 76: Natural Language Processing for Signed Languages

➔ Theories of coreference in spoken languages may be extended to

signed languages

◆ Discourse Representation Theory (Kamp et al., 2011; Steinbach and Onea 2016)

◆ First mention effect (Gernsbacher and Hargreaves, 1988; Wienholz et al., 2020)

Why study Signed Coreference Resolution in NLP?

Page 77: Natural Language Processing for Signed Languages

➔ Theories of coreference in spoken languages may be extended to

signed languages

◆ Discourse Representation Theory (Kamp et al., 2011; Steinbach and Onea 2016)

◆ First mention effect (Gernsbacher and Hargreaves, 1988; Wienholz et al., 2020)

➔ It can help us better understand multimodal communication

Why study Signed Coreference Resolution in NLP?

Page 78: Natural Language Processing for Signed Languages

➔ Theories of coreference in spoken languages may be extended to

signed languages

◆ Discourse Representation Theory (Kamp et al., 2011; Steinbach and Onea 2016)

◆ First mention effect (Gernsbacher and Hargreaves, 1988; Wienholz et al., 2020)

➔ It can help us better understand multimodal communication

◆ Spatial iconicity and situated referents in signed languages

Why study Signed Coreference Resolution in NLP?

Page 79: Natural Language Processing for Signed Languages

➔ Theories of coreference in spoken languages may be extended to

signed languages

◆ Discourse Representation Theory (Kamp et al., 2011; Steinbach and Onea 2016)

◆ First mention effect (Gernsbacher and Hargreaves, 1988; Wienholz et al., 2020)

➔ It can help us better understand multimodal communication

◆ Spatial iconicity and situated referents in signed languages

➔ Widen the accessibility of language technologies

Why study Signed Coreference Resolution in NLP?

Page 80: Natural Language Processing for Signed Languages

1. Pronominal Pointing Signs

2. Signed Coreference Resolution

3. Unsupervised Continuous Multigraph

4. Results & Discussion

Outline

Page 81: Natural Language Processing for Signed Languages

Signed Coreference Resolution

Page 82: Natural Language Processing for Signed Languages

Signed Coreference Resolution

1. Mention Detection

Page 83: Natural Language Processing for Signed Languages

Signed Coreference Resolution

2. Coreference Resolution

Page 84: Natural Language Processing for Signed Languages

DGS-Coref Dataset

Public DGS Corpus (Hanke et al., 2020)

Page 85: Natural Language Processing for Signed Languages

DGS-Coref Dataset

Page 86: Natural Language Processing for Signed Languages

➔ 16m30s of signing

➔ 3 conversations

➔ 5 different signers

➔ 288 signed sentences

➔ 1,457 glosses

◆ 95 <I> signs

◆ 8 <YOU> signs

◆ 93 <INDEX> signs

DGS-Coref Dataset

A: WITH TRIP INDEX SHIP INDEX

A: We went there with an excursion boat.

Page 87: Natural Language Processing for Signed Languages

1. Pronominal Pointing Signs

2. Signed Coreference Resolution

3. Unsupervised Continuous Multigraph

4. Results & Discussion

Outline

Page 88: Natural Language Processing for Signed Languages

Unsupervised Continuous Multigraph

Page 89: Natural Language Processing for Signed Languages

Unsupervised Continuous Multigraph

Page 90: Natural Language Processing for Signed Languages

Unsupervised Continuous Multigraph

Page 91: Natural Language Processing for Signed Languages

Unsupervised Continuous Multigraph

Page 92: Natural Language Processing for Signed Languages

1. I and I

Positive Relations

Page 93: Natural Language Processing for Signed Languages

1. I and I2. You and You

Positive Relations

Page 94: Natural Language Processing for Signed Languages

1. I and I2. You and You3. I and You

Positive Relations

Page 95: Natural Language Processing for Signed Languages

1. I and I2. You and You3. I and You4. Temporally Close Index

Positive Relations

Page 96: Natural Language Processing for Signed Languages

1. I and I2. You and You3. I and You4. Temporally Close Index5. Noun Phrase

Positive Relations

Page 97: Natural Language Processing for Signed Languages

1. I and I2. You and You3. I and You4. Temporally Close Index5. Noun Phrase6. Spatially Close Index

Positive Relations

Page 98: Natural Language Processing for Signed Languages

1. I and I

Negative Relations

Page 99: Natural Language Processing for Signed Languages

1. I and I2. You and You

Negative Relations

Page 100: Natural Language Processing for Signed Languages

1. I and I2. You and You3. I and You

Negative Relations

Page 101: Natural Language Processing for Signed Languages

1. I and I2. You and You3. I and You4. Different Person

Negative Relations

Page 102: Natural Language Processing for Signed Languages

1. I and I2. You and You3. I and You4. Spatially Far Index

Negative Relations

Page 103: Natural Language Processing for Signed Languages

Positive Relations

1. I and I

2. You and You

3. I and You

4. Temporally Close Index

5. Noun Phrase

6. Spatially Close Index

Weight Assignment

Negative Relations

1. I and I

2. You and You

3. I and You

4. Spatially Far Index

Page 104: Natural Language Processing for Signed Languages

Positive Relations

1. I and I

2. You and You

3. I and You

4. Temporally Close Index

5. Noun Phrase

6. Spatially Close Index

Weight Assignment

Negative Relations

1. I and I

2. You and You

3. I and You

4. Spatially Far Index

-∞

Page 105: Natural Language Processing for Signed Languages

Positive Relations

1. I and I

2. You and You

3. I and You

4. Temporally Close Index

5. Noun Phrase

6. Spatially Close Index

Weight Assignment

Negative Relations

1. I and I

2. You and You

3. I and You

4. Spatially Far Index

-∞+0.5

Page 106: Natural Language Processing for Signed Languages

Positive Relations

1. I and I

2. You and You

3. I and You

4. Temporally Close Index

5. Noun Phrase

6. Spatially Close Index

Weight Assignment

Negative Relations

1. I and I

2. You and You

3. I and You

4. Spatially Far Index

-∞+0.5

+(10-t)/20

Page 107: Natural Language Processing for Signed Languages

Positive Relations

1. I and I

2. You and You

3. I and You

4. Temporally Close Index

5. Noun Phrase

6. Spatially Close Index

Weight Assignment

Negative Relations

1. I and I

2. You and You

3. I and You

4. Spatially Far Index

-∞+0.5

+(10-t)/20

+(50-s)/50

Page 108: Natural Language Processing for Signed Languages

Clustering

Page 109: Natural Language Processing for Signed Languages

Clustering

Page 110: Natural Language Processing for Signed Languages

1. Pronominal Pointing Signs

2. Signed Coreference Resolution

3. Unsupervised Continuous Multigraph

4. Results & Discussion

Outline

Page 111: Natural Language Processing for Signed Languages

Results

Page 112: Natural Language Processing for Signed Languages

Results

Page 113: Natural Language Processing for Signed Languages

Results

Page 114: Natural Language Processing for Signed Languages

Examples

TO-SEE YOU GOOD YOU

I think you could do a good job there.

GEST-DECLINE I CAN NOT TO-SAY TO-HOLD-ON I

I can’t keep that promise

Page 115: Natural Language Processing for Signed Languages

Examples

TO-SEE YOU GOOD YOU

I think you could do a good job there.

GEST-DECLINE I CAN NOT TO-SAY TO-HOLD-ON I

I can’t keep that promise

P_IAndIP_YouAndYou

P_IAndYou

Page 116: Natural Language Processing for Signed Languages

Examples

STUTTGART NUM-1 NAME INDEX NUM-1 FREIBURG

Once we were in Stuttgart, once in Ingolstadt and once in Freiburg.

Page 117: Natural Language Processing for Signed Languages

Examples

STUTTGART NUM-1 NAME INDEX NUM-1 FREIBURG

Once we were in Stuttgart, once in Ingolstadt and once in Freiburg.

P_NounPhrase

Page 118: Natural Language Processing for Signed Languages

Examples

WITH TRIP INDEX SHIP INDEX

We went there with an excursion boat.

Page 119: Natural Language Processing for Signed Languages

Examples

WITH TRIP INDEX SHIP INDEX

We went there with an excursion boat.

P_TemporallyCloseIndexP_SpatiallyCloseIndex

Page 120: Natural Language Processing for Signed Languages

Examples

I TO-LEARN INDEX HAMBURG INDEX

I learned it in Hamburg.

Page 121: Natural Language Processing for Signed Languages

Examples

I TO-LEARN INDEX HAMBURG INDEX

I learned it in Hamburg.

P_TemporallyCloseIndexP_SpatiallyCloseIndex

Page 122: Natural Language Processing for Signed Languages

Conclusion

● Signed language data is scarce and hard to obtain

Page 123: Natural Language Processing for Signed Languages

Conclusion

● Signed language data is scarce and hard to obtain○ Data augmentation from monolingual spoken language data is

one promising way to mitigate this

Page 124: Natural Language Processing for Signed Languages

Conclusion

● Signed language data is scarce and hard to obtain○ Data augmentation from monolingual spoken language data is

one promising way to mitigate this

● The meaning of certain signs rely on spatial context

Page 125: Natural Language Processing for Signed Languages

Conclusion

● Signed language data is scarce and hard to obtain○ Data augmentation from monolingual spoken language data is

one promising way to mitigate this

● The meaning of certain signs rely on spatial context ○ Signed Coreference Resolution as a new challenge○ Unsupervised Continuous Multigraph for SCR

Page 126: Natural Language Processing for Signed Languages

Future Work

● Pre-training the target side decoder with spoken language data

Page 127: Natural Language Processing for Signed Languages

Future Work

● Pre-training the target side decoder with spoken language data

● Resolve other types of ambiguous signs

Page 128: Natural Language Processing for Signed Languages

Summary

➔ New challenge: Signed Coreference Resolution

➔ Annotation software & DGS-Coref dataset

Page 129: Natural Language Processing for Signed Languages

Summary

➔ New challenge: Signed Coreference Resolution

➔ Annotation software & DGS-Coref dataset

➔ Unsupervised Continuous Multigraph for SCR

Page 130: Natural Language Processing for Signed Languages

Summary

➔ New challenge: Signed Coreference Resolution

➔ Annotation software & DGS-Coref dataset

➔ Unsupervised Continuous Multigraph for SCR

➔ Code & data: https://github.com/kayoyin/scr

Page 131: Natural Language Processing for Signed Languages

Future Work

➔ Detect reassignment of loci

➔ Detect different functions of indexing signs

➔ Keep track of the dynamic signing space

➔ Directly process videos

➔ Resolve other types of pronominal signs

➔ Resolve other types of ambiguous signs