zero-shot sequence labeling: transferring ... - marek rei · zero-shot sequence labeling 3 i like...

13
1 Zero-shot Sequence Labeling: Transferring Knowledge from Sentences to Tokens Marek Rei Anders Søgaard

Upload: others

Post on 01-Oct-2020

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Zero-shot Sequence Labeling: Transferring ... - Marek Rei · Zero-shot sequence labeling 3 I like to playing the guitar and sing very louder . It was so long time to wait in the theatre

1

Zero-shot Sequence Labeling:Transferring Knowledge

from Sentences to TokensMarek Rei Anders Søgaard

Page 2: Zero-shot Sequence Labeling: Transferring ... - Marek Rei · Zero-shot sequence labeling 3 I like to playing the guitar and sing very louder . It was so long time to wait in the theatre

2

Sequence labeling

_ _ C C _ _ _ _ _ _

Our data indicate that increased NF-kappa B DNA binding is ...

Hedge cue detection:

+ + + - + + + + + - +

I like to playing the guitar and sing very louder .

Error detection:

Page 3: Zero-shot Sequence Labeling: Transferring ... - Marek Rei · Zero-shot sequence labeling 3 I like to playing the guitar and sing very louder . It was so long time to wait in the theatre

3

Zero-shot sequence labeling

+ + + - + + + + + - +

I like to playing the guitar and sing very louder .

It was so long time to wait in the theatre .

I look forward to receiving reply to my enquiry .

This is a great opportunity to learn more about whales .

Therefore, houses will be built on high supports .

Page 4: Zero-shot Sequence Labeling: Transferring ... - Marek Rei · Zero-shot sequence labeling 3 I like to playing the guitar and sing very louder . It was so long time to wait in the theatre

4

Main idea

Neural sentence classification model01

02

03 Make attention weights behave like sequence labeling output

Based on self-attention

Page 5: Zero-shot Sequence Labeling: Transferring ... - Marek Rei · Zero-shot sequence labeling 3 I like to playing the guitar and sing very louder . It was so long time to wait in the theatre

5

Model architecture

Page 6: Zero-shot Sequence Labeling: Transferring ... - Marek Rei · Zero-shot sequence labeling 3 I like to playing the guitar and sing very louder . It was so long time to wait in the theatre

6

Based on softmax:

Based on sigmoid + normalisation:

Soft attention weights

Page 7: Zero-shot Sequence Labeling: Transferring ... - Marek Rei · Zero-shot sequence labeling 3 I like to playing the guitar and sing very louder . It was so long time to wait in the theatre

7

Optimising the attention

We can constrain the attention values based on the sentence-level label.

1. Only some, but not all, tokens in the sentence can have a positive label.

2. There are positive tokens in a sentence only if the overall sentence is positive.

Page 8: Zero-shot Sequence Labeling: Transferring ... - Marek Rei · Zero-shot sequence labeling 3 I like to playing the guitar and sing very louder . It was so long time to wait in the theatre

8

Alternative methods

1. Labeling through backpropagation

2. Relative frequency

3. Supervised sequence labeling

Selvaraju et al (2016)

Page 9: Zero-shot Sequence Labeling: Transferring ... - Marek Rei · Zero-shot sequence labeling 3 I like to playing the guitar and sing very louder . It was so long time to wait in the theatre

9

Evaluation: CoNLL 2010Detection of uncertain language in scientific articles

Page 10: Zero-shot Sequence Labeling: Transferring ... - Marek Rei · Zero-shot sequence labeling 3 I like to playing the guitar and sing very louder . It was so long time to wait in the theatre

10

Evaluation: FCEDetecting grammatical errors in essays written by language learners.

Page 11: Zero-shot Sequence Labeling: Transferring ... - Marek Rei · Zero-shot sequence labeling 3 I like to playing the guitar and sing very louder . It was so long time to wait in the theatre

11

Examples

Page 12: Zero-shot Sequence Labeling: Transferring ... - Marek Rei · Zero-shot sequence labeling 3 I like to playing the guitar and sing very louder . It was so long time to wait in the theatre

12

Applications

Sequence labeling without data01

Data exploration and feature analysis02

Model visualisation and interpretation03

Page 13: Zero-shot Sequence Labeling: Transferring ... - Marek Rei · Zero-shot sequence labeling 3 I like to playing the guitar and sing very louder . It was so long time to wait in the theatre

13

Thank you! Any questions?