zero-shot sequence labeling: transferring ... - marek rei · zero-shot sequence labeling 3 i like...

Post on 01-Oct-2020

6 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

1

Zero-shot Sequence Labeling:Transferring Knowledge

from Sentences to TokensMarek Rei Anders Søgaard

2

Sequence labeling

_ _ C C _ _ _ _ _ _

Our data indicate that increased NF-kappa B DNA binding is ...

Hedge cue detection:

+ + + - + + + + + - +

I like to playing the guitar and sing very louder .

Error detection:

3

Zero-shot sequence labeling

+ + + - + + + + + - +

I like to playing the guitar and sing very louder .

It was so long time to wait in the theatre .

I look forward to receiving reply to my enquiry .

This is a great opportunity to learn more about whales .

Therefore, houses will be built on high supports .

4

Main idea

Neural sentence classification model01

02

03 Make attention weights behave like sequence labeling output

Based on self-attention

5

Model architecture

6

Based on softmax:

Based on sigmoid + normalisation:

Soft attention weights

7

Optimising the attention

We can constrain the attention values based on the sentence-level label.

1. Only some, but not all, tokens in the sentence can have a positive label.

2. There are positive tokens in a sentence only if the overall sentence is positive.

8

Alternative methods

1. Labeling through backpropagation

2. Relative frequency

3. Supervised sequence labeling

Selvaraju et al (2016)

9

Evaluation: CoNLL 2010Detection of uncertain language in scientific articles

10

Evaluation: FCEDetecting grammatical errors in essays written by language learners.

11

Examples

12

Applications

Sequence labeling without data01

Data exploration and feature analysis02

Model visualisation and interpretation03

13

Thank you! Any questions?

top related