transition based dependency parsing with deep learningtransition based dependency parsing with deep...

Transition Based Dependency Parsing with DeepLearning

Omer Kırnap

Koc University

okirnapkuedutr

September 27 2018

Omer Kırnap (Koc University) MSc Thesis September 27 2018 1 123

Overview

1 IntroductionOverview of Dependency ParsingTransition Based Dependency Parsing

2 Related WorkLinear Models and their DrawbacksNeural Network Models

3 ModelLanguage ModelMLP ParserTree-stack LSTM Parser

4 ResultsMLP vs Tree-stack LSTMMorphological Feature EmbeddingsStatic vs Dynamic Oracle TrainingTransfer Learning

5 Conclusion6 Future Work amp Discussions

1 Introduction

Introduction

What is dependency parsing

Dependency parsing aims to detect word relations by finding the treestructure of a sentence inspired by dependency grammar

Figure Dependency annotations for a sentence ldquo Economic news had little effecton financial marketsrdquo

1Figure from S Kbler R McDonald and J Nivre 2009 Dependency parsingMorgan amp Claypool US

Introduction

Why do we need dependency parsing

Dependencies resolve ambiguity

Useful for some down-stream tasks in NLP

2Figure from httpwwwphontroncomslidesnlp-programming-en-11-dependpdfOmer Kırnap (Koc University) MSc Thesis September 27 2018 5 123

Introduction

Dependency Parsing Categorization

Grammar BasedRelying on a formal grammardefining a formal languageasking whether a given inputsentence is in the languagedefined by the grammar or not

Data-drivenMaking essential use of machinelearning from linguistic data in orderto parse new sentences

3From S Kbler R McDonald and J Nivre 2009 Dependency parsing Morgan ampClaypool US

Introduction

Data-driven Dependency Parsing

Graph Based Algorithms

Using maximum spanning tree algorithms from graph theory

Transition Based Algorithms

Capitalizing on greedy stack based algorithms to build dependency treewith incremental steps in linear time

Introduction

Transition Based Dependency Parsing

Transition System Abstract machine with a set of configurations(states) and transitions We use the ArcHybrid transition system[Kuhlmann et al 2011]

Configurations (σ β A)bull σ Stack of tree fragments initially emptybull β Buffer of words initially containing the whole sentencebull A Set of dependency arcs (head relation modifier) initially empty

An example parsing of a sentence

Problem Definition

Find a model learning to decide correct transition from current state

2 Related Work

Related Work

Neural Networks for Feature Conjunctions

Neural networks can handle feature conjunctions and nonlinearityHoweverImpractical for high dimensional inputs they scale linearly in inputdimensions (in both time and space assuming fixed number of hiddenunits)

Related Work

Solution Using dense embeddings for input features

Overview

3 Model

Model Overview

2 Shared Tasks for Multilingual Parsing from Raw Text to UniversalDependencies

CoNLL17bull Koc-University team with MLP Parser using Context Embeddings

CoNLL18bull KParse team with Tree-stack LSTM Parser using Context and

Morph-feat Embeddings

Model Overview

CoNLL17

bull Koc-University team with MLP Parser using ContextEmbeddings

CoNLL18

bull KParse team with Tree-stack LSTM Parser using Context and Morph-featEmbeddings

a Language Model

Language Model (LM)

LM is used to obtain Context and Word embeddings with twocomponents

Character Based LSTM extracts word vectors

Word Based BiLSTM extracts context vectors

Language Model - Word vectors

Character based LSTM generates word Vectors

Figure Character LSTM from Kırnap et al 2017

Language Model - Context Vectors

Word based BiLSTM generates Context Vectors

Figure Word BiLSTM from Kırnap et al 2017

b MLP Parser (CoNLL17)

MLP Parser

MLP Parser consists of 4 components

Feature extractor describes current state

Decision module (MLP) decides the next transition

MLP Parser - Feature Extraction

Figure Kırnap et al 2017Omer Kırnap (Koc University) MSc Thesis September 27 2018 41 123

MLP Parser - Decision Module

Experiments amp Dataset (MLP) CoNLL17

CoNLL17 Dataset

Dependency parsing of 81 treebanks in 49 languages

All treebanks use standardized annotationbull 17 universal part-of-speech tagsbull 37 universal dependency relations

Experiments - Evaluation Metric

Labeled Attachment Score (LAS)The percentage of words correctly assigned both the correct syntactic headand the correct dependency label

Economic news hadGold Tree LAS 1

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

OBJATTPred 2 LAS (frac12)100

Experiments (MLP)

CoNLL 2017 Results (all treebanks LAS)

Ranked 1st among transition based parsers 5

5Source CoNLL17 official results pageOmer Kırnap (Koc University) MSc Thesis September 27 2018 45 123

Contributions in CoNLL17

Context and Word Embeddings

Relative contributions of part-of-speech (p) word vector (v)context vector (c)

Feats Hungarian En-ParTUT Latvianp 636 766 559

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Context vectors provide independent contribution on top ofPOS tags

Context and Word embeddings

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

Our BiLSTM language model word vectors perform betterthan FB vectors

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Both POS tags and context vectors have significantcontributions on top of word vectors

Issues with MLP

However

Choosing correct state of parser still remains critical

We are unable to represent whole parsing history with featureextracting

Solution

Find a recurrent architecture such that it can summarize the parsinghistory as well as word sequences in a buffer and stack

Model Overview

CoNLL17

bull Koc-University team with MLP Parser using Context Embeddings

CoNLL18

bull KParse team with Tree-stack LSTM Parser usingContext and Morph-feat Embeddings

c Tree-stack LSTM Parser (CoNLL18)

Related Work - Stack LSTM

Figure Stack LSTM [Dyer et al 2015]

Represent each component (σ β A) with an LSTMModifying head wordrsquos embedding with dependent embedding

Problems with Stack LSTM

They only modify stackrsquos word embeddings

Hidden states of LSTMS are not updated unless reduce

Actions are not explicitly represented

They only used word2vec embeddings [Mikolov et al 2013]

Our solution

We propose

Context embeddings should improve parsing accuracy

Dependency relations should be explicitly represented

Morphological Features of a word may enhance parsing accuracy

Tree-stack LSTM Overview

Head word

Dependent word Dependency Relation

LSTM LSTM LSTM LSTM LSTM

LSTM LSTM A

Concat

We propose Tree-stack LSTM model with 4 components

β-LSTMσ-LSTMAction-LSTMTree-RNN

Tree-stack LSTM

Input Representation

Action and Dependency Relation Embeddings

Every action is represented with continuous vector

Every dependency relation is represented with continuous vector

We do not include explicit feature extractor We initiated wordrepresentation by concatenating

Character Based LSTMrsquos word vectors

Word Based BiLSTMrsquos context vectors

Part-of-speech (POS) vectors

Morph-feat vectors

Morp-feat Vectors

Case=Nom|Gender=Neut|Number=Sing|Person=3|PronType=Prs IT It

Figure Morph-feat Embeddings

Tree-stack LSTM

Model Components1 β-LSTM2 σ-LSTM3 Action-LSTM4 Tree-RNN

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

Figure Bufferrsquos β-LSTM

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Figure Stackrsquos σ-LSTM

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

How do components of tree-stack LSTM are connected

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

whead new = tanh(Wrnn lowast [whead old dl wdep] + brnn) (1)

Tree-RNN with

1 Left Transition2 Right Transition

Left Transition

Transitions - Left

leftd(σ|s b|βA) = (σ b|βA cup (b d s))

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Figure Each embedding initiated by concatenating POS language andmorph-feat embeddings

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Figure Stackrsquos top LSTM is reducedOmer Kırnap (Koc University) MSc Thesis September 27 2018 76 123

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Figure t-RNN calculates new head embedding

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Figure β-LSTM recalculates its hidden based on new input

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Figure Tree-stack LSTM is ready to give new transition

Right Transition

Transitions - Right

rightd(σ|s|t βA) = (σ|s βA cup (s d t))

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Figure Stackrsquos top LSTM is reduced

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Figure σ-LSTM recalculates its hidden from new input

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Final overview of Tree-stack LSTM

Head word

LSTM LSTM A

Concat

Overview

4 Results amp Comparisons

Results amp Comparisons

Dataset

Dependency parsing of 81

treebanks in 49 languages

All treebanks use standardized

annotation

17 universal

part-of-speech tags

37 universal dependency

relations

Koc-University ranked 7th out

of 33 participants (1st among

transition based parsers)

All treebanks use

standardized annotation

17 universal

part-of-speech tags

relations

Koc-University ranked 16th

out of 30 participants (2nd

among transition based

parsers)

CoNLL17 CoNLL181 Traintest split change 2 Annotation

MLP vs Tree-stack LSTM

CoNLL 2018 committee released comparison results of CoNLL17 andCoNLL18 systems tested under the same test sets

2 possible problems of official comparison

1 If the annotation of the tree bank is improved the older parser ishandicapped

2 If the training-test split has changed and the old training data arenow in test data the old parser is favored undeservedly

Experiments with the same train-test datasets to compare models

Lang Code MLP Tree-stackru taiga (10k) 5889 6055hu szeged (20k) 6621 6818tr imst (50k) 5678 5875ar padt (120k) 6783 6814en ewt (205k) 7487 7577cs cac (473k) 8339 8357

Tree-stack LSTM outperforms MLP

Ablation Analysis of Tree-stack LSTM

An evolution from MLP to Tree-stack LSTM

MLP Parser

Figure Initial model

Only Action LSTM

LSTM LSTM

Figure Only action LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

Ablation Analysis Results

Lang Code MLP Only Action Only-β Only-σhu szeged 6621 6687 6694 6703sv lines 7112 7205 7217 7245tr imst 5712 5687 5702 5712ar padt 6783 6667 6689 6692

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Table Comparison between MLP and rdquoOnlyrdquo models

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

Comparison of stack-LSTMs with and without t-RNN

Lang Code without t-RNN with t-RNNno nynorsklia (3k) 5178 5333ru taiga (11k) 5913 6055gl treegal (15k) 6976 7045hu szeged (20k) 6612 6818sv lines (49k) 7404 7546tr imst (50k) 5812 5875

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

t-RNN provides comparative advantage for low-resourcelanguages

Ablation Analysis

Overall results of ablation analysis

Lang MLP Only A Only-β Only-σ wot-RNN allhu szeged 6621 6687 6694 6703 6612 6818sv lines 7112 7205 7217 7404 7217 7546tr imst 5712 5687 5702 5712 5812 5875ar padt 6783 6667 6689 6692 6804 6814cs cac 8389 8223 8313 8317 8289 8357en ewt 7554 7543 7556 7567 7487 7577

Tree-stack LSTM beats other model variations

Ablation Analysis

Conclusions of Ablation Experiments

t-RNNrsquos performance contribution increases when the training sizedecreases

σ-LSTM provides more useful information independent from datasetsize

Interconnecting modelrsquos component with t-RNN makes tree-stackLSTM more powerful for low-resource languages (ranked 10th of alland 2nd among transition based parsers)

What does Morphological Feature Embedding provide

Contribution of Morph-feat Embeddings

Experimental SettingsWe divide Conll18 UD dataset 22 into 4 parts based on number oftraining tokens for each language to better understand our contributions

Languages having less than 20k tokens

Languages having more than 20k less than 50k tokens

Languages having 100k tokens or more

Contribution of Morph-feat embeddings

Morp-feat experiments for languages having less than 20k training tokens

Lang code Morph-Feats no Morph-Feats of tokensno nynorsklia 5113 5333 3583

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

Not useful for languages having less than 20k training tokens

Morp-feat experiments for languages having tokens in between 50k and100k

Lang code Morph-Feats no Morph-Feats of tokenssv lines 7218 7481 48325

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

nl lassymal 767 758 75134

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

Beneficial for languages with 50k-100k training tokens

Morp-feat experiments for languages having more than 100k trainingtokens

Lang code Morph-Feats no Morph-Feats of tokensfa seraji 8118 8112 121064

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Neutral for languages having more than 100k training tokens

Static vs Dynamic Oracle Training

Static oracle transitions using gold movesDynamic oracle transitions using predicted moves

In both cases logp of gold moves maximized

Head word

LSTM LSTM A

Concat

Figure Results are very close for training tokens less than 20k

Figure Results are very close for training tokens in between 20k and 50k

Figure Results are very close for training tokens more than 50k

How about languages with less than 20k training tokens

Transfer Learning

There are 4 possible types of transfer learning1 Using very limited data to train LM for word and context vectors and

use them to train a parser from scratch2 Using Facebookrsquos word vectors to train a parser [Bojanowski et al

2017]3 Using my own word and context vectors trained with different

language but from the same language family4 Applying transfer learning with a pre-trained parser

Language (1) (2) (3) (4)af afribooms not provided 7546 7743 7812kk ktb 2019 2231 2196 2386bxr bdt 764 976 993 898

kmr mg 2012 2257 2278 2339

Table LAS values for strategies (1) (2) (3) and (4)

Transfer Learning

Conclusions of Transfer Learning Experiments

Applying transfer learning with a pre-trained parser is the mostbeneficial

From scratch LM training does not bring useful word and contextvectors

Our word and context vectors are still more useful than Facebookrsquos[Bojanowski et al 2017]

Projectivity

Transition Based Parser can only build projective trees 6

6Figure fromhttpstplingfiluuse sarakurser5LN455-2014lectures5LN455-F8pdf

Projective vs Non-projective

We compared our model with the best model for different projectivityratios

Language Projectiviy Best (LAS) Our (LAS)

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Table Our models performance gap decreases as the projectivity ratio increases

7From official results page and our projectivity tableOmer Kırnap (Koc University) MSc Thesis September 27 2018 116 123

Conclusions

Conclusion

In conclusionWe introduced ldquoContext Word and Morph-featrdquo embeddings and showedtheir contribution in transition based dependency parsing

Our Tree-stack LSTM outperformed MLP by removing hand-craftedfeature engineering

Tree-stack LSTM performed better with low resource languages

When the training dataset size increases tree-stack LSTM losses itsadvantage

Future Research Direction

End-to-End Training

Systems that are jointly trained for tokenization morphological taggingand dependency parsing performed better Some are also jointly trained alanguage model together with pre-trained embeddings

Attention Mechanism

Applying attention in between σ-LSTM states or β-LSTM orAction-LSTM may bring performance improvement

Morphological Features

Finding different way to represent morphological features

Dynamic Oracle vs Beam Training

Although I tried both of them I could not obtain performanceimprovement There may be convergence problems with our loss functionand another losses (CRF) may solve this problem

Publications

Omer Kırnap Erenay Dayanık and Deniz Yuret 2018 Tree-stackLSTM in Transition Based Dependency Parsing In Proceedings ofthe CoNLL 2018 Shared Task Multilingual Parsing from Raw Text toUniversal Dependencies

Omer Kırnap Berkay Furkan Onder and Deniz Yuret 2017 Parsingwith Context Embeddings In Proceedings of the CoNLL 2017 SharedTask Multilingual Parsing from Raw Text to Universal Dependencies

References

Marco Kuhlmann Carlos Gomez-Rodriguez and Giorgio Satta 2011Dynamic programming algorithms for transition-based dependencyparsers In Proceedings of the 49th Annual Meeting of theAssociation for Computational Linguistics Human LanguageTechnologies-Volume 1 Association for Computational Linguisticspages 673682

S Kbler R McDonald and J Nivre 2009 Dependency parsingMorgan amp Claypool US

Chris Dyer Miguel Ballesteros Wang Ling Austin Matthews andNoah A Smith 2015 Transition based dependency parsing withstack long-short term memory CoRR abs150508075

Thank you for your attention

Questions

Introduction

Overview of Dependency Parsing


Related Work

Linear Models and their Drawbacks

Neural Network Models

Model

Language Model

MLP Parser

Tree-stack LSTM Parser

Results


Morphological Feature Embeddings


Transfer Learning

Conclusion

Future Work amp Discussions

Overview

1 Introduction

Introduction

Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


1 Introduction

Introduction

Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Introduction

Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Introduction

Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Introduction

Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Introduction

Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Introduction

Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Problem Definition

2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


2 Related Work

Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Related Work

Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Overview

3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


3 Model

Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Model Overview

CoNLL17

CoNLL18

a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


a Language Model

Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Language Model (LM)

MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


MLP Parser

CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


CoNLL17 Dataset

SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


SBJATT

Economic news had

PREDOBJPred 1 LAS 0

Economic news had

Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Experiments (MLP)

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677p-fb 747 797 663

p-v-c 793 832 742

v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


v 735 759 63

c 722 76 635

v-c 76 79 676

p-c 78 825 706

p-v 766 808 677

p-fb 747 797 663

p-v-c 793 832 742

Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Issues with MLP

However

Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Solution

Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Model Overview

CoNLL17

CoNLL18

Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Our solution

We propose

Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Head word

LSTM LSTM A

Concat

Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Tree-stack LSTM

Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Morph-feat vectors

Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Morp-feat Vectors

Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Tree-stack LSTM

β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


β-LSTM

Head word

LSTM LSTM A

Concat

β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


β-LSTM

LSTM LSTM LSTM

wi+2wi+1wi

σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


σ-LSTM

Head word

LSTM LSTM A

Concat

σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


σ-LSTM

LSTM LSTM LSTM

si si+1 si+2

Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Action-LSTM

Head word

LSTM LSTM A

Concat

Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Action-LSTM

LSTM LSTM LSTM

Figure Action-LSTM

Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Tree-RNN

Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Tree-RNN (t-RNN)

Dependent word

Dependency Relation

Head word

Figure t-RNN

Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Tree-RNN with

Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Left Transition

Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Left

LSTM LSTM LSTM LSTM

Left transition

Dependency Relation

HeadDependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Left

LSTM LSTM

Left transition

Dependency Relation

Dependent

New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Left

LSTM LSTM LSTM

Left transition

t-RNN New Head

Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Right Transition

Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Right

LSTM LSTM LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Right

LSTM LSTM

Dependency Relation

Right Transition

Dependent

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transitions - Right

LSTM LSTM LSTM

Right Transition

New Head

Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Head word

LSTM LSTM A

Concat

Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Overview

Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Dataset

annotation

17 universal

part-of-speech tags

relations

All treebanks use

17 universal

part-of-speech tags

relations

parsers)

MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


MLP Parser

Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Only Action LSTM

LSTM LSTM

Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Only β-LSTM

LSTM LSTM LSTM

LSTM LSTM MLP

Figure Only β-LSTM

Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Only σ-LSTM

LSTM LSTM

LSTM LSTM MLP

Figure Only σ-LSTM

cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


cs cac 8389 8223 8313 8317

en ewt 7554 7543 7556 7567

Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Ablation of t-RNN

Head word

LSTM LSTM A

Concat

Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Ablation of t-RNN

ar padt (120k) 6804 6814

en ewt (204k) 7487 7577

cs cac (473k) 8289 8357

cs pdt (1M) 8117 81164

Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Ablation Analysis

ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


ru taiga 5832 6055 10479

sme giella 5278 5339 16385

la perseus 4993 516 18184

ug udt 5278 5339 19262

sl sst 4672 4877 19473

hu szeged 6623 6818 20166

fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


fr sequoia 8436 8217 50543

en gum 7644 7534 53686

ko gsd 7374 7254 56687

eu bdt 7455 7332 72974

gl ctg 7902 79018 79327

lv lvtb 7233 7224 80666

id gsd 7576 7397 97531

bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


bg btb 8453 8455 124336

en ewt 7577 75682 204585

ar padt 6802 6814 223881

de gsd 7159 7132 263804

ca ancora 8589 85874 417587

es ancora 8499 8478 444617

cs cac 8357 8363 472608

cs pdt 8143 8212 1173282

Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Head word

LSTM LSTM A

Concat

Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transfer Learning

kmr mg 2012 2257 2278 2339

Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Transfer Learning

Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Projectivity

grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


grc perseus 907 7939 5503 (20)

eu bdt 9513 8422 7413 (17)

hu szeged 978 8266 6818 (14)

da ddt 9826 8628 7640 (17)

en gum 996 8505 7644 (15)

gl treegal 100 7425 7045 (10)

gl ctg 100 8212 7945 (14)

Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Conclusions

Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Conclusion

End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


End-to-End Training

Attention Mechanism

Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Publications

References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


References

Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


Questions

Introduction



Related Work



Model

Language Model

MLP Parser


Results




Transfer Learning

Conclusion


transition based dependency parsing with deep learningtransition based dependency parsing with deep...

Documents

bare-bones dependency parsing - uppsala...

improving dependency parsing using word clusters ·...

some observations on hindi dependency parsing

nlp programming tutorial 12 - dependency parsing · 2 nlp...

introduction to nlp data-driven dependency parsing

feature embedding for dependency parsing

data-driven dependency parsing

dependency parsing -...

dependency parsing framenet, semantic...

dependency parsing and empty category detection in hindi...

approximation-aware dependency parsing by belief …

lecture 19: dependency grammars and dependency parsing

dependency parsing - indian institute of technology...

dependency parsing - cornell university

bidirectional ltag dependency parsing -...

statistical dependency parsing - uppsala...

bare-bones dependency parsing - uppsala...

dependency parsing (3) - university of maryland ·...

synthetic treebanking for cross-lingual dependency...

dependency grammar and dependency parsing -...