information structure with the prague arabic dependency...

Post on 17-Oct-2020

0 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Information Structure with the Prague ArabicDependency Treebank

Otakar Smrz Petr Zemanek Jakub Kracmar Viktor Bielicky

Institute of Formal and Applied Linguistics& Institute of Comparative Linguistics

Charles University in Prague

Conference on Communication and Information Structurein Spoken Arabic

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 1 / 13

Introduction

The issue of information structure in language has been studied extensivelyboth in the Prague School of Linguistics and in the Functional GenerativeDescription [Sgall et al., 1986, Hajicova and Sgall, 2003].

This theory of representation of linguistic meaning is the framework for afamily of multi-level annotation projects, esp. Prague Dependency Treebankfor Czech [Hajic et al., 2001, 2006] and Prague Arabic Dependency Treebank[Hajic et al., 2004, Smrz et al., 2006].

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 2 / 13

Introduction

The issue of information structure in language has been studied extensivelyboth in the Prague School of Linguistics and in the Functional GenerativeDescription [Sgall et al., 1986, Hajicova and Sgall, 2003].

This theory of representation of linguistic meaning is the framework for afamily of multi-level annotation projects, esp. Prague Dependency Treebankfor Czech [Hajic et al., 2001, 2006] and Prague Arabic Dependency Treebank[Hajic et al., 2004, Smrz et al., 2006].

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 2 / 13

Introduction

Outline

1 Introduction2 Definitions

dependency / information structurecommunicative dynamism / valencytopic–focus / contextual boundness

3 Examplessystemic orderingtopicalizersrhematizers

4 Annotation5 Discussion

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 3 / 13

Introduction

Outline

1 Introduction2 Definitions

dependency / information structurecommunicative dynamism / valencytopic–focus / contextual boundness

3 Examplessystemic orderingtopicalizersrhematizers

4 Annotation5 Discussion

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 3 / 13

Introduction

Outline

1 Introduction2 Definitions

dependency / information structurecommunicative dynamism / valencytopic–focus / contextual boundness

3 Examplessystemic orderingtopicalizersrhematizers

4 Annotation5 Discussion

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 3 / 13

Introduction

Outline

1 Introduction2 Definitions

dependency / information structurecommunicative dynamism / valencytopic–focus / contextual boundness

3 Examplessystemic orderingtopicalizersrhematizers

4 Annotation5 Discussion

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 3 / 13

Introduction

Outline

1 Introduction2 Definitions

dependency / information structurecommunicative dynamism / valencytopic–focus / contextual boundness

3 Examplessystemic orderingtopicalizersrhematizers

4 Annotation5 Discussion

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 3 / 13

Definitions

Dependency Description

Representation of structure in language built on dependency exhibits

-- Immediate Dominance relation

-- Linear Precedence relation

Q�� � ñ��JK.��é� ��J�� A �� � B� �è� A�J�mÌ'� �H� A��KP�ð �Qå��A�îD.�ª ���Ë�A�î D�J�K. áÓ� �ð ��é��J�� A �� � B� ��é��J ��j ���Ë� ��é�K A �«��QË � ��é ���A �g �ð ��é��J��J. ��¢Ë� ��é�K A �«��QË��bi-tawfıri d. arurıyati ’l-h. ayati ’l- asasıyati li-sa biha wa-min bayniha

ar-riayatu ’t.-t. ibbıyatu wa-h˘

as.s.atan ar-riayatu ’s. -s. ih. h. ıyatu ’l- asasıyatu

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 4 / 13

Definitions

Dependency Description

Representation of structure in language built on dependency exhibits

-- Immediate Dominance relation

-- Linear Precedence relation �é��J��J. Ë � ��� � úÍ� Èñ �Â�K © �k. Q�K�tirga tgul li umm l-bnayya úÍ� Èñ �Â�K © �k. Q�K� �é��J��J. Ë � ��� �umm l-bnayya tirga tgul li ? é�JJ � Qª��K �I� K� � Y�Ôg� � ��� � A�K[Brustad, 2000, page 336]

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 4 / 13

Definitions

Dependency Description

Representation of structure in language built on dependency exhibits

-- Immediate Dominance relation

-- Linear Precedence relation �é��J��J. Ë � ��� � úÍ� Èñ �Â�K © �k. Q�K�tirga tgul li umm l-bnayya úÍ� Èñ �Â�K © �k. Q�K� �é��J��J. Ë � ��� �umm l-bnayya tirga tgul li ? é�JJ � Qª��K �I� K� � Y�Ôg� � ��� � A�K[Brustad, 2000, page 336] úÍ� Èñ �Â�K �é��J��J. Ë � ��� � © �k. Q�K�tirga umm l-bnayya tgul li

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 4 / 13

Definitions

Dependency Description

Representation of structure in language built on dependency exhibits

-- Immediate Dominance relation

-- Linear Precedence relation

Contrast dependency with constituency, i.e. phrase-structure syntax, wherelinear order and derivation are not independent, and thus less expressive(context-free vs. context-sensitive).

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 4 / 13

Definitions

Dependency Description

Representation of structure in language built on dependency exhibits

-- Immediate Dominance relation

-- Linear Precedence relation

Contrast dependency with constituency, i.e. phrase-structure syntax, wherelinear order and derivation are not independent, and thus less expressive(context-free vs. context-sensitive).

Distinction of different points of view and levels of abstraction

-- Surface/Analytical syntax level

-- Deep/Tectogrammatical syntax level

Description includes recovery of tree topology and syntactic functions.

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 4 / 13

Definitions

Information Structure

Information structure — the question of “the given” and “the new” in anutterance and how it is expressed — is considered to contribute to thelinguistic meaning, and its annotation is thus in our interest.

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 5 / 13

Definitions

Information Structure

Information structure — the question of “the given” and “the new” in anutterance and how it is expressed — is considered to contribute to thelinguistic meaning, and its annotation is thus in our interest.

How is communicative dynamism expressed and delivered/packaged?

-- word order variation with respect to systemic ordering

-- intonation centers, sentence stress

-- extra constructs in syntax or morphology�è� �Yj� ���J �ÜÏ� �H� A�KB�ñ� Ë � ú �Í� � �H� � �PX� A ���Ë� ��é �ÒJ�� �� ®� �jJ��K à� ��éKPAm.Ì '� �é J ��Ë� áÓ ú G A��JË � ­��JË � ú A��JË� A �g P�B�ð �X �àñ�JÊÓ� 593 áÓ� P�B�ð �X à� ñ�JÊÓ� 400 ú �Í� ��H�Yg B� �HAJ«�Y�K I. �.��.an tanh˘

afid. a qımatu . . .

fı ’n-nis. fi . . .

ila 400 . . . min 593 . . .

bi-sababi . . .

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 5 / 13

Definitions

Information Structure

Information structure — the question of “the given” and “the new” in anutterance and how it is expressed — is considered to contribute to thelinguistic meaning, and its annotation is thus in our interest.

How is communicative dynamism expressed and delivered/packaged?

-- word order variation with respect to systemic ordering

-- intonation centers, sentence stress

-- extra constructs in syntax or morphology�è� �Yj� ���J �ÜÏ� �H� A�KB�ñ� Ë � ú �Í� � �H� � �PX� A ���Ë� ��é �ÒJ�� �� ®� �jJ��K à� ��éKPAm.Ì '� �é J ��Ë� áÓ ú G A��JË � ­��JË � ú A��JË� A �g P�B�ð �X �àñ�JÊÓ� 593 áÓ� P�B�ð �X à� ñ�JÊÓ� 400 ú �Í� ��H�Yg B� �HAJ«�Y�K I. �.��.an tanh˘

afid. a qımatu . . .

fı ’n-nis. fi . . .

ila 400 . . . min 593 . . .

bi-sababi . . .

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 5 / 13

Definitions

Information Structure

Information structure — the question of “the given” and “the new” in anutterance and how it is expressed — is considered to contribute to thelinguistic meaning, and its annotation is thus in our interest.

How is communicative dynamism expressed and delivered/packaged?

-- word order variation with respect to systemic ordering

-- intonation centers, sentence stress

-- extra constructs in syntax or morphology�è� �Yj� ���J �ÜÏ� �H� A�KB�ñ� Ë � ú �Í� � �H� � �PX� A ���Ë� ��é �ÒJ�� �� ®� �jJ��K à� ��éKPAm.Ì '� �é J ��Ë� áÓ ú G A��JË � ­��JË � ú A��JË� A �g P�B�ð �X �àñ�JÊÓ� 593 áÓ� P�B�ð �X à� ñ�JÊÓ� 400 ú �Í� ��H�Yg B� �HAJ«�Y�K I. �.��.an tanh˘

afid. a qımatu . . .

fı ’n-nis. fi . . .

ila 400 . . . min 593 . . .

bi-sababi . . .

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 5 / 13

Definitions

Topic / Focus

Dichotomy of aboutness inspiring to many similar concepts in modern lin-guistics [cf. Kruijff-Korbayova and Steedman, 2003, for overview].

Topic (theme) part of sentence structure linking the content of theutterance with the context of the discourse

Focus (rheme, comment) the part providing or modifying someinformation about the topic

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 6 / 13

Definitions

Topic / Focus

Dichotomy of aboutness inspiring to many similar concepts in modern lin-guistics [cf. Kruijff-Korbayova and Steedman, 2003, for overview].

Topic (theme) part of sentence structure linking the content of theutterance with the context of the discourse

Focus (rheme, comment) the part providing or modifying someinformation about the topic

Communicative dynamism as measure of this linguistic property

-- Topic proper is the least dynamic part

-- Focus proper is the most dynamic part

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 6 / 13

Definitions

Contextual Boundness

The elementary distinction, from which topic–focus dichotomy is derived, iscontextual boundness.

Context-Bound lexical reference to an already explicitly mentioned entity,or to an entity implicitly evoked in the context of the discourse

Non-Bound lexical item that is not contextually bound, i.e. not availablein the interlocutor’s mind as reference

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 7 / 13

Definitions

Contextual Boundness

The elementary distinction, from which topic–focus dichotomy is derived, iscontextual boundness.

Context-Bound lexical reference to an already explicitly mentioned entity,or to an entity implicitly evoked in the context of the discourse

Non-Bound lexical item that is not contextually bound, i.e. not availablein the interlocutor’s mind as reference

Using question test to identify the context-bound and non-bound items.

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 7 / 13

Definitions

Contextual Boundness

The elementary distinction, from which topic–focus dichotomy is derived, iscontextual boundness.

Context-Bound lexical reference to an already explicitly mentioned entity,or to an entity implicitly evoked in the context of the discourse

Non-Bound lexical item that is not contextually bound, i.e. not availablein the interlocutor’s mind as reference

Using question test to identify the context-bound and non-bound items.

The relation of definiteness and boundness is not trivial [Kruijff-Korbayova,1998, Brustad, 2000]. Boundness is not equated to the given/new concept.

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 7 / 13

Examples àYJ�Ë áÓ� �I� K �ñ�ºË� ú �Í� � q� J ����Ë� �è� �Xñ �« ú �Î �«ala awdati ’s-sayh

˘i ila ’l-kuwayti min landn á� �g� ú �Í� � á� �g� áÓ� Ñî�D�Ê �« ��J. ��®Ë � ��Õ �æ� �K

yatimmu ’l-qabd. u alayhim min h. ınin ila h. ınin�X A ��®�J� KB� ����Ë �ð ��Q �ªË� �ñ �ë ��úæ�� A �� � B� A�K �Pð �X ��à� � �Q �kÆ� ú �æª�Üß.�bi-ma nan ah

˘ara inna dawrana ’l- asasıya huwa ’l- ard. u wa-laysa ’l-intiqadu�é� ��J�� � K� ��QË � á� � ���Ë� �H� � �XP�� �ñË� �� A��P � B� �ð ��� K� A ��®�mÌ'� úÎ��K A �ÒJ �

fıma yalı ’l-h. aqa iqu wa-’l- arqamu li-waridati ’s. -s. ıni ’r-raısıyati¡��®� Õ�»� A�mÌ'� H.� Qm�Ì'� ú �Î �« ��� K.� A ���Ë� ú � �Qå�� ��J �®��K �I�K A�¿ ú �æ���Ë �allatı kanat taqtas. iru fı ’s-sabiqi ala ’l-h. izbi ’l-h. akimi faqat.­� J �ªË � ­� � �ð ú �Í� � ¡ ��®� ù �ª��� ø Y� ��Ë �allad

¯ı yasa faqat. ila waqfi ’l- unfi

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 8 / 13

Examples àYJ�Ë áÓ� �I� K �ñ�ºË� ú �Í� � q� J ����Ë� �è� �Xñ �« ú �Î �«ala awdati ’s-sayh

˘i ila ’l-kuwayti min landn á� �g� ú �Í� � á� �g� áÓ� Ñî�D�Ê �« ��J. ��®Ë � ��Õ �æ� �K

yatimmu ’l-qabd. u alayhim min h. ınin ila h. ınin�X A ��®�J� KB� ����Ë �ð ��Q �ªË� �ñ �ë ��úæ�� A �� � B� A�K �Pð �X ��à� � �Q �kÆ� ú �æª�Üß.�bi-ma nan ah

˘ara inna dawrana ’l- asasıya huwa ’l- ard. u wa-laysa ’l-intiqadu�é� ��J�� � K� ��QË � á� � ���Ë� �H� � �XP�� �ñË� �� A��P � B� �ð ��� K� A ��®�mÌ'� úÎ��K A �ÒJ �

fıma yalı ’l-h. aqa iqu wa-’l- arqamu li-waridati ’s. -s. ıni ’r-raısıyati¡��®� Õ�»� A�mÌ'� H.� Qm�Ì'� ú �Î �« ��� K.� A ���Ë� ú � �Qå�� ��J �®��K �I�K A�¿ ú �æ���Ë �allatı kanat taqtas. iru fı ’s-sabiqi ala ’l-h. izbi ’l-h. akimi faqat.­� J �ªË � ­� � �ð ú �Í� � ¡ ��®� ù �ª��� ø Y� ��Ë �allad

¯ı yasa faqat. ila waqfi ’l- unfi

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 8 / 13

Examples àYJ�Ë áÓ� �I� K �ñ�ºË� ú �Í� � q� J ����Ë� �è� �Xñ �« ú �Î �«ala awdati ’s-sayh

˘i ila ’l-kuwayti min landn á� �g� ú �Í� � á� �g� áÓ� Ñî�D�Ê �« ��J. ��®Ë � ��Õ �æ� �K

yatimmu ’l-qabd. u alayhim min h. ınin ila h. ınin�X A ��®�J� KB� ����Ë �ð ��Q �ªË� �ñ �ë ��úæ�� A �� � B� A�K �Pð �X ��à� � �Q �kÆ� ú �æª�Üß.�bi-ma nan ah

˘ara inna dawrana ’l- asasıya huwa ’l- ard. u wa-laysa ’l-intiqadu�é� ��J�� � K� ��QË � á� � ���Ë� �H� � �XP�� �ñË� �� A��P � B� �ð ��� K� A ��®�mÌ'� úÎ��K A �ÒJ �

fıma yalı ’l-h. aqa iqu wa-’l- arqamu li-waridati ’s. -s. ıni ’r-raısıyati¡��®� Õ�»� A�mÌ'� H.� Qm�Ì'� ú �Î �« ��� K.� A ���Ë� ú � �Qå�� ��J �®��K �I�K A�¿ ú �æ���Ë �allatı kanat taqtas. iru fı ’s-sabiqi ala ’l-h. izbi ’l-h. akimi faqat.­� J �ªË � ­� � �ð ú �Í� � ¡ ��®� ù �ª��� ø Y� ��Ë �allad

¯ı yasa faqat. ila waqfi ’l- unfi

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 8 / 13

Examples àYJ�Ë áÓ� �I� K �ñ�ºË� ú �Í� � q� J ����Ë� �è� �Xñ �« ú �Î �«ala awdati ’s-sayh

˘i ila ’l-kuwayti min landn á� �g� ú �Í� � á� �g� áÓ� Ñî�D�Ê �« ��J. ��®Ë � ��Õ �æ� �K

yatimmu ’l-qabd. u alayhim min h. ınin ila h. ınin�X A ��®�J� KB� ����Ë �ð ��Q �ªË� �ñ �ë ��úæ�� A �� � B� A�K �Pð �X ��à� � �Q �kÆ� ú �æª�Üß.�bi-ma nan ah

˘ara inna dawrana ’l- asasıya huwa ’l- ard. u wa-laysa ’l-intiqadu�é� ��J�� � K� ��QË � á� � ���Ë� �H� � �XP�� �ñË� �� A��P � B� �ð ��� K� A ��®�mÌ'� úÎ��K A �ÒJ �

fıma yalı ’l-h. aqa iqu wa-’l- arqamu li-waridati ’s. -s. ıni ’r-raısıyati¡��®� Õ�»� A�mÌ'� H.� Qm�Ì'� ú �Î �« ��� K.� A ���Ë� ú � �Qå�� ��J �®��K �I�K A�¿ ú �æ���Ë �allatı kanat taqtas. iru fı ’s-sabiqi ala ’l-h. izbi ’l-h. akimi faqat.­� J �ªË � ­� � �ð ú �Í� � ¡ ��®� ù �ª��� ø Y� ��Ë �allad

¯ı yasa faqat. ila waqfi ’l- unfi

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 8 / 13

Examples àYJ�Ë áÓ� �I� K �ñ�ºË� ú �Í� � q� J ����Ë� �è� �Xñ �« ú �Î �«ala awdati ’s-sayh

˘i ila ’l-kuwayti min landn á� �g� ú �Í� � á� �g� áÓ� Ñî�D�Ê �« ��J. ��®Ë � ��Õ �æ� �K

yatimmu ’l-qabd. u alayhim min h. ınin ila h. ınin�X A ��®�J� KB� ����Ë �ð ��Q �ªË� �ñ �ë ��úæ�� A �� � B� A�K �Pð �X ��à� � �Q �kÆ� ú �æª�Üß.�bi-ma nan ah

˘ara inna dawrana ’l- asasıya huwa ’l- ard. u wa-laysa ’l-intiqadu�é� ��J�� � K� ��QË � á� � ���Ë� �H� � �XP�� �ñË� �� A��P � B� �ð ��� K� A ��®�mÌ'� úÎ��K A �ÒJ �

fıma yalı ’l-h. aqa iqu wa-’l- arqamu li-waridati ’s. -s. ıni ’r-raısıyati¡��®� Õ�»� A�mÌ'� H.� Qm�Ì'� ú �Î �« ��� K.� A ���Ë� ú � �Qå�� ��J �®��K �I�K A�¿ ú �æ���Ë �allatı kanat taqtas. iru fı ’s-sabiqi ala ’l-h. izbi ’l-h. akimi faqat.­� J �ªË � ­� � �ð ú �Í� � ¡ ��®� ù �ª��� ø Y� ��Ë �allad

¯ı yasa faqat. ila waqfi ’l- unfi

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 8 / 13

Examples àYJ�Ë áÓ� �I� K �ñ�ºË� ú �Í� � q� J ����Ë� �è� �Xñ �« ú �Î �«ala awdati ’s-sayh

˘i ila ’l-kuwayti min landn á� �g� ú �Í� � á� �g� áÓ� Ñî�D�Ê �« ��J. ��®Ë � ��Õ �æ� �K

yatimmu ’l-qabd. u alayhim min h. ınin ila h. ınin�X A ��®�J� KB� ����Ë �ð ��Q �ªË� �ñ �ë ��úæ�� A �� � B� A�K �Pð �X ��à� � �Q �kÆ� ú �æª�Üß.�bi-ma nan ah

˘ara inna dawrana ’l- asasıya huwa ’l- ard. u wa-laysa ’l-intiqadu�é� ��J�� � K� ��QË � á� � ���Ë� �H� � �XP�� �ñË� �� A��P � B� �ð ��� K� A ��®�mÌ'� úÎ��K A �ÒJ �

fıma yalı ’l-h. aqa iqu wa-’l- arqamu li-waridati ’s. -s. ıni ’r-raısıyati¡��®� Õ�»� A�mÌ'� H.� Qm�Ì'� ú �Î �« ��� K.� A ���Ë� ú � �Qå�� ��J �®��K �I�K A�¿ ú �æ���Ë �allatı kanat taqtas. iru fı ’s-sabiqi ala ’l-h. izbi ’l-h. akimi faqat.­� J �ªË � ­� � �ð ú �Í� � ¡ ��®� ù �ª��� ø Y� ��Ë �allad

¯ı yasa faqat. ila waqfi ’l- unfi

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 8 / 13

Annotation

The Annotation

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 9 / 13

Discussion

Relevance Issue

Topic–focus articulation is relevant even semantically — it affects the truthvalue of a proposition:

-- presupposition, allegation, meaning proper

-- scope of quantifiers, scope of negation

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 10 / 13

Discussion

Relevance Issue

Topic–focus articulation is relevant even semantically — it affects the truthvalue of a proposition:

-- presupposition, allegation, meaning proper

-- scope of quantifiers, scope of negation

The applicability of the general approach to written as well as spoken Arabicbecomes the main point of our account. In FGD, the description of infor-mation structure is related also the notions of intonation center and stress,contrast, subjective word order, potential ellipsis, prosody, . . .

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 10 / 13

Discussion

Relevance Issue

Topic–focus articulation is relevant even semantically — it affects the truthvalue of a proposition:

-- presupposition, allegation, meaning proper

-- scope of quantifiers, scope of negation

The applicability of the general approach to written as well as spoken Arabicbecomes the main point of our account. In FGD, the description of infor-mation structure is related also the notions of intonation center and stress,contrast, subjective word order, potential ellipsis, prosody, . . .

Annotated corpora for written and spoken Arabic are becoming available forquantitative evaluation of linguistic theories, large-scale analysis of linguisticmaterial, computational processing and modeling.

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 10 / 13

Discussion

Conclusion / Prospects

In PADT, which now consists of the morphological and the analytical lev-els of description of Arabic, the annotation of information structure andtectogrammatics is being established.

In our contribution, we have tried to overview the theoretical concepts wework with, and present our formal treatment of a number of corpus-basedinstances of linguistic phenomena that have a principal impact on the struc-ture of information in Arabic.

Rich linguistic literature and interesting computational systems are available[cf. e.g. Hajicova et al., 1995, Kruijff-Korbayova, 1998, Hajicova and Sgall,2004, Debusmann et al., 2005, Mikulova et al., 2006].

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 11 / 13

Discussion

Conclusion / Prospects

In PADT, which now consists of the morphological and the analytical lev-els of description of Arabic, the annotation of information structure andtectogrammatics is being established.

In our contribution, we have tried to overview the theoretical concepts wework with, and present our formal treatment of a number of corpus-basedinstances of linguistic phenomena that have a principal impact on the struc-ture of information in Arabic.

Rich linguistic literature and interesting computational systems are available[cf. e.g. Hajicova et al., 1995, Kruijff-Korbayova, 1998, Hajicova and Sgall,2004, Debusmann et al., 2005, Mikulova et al., 2006].

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 11 / 13

Discussion

Conclusion / Prospects

In PADT, which now consists of the morphological and the analytical lev-els of description of Arabic, the annotation of information structure andtectogrammatics is being established.

In our contribution, we have tried to overview the theoretical concepts wework with, and present our formal treatment of a number of corpus-basedinstances of linguistic phenomena that have a principal impact on the struc-ture of information in Arabic.

Rich linguistic literature and interesting computational systems are available[cf. e.g. Hajicova et al., 1995, Kruijff-Korbayova, 1998, Hajicova and Sgall,2004, Debusmann et al., 2005, Mikulova et al., 2006].

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 11 / 13

Discussion

References I

Kristen E. Brustad. The Syntax of Spoken Arabic: A Comparative Study of Moroccan,

Egyptian, Syrian, and Kuwaiti Dialects. Georgetown University Press, 2000.

Ralph Debusmann, Oana Postolache, and Maarika Traat. A Modular Account ofInformation Structure in Extensible Dependency Grammar. In Proceedings of the

CICLING 2005 Conference, 2005.

Jan Hajic, Eva Hajicova, Petr Pajas, Jarmila Panevova, Petr Sgall, and BarboraVidova-Hladka. Prague Dependency Treebank 1.0. LDC catalog numberLDC2001T10, ISBN 1-58563-212-0, 2001.

Jan Hajic, Eva Hajicova, Petr Pajas, Jarmila Panevova, Petr Sgall, and BarboraVidova-Hladka. Prague Dependency Treebank 2.0. LDC catalog numberLDC2006T??, ISBN 1-58563-???-?, 2006.

Jan Hajic, Otakar Smrz, Petr Zemanek, Petr Pajas, Jan Snaidauf, Emanuel Beska,Jakub Kracmar, and Kamila Hassanova. Prague Arabic Dependency Treebank 1.0.LDC catalog number LDC2004T23, ISBN 1-58563-319-4, 2004.

Eva Hajicova and Petr Sgall. Dependency Syntax in Functional Generative Description.In Dependenz und Valenz – Dependency and Valency, volume I, pages 570–592.Walter de Gruyter, 2003.

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 12 / 13

Discussion

References II

Eva Hajicova and Petr Sgall. Degrees of Contrast and the Topic–Focus Articulation.volume 1, pages 1–13. Walter de Gruyter, Berlin, 2004.

Eva Hajicova, Petr Sgall, and Hana Skoumalova. An Automatic Procedure forTopic–Focus Identification. Computational Linguistics, 21(1):81–94, 1995.

Ivana Kruijff-Korbayova. The Dynamic Potential of Topic and Focus: A Praguian

Approach to Discourse Representation Theory. PhD thesis, Charles University inPrague, 1998.

Ivana Kruijff-Korbayova and Mark Steedman. Discourse and Information Structure.Journal of Logic, Language and Information, 12(3), 2003.

Marie Mikulova et al. A Manual for Tectogrammatical Layer Annotation of the PragueDependency Treebank. Technical report, Charles University in Prague, 2006.

Petr Sgall, Eva Hajicova, and Jarmila Panevova. The Meaning of the Sentence in Its

Semantic and Pragmatic Aspects. D. Reidel & Academia, 1986.

Otakar Smrz, Petr Pajas, Zdenek Zabokrtsky, Jan Hajic, Jirı Mırovsky, and Petr Nemec.Learning to Use the Prague Arabic Dependency Treebank. In Perspectives on Arabic

Linguistics, volume XIX. John Benjamins, 2006.

CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 13 / 13

top related