information structure with the prague arabic dependency...
Post on 17-Oct-2020
0 Views
Preview:
TRANSCRIPT
Information Structure with the Prague ArabicDependency Treebank
Otakar Smrz Petr Zemanek Jakub Kracmar Viktor Bielicky
Institute of Formal and Applied Linguistics& Institute of Comparative Linguistics
Charles University in Prague
Conference on Communication and Information Structurein Spoken Arabic
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 1 / 13
Introduction
The issue of information structure in language has been studied extensivelyboth in the Prague School of Linguistics and in the Functional GenerativeDescription [Sgall et al., 1986, Hajicova and Sgall, 2003].
This theory of representation of linguistic meaning is the framework for afamily of multi-level annotation projects, esp. Prague Dependency Treebankfor Czech [Hajic et al., 2001, 2006] and Prague Arabic Dependency Treebank[Hajic et al., 2004, Smrz et al., 2006].
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 2 / 13
Introduction
The issue of information structure in language has been studied extensivelyboth in the Prague School of Linguistics and in the Functional GenerativeDescription [Sgall et al., 1986, Hajicova and Sgall, 2003].
This theory of representation of linguistic meaning is the framework for afamily of multi-level annotation projects, esp. Prague Dependency Treebankfor Czech [Hajic et al., 2001, 2006] and Prague Arabic Dependency Treebank[Hajic et al., 2004, Smrz et al., 2006].
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 2 / 13
Introduction
Outline
1 Introduction2 Definitions
dependency / information structurecommunicative dynamism / valencytopic–focus / contextual boundness
3 Examplessystemic orderingtopicalizersrhematizers
4 Annotation5 Discussion
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 3 / 13
Introduction
Outline
1 Introduction2 Definitions
dependency / information structurecommunicative dynamism / valencytopic–focus / contextual boundness
3 Examplessystemic orderingtopicalizersrhematizers
4 Annotation5 Discussion
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 3 / 13
Introduction
Outline
1 Introduction2 Definitions
dependency / information structurecommunicative dynamism / valencytopic–focus / contextual boundness
3 Examplessystemic orderingtopicalizersrhematizers
4 Annotation5 Discussion
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 3 / 13
Introduction
Outline
1 Introduction2 Definitions
dependency / information structurecommunicative dynamism / valencytopic–focus / contextual boundness
3 Examplessystemic orderingtopicalizersrhematizers
4 Annotation5 Discussion
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 3 / 13
Introduction
Outline
1 Introduction2 Definitions
dependency / information structurecommunicative dynamism / valencytopic–focus / contextual boundness
3 Examplessystemic orderingtopicalizersrhematizers
4 Annotation5 Discussion
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 3 / 13
Definitions
Dependency Description
Representation of structure in language built on dependency exhibits
-- Immediate Dominance relation
-- Linear Precedence relation
Q�� � ñ��JK.��é� ��J�� A �� � B� �è� A�J�mÌ'� �H� A��KP�ð �Qå��A�îD.�ª ���Ë�A�î D�J�K. áÓ� �ð ��é��J�� A �� � B� ��é��J ��j ���Ë� ��é�K A �«��QË � ��é ���A �g �ð ��é��J��J. ��¢Ë� ��é�K A �«��QË��bi-tawfıri d. arurıyati ’l-h. ayati ’l- asasıyati li-sa biha wa-min bayniha
ar-riayatu ’t.-t. ibbıyatu wa-h˘
as.s.atan ar-riayatu ’s. -s. ih. h. ıyatu ’l- asasıyatu
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 4 / 13
Definitions
Dependency Description
Representation of structure in language built on dependency exhibits
-- Immediate Dominance relation
-- Linear Precedence relation �é��J��J. Ë � ��� � úÍ� Èñ �Â�K © �k. Q�K�tirga tgul li umm l-bnayya úÍ� Èñ �Â�K © �k. Q�K� �é��J��J. Ë � ��� �umm l-bnayya tirga tgul li ? é�JJ � Qª��K �I� K� � Y�Ôg� � ��� � A�K[Brustad, 2000, page 336]
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 4 / 13
Definitions
Dependency Description
Representation of structure in language built on dependency exhibits
-- Immediate Dominance relation
-- Linear Precedence relation �é��J��J. Ë � ��� � úÍ� Èñ �Â�K © �k. Q�K�tirga tgul li umm l-bnayya úÍ� Èñ �Â�K © �k. Q�K� �é��J��J. Ë � ��� �umm l-bnayya tirga tgul li ? é�JJ � Qª��K �I� K� � Y�Ôg� � ��� � A�K[Brustad, 2000, page 336] úÍ� Èñ �Â�K �é��J��J. Ë � ��� � © �k. Q�K�tirga umm l-bnayya tgul li
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 4 / 13
Definitions
Dependency Description
Representation of structure in language built on dependency exhibits
-- Immediate Dominance relation
-- Linear Precedence relation
Contrast dependency with constituency, i.e. phrase-structure syntax, wherelinear order and derivation are not independent, and thus less expressive(context-free vs. context-sensitive).
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 4 / 13
Definitions
Dependency Description
Representation of structure in language built on dependency exhibits
-- Immediate Dominance relation
-- Linear Precedence relation
Contrast dependency with constituency, i.e. phrase-structure syntax, wherelinear order and derivation are not independent, and thus less expressive(context-free vs. context-sensitive).
Distinction of different points of view and levels of abstraction
-- Surface/Analytical syntax level
-- Deep/Tectogrammatical syntax level
Description includes recovery of tree topology and syntactic functions.
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 4 / 13
Definitions
Information Structure
Information structure — the question of “the given” and “the new” in anutterance and how it is expressed — is considered to contribute to thelinguistic meaning, and its annotation is thus in our interest.
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 5 / 13
Definitions
Information Structure
Information structure — the question of “the given” and “the new” in anutterance and how it is expressed — is considered to contribute to thelinguistic meaning, and its annotation is thus in our interest.
How is communicative dynamism expressed and delivered/packaged?
-- word order variation with respect to systemic ordering
-- intonation centers, sentence stress
-- extra constructs in syntax or morphology�è� �Yj� ���J �ÜÏ� �H� A�KB�ñ� Ë � ú �Í� � �H� � �PX� A ���Ë� ��é �ÒJ�� �� ®� �jJ��K à� ��éKPAm.Ì '� �é J ��Ë� áÓ ú G A��JË � ��JË � ú A��JË� A �g P�B�ð �X �àñ�JÊÓ� 593 áÓ� P�B�ð �X à� ñ�JÊÓ� 400 ú �Í� ��H�Yg B� �HAJ«�Y�K I. �.��.an tanh˘
afid. a qımatu . . .
fı ’n-nis. fi . . .
ila 400 . . . min 593 . . .
bi-sababi . . .
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 5 / 13
Definitions
Information Structure
Information structure — the question of “the given” and “the new” in anutterance and how it is expressed — is considered to contribute to thelinguistic meaning, and its annotation is thus in our interest.
How is communicative dynamism expressed and delivered/packaged?
-- word order variation with respect to systemic ordering
-- intonation centers, sentence stress
-- extra constructs in syntax or morphology�è� �Yj� ���J �ÜÏ� �H� A�KB�ñ� Ë � ú �Í� � �H� � �PX� A ���Ë� ��é �ÒJ�� �� ®� �jJ��K à� ��éKPAm.Ì '� �é J ��Ë� áÓ ú G A��JË � ��JË � ú A��JË� A �g P�B�ð �X �àñ�JÊÓ� 593 áÓ� P�B�ð �X à� ñ�JÊÓ� 400 ú �Í� ��H�Yg B� �HAJ«�Y�K I. �.��.an tanh˘
afid. a qımatu . . .
fı ’n-nis. fi . . .
ila 400 . . . min 593 . . .
bi-sababi . . .
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 5 / 13
Definitions
Information Structure
Information structure — the question of “the given” and “the new” in anutterance and how it is expressed — is considered to contribute to thelinguistic meaning, and its annotation is thus in our interest.
How is communicative dynamism expressed and delivered/packaged?
-- word order variation with respect to systemic ordering
-- intonation centers, sentence stress
-- extra constructs in syntax or morphology�è� �Yj� ���J �ÜÏ� �H� A�KB�ñ� Ë � ú �Í� � �H� � �PX� A ���Ë� ��é �ÒJ�� �� ®� �jJ��K à� ��éKPAm.Ì '� �é J ��Ë� áÓ ú G A��JË � ��JË � ú A��JË� A �g P�B�ð �X �àñ�JÊÓ� 593 áÓ� P�B�ð �X à� ñ�JÊÓ� 400 ú �Í� ��H�Yg B� �HAJ«�Y�K I. �.��.an tanh˘
afid. a qımatu . . .
fı ’n-nis. fi . . .
ila 400 . . . min 593 . . .
bi-sababi . . .
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 5 / 13
Definitions
Topic / Focus
Dichotomy of aboutness inspiring to many similar concepts in modern lin-guistics [cf. Kruijff-Korbayova and Steedman, 2003, for overview].
Topic (theme) part of sentence structure linking the content of theutterance with the context of the discourse
Focus (rheme, comment) the part providing or modifying someinformation about the topic
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 6 / 13
Definitions
Topic / Focus
Dichotomy of aboutness inspiring to many similar concepts in modern lin-guistics [cf. Kruijff-Korbayova and Steedman, 2003, for overview].
Topic (theme) part of sentence structure linking the content of theutterance with the context of the discourse
Focus (rheme, comment) the part providing or modifying someinformation about the topic
Communicative dynamism as measure of this linguistic property
-- Topic proper is the least dynamic part
-- Focus proper is the most dynamic part
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 6 / 13
Definitions
Contextual Boundness
The elementary distinction, from which topic–focus dichotomy is derived, iscontextual boundness.
Context-Bound lexical reference to an already explicitly mentioned entity,or to an entity implicitly evoked in the context of the discourse
Non-Bound lexical item that is not contextually bound, i.e. not availablein the interlocutor’s mind as reference
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 7 / 13
Definitions
Contextual Boundness
The elementary distinction, from which topic–focus dichotomy is derived, iscontextual boundness.
Context-Bound lexical reference to an already explicitly mentioned entity,or to an entity implicitly evoked in the context of the discourse
Non-Bound lexical item that is not contextually bound, i.e. not availablein the interlocutor’s mind as reference
Using question test to identify the context-bound and non-bound items.
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 7 / 13
Definitions
Contextual Boundness
The elementary distinction, from which topic–focus dichotomy is derived, iscontextual boundness.
Context-Bound lexical reference to an already explicitly mentioned entity,or to an entity implicitly evoked in the context of the discourse
Non-Bound lexical item that is not contextually bound, i.e. not availablein the interlocutor’s mind as reference
Using question test to identify the context-bound and non-bound items.
The relation of definiteness and boundness is not trivial [Kruijff-Korbayova,1998, Brustad, 2000]. Boundness is not equated to the given/new concept.
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 7 / 13
Examples àYJ�Ë áÓ� �I� K �ñ�ºË� ú �Í� � q� J ����Ë� �è� �Xñ �« ú �Î �«ala awdati ’s-sayh
˘i ila ’l-kuwayti min landn á� �g� ú �Í� � á� �g� áÓ� Ñî�D�Ê �« ��J. ��®Ë � ��Õ �æ� �K
yatimmu ’l-qabd. u alayhim min h. ınin ila h. ınin�X A ��®�J� KB� ����Ë �ð ��Q �ªË� �ñ �ë ��úæ�� A �� � B� A�K �Pð �X ��à� � �Q �kÆ� ú �æª�Üß.�bi-ma nan ah
˘ara inna dawrana ’l- asasıya huwa ’l- ard. u wa-laysa ’l-intiqadu�é� ��J�� � K� ��QË � á� � ���Ë� �H� � �XP�� �ñË� �� A��P � B� �ð ��� K� A ��®�mÌ'� úÎ��K A �ÒJ �
fıma yalı ’l-h. aqa iqu wa-’l- arqamu li-waridati ’s. -s. ıni ’r-raısıyati¡��®� Õ�»� A�mÌ'� H.� Qm�Ì'� ú �Î �« ��� K.� A ���Ë� ú � �Qå�� ��J �®��K �I�K A�¿ ú �æ���Ë �allatı kanat taqtas. iru fı ’s-sabiqi ala ’l-h. izbi ’l-h. akimi faqat.� J �ªË � � � �ð ú �Í� � ¡ ��®� ù �ª��� ø Y� ��Ë �allad
¯ı yasa faqat. ila waqfi ’l- unfi
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 8 / 13
Examples àYJ�Ë áÓ� �I� K �ñ�ºË� ú �Í� � q� J ����Ë� �è� �Xñ �« ú �Î �«ala awdati ’s-sayh
˘i ila ’l-kuwayti min landn á� �g� ú �Í� � á� �g� áÓ� Ñî�D�Ê �« ��J. ��®Ë � ��Õ �æ� �K
yatimmu ’l-qabd. u alayhim min h. ınin ila h. ınin�X A ��®�J� KB� ����Ë �ð ��Q �ªË� �ñ �ë ��úæ�� A �� � B� A�K �Pð �X ��à� � �Q �kÆ� ú �æª�Üß.�bi-ma nan ah
˘ara inna dawrana ’l- asasıya huwa ’l- ard. u wa-laysa ’l-intiqadu�é� ��J�� � K� ��QË � á� � ���Ë� �H� � �XP�� �ñË� �� A��P � B� �ð ��� K� A ��®�mÌ'� úÎ��K A �ÒJ �
fıma yalı ’l-h. aqa iqu wa-’l- arqamu li-waridati ’s. -s. ıni ’r-raısıyati¡��®� Õ�»� A�mÌ'� H.� Qm�Ì'� ú �Î �« ��� K.� A ���Ë� ú � �Qå�� ��J �®��K �I�K A�¿ ú �æ���Ë �allatı kanat taqtas. iru fı ’s-sabiqi ala ’l-h. izbi ’l-h. akimi faqat.� J �ªË � � � �ð ú �Í� � ¡ ��®� ù �ª��� ø Y� ��Ë �allad
¯ı yasa faqat. ila waqfi ’l- unfi
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 8 / 13
Examples àYJ�Ë áÓ� �I� K �ñ�ºË� ú �Í� � q� J ����Ë� �è� �Xñ �« ú �Î �«ala awdati ’s-sayh
˘i ila ’l-kuwayti min landn á� �g� ú �Í� � á� �g� áÓ� Ñî�D�Ê �« ��J. ��®Ë � ��Õ �æ� �K
yatimmu ’l-qabd. u alayhim min h. ınin ila h. ınin�X A ��®�J� KB� ����Ë �ð ��Q �ªË� �ñ �ë ��úæ�� A �� � B� A�K �Pð �X ��à� � �Q �kÆ� ú �æª�Üß.�bi-ma nan ah
˘ara inna dawrana ’l- asasıya huwa ’l- ard. u wa-laysa ’l-intiqadu�é� ��J�� � K� ��QË � á� � ���Ë� �H� � �XP�� �ñË� �� A��P � B� �ð ��� K� A ��®�mÌ'� úÎ��K A �ÒJ �
fıma yalı ’l-h. aqa iqu wa-’l- arqamu li-waridati ’s. -s. ıni ’r-raısıyati¡��®� Õ�»� A�mÌ'� H.� Qm�Ì'� ú �Î �« ��� K.� A ���Ë� ú � �Qå�� ��J �®��K �I�K A�¿ ú �æ���Ë �allatı kanat taqtas. iru fı ’s-sabiqi ala ’l-h. izbi ’l-h. akimi faqat.� J �ªË � � � �ð ú �Í� � ¡ ��®� ù �ª��� ø Y� ��Ë �allad
¯ı yasa faqat. ila waqfi ’l- unfi
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 8 / 13
Examples àYJ�Ë áÓ� �I� K �ñ�ºË� ú �Í� � q� J ����Ë� �è� �Xñ �« ú �Î �«ala awdati ’s-sayh
˘i ila ’l-kuwayti min landn á� �g� ú �Í� � á� �g� áÓ� Ñî�D�Ê �« ��J. ��®Ë � ��Õ �æ� �K
yatimmu ’l-qabd. u alayhim min h. ınin ila h. ınin�X A ��®�J� KB� ����Ë �ð ��Q �ªË� �ñ �ë ��úæ�� A �� � B� A�K �Pð �X ��à� � �Q �kÆ� ú �æª�Üß.�bi-ma nan ah
˘ara inna dawrana ’l- asasıya huwa ’l- ard. u wa-laysa ’l-intiqadu�é� ��J�� � K� ��QË � á� � ���Ë� �H� � �XP�� �ñË� �� A��P � B� �ð ��� K� A ��®�mÌ'� úÎ��K A �ÒJ �
fıma yalı ’l-h. aqa iqu wa-’l- arqamu li-waridati ’s. -s. ıni ’r-raısıyati¡��®� Õ�»� A�mÌ'� H.� Qm�Ì'� ú �Î �« ��� K.� A ���Ë� ú � �Qå�� ��J �®��K �I�K A�¿ ú �æ���Ë �allatı kanat taqtas. iru fı ’s-sabiqi ala ’l-h. izbi ’l-h. akimi faqat.� J �ªË � � � �ð ú �Í� � ¡ ��®� ù �ª��� ø Y� ��Ë �allad
¯ı yasa faqat. ila waqfi ’l- unfi
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 8 / 13
Examples àYJ�Ë áÓ� �I� K �ñ�ºË� ú �Í� � q� J ����Ë� �è� �Xñ �« ú �Î �«ala awdati ’s-sayh
˘i ila ’l-kuwayti min landn á� �g� ú �Í� � á� �g� áÓ� Ñî�D�Ê �« ��J. ��®Ë � ��Õ �æ� �K
yatimmu ’l-qabd. u alayhim min h. ınin ila h. ınin�X A ��®�J� KB� ����Ë �ð ��Q �ªË� �ñ �ë ��úæ�� A �� � B� A�K �Pð �X ��à� � �Q �kÆ� ú �æª�Üß.�bi-ma nan ah
˘ara inna dawrana ’l- asasıya huwa ’l- ard. u wa-laysa ’l-intiqadu�é� ��J�� � K� ��QË � á� � ���Ë� �H� � �XP�� �ñË� �� A��P � B� �ð ��� K� A ��®�mÌ'� úÎ��K A �ÒJ �
fıma yalı ’l-h. aqa iqu wa-’l- arqamu li-waridati ’s. -s. ıni ’r-raısıyati¡��®� Õ�»� A�mÌ'� H.� Qm�Ì'� ú �Î �« ��� K.� A ���Ë� ú � �Qå�� ��J �®��K �I�K A�¿ ú �æ���Ë �allatı kanat taqtas. iru fı ’s-sabiqi ala ’l-h. izbi ’l-h. akimi faqat.� J �ªË � � � �ð ú �Í� � ¡ ��®� ù �ª��� ø Y� ��Ë �allad
¯ı yasa faqat. ila waqfi ’l- unfi
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 8 / 13
Examples àYJ�Ë áÓ� �I� K �ñ�ºË� ú �Í� � q� J ����Ë� �è� �Xñ �« ú �Î �«ala awdati ’s-sayh
˘i ila ’l-kuwayti min landn á� �g� ú �Í� � á� �g� áÓ� Ñî�D�Ê �« ��J. ��®Ë � ��Õ �æ� �K
yatimmu ’l-qabd. u alayhim min h. ınin ila h. ınin�X A ��®�J� KB� ����Ë �ð ��Q �ªË� �ñ �ë ��úæ�� A �� � B� A�K �Pð �X ��à� � �Q �kÆ� ú �æª�Üß.�bi-ma nan ah
˘ara inna dawrana ’l- asasıya huwa ’l- ard. u wa-laysa ’l-intiqadu�é� ��J�� � K� ��QË � á� � ���Ë� �H� � �XP�� �ñË� �� A��P � B� �ð ��� K� A ��®�mÌ'� úÎ��K A �ÒJ �
fıma yalı ’l-h. aqa iqu wa-’l- arqamu li-waridati ’s. -s. ıni ’r-raısıyati¡��®� Õ�»� A�mÌ'� H.� Qm�Ì'� ú �Î �« ��� K.� A ���Ë� ú � �Qå�� ��J �®��K �I�K A�¿ ú �æ���Ë �allatı kanat taqtas. iru fı ’s-sabiqi ala ’l-h. izbi ’l-h. akimi faqat.� J �ªË � � � �ð ú �Í� � ¡ ��®� ù �ª��� ø Y� ��Ë �allad
¯ı yasa faqat. ila waqfi ’l- unfi
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 8 / 13
Annotation
The Annotation
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 9 / 13
Discussion
Relevance Issue
Topic–focus articulation is relevant even semantically — it affects the truthvalue of a proposition:
-- presupposition, allegation, meaning proper
-- scope of quantifiers, scope of negation
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 10 / 13
Discussion
Relevance Issue
Topic–focus articulation is relevant even semantically — it affects the truthvalue of a proposition:
-- presupposition, allegation, meaning proper
-- scope of quantifiers, scope of negation
The applicability of the general approach to written as well as spoken Arabicbecomes the main point of our account. In FGD, the description of infor-mation structure is related also the notions of intonation center and stress,contrast, subjective word order, potential ellipsis, prosody, . . .
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 10 / 13
Discussion
Relevance Issue
Topic–focus articulation is relevant even semantically — it affects the truthvalue of a proposition:
-- presupposition, allegation, meaning proper
-- scope of quantifiers, scope of negation
The applicability of the general approach to written as well as spoken Arabicbecomes the main point of our account. In FGD, the description of infor-mation structure is related also the notions of intonation center and stress,contrast, subjective word order, potential ellipsis, prosody, . . .
Annotated corpora for written and spoken Arabic are becoming available forquantitative evaluation of linguistic theories, large-scale analysis of linguisticmaterial, computational processing and modeling.
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 10 / 13
Discussion
Conclusion / Prospects
In PADT, which now consists of the morphological and the analytical lev-els of description of Arabic, the annotation of information structure andtectogrammatics is being established.
In our contribution, we have tried to overview the theoretical concepts wework with, and present our formal treatment of a number of corpus-basedinstances of linguistic phenomena that have a principal impact on the struc-ture of information in Arabic.
Rich linguistic literature and interesting computational systems are available[cf. e.g. Hajicova et al., 1995, Kruijff-Korbayova, 1998, Hajicova and Sgall,2004, Debusmann et al., 2005, Mikulova et al., 2006].
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 11 / 13
Discussion
Conclusion / Prospects
In PADT, which now consists of the morphological and the analytical lev-els of description of Arabic, the annotation of information structure andtectogrammatics is being established.
In our contribution, we have tried to overview the theoretical concepts wework with, and present our formal treatment of a number of corpus-basedinstances of linguistic phenomena that have a principal impact on the struc-ture of information in Arabic.
Rich linguistic literature and interesting computational systems are available[cf. e.g. Hajicova et al., 1995, Kruijff-Korbayova, 1998, Hajicova and Sgall,2004, Debusmann et al., 2005, Mikulova et al., 2006].
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 11 / 13
Discussion
Conclusion / Prospects
In PADT, which now consists of the morphological and the analytical lev-els of description of Arabic, the annotation of information structure andtectogrammatics is being established.
In our contribution, we have tried to overview the theoretical concepts wework with, and present our formal treatment of a number of corpus-basedinstances of linguistic phenomena that have a principal impact on the struc-ture of information in Arabic.
Rich linguistic literature and interesting computational systems are available[cf. e.g. Hajicova et al., 1995, Kruijff-Korbayova, 1998, Hajicova and Sgall,2004, Debusmann et al., 2005, Mikulova et al., 2006].
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 11 / 13
Discussion
References I
Kristen E. Brustad. The Syntax of Spoken Arabic: A Comparative Study of Moroccan,
Egyptian, Syrian, and Kuwaiti Dialects. Georgetown University Press, 2000.
Ralph Debusmann, Oana Postolache, and Maarika Traat. A Modular Account ofInformation Structure in Extensible Dependency Grammar. In Proceedings of the
CICLING 2005 Conference, 2005.
Jan Hajic, Eva Hajicova, Petr Pajas, Jarmila Panevova, Petr Sgall, and BarboraVidova-Hladka. Prague Dependency Treebank 1.0. LDC catalog numberLDC2001T10, ISBN 1-58563-212-0, 2001.
Jan Hajic, Eva Hajicova, Petr Pajas, Jarmila Panevova, Petr Sgall, and BarboraVidova-Hladka. Prague Dependency Treebank 2.0. LDC catalog numberLDC2006T??, ISBN 1-58563-???-?, 2006.
Jan Hajic, Otakar Smrz, Petr Zemanek, Petr Pajas, Jan Snaidauf, Emanuel Beska,Jakub Kracmar, and Kamila Hassanova. Prague Arabic Dependency Treebank 1.0.LDC catalog number LDC2004T23, ISBN 1-58563-319-4, 2004.
Eva Hajicova and Petr Sgall. Dependency Syntax in Functional Generative Description.In Dependenz und Valenz – Dependency and Valency, volume I, pages 570–592.Walter de Gruyter, 2003.
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 12 / 13
Discussion
References II
Eva Hajicova and Petr Sgall. Degrees of Contrast and the Topic–Focus Articulation.volume 1, pages 1–13. Walter de Gruyter, Berlin, 2004.
Eva Hajicova, Petr Sgall, and Hana Skoumalova. An Automatic Procedure forTopic–Focus Identification. Computational Linguistics, 21(1):81–94, 1995.
Ivana Kruijff-Korbayova. The Dynamic Potential of Topic and Focus: A Praguian
Approach to Discourse Representation Theory. PhD thesis, Charles University inPrague, 1998.
Ivana Kruijff-Korbayova and Mark Steedman. Discourse and Information Structure.Journal of Logic, Language and Information, 12(3), 2003.
Marie Mikulova et al. A Manual for Tectogrammatical Layer Annotation of the PragueDependency Treebank. Technical report, Charles University in Prague, 2006.
Petr Sgall, Eva Hajicova, and Jarmila Panevova. The Meaning of the Sentence in Its
Semantic and Pragmatic Aspects. D. Reidel & Academia, 1986.
Otakar Smrz, Petr Pajas, Zdenek Zabokrtsky, Jan Hajic, Jirı Mırovsky, and Petr Nemec.Learning to Use the Prague Arabic Dependency Treebank. In Perspectives on Arabic
Linguistics, volume XIX. John Benjamins, 2006.
CCISSA 2006 (University of Maryland) Information Structure with PADT June 10, 2006 13 / 13
top related