kakia chatsiou greekgram: building a parallel grammar for modern greek 05.10.2007lac day 20071...

16
u GreekGram: Building a parallel grammar fo 05.10.2007 LAC day 2007 1 GreekGram Building a parallel grammar for Modern Greek Kakia Chatsiou [email protected] Dept of Language and Linguistics University of Essex Parts of this project have been funded with an ESRC Award PTA-2004-031-00112 , support which is gratefully acknowledged. Parts of this presentation have been presented in the LangUE 2007 International Postgraduate Conference, University of Essex.

Upload: charleen-floyd

Post on 18-Dec-2015

225 views

Category:

Documents


4 download

TRANSCRIPT

Kakia Chatsiou GreekGram: Building a parallel grammar for Modern Greek

05.10.2007 LAC day 2007 1

GreekGramBuilding a parallel grammar for Modern

Greek

Kakia [email protected]

Dept of Language and LinguisticsUniversity of Essex

Parts of this project have been funded with an ESRC Award PTA-2004-031-00112 , support which is gratefully acknowledged.

Parts of this presentation have been presented in the LangUE 2007 International Postgraduate Conference, University of Essex.

Kakia Chatsiou GreekGram: Building a parallel grammar for Modern Greek

05.10.2007 LAC day 2007 2

Outline

• Introduction – Overview of the ParGram Project– Project Objectives– Participating Members– The XLE Platform– Grammar Architecture

• The GreekGram Project– Overview– Assumptions– Coverage– Demonstration– Future Development Directions

Kakia Chatsiou GreekGram: Building a parallel grammar for Modern Greek

05.10.2007 LAC day 2007 3

Outline

• Introduction – Overview of the ParGram Project– Project Objectives– Participating Members– The XLE Platform– Grammar Architecture

• The GreekGram Project– Overview– Assumptions– Coverage– Demonstration– Future Development Directions

Kakia Chatsiou GreekGram: Building a parallel grammar for Modern Greek

05.10.2007 LAC day 2007 4

An Overview of the ParGram Project

Project Objectives

• Broad coverage grammars– Inclusion of important and frequently occurring constructions– Linguistically motivated analyses

• Parallel and crosslinguistic development of grammars between the participating members– All grammars are guided by a common set of linguistic principles

and a commonly agreed upon set of grammatical analyses and features

– Identical treatment of core crosslinguistic phenomena

• Methods in grammar engineering– Common test methods and evaluation strategies– Balance between efficiency, performance, reliability and

maintainability across grammars

Kakia Chatsiou GreekGram: Building a parallel grammar for Modern Greek

05.10.2007 LAC day 2007 5

Participating Members

An Overview of the ParGram Project

PARC, CAChinese, English,

French

EssexGreek, Welsh

ManchesterArabic

OxfordMalagasy

DCU, IrelandChinese, English, French, German,

Japanese, Spanish

Fuji XEROXJapanese

Bergen, NorwayGeorgian, Norwegian, Tigrinya

Ho Chi MinchVietnamese

DebrecenHungarian

IMS, StuttgartGerman

KonstanzUrdu

Sabanci, IstanbulTurkish

Kakia Chatsiou GreekGram: Building a parallel grammar for Modern Greek

05.10.2007 LAC day 2007 6

An Overview of the ParGram Project

The Xerox Linguistics Environment (XLE) Platform

• Under current development at PARC (Palo Alto Research Center, USA)

• An implementation of the Lexical Functional Grammar (LFG) Formalism

• Implemented in C; works in Unix, Linux, MacOS. MS Windows version under development.

• Integrates a morphological analyser employing Finite State Technology

• Can be used for both parsing and generation• Includes tools for various grammar development activities (such

as analysing performance, test-suites)• The core technology used in the consumer search engine based

on natural language processing which is currently under development by

Kakia Chatsiou GreekGram: Building a parallel grammar for Modern Greek

An Overview of the ParGram Project

• a Silicon Valley Company, currently building a transformative consumer search enginesearch engine based on natural language processing:– It is based on technologies that take advantage of the structure

and nuances of natural language– It offers an innovative approach to searching:

• It breaks the confines of keyword search queries using natural language

• Makes search more natural and intuitive• Aims at fundamentally changing how we search the web and at the

same time delivering higher quality results

(source: http://www.powerset.com )

05.10.2007 LAC day 2007 7

Kakia Chatsiou GreekGram: Building a parallel grammar for Modern Greek

05.10.2007 LAC day 2007 8

Kakia Chatsiou GreekGram: Building a parallel grammar for Modern Greek

05.10.2007 LAC day 2007 9

An Overview of the ParGram Project

Basic Grammar Architecture

Other Finite State Tools

Tokenizer

Morpholo-gical

AnalyserFST

Lexicon(s)(Hand-written

OrAutomatically

Extracted)

LFG Grammar

(rules, templates)

Grammar Resources(parsing

and generating)

Kakia Chatsiou GreekGram: Building a parallel grammar for Modern Greek

05.10.2007 LAC day 2007 10

Outline

• Introduction – Overview of the ParGram Project– Project Objectives– Participating Members– The XLE Platform– Grammar Architecture

• The GreekGram Project– Overview– Assumptions– Coverage– Demonstration– Future Development Directions

Kakia Chatsiou GreekGram: Building a parallel grammar for Modern Greek

05.10.2007 LAC day 2007 11

The GreekGram Project

Overview

• A preliminary effort to develop a large-scale LFG computational Grammar for Modern Greek

• Shares the objectives and principles of similar ParGram projects (parallel and crosslinguistic; balancing of maintainability and achieving large coverage)

• Current main focus is on development of grammar rules; the lexicon is kept as minimal as possible

Kakia Chatsiou GreekGram: Building a parallel grammar for Modern Greek

05.10.2007 LAC day 2007 12

The GreekGram Project

Assumptions

• Focus on syntax (but in such a way that the representations produced could serve as direct and useful input for incorporation into semantic interpretations)

• All four possible Modern Greek Word orders are treated as equally acceptable in terms of markedness and acceptability

• Modern Greek word Order is represented non-configurationally (Tzanidaki, 1996; Alexopoulou, 1999)

• No treatment of morphology (using the Morphological analyzer or some other Finite state tool) – there is a separate lexical entry for each form.

Kakia Chatsiou GreekGram: Building a parallel grammar for Modern Greek

05.10.2007 LAC day 2007 13

The GreekGram Project

Coverage

• All 4 possible word orders (irrespective of markedness)• Pro-drop character of the language• Subcategorisation frames of intransitive, transitive and

ditransitive verbs• Number, case and gender agreement within the NP or PP;

subject-verb agreement• Basic Relative Clause structure• Coordination of relative clauses• Some punctuation

Kakia Chatsiou GreekGram: Building a parallel grammar for Modern Greek

05.10.2007 LAC day 2007 14

The GreekGram Project

Demonstration

Kakia Chatsiou GreekGram: Building a parallel grammar for Modern Greek

05.10.2007 LAC day 2007 15

The GreekGram Project

Future Development Directions

• Account for a greater variety of verb subcategorisation frames

• Enrich the lexicon adding more lexical entries• Account for the morphology within the lexicon, using the

XFST (or other finite state tools)• Expand the coverage of the grammar to other constructions

such as interrogatives, imperatives and negation• …

Kakia Chatsiou GreekGram: Building a parallel grammar for Modern Greek

05.10.2007 LAC day 2007 16

Thank you!

For more information and updates on the progress of the project visit

http://privatewww.essex.ac.uk/~achats/projects/greekgram/index.html