1 syntax sudeshna sarkar 25 aug 2008. 2 some fundamental questions what is language? how to define a...
TRANSCRIPT
![Page 1: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/1.jpg)
1
Syntax
Sudeshna Sarkar
25 Aug 2008
![Page 2: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/2.jpg)
2
Some Fundamental Questions
What is Language?
How to define a Language?
What makes a language different from another?
Is there anything common to all languages?
![Page 3: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/3.jpg)
3
Syntax
Syntax: from Greek syntaxis, “setting out together, arrangmenet’
Refers to the way words are arranged together, and the relationship between them.
Distinction:Prescriptive grammar: how people ought to talk
Descriptive grammar: how they do talk
Goal of syntax is to model the knowledge of that people unconsciously have about the grammar of their native language
![Page 4: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/4.jpg)
4
The Two Schools
RationalistsIt’s all hardcoded in our brains
Principle and Parameter Theory
Poverty of Stimulus
Recursion
EmpiricistsJust a special kind of pattern recognition
No different from other cognitive abilities like vision
Language is a stochastic phenomenon
![Page 5: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/5.jpg)
5
The Generative Grammar
“The grammatical principles underlying languages are innate and fixed, and the differences among the world's languages can be characterized in terms of parameter settings in the brain …”
- www.wikipedia.org
Noam Chomsky [1928-]
Courtesy www.chomsky.info
![Page 6: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/6.jpg)
6
I & E Languages
I – Language: Mentally represented system of rules (I – internal)E – Language: Observable external products of I-language (written text, utterances)Language: Collective E-language of a very large group of speakersSyntax: Study of the I-language from E-language
![Page 7: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/7.jpg)
7
The Chomsky Hierarchy
A → aBA → aFinite state automatonRegularType-3
A → γNon-deterministic
pushdown automatonContext-freeType-2
αAβ → αγβLinear-bounded non-deterministic Turing
machine
Context-sensitive
Type-1
No restrictions
Turing machineRecursively enumerable
Type-0
Production rules
AutomatonLanguagesGrammar
![Page 8: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/8.jpg)
8
From Formal to Natural Languages
Organizational Unit Complexity
Word Regular
Sounds Regular
Sentence Context-free
Discourse ??
![Page 9: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/9.jpg)
9
Some Observations on NLs
Constituency: A group of words acts as a single unit – phrases, clauses etc.
Grammatical Relations: Different words/ phrases are related to the main verb of the sentence – object, subject, instrument
Subcategorization and Dependency Relations: Not all verbs can take all type of arguments – transitive, intransitive etc.
![Page 10: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/10.jpg)
10
Syntax
Why should you care?Grammar checkers
Question answering
Information extraction
Machine translation
![Page 11: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/11.jpg)
11
Why NLP is difficult:Newspaper headlines
Iraqi Head Seeks Arms
Juvenile Court to Try Shooting Defendant
Teacher Strikes Idle Kids
Stolen Painting Found by Tree
Local High School Dropouts Cut in Half
Red Tape Holds Up New Bridges
Clinton Wins on Budget, but More Lies Ahead
Hospitals Are Sued by 7 Foot Doctors
Kids Make Nutritious Snacks
![Page 12: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/12.jpg)
12
Why is NLU difficult? The hidden structure of
language is hugely ambiguous
Tree for: Fed raises interest rates 0.5% in effort to control inflation (NYT headline 5/17/00)
![Page 13: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/13.jpg)
13
Where are the ambiguities?
![Page 14: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/14.jpg)
14
The bad effects of V/N ambiguities
![Page 15: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/15.jpg)
15
Context-Free Grammars
Capture constituency and orderingOrdering is easy
What are the rules that govern the ordering of words and bigger units in the language
What’s constituency?
How words group into units and how the various kinds of units behave wrt one another
![Page 16: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/16.jpg)
16
Constituency
We have NLP classes from 5:30 to 6:30 pm on Tuesday.
On Tuesday we have NLP classes from 5:30 – 6:30 pm.
From 5:30 to 6:30 pm on Tuesday we have NLP classes.
We have NLP on Tuesday from 5:30 to 6:30 pm classes.
On we have NLP classes from Tuesday 5:30 to 6:30 pm.
From 5:30 we have to 6:30 pm on Tuesday NLP classes.
![Page 17: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/17.jpg)
17
Constituency
We have NLP classes from 5:30 to 6:30 pm on Tuesday.
On Tuesday we have NLP classes from 5:30 – 6:30 pm.
From 5:30 to 6:30 pm on Tuesday we have NLP classes.
We have NLP on Tuesday from 5:30 to 6:30 pm classes.
On we have NLP classes from Tuesday 5:30 to 6:30 pm.
From 5:30 we have to 6:30 pm on Tuesday NLP classes.
![Page 18: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/18.jpg)
18
Phrases
Phrase: Group of words that act as a unitNoun Phrase NP
– A midsummer night’s dream, My experiments with truth, The man who knew infinity
Verb Phrase VP– Gone with the wind, Saving private Ryan
Prepositional Phrases PP– Of sons and lovers, to sir with love, Beyond the blue
mountains, Into the heart of the mind
![Page 19: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/19.jpg)
19
Modelling the Syntax of English
Let us try CFGsS NP VP I love India.
S VP Love your country.
S Aux NP VP Do you love your country?
S Wh-NP VP Who loves his country?
S Wh-NP Aux NP VP
Which country do you live in?
![Page 20: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/20.jpg)
20
Phrase Structure Grammar
Context Free Grammars are also called phrase structure grammars
Phrases are the building blocks of any PSG (i.e. CFG)
Phrases in turn are defined by CFG (PSG)
![Page 21: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/21.jpg)
21
Is CFG Necessary?
Can we model the syntax of English using Regular Grammar?
NO! we cannot model recursion in RG
S NP VP
VP Verb S
I think that Einstein thought that Newton said …
![Page 22: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/22.jpg)
22
CFG Examples
S -> NP VPNP -> Det NOMINALNOMINAL -> NounVP -> VerbDet -> aNoun -> flightVerb -> left
![Page 23: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/23.jpg)
23
CFGs
S -> NP VP
This says that there are units called S, NP, and VP in this language
That an S consists of an NP followed immediately by a VP
Doesn’t say that that’s the only kind of S
Nor does it say that this is the only place that NPs and VPs occur
![Page 24: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/24.jpg)
24
Context Free Grammars
A CFG consists of a tuple (N,T,S,P)N is a finite set of non-terminal symbols
T is a finite set of terminal symbols
S is the start symbol
P is a finite set of rules of the form X where X N and {N U T}*
![Page 25: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/25.jpg)
25
Phrase Structure Parsing
Phrase structure organizes words into phrases, often called constituents
This organization is hierarchical
For a given string there is often ambiguity about the correct phrase structure
This ambiguity often corresponds to semantic ambiguity
![Page 26: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/26.jpg)
26
![Page 27: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/27.jpg)
27
Simple examples of a CFG
Take the non-terminals = {S, NP, VP, V}
And the terminals {boys, study, play, books, cricket)
Let the start symbol be S
Let the rule set beS NP VP
VP V
VP V NP
NP boys
NP books
NP cricket
V study
V play
This CFG licenses a finite number of tree sentences
![Page 28: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/28.jpg)
28
Generativity
As with FSAs and FSTs you can view these rules as either analysis or synthesis machines
Generate strings in the language
Reject strings not in the language
Impose structures (trees) on strings in the language
![Page 29: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/29.jpg)
29
Derivations
A derivation is a sequence of rules applied to a string that accounts for that string
Covers all the elements in the string
Covers only the elements in the string
![Page 30: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/30.jpg)
30
Derivations as Trees
![Page 31: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/31.jpg)
31
Two views of linguistic structure: 1. Constituency (phrase structure)
Phrase structure organizes words into nested constituents.
How do we know what is a constituent? (Not that linguists don't argue about some cases.)
Distribution: a constituent behaves as a unit that can appear in different places:– John talked [to the children] [about drugs].– John talked [about drugs] [to the children].– *John talked drugs to the children about
Substitution/expansion/pro-forms:– I sat [on the box/right on top of the box/there].
Coordination, regular internal structure, no intrusion, fragments, semantics, …
![Page 32: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/32.jpg)
32
Two views of linguistic structure: 2. Dependency structure
Dependency structure shows which words depend on (modify or are arguments of) which other words.
Theboy put the tortoiseonthe rugrug
the
the
ontortoise
put
boy
The
![Page 33: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/33.jpg)
33
Parsing
Parsing is the process of taking a string and a grammar and returning a (many?) parse tree(s) for that string
It is completely analogous to running a finite-state transducer with a tape
It’s just more powerful– Remember this means that there are languages we can
capture with CFGs that we can’t capture with finite-state methods
![Page 34: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/34.jpg)
34
Other Options
Regular languages (expressions)Too weak
Context-sensitive or Turing equivToo powerful (maybe)
![Page 35: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/35.jpg)
35
Context?
The notion of context in CFGs has nothing to do with the ordinary meaning of the word context in language.
All it really means is that the non-terminal on the left-hand side of a rule is out there all by itself (free of context)A -> B C
Means that
I can rewrite an A as a B followed by a C regardless of the context in which A is found
Or when I see a B followed by a C I can infer an A regardless of the surrounding context
![Page 36: 1 Syntax Sudeshna Sarkar 25 Aug 2008. 2 Some Fundamental Questions What is Language? How to define a Language? What makes a language different from another?](https://reader035.vdocuments.us/reader035/viewer/2022062804/56649f4f5503460f94c70e63/html5/thumbnails/36.jpg)
36
Key Constituents (English)
Sentences
Noun phrases
Verb phrases
Prepositional phrases