csc 3130: automata theory and formal languages

20
CSC 3130: Automata theory and formal languages Andrej Bogdanov http://www.cse.cuhk.edu.hk/ ~andrejb/csc3130 The Chinese University of Hong Kong Limitations of context-free languages Fall 2009

Upload: jules

Post on 06-Jan-2016

31 views

Category:

Documents


2 download

DESCRIPTION

Fall 2009. The Chinese University of Hong Kong. CSC 3130: Automata theory and formal languages. Limitations of context-free languages. Andrej Bogdanov http://www.cse.cuhk.edu.hk/~andrejb/csc3130. Non context-free languages. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: CSC 3130: Automata theory and formal languages

CSC 3130: Automata theory and formal languages

Andrej Bogdanov

http://www.cse.cuhk.edu.hk/~andrejb/csc3130

The Chinese University of Hong Kong

Limitations of context-free languages

Fall 2009

Page 2: CSC 3130: Automata theory and formal languages

Non context-free languages

• Recall the pumping lemma for regular languagesallows us to show some languages are not regular

Are these languages context-free?

L1 = {anbn: n ≥ 0}

L2 = {x: x has same number of as and bs}L3 = {1n: n is prime}

L4 = {anbncn: n ≥ 0}

L5 = {zzR: z ∈ {0, 1}*}

L6 = {zz: z ∈ {0, 1}*}

Page 3: CSC 3130: Automata theory and formal languages

Some intuition

• Let’s try to show this is context-free

L4 = {anbncn: n ≥ 0}

context-free grammar pushdown automaton

S → aBc B → ??

read a / push 1

read c / pop 1???

Page 4: CSC 3130: Automata theory and formal languages

More intuition

• Suppose we could construct some CFG for L4, e.g.

• We do some derivationsof “long” strings

S BCB CS | bC SB | a

. . .

S BC CSC aSC aBCC abCC abaC abaSB abaBCB ababCB ababaB ababab

Page 5: CSC 3130: Automata theory and formal languages

More intuition

• If derivation is long enough, some variable mustappear twice on same path in parse tree

S BC CSC aSC aBCC abCC abaC abaSB abaBCB ababCB ababaB ababab

S

B C

BC S S

B C B C

a b a b a b

Page 6: CSC 3130: Automata theory and formal languages

More intuition

• Then we can “cut and paste” part of parse tree

S

B C

BC S S

B C B C

a b a b a b

BS

B C

a

b

b

B C

BC SS

B CBa

b a b

bC

S

ababab ababbabb✗

Page 7: CSC 3130: Automata theory and formal languages

More intuition

• We can repeat this many times

• Every sufficiently large derivation will have a part that can be repeated indefinitely– This is caused by cycles in the grammar

ababab✗ ababbabb✗ ababbbabbb

ababnabnbb

Page 8: CSC 3130: Automata theory and formal languages

General picture

u u

v v

vw

x

yy

u

v

v

v

w

x

y

uvwxy uv3wx3y

A

A

A

A

A

A

A

A

A

x

x

x

x

wuvvwxxy

Page 9: CSC 3130: Automata theory and formal languages

Example

• If L4 has a context-free grammar G, then

• What happens for anbncn?

• No matter how it is split, uv2wx2y ∉ L4!

If uvwxy can be derived in G, so can uviwxiy for every i

L4 = {anbncn: n ≥ 0}

wu yxva a a ... a a b b b ... b b c c c ... c c

Page 10: CSC 3130: Automata theory and formal languages

Pumping lemma for context-free languages• Theorem: For every context-free language L

There exists a number n such that for every string z in L, we can write z = uvwxy where |vwx| ≤ n |vx| ≥ 1 For every i ≥ 0, the string uviwxiy is in L.

wu yxv

Page 11: CSC 3130: Automata theory and formal languages

Pumping lemma for context-free languages• So to prove L is not context-free, it is enough that

For every n there exists z in L, such that forevery way of writing z = uvwxy where |vwx| ≤ n and |vx| ≥ 1, the string uviwxiy isnot in L for some i ≥ 0.

wu yxv

Page 12: CSC 3130: Automata theory and formal languages

Proving language is not context-free• Just like for regular languages, need strategy that,

regardless of adversary, always wins you this game

adversary

choose nwrite z = uvwxy (|vwx| ≤ n,|vx| ≥ 1)

you

choose z Lchoose iyou win if uviwxiy L

1

2

Page 13: CSC 3130: Automata theory and formal languages

Example

adversary

choose nwrite z = uvwxy (|vwx| ≤ n,|vx| ≥ 1)

you

choose z Lchoose iyou win if uviwxiy L

1

2

1

2

L4 = {anbncn: n ≥ 0}

wu yxva a a ... a a b b b ... b b c c c ... c c

choose nwrite z = uvwxy

z = anbncni = ?

Page 14: CSC 3130: Automata theory and formal languages

Example

• Case 1: v or x contains two kinds of symbols

Then uv2wx2y not in L because pattern is wrong

• Case 2: v and x both contain one kind of symbol

Then uv2wx2y does not have same number of as, bs, cs

xva a a ... a a b b b ... b b c c c ... c c

xva a a ... a a b b b ... b b c c c ... c c

Page 15: CSC 3130: Automata theory and formal languages

More examples

• Which of these is context-free?

L1 = {anbn: n ≥ 0}

L2 = {x: x has same number of as and bs}L3 = {1n: n is prime}

L4 = {anbncn: n ≥ 0}

L5 = {zzR: z ∈ {a, b}*}

L6 = {zz: z ∈ {a, b}*}

Page 16: CSC 3130: Automata theory and formal languages

Example

1

2

L6 = {zz: z ∈ {a, b}*}

wu yxva a a a a a a a a b a a a a a a a a a b

choose nwrite z = uvwxy

z = anbi = ?

wxvu ya a a a a a a a a b a a a a a a a a a bWhat if:

Page 17: CSC 3130: Automata theory and formal languages

Example

1

2

L6 = {zz: z ∈ {a, b}*}

wu yxva a a a a a b b b b b b a a a a a a b b b b b b

choose nwrite z = uvwxy

z = anbn

i = ?

Recall that |vwx| ≤ n

Page 18: CSC 3130: Automata theory and formal languages

Example

Case 1: w xv

a a a a a a b b b b b b a a a a a a b b b b b b

Three cases

Case 2: w xv

a a a a a a b b b b b b a a a a a a b b b b b b

Case 3: w xv

a a a a a a b b b b b b a a a a a a b b b b b b

vwx is in the first half of anbnanbn

vwx is in the middle part of anbnanbn

vwx is in the second half of anbnanbn

Page 19: CSC 3130: Automata theory and formal languages

Example

Case 1: w xv

a a a a a a b b b b b b a a a a a a b b b b b b

Apply pumping with i = 0

Case 2: w xv

a a a a a a b b b b b b a a a a a a b b b b b b

Case 3: w xv

a a a a a a b b b b b b a a a a a a b b b b b b

uv0wx0y looks like aibjanbn, where i < n or j < n

uv0wx0y looks like anbiajbn, where i < n or j < n

uv0wx0y looks like anbnaibj, where i < n or j < n

Page 20: CSC 3130: Automata theory and formal languages

Example

Case 1:

Apply pumping with i = 0

Case 2:

Case 3:

uv0wx0y looks like aibjanbn, where i < n or j < n

uv0wx0y looks like anbiajbn, where i < n or j < n

uv0wx0y looks like anbnaibj, where i < n or j < n

Not of the form zz

Not of the form zz

Not of the form zz

This covers all the cases, so L6 is not context-free