learning regular - university of pennsylvania...(deterministic finite automaton) and the syntactic...

LearningRegular

ω-Languages

The Goal

Device an algorithm for learning

an unknown regularω-language L

The Goal

set of infinite words (ω-words)

The Goal

accepted by a finite automaton

The Goal

Learner

The Goal

Learner

Teacher

The Goal

Learner

Teacher

membership queries

equivalencequeries

Coping with ω-words

Learner

Teacher

Isabcccccabdebbbaaaabcdaa…

Learner

Teacher

Isabcccccabdebbbaaaabcdaa…

Learner

Teacher

Iswingardium laviosaω in L?

Learner

Teacher

wingardium laviosa laviosa laviosa laviosa laviosa …

Learner

Teacher

wingardium laviosa laviosa laviosa laviosa laviosa …

Two regular ω-languages are equivalent iff

they agree on the set of ultimately periodic words

The Goal

Is uvω in L ?

Learner

Teacher

The Goal

Is uvω in L ?

Yes / No

Learner

Teacher

The Goal

Is uvω in L ?

Yes / No

Is H same as L ?

Learner

Teacher

The Goal

Is uvω in L ?

Yes / No

Is H same as L ?

Yes / No, c.e: uvω

Learner

Teacher

The Goal

Is uvω in L ?

Yes / No

Is H same as L ?

Yes / No, c.e: uvω

H s.t. H=L

Learner

Teacher

Motivation

Same problem for regular (finitary) languages is solved by the L* algorithm [Angluin]

L* has found applications in many areas including AI, neural networks, geometry, data mining, verificationand synthesis and many more.

Reactive systems concerns ω-languages, using L* for this limits application to safety (and excludes liveness, fairness)

Previous Work on Learning ω-Langs.

allregular

ω-languages

DP / DMDS / DR

DWCDWB

strictly less expressive

allregular

ω-languages

DP / DMDS / DR

DWCDWB

• [de la Higuera & Janodet, 2004]

• [Jayasrirani et al, 2012]

• [Saoudi & Yokomori, 1993]

From Prefixes

From Lassos

allregular

ω-languages

DP / DMDS / DR

DWCDWB

• [Maler & Pnueli ,1995]

From Prefixes

From Lassos

allregular

ω-languages

DP / DMDS / DR

DWCDWB

From Prefixes

From Lassos

allregular

ω-languages

DP / DMDS / DR

DWCDWB

From Prefixes

From Lassos

allregular

ω-languages

DP / DMDS / DR

DWCDWB

From Prefixes

From Lassos

However …

SyntacticFORC

RecurrentFDFA

[Calbrix, Nivat & Podelski ‘93]

[Maler & Staiger ‘93]

However …

SyntacticFORC

RecurrentFDFA

May be exp. bigger

However …

SyntacticFORC

RecurrentFDFA

May be exp. bigger

May be quadr.

bigger than

Main Contribution

Learner

Lω –

a learning algorithm for all 3 representations

(periodic, syntactic, recurrent)

Why is it difficult?

ω-aut.

states of the minimal DFA (deterministic finite automaton)

and the syntactic right congruence ~ of a language.

L* works due to the Myhill-NerodeTheorem, stating a 1-to-1 relationship between

ω-aut.

states of the minimal DFA (deterministic finite automaton)

and the syntactic right congruence ~ of a language.

L* works due to the Myhill-NerodeTheorem, stating a 1-to-1 relationship between

But for ω-langs, the syntactic right congruence is not informative enough!

ω-aut.

For finite words

The Syntactic Right Congruence

x »L y iff v2*. xv2L , yv2L

For finite words

Example:

L = { w w has an even number of a’s}

For finite words

Example:

L = { w w has an even number of a’s}

Then »L has two equivalence classes:

words with an odd number of a’s and

words with an even number of a’s

For finite words:

For ω-words:

x »L y iff w2!. xw2L, yw2L

For finite words:

For ω-words:

x »L y iff w2!. xw2L, yw2Lx »L y iff u,v2*. xuv!2L, yuv!2L

For ω-words:

x »L y iff u,v2*. xuv!2L, yuv!2L

Example:

L = { w | w has finitely many a’s}

For ω-words:

Example:

Then xuv!2L iff uv! has finitely may a’s.

For ω-words:

Example:

Regardless of x. Thus »L has just one equivalence class.

For ω-words:

Example:

For ω-words:

Example:

But clearly an ω-automaton for L requires at least two states.

Solution Foundation

Families of DFAs (FDFA)

A non-traditional representation of ω-automata

inspired by Maler & Staiger’s ’97 families of right congruences (FORC)

and their canonical representation of a Syntactic FORC

for which they have shown a Myhill-Nerode Theorem

Family of Right Congruences [MS97]

Leading Right Congruence

Plus some restriction (details ommited)

(»,¼1,¼2,¼3,¼4,¼5)

ProgressLeading

Family of DFAs (FDFA)

Leading DFAM

(M, P1, P2, P3, P4, P5 )

ProgressLeading

Leading DFAM

(M, P1, P2, P3, P4, P5 )

ProgressLeading

Leading DFAM

(M, P1, P2, P3, P4, P5 )

ProgressLeading

Leading DFAM

(M, P1, P2, P3, P4, P5 )

ProgressLeading

Leading DFAM

(M, P1, P2, P3, P4, P5 )

ProgressLeading

Leading DFAM

(M, P1, P2, P3, P4, P5 )

ProgressLeading

Leading DFAM

(M, P1, P2, P3, P4, P5 )

ProgressLeading

That restriction removed.

FDFA Exact Acceptance

(u,v)2 M, P1, P2, P3, P4, P5

P1P1P1

33 552

P4P4P4 P5P5P5

P3P3P3

P1P1P1

(u,v)2 M, P1, P2, P3, P4, P5

P1P1P1

33 552

P4P4P4 P5P5P5

P3P3P3

P1P1P1

(u,v)2 M, P1, P2, P3, P4, P5

P1P1P1

33 552

P4P4P4 P5P5P5

P3P3P3

P1P1P1

(u,v)2 M, P1, P2, P3, P4, P5

P1P1P1

33 552

P4P4P4 P5P5P5

P3P3P3

P1P1P1

FORC and FDFA – Example

No a’s

Some a’s

FORC FDFA

Leading

Progress

All prefixes are equally good

No a’s

Some a’s

FORC FDFA

Leading

Progress

No a’s

Some a’s

FORC FDFA

(abba,bbb)

(abba,bab)

Leading

Progress

4-state automaton

00,1,2

4-state automaton

00,1,2

Some periods of are easy to find:

100201

10020122

4-state automaton

00,1,2

100201

10020122

Some are hard:

4-state automaton

00,1,2

100201

10020122

Some are hard:

10121012 1012

1012 1012

4-state automaton

00,1,2

100201

10020122

Some are hard:

10121012 1012

1012 1012

The DFA for periods of has 23 states

=> L$ > 23

4-state automaton

00,1,2

All periods that loop back are easy to find:

100201

10020122

4-state automaton

00,1,2

All periods that loop back are easy to find:

100201

10020122

The DFA that accepts only periods of that loop back has only 5 states !

Saturation (Consistency)

How can we use this without losing saturation?

We say that a language is saturated if

uv!=xy! implies (u,v)2 L, (x,y)2 L

To achieve saturation, we change the definition of acceptance.

We say that a language is saturated if

uv!=xy! implies (u,v)2 L, (x,y)2 L

FDFA Normalized Acceptance

P1P1P1 22

33 552

P4 P5P5P5

P3P3P3

P1P1P1

(u,v)2 M, P1, P2, P3, P4, P5 ?

P1P1P1 22

33 552

P4 P5P5P5

P3P3P3

P1P1P1

Normalization seeks for the smallest repetition

of the period that loops back

(u,v)2 M, P1, P2, P3, P4, P5 ?

P1P1P1 22

33 552

P4 P5P5P5

P3P3P3

P1P1P1

(u,v)2 M, P1, P2, P3, P4, P5 ?

P1P1P1 22

33 552

P4 P5P5P5

P3P3P3

P1P1P1

(u,v)2 M, P1, P2, P3, P4, P5 ?

P1P1P1 22

33 552

P4 P5P5P5

P3P3P3

P1P1P1

(u,v)2 M, P1, P2, P3, P4, P5 ?

P1P1P1 22

33 552

P4 P5P5P5

P3P3P3

P1P1P1

We term Recurrent FDFA the FDFA where progress DFA recognize only periods that loop back. It is saturated under normalized acceptance!

(u,v)2 M, P1, P2, P3, P4, P5 ?

More generally

SyntacticFDFA

RecurrentFDFA

May be exp. bigger

Periodic FDFA (L$)

Generalizing to arbitrary n we get the following lower bound

The Learner Lω

Same scheme for all 3

canonical FDFAs

Minimality?

The Recurrent FDFA for a given L is not necessarilythe minimal FDFA for L.

According to the normalized acceptance criterion a progress DFA Pu should give correct results only to extensions that close a cycle to u. On other extensions it has freedom.

This has the flavor of minimization with unknowns, which is NP complete.

The Recurrent FDFA chooses to treat all don’t cares as rejecting.

Time Complexity

Interestingly, [Klarlund, 1994] has shown that choosing a leading automaton which is more refined (bigger) than the syntactic right congruence, may yield an overall smaller FDFA.

If we are given such a leading automaton, we can feedit to the learning algorithm, in which case it will yield the smaller FDFA.

However, the same phenomenon can cause the learning algorithm for the syntactic/recurrent FDFAs to work as hard as the one for the periodic FDFA

Time Complexity

The worst case time complexity for all 3 families is thus polynomial in the size of the periodic FDFA.

Some positive results on sizes obtained via recurrent FDFA learner vs. periodic FDFA learner on randomly generated Muller automata.

Future Direction

Find smaller canonical representations ?

Find smallest FDFA ?

Learning the leading first ?

L*-learning for (traditional) ω-automata ?

Other types of learning for ω-languages ?

THE ENDThank you for your

attention!

Comments or questions?

Learning

Regular

Languagesvia

Alternating

Automata

IJCAI’15

Time Complexity

The worst case time complexity for all 3 families is thus polynomial in the size of the periodic FDFA.

Some positive results on sizes obtained via recurrent FDFA learner vs. periodic FDFA learner on randomly generated Muller automata.

learning regular - university of pennsylvania...(deterministic finite automaton) and the syntactic...

Documents

goals congruence

similarity and congruence - intranet.cesc.vic.edu.au · ...

techcard automaton instructions

definition of an automaton:-an automaton...

congruence model

the automaton theater

ω automaton

automaton logic - tph.tuwien.ac.at

mathlinks grade 8 student packet 14 congruence and...

4-54-5triangle congruence: sss and sastriangle congruence...

cellular automaton modeling of microporosity formation...

1 lesson 3.4.6 congruence and similarity congruence and...

buildtrack apartment automaton

predicativity, the russell-myhill paradox, and church’s...

targets id reflections, translations, and rotations. verify...

becoming a designer: trajectories of linguistic development...

j. myhill - paradoxes

congruence transformations and triangle congruence

medieval dragon automaton

schreckenberg cellular automaton model