a counterexample to a conjecture concerning concatenation hierarchies

4
Information Processing Letters 110 (2009) 4–7 Contents lists available at ScienceDirect Information Processing Letters www.elsevier.com/locate/ipl A counterexample to a conjecture concerning concatenation hierarchies Jorge Almeida a,, Ondˇ rej Klíma b a Dep. Matemática Pura/CMUP, Faculdade de Ciências, Universidade do Porto, Rua do Campo Alegre 687, 4169-007 Porto, Portugal b Department of Mathematics and Statistics, Faculty of Science, Masaryk University, Kotlᡠrská 2, Brno 61137, Czech Republic article info abstract Article history: Received 5 June 2009 Received in revised form 18 August 2009 Accepted 18 September 2009 Available online 22 September 2009 Communicated by A. Muscholl Keywords: Formal languages Regular languages Concatenation hierarchies Level two Star-free languages We give a counterexample to the conjecture which was originally formulated by Straubing in 1986 concerning a certain algebraic characterization of regular languages of level 2 in the Straubing–Thérien concatenation hierarchy of star-free languages. © 2009 Elsevier B.V. All rights reserved. 1. Introduction This paper contributes to one of the most interesting open problems in the theory of regular languages, namely the effective characterization of languages of the second level in the Straubing–Thérien concatenation hierarchy of star-free languages. For a detailed overview, missing defini- tions, and complete references we refer to the basic survey paper [5], where Section 8.1 is devoted to the Straubing– Thérien hierarchy and Section 8.5 contains a conjecture concerning the second level, which is the main topic of this paper. The individual levels of the Straubing–Thérien hierarchy are defined inductively by alternately taking polynomial and Boolean closures, starting from the trivial variety of languages. It is decidable whether a given regular language belongs to each of the levels 0, 1/2, 1 and 3/2, but no al- gorithm is known to decide the same problem for level 2 or higher. Since the class V 2 of all languages from the sec- ond level forms a variety of languages, one can consider * Corresponding author. E-mail addresses: [email protected] (J. Almeida), [email protected] (O. Klíma). the corresponding pseudovariety of monoids V 2 according to Eilenberg’s correspondence. Thus the main longstanding open problem in the topic is the membership problem for the pseudovariety of monoids V 2 , i.e. the problem of de- ciding whether a given finite monoid belongs to V 2 . It was proved in [6] that the languages from the sec- ond level V 2 are the finite Boolean combinations of the languages of the form A 0 a 1 A 1 a 2 ... a k A k , where the a i ’s are letters and the A j ’s are subsets of A. Also some al- gebraic characterizations of the class V 2 were established in [6]. Here we only recall that V 2 = PJ, where PJ is the pseudovariety of finite monoids generated by all power monoids P (M), where M is an arbitrary finite J -trivial monoid. Unfortunately, there is no general algorithm to compute the power operator [2], even though many com- putations on specific pseudovarieties have been carried out [1, Chapter 11]. Perhaps surprisingly, none of the men- tioned nontrivial results has so far led to any solution of the membership problem for V 2 . Straubing conjectured that V 2 is equal to a certain pseudovariety CJ which is given by an effective de- scription. Straubing’s conjecture was originally formulated in [12], while it was not repeated in the full version of that paper [13]. Later Straubing and Weil corrected a certain technical error contained in [13] and stated the equality 0020-0190/$ – see front matter © 2009 Elsevier B.V. All rights reserved. doi:10.1016/j.ipl.2009.09.011

Upload: jorge-almeida

Post on 26-Jun-2016

215 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: A counterexample to a conjecture concerning concatenation hierarchies

Information Processing Letters 110 (2009) 4–7

Contents lists available at ScienceDirect

Information Processing Letters

www.elsevier.com/locate/ipl

A counterexample to a conjecture concerning concatenation hierarchies

Jorge Almeida a,∗, Ondrej Klíma b

a Dep. Matemática Pura/CMUP, Faculdade de Ciências, Universidade do Porto, Rua do Campo Alegre 687, 4169-007 Porto, Portugalb Department of Mathematics and Statistics, Faculty of Science, Masaryk University, Kotlárská 2, Brno 61137, Czech Republic

a r t i c l e i n f o a b s t r a c t

Article history:Received 5 June 2009Received in revised form 18 August 2009Accepted 18 September 2009Available online 22 September 2009Communicated by A. Muscholl

Keywords:Formal languagesRegular languagesConcatenation hierarchiesLevel twoStar-free languages

We give a counterexample to the conjecture which was originally formulated by Straubingin 1986 concerning a certain algebraic characterization of regular languages of level 2 inthe Straubing–Thérien concatenation hierarchy of star-free languages.

© 2009 Elsevier B.V. All rights reserved.

1. Introduction

This paper contributes to one of the most interestingopen problems in the theory of regular languages, namelythe effective characterization of languages of the secondlevel in the Straubing–Thérien concatenation hierarchy ofstar-free languages. For a detailed overview, missing defini-tions, and complete references we refer to the basic surveypaper [5], where Section 8.1 is devoted to the Straubing–Thérien hierarchy and Section 8.5 contains a conjectureconcerning the second level, which is the main topic ofthis paper.

The individual levels of the Straubing–Thérien hierarchyare defined inductively by alternately taking polynomialand Boolean closures, starting from the trivial variety oflanguages. It is decidable whether a given regular languagebelongs to each of the levels 0, 1/2, 1 and 3/2, but no al-gorithm is known to decide the same problem for level 2or higher. Since the class V2 of all languages from the sec-ond level forms a variety of languages, one can consider

* Corresponding author.E-mail addresses: [email protected] (J. Almeida), [email protected]

(O. Klíma).

0020-0190/$ – see front matter © 2009 Elsevier B.V. All rights reserved.doi:10.1016/j.ipl.2009.09.011

the corresponding pseudovariety of monoids V2 accordingto Eilenberg’s correspondence. Thus the main longstandingopen problem in the topic is the membership problem forthe pseudovariety of monoids V2, i.e. the problem of de-ciding whether a given finite monoid belongs to V2.

It was proved in [6] that the languages from the sec-ond level V2 are the finite Boolean combinations of thelanguages of the form A∗

0a1 A∗1a2 . . .ak A∗

k , where the ai ’sare letters and the A j ’s are subsets of A. Also some al-gebraic characterizations of the class V2 were establishedin [6]. Here we only recall that V2 = PJ, where PJ is thepseudovariety of finite monoids generated by all powermonoids P (M), where M is an arbitrary finite J -trivialmonoid. Unfortunately, there is no general algorithm tocompute the power operator [2], even though many com-putations on specific pseudovarieties have been carriedout [1, Chapter 11]. Perhaps surprisingly, none of the men-tioned nontrivial results has so far led to any solution ofthe membership problem for V2.

Straubing conjectured that V2 is equal to a certainpseudovariety CJ which is given by an effective de-scription. Straubing’s conjecture was originally formulatedin [12], while it was not repeated in the full version of thatpaper [13]. Later Straubing and Weil corrected a certaintechnical error contained in [13] and stated the equality

Page 2: A counterexample to a conjecture concerning concatenation hierarchies

J. Almeida, O. Klíma / Information Processing Letters 110 (2009) 4–7 5

V2 = CJ as Conjecture 2.5 in [14]. Straubing proved theinclusion V2 ⊆ CJ and that the classes V2 and CJ do notdiffer on monoids generated by two elements. It has alsobeen shown by Cowan [3,4] that the two classes containprecisely the same inverse monoids. The notation CJ is notused in [5], but it is used for example in [1, p. 400], wheredefining pseudoidentities for the variety CJ can be found.

We will use the alternative formulation of the Straubingconjecture based on the equality CJ = B1 ©m Sl, which fol-lows from nontrivial general results given by Pin and Weilin [7] (see also [5, Theorem 6.5]). Here ©m is the Mal’cevproduct [5, Section 6], B1 is the pseudovariety of finitesemigroups corresponding to the variety of languages ofdot-depth one, and Sl is the pseudovariety of finite semi-lattices. A general conjecture [5, Conjecture 8.1] concern-ing the Boolean-polynomial closure was formulated by Pinand Weil originally in [9]. Although the general conjecturewas corrected in [10], all these adaptations did not changethe original Straubing conjecture for the class V2, so thepresent-day conjecture is the following.

Conjecture. (See Pin, Straubing, Weil [9,10,12–14].) V2 =B1 ©m Sl.

In this paper we provide a counterexample to thisconjecture, and consequently also to the generalizationfrom [10].

Theorem. V2 �= B1 ©m Sl.

This of course does not solve the original problem ofeffectively characterizing V2 and in a sense it shows thatthe situation is rather more complicated than expected.Yet, our method does provide a tighter upper bound forV2 which opens new paths for attacking the problem.

In Section 2, we recall the few preliminaries that arerequired for the proof of the theorem, which is presentedin Section 3.

2. Preliminaries

There are two parallel theories of varieties of languages,namely ∗-varieties of languages and +-varieties of lan-guages. In the first case, languages are subsets of A∗ and∗-varieties of languages correspond to pseudovarieties ofmonoids according to Eilenberg’s correspondence, while, inthe second case, languages are subsets of A+ and +-va-rieties of languages correspond to pseudovarieties of semi-groups. See [5, Sections 1 and 4] for details. Here we noteonly that some operations have a different interpretationin these two theories, such as language complementation.Positive varieties of languages have been considered inboth cases [5, Section 4].

By Reiterman’s Theorem [11], pseudovarieties of alge-bras are defined by pseudoidentities. We will need thisconcept only for pseudovarieties of monoids. In this case,pseudoidentities are formal equalities of implicit opera-tions, which are operations whose interpretation in finitemonoids commutes with homomorphisms. Reiterman’sTheorem has been extended by Pin and Weil [8] in particu-lar to pseudovarieties of ordered monoids, the equality (of

implicit operations) being replaced by the order relation.A pseudovariety of monoids can be viewed as a pseudova-riety of ordered monoids by endowing its elements withall possible compatible partial orders.

Besides so-called explicit operations given by words,the most familiar example of implicit operation is the ω-power, which associates to each element m of a finitemonoid its unique idempotent power mω = mn (n > 0). Allimplicit operations over a fixed finite set A form a compactmonoid under a natural topology, namely the free profinitemonoid F A on the set A, in which the discrete submonoidgenerated by A, whose elements are the explicit opera-tions, is a free monoid and thus is identified with A∗ . Inparticular, if we let P A denote the monoid of all subsetsof A under the union operation, endowed with the dis-crete topology, then the natural mapping A → P A inducesa continuous homomorphism α : F A → P A . The restrictionof α to A∗ is the familiar content function and, more gen-erally, a letter a ∈ A belongs to α(u) for a given implicitoperation u ∈ F A if and only if there is a factorization ofthe form u = xay with x, y ∈ F A . Thus, α is still called thecontent function. See [1, Section 8.1] for further details.

Trivially, P A is a semilattice, i.e. a commutative andidempotent monoid. Moreover, since P A is a relatively freeprofinite monoid on the set A in the class Sl, for u, v ∈ F A ,we have Sl |� u = v if and only if α(u) = α(v). Note thatthe ∗-variety of languages corresponding to Sl is formedby finite Boolean combinations of languages of the formA∗aA∗ , where a is a letter from an alphabet A. The pseu-dovariety Sl is also often denoted J1, for example in [5].

A subset of A+ is a language of dot-depth one if it isa finite Boolean combination of languages of the formw0 A∗w1 A∗ . . . A∗wk−1 A∗wk , where k � 0, w0, . . . , wk ∈A∗ , and at least one wi ∈ A+ . The languages of dot-depthone constitute a +-variety of languages and the corre-sponding pseudovariety of semigroups is the B1 of Straub-ing’s conjecture. For the definition of the dot-depth hi-erarchy, connections to the Straubing–Thérien hierarchy,and details on the pseudovariety B1, we refer to [5, Sec-tion 8.2].

Finally, we recall the characterization of V3/2, the level3/2 of the Straubing–Thérien hierarchy. It consists of finiteunions of languages of the form A∗

0a1 A∗1a2 . . .ak A∗

k , whereeach Ai ⊆ A and each a j ∈ A, and it forms a positive ∗-va-riety of languages.

Proposition 1. (See [9, Theorem 8.7], [5, Theorem 8.9].) A lan-guage is of level 3/2 if and only if its ordered syntactic monoidsatisfies the pseudoidentity uωvuω � uω for all u, v such thatα(u) = α(v).

Note that the aperiodicity pseudoidentity xω+1 = xω isa consequence of pseudoidentities from Proposition 1. Thepseudovariety of ordered monoids corresponding to V3/2 isdenoted V3/2.

3. Proof of the theorem

For the proof of the theorem, we give a certain pseu-doidentity which is satisfied in V2 and a monoid M ∈

Page 3: A counterexample to a conjecture concerning concatenation hierarchies

6 J. Almeida, O. Klíma / Information Processing Letters 110 (2009) 4–7

Fig. 1.

B1 ©m Sl such that M does not satisfy this pseudoidentity.In other words we have M ∈ B1 ©m Sl and M /∈ V2.

The following proposition gives new pseudoidentitiesfor the pseudovariety V2.

Proposition 2. Let u and v be implicit operations such thatV3/2 |� u � v. Then V2 |� uω = uωvuω .

Proof. Since V2 is the Boolean closure of V3/2, it is clearthat V2 |� s = t if and only if V3/2 |� s = t , i.e. if and onlyif V3/2 |� s � t and V3/2 |� t � s.

From the assumption V3/2 |� u � v , we deduce thatα(u) = α(v) because Sl ⊆ V3/2. From Proposition 1, we ob-tain immediately V3/2 |� uωvuω � uω .

When we multiply u � v by uω from both sides, weobtain uωuuω � uωvuω . Since V3/2 |� xω+1 = xω , we con-clude that V3/2 |� uω � uωvuω . �

We consider the following implicit operations over theset of variables X = {x, y, z}:

π = (xy)ωx, ρ = zππ z, σ = zπ z.

Proposition 3. The pseudovariety of finite monoids V2 satisfiesthe following pseudoidentity

ρω = ρωσρω. (1)

Proof. Applying Proposition 1 to the pair of explicit oper-ations xy and xxy, we obtain that V3/2 satisfies the pseu-doidentity (xy)ωxxy(xy)ω � (xy)ω . If we multiply it by xon the right, then we deduce that V3/2 |� ππ � π . HenceV3/2 |� ρ � σ and the statement follows from Proposi-tion 2. �

In the sequel, we consider a monoid M which is thetransformation monoid of the automaton over the alphabetA = {a,b, c} given in Fig. 1.

Note that the automaton is incomplete since there isno action of the letter c on state 2. Hence the elements ofthe monoid M are partial transformations. In Fig. 2 we cansee the structure of the monoid M using the usual eggboxrepresentation of J -classes, where a ∗ marks a subgroupH-class. A crucial observation is that the partial transfor-mation c has incomplete domain and one-element range.Hence each transformation given by a word containing the

Fig. 2.

letter c has one-element range or it is the empty trans-formation, i.e. the element 0. An elementary calculationshows that all partial transformations from M which haveone-element ranges are J -related. Further, the ideal gen-erated by the element c, denoted by McM , consists of thetwo bottom J -classes of M and hence it is a completely0-simple semigroup.

Proposition 4. M ∈ B1 ©m Sl.

Proof. To prove the statement, we describe a relationalmorphism [5, Section 6] τ from M to the semilattice P A .Let ϕ : A∗ → M be the morphism identifying letters fromthe fixed alphabet A = {a,b, c} with the corresponding el-ements of M , i.e. with partial transformations on the threeelement set {1,2,3}.

Now we consider the relational morphism τ : M → P A ,given by the formula

τ (m) = {α(w)

∣∣ w ∈ A∗, ϕ(w) = m}, for m ∈ M.

It is clear that τ is indeed a relational morphism as τ =α ◦ ϕ−1. Since P A ∈ Sl, it is enough to prove that for eachB ∈ P A we have τ−1(B) ∈ B1.

For B = ∅, the subsemigroup τ−1(B) is a singleton. Nowassume that B �= ∅ and c /∈ B . The automaton obtainedfrom that in Fig. 1 by changing the action of the letterc to let it act like the letter b is the minimal automa-ton of the language A∗aaA∗ , which is of dot-depth one.The transformation semigroup of this automaton, i.e. thesyntactic semigroup of the language A∗aaA∗ , is the sub-semigroup of M generated by the letters a and b, which isusually denoted A2 in the literature. This syntactic semi-group belongs to B1, whence so does its subsemigroupτ−1(B).

If we assume that c ∈ B , then τ−1(B) is a subsemigroupof the completely 0-simple semigroup McM. Note that ev-ery aperiodic completely 0-simple semigroup is locally asemilattice. It is well known [5, Theorem 5.18] that thepseudovariety of semigroups consisting of all local semilat-tices LSl corresponds to the +-variety of all locally testablelanguages, which are Boolean combinations of languages of

Page 4: A counterexample to a conjecture concerning concatenation hierarchies

J. Almeida, O. Klíma / Information Processing Letters 110 (2009) 4–7 7

the form w A∗ , A∗w or A∗w A∗ with w ∈ A+ . Since everylocally testable language is of dot-depth one, we concludethat McM ∈ LSl ⊆ B1. The required property τ−1(B) ∈ B1follows. �

We finish the proof of the theorem with the followingobservation.

Proposition 5. The monoid M does not satisfy the pseudoiden-tity (1).

Proof. We consider the substitution ψ : F X → M given bythe rules ψ(x) = a, ψ(y) = b, ψ(z) = c. Then it is easy tocheck that ψ(π) = (ab)ωa = a, ψ(ρ) = caac = c, ψ(σ ) =cac = 0. Finally, we have ψ(ρω) = c �= 0 = ψ(ρωσ ρω). �Acknowledgements

The work of the first author was supported, in part,by Fundação para a Ciência e a Tecnologia (FCT) through theCentro de Matemática da Universidade do Porto and by theFCT project PTDC/MAT/65481/2006, which is partly fundedby the European Community Fund FEDER. The work ofthe second author was supported, in part, by the Min-istry of Education of the Czech Republic under the projectMSM 0021622409 and by the Grant No. 201/09/1313 ofthe Grant Agency of the Czech Republic. The work leadingto this paper has also been carried out within the frame-work of the ESF programme “Automata: from Mathematicsto Applications (AutoMathA)”, benefiting in particular from

an exchange grant for the visit of the second author to theUniversity of Porto.

References

[1] J. Almeida, Finite Semigroups and Universal Algebra, World Scientific,1994.

[2] K. Auinger, B. Steinberg, On the extension problem for partial per-mutations, Proc. Amer. Math. Soc. 131 (2003) 2693–2703.

[3] D. Cowan, Inverse monoids of dot-depth two, Int. J. Algebra Com-put. 3 (1993) 411–424.

[4] D. Cowan, A result on the dot-depth hierarchy for inverse monoids,in: J. Almeida, et al. (Eds.), Semigroups, Automata and Languages,World Scientific, 1996, pp. 41–57.

[5] J.-E. Pin, Syntactic semigroups, in: G. Rozenberg, A. Salomaa (Eds.),Handbook of Formal Languages, Springer, 1997 (Chapter 10).

[6] J.-E. Pin, H. Straubing, Monoids of upper triangular matrices, in:G. Pollák, et al. (Eds.), Semigroups. Structure and Universal AlgebraicProblems, in: Colloq. Math. Soc. János Bolyai, vol. 39, North-Holland,1985, pp. 259–272.

[7] J.-E. Pin, P. Weil, Profinite semigroups, Mal’cev products and identi-ties, J. Algebra 182 (1996) 604–626.

[8] J.-E. Pin, P. Weil, A Reiterman theorem for pseudovarieties of finitefirst-order structures, Algebra Universalis 35 (1996) 577–595.

[9] J.-E. Pin, P. Weil, Polynomial closure and unambiguous product, The-ory Comput. Systems 30 (1997) 383–422.

[10] J.-E. Pin, P. Weil, A conjecture on the concatenation product, Theoret.Inform. Appl. 35 (2001) 597–618.

[11] J. Reiterman, The Birkhoff theorem for finite algebras, Algebra Uni-versalis 14 (1982) 1–10.

[12] H. Straubing, Semigroups and languages of dot-depth 2, in: Proc.ICALP’86, in: Lecture Notes in Comput. Sci., vol. 226, Springer, 1986,pp. 416–423.

[13] H. Straubing, Semigroups and languages of dot-depth two, Theoret.Comput. Sci. 58 (1988) 361–378.

[14] H. Straubing, P. Weil, On a conjecture concerning dot-depth two lan-guages, Theoret. Comput. Sci. 104 (1992) 161–183.