on approximate solutions of systems of linear inequalities · on approximate solutions of systems...

3
Journa l of Resea rch 01 the Nationa l Bureau of Sta ndards Vol. 49, No.4, October 1952; Resea rch Paper 2362 On Approximate Solutions of Systems of Linear Inequalities * Al an J . HoHman Let b be a. consiste nt system of linp, ar inequali ti es. The principal resu lt is a quantitative formulatIOn of the fa ct t hat if x "almost" satisfi es the inequa li ties then x is "close" to a solut ion. It is fur ther shown h ow it is possible in certain cases to the $i ze of the v ector joini ng x to th e ne arest solution from the magnitude of the positive coordi na tes of Ax- b. l. Introduction In many computational schem es for solving a ystem of linear inequalities (1) (brie fl y, one arrives at a vector x th at "almost" s atisfi es (1). It is almost obvious geo - m etrically th at , if (1) is consistent, one can infer th at there is a solu tion Xo of (1) "close" to x. Th e purpose of this rep ort is to justify and formul at e precisely this asse rtion.l We shall use fairly general definitions of functions th at measur e the size of vectors, sin ce it m ay be possible to obtain b etter estimate of the cons ta nt c (whose important role is described in the s tatement of the theorem) for ome measuring f un ctions th an for others. We shall make a few rem ar ks on the estimation of c aft er completing the proof of the main th eo rem. 2. The Main Theorem v Ve require the following Definitions: For any real number a, we de fin e a if a a+= - o if a< O. For any vector y =( y" ., Yk), we de fi ne y += (yt , . . ., y:) . (2) .. A. positive homogeneou function Fk defined on k-spa ce is a r eal con tinu ous f un ction s atisfying (i) F k( x) 0, F k( x) = 0 if, and only if, X= 0 (ii) (3) 'Th is work was sponsored (in part) by t il e Offi ce of Scient ifi c Research, US AF . 1 A. M. Ost rowski has kindly pointed out that part of the res ults given below is im pli ed by the f act that if J( and L are two convex bodies each of which is in a small neighborhood of the ot her, then their associated gauge f nnctions d iffer slightly. Th eorem : L et ( 1) be a consiste nt system of ine- qualities and let Fn and F m each sati sfy (3 ). Then th ere exists a constant c >0 such that for any x there exists a so luti on Xo of (1) wi th Th e proof is essentially contained in two lemm a (2 and 3 below) given by Shmuel Agmon. 2 Le mma 1. If Fm satisfies (3), there exists an e> O su ch that for every y and every su bset S of the half spa ces (1 ) F ", (if ) where Y =(Y I, . .. ,y",), g = (y" . . ', Ym), and _ Yi if the i th half space belon gs to S Y i= o otherwi se . P roof. It is clear from (3) (i) th at any e will s uffi ce for y= O. B y (3) (ii ), we n ee d only consider the ratio Fm(if )/F m( Y) for Y such that F(y)= ], a. comp act set. H ence, for each subs et S, F m( y )/ F m( Y) has a maximum es . Se t e= max es . Le mma 2. L et Q be the se t of soluti ons of (1), let x be a poi nt exterior to Q, and let Y be the poin t in Q ne arest to x. L et S be the subse t of the half spaces (1), each of which contains y in its b ounding hyp er- pl an e, and let Q s be the i nterse ction of these half spaces. Th en x is exteri or to Qs and y is the nea re st point of Q s to x. - Le mma 3. Le tM be an mXn matrix obtained fr om A by substituting 0 f or some of the rows of A, and let Q be th e cone of solutions oj O. L et E be the set oj all poin ts x such that (i) x is exterior to Q, and (ii) the origin is the po int oj Q n eare st to x. Th en th re exists a d s> O such that xeE i mplie P rooj oj the theor em . L et x be any vector ext erior to the solu tions Q of (1), l et Xo be the poin t of Q near- est to x, and l et S be defined as in lemma 2. Let M be the m atrix ob tain ed from A by substitu t- 2 S. Agmon. T he relaxati on meth od for linear ineqnalities, Nat ional Applied M at hemat ics Laboratory Report 52-27, N BS (pre publi cation copy). 263

Upload: others

Post on 18-Oct-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: On approximate solutions of systems of linear inequalities · On Approximate Solutions of Systems of Linear Inequalities * Alan J . HoHman Let ~x ~ b be a. consistent system of linp,ar

Journa l of Resea rch 01 the Nationa l Bureau of Standards Vol. 49, No.4, October 1952; Resea rch Paper 2362

On Approximate Solutions of Systems of Linear Inequalities *

Alan J . HoHman

Let ~x ~ b be a. consis tent system of linp,ar inequal ities. The principal result is a quantitative formulatIOn of the fa ct t hat if x "almost " sa t isfi es the inequalit ies t hen x is "close" to a solut ion. It is fur ther shown how i t is possible in certain cases to e~timate the $ize of the vector joining x to the nearest solu t ion from the magn itude of t he posit ive coordi nates of A x - b.

l. Introduction

In many comp utational schemes for solving a ystem of linear inequali ties

(1)

(briefly, Ax ~ b ), one arrives at a vector x that " almost" satisfies (1). It is almost obvious geo­metrically that, if (1) is consisten t, one can infer that there is a solution Xo of (1) " close" to x. The purpose of this r eport is to justify and formulate precisely this assertion .l We shall use fairly general defini tions of functions that measure the size of vectors, since i t may be possible to obtain better estima te of the constan t c (whose importan t role is described in the statemen t of the theorem) for ome measuring functions than for others. We shall

make a few remarks on the estimation of c after completing the proof of the main theorem.

2 . The Main Theorem vVe require the following

Defini tions: For any r eal number a, we de fine

a if a ~ o a+= -

o if a< O.

For any vector y =(y" ., Yk), we define y+= (yt , . . . , y:) . (2)

.. A. posi tive homogeneou function F k defined on k-space is a r eal con tinuous function satisfying

(i) F k( x ) ~ 0, Fk(x ) = 0 if, and only if, X= 0

(ii) (3) 'This work was sponsored (in part) by t ile Office of Scient ific Research , USAF. 1 A . M. Ostrowski has kind ly poin ted out that part of the results given below

is im plied by the fact t hat if J( and L are two convex bodies each of which is in a small neighborhood of the other, then their associated gauge fnnctions d iffer slightly.

Theorem : L et (1) be a consistent system of ine­qualities and let Fn and F m each satisfy (3). Then th ere exists a constant c >0 such that for any x there exists a solution Xo of (1) with

The proof is essentially contained in two lemma (2 and 3 below) given by Shmuel Agmon.2

L emma 1. If Fm satisfies (3), there exists an e> O such that for every y and every su bset S of the half spaces (1 )

F", (if ) ~eF",(y)

where Y=(YI, . .. ,y",), g = (y" . . ',Ym), and

_ Yi if the i th half space belongs to S Yi=

o otherwise.

P roof. It is clear from (3) (i) that any e will suffice for y = O. B y (3) (ii), we need only consider the ratio Fm(if)/Fm(Y) for Y such that F (y)= ], a. compact se t . H ence, for each subset S, Fm(y )/Fm(Y) has a maximum es . Set e=max es .

L emma 2. L et Q be the set of soluti ons of (1), let x be a point exterior to Q, and let Y be the point in Q nearest to x . L et S be the subset of the half spaces (1), each of which contains y in i ts bounding hyper­plane, and let Qs be the intersection of these half spaces.

Then x is exterior to Qs and y is the nearest poin t of Qs to x . - L emma 3. L etM be an m X n matrix obtained from A by substituting 0 f or some of the rows of A , and let Q be th e cone of solutions oj ~Mz ~ O. L et E be the set oj all points x such that (i) x is exterior to Q ,

and (ii) the origin is the point oj Q nearest to x . Then th re exists a ds>O such that xeE i mplie

P rooj oj the theorem . Let x be any vector exterior to the solutions Q of (1), let Xo be the point of Q near­est to x, and let S be defined as in lemma 2.

Let M be the matrix ob tained from A by substi tut-2 S. Agmon. T he relaxation method for linear ineq nalities, National Applied

M athematics Laboratory Report 52-27, N BS (prepublication copy).

263

Page 2: On approximate solutions of systems of linear inequalities · On Approximate Solutions of Systems of Linear Inequalities * Alan J . HoHman Let ~x ~ b be a. consistent system of linp,ar

ing 0 for the rows not in S, and let ;; be the vector obtained from b by substituting 0 for the components not contained in S.

Then lemma 2 says that Xo satisfies Mz= b, x is exterior to the solutions of Mz ~ b, and Xo is the solution of Mz~ b nearest to x. Perform the trans­lation z' = Z- Xo. Then Mz ~ b if, and only if,

Mz' = Mz - M Xo= Mz - b ~ o. Thus X- Xo belongs to the set E of lemma 3, and

Fm((Mx- b )+)=Fm((M(x- xo»)+) ~dsFn(x-xo) ~

dFn(x- xo),

where d= min ds . s

Thus, 1 - e

Fn(x- xo) ~dF m(M x- b) ~dF m((Ax- b)+),

using lemma 1. Setting c= eld completes the proof of the theorem.

3. Estimates of c for various norms

None of the estimates to be obtained is satisfac­tory, since each requires an inordinate amount of computation except in special cases. It is worth remarking, however, that even without knowledge of the size of c, the theorem is of use in insuring that any computation scheme that makes (Ax- b)+ approach 0 will bring x close to the set of solutions of (1). This guarantees, for instance, that in Brown's method for solving games the computed strategy vector is approaching the set of optimal strategy vectors .

In what follows let

I'x l = maximum of the absolute values of the co Ol·dina tes of x;

Ilx ll = sum of the absolute values of the co­ordinates of x;

Illxlll= the square root of the sum of the squares of the coordinates of x.

Note that if Fm is anyone of these norms, then e = 1. We consider these cases:

Case I. Fn=111 III , Fm=1 I. If G=(Cii) IS a square matrix of rth order, let

where the Gi/s are the cofactors of the elements of CiJ' Using this notation, and assuming that the

n rows of (1) are normalized so that :8 ail = 1, Agmon

j=1

(see p . 9 of reference in footnote 2) has shown that if

A is of rank r, then

where iI, . .. , ir are r (fixed) linearly independent rows of (ati) ; A~:::::: {; is the r X r submatrix formed by the fixed rows and indicated columns, ancl where the summation is performed over all different combinations of thej's.

Case II . Fn=1 I,Fm=1 I· CaseIII. Fn=1 I, Fm=1 1 II·

For cases II and III, it is convenient to have a description of E alternative to that contained in lemma 3. We shall use the notation of lemma 3.

Lemma 4 . Let K ' = the cone spanned by the ruw vectors oj M, with the origin deleted. Then K' = E.

Proof. Let Mil . . ., A;[m be the row vectors of 1\1, and let X= AlA;[1 + .. . + AmM m, where x7"" 0, and A i~ O , i = l , . .. , m. Then x is exterior to Q, and the origin is the point of Q closes t to x ; that is, z E a implies (x- z ).(x- z )-x·x ;;;;O. For z· X= Z·(A1Ml + .. . + AmM m)= A1Z·Ml + .. . +Amz·Mm ~O. H ence (x- z) · (x - z) - x . x= z.z- 2z.x ~ O . This shows that K' C Eo

We now prove thatE C K' . Assume x eE. Then,

Z ea implies z·x ~ 0 (4)

(o therwise let w be a sufficiently small positive num­ber; then w z eQ and wz·wz-2wz·x< O). Consider z as the coordinates of a half space whose bounding plane contains the origin. Then (4) says that all half spaces (" through" the origin) containing the row vectors of M also contain X . It is a fundamental result in the theory of linear inequalities 3 that this later statement implies that x is in the cone gener­a ted by the rows of M . H ence E C K' .

4 . Case II

I t is clear from the proof of the theorem tha t all we need is to calculate min (IMx )+l/lx J) , for each .M

XEE

corresponding to a subset S of the vectors AI , .. " A m. Let All ... , A le (say) be the vectors of tho subset S. Then by lemma 4, xeE implies that there exist AI, ..• , Ale with Ai ~ 0 such that

H ence, k

Ix l ~ as :8 Ail j= l

(5)

(6)

where as is the largest absolute value of the cO-Ol'di­nates of AI, ... , A le.

It follows from the homogeneity of I(Mx )+l/l xl that

3 T. S, Motzkin , Beitl'llge ?ur TheOt'ie del' Linearen Ungleichungen. Jerusalem, 1936, with references to proofs by Minkowski a nd Weyl.

264

Page 3: On approximate solutions of systems of linear inequalities · On Approximate Solutions of Systems of Linear Inequalities * Alan J . HoHman Let ~x ~ b be a. consistent system of linp,ar

we need only consider x~E such tha t if x is expressed k

as in (5), ~ }. j= 1. j= 1

Then

I(M x)+I=max (A i· x)+=max (A i ' X)= i i

where gii= A i · A i> }. j ;:':; 0, ~}.j= 1.

H ence, k

min I(Mx)+I= minmax ~ 9ijAj=VS, (7) A ; j= 1

where Vs is the value of zero sum two person game whose matrix is gij.

Therefore, from (6) and (7)

min I(Mx)+I;;::: vs . x,E Ix l - as

Can vs= O? Clearly, if, and only if, the origin is in convex body spanned by A I, . . . , A k • But this would imply that the set E is the ent ire space (except for the origin). And it follows from the proof of the main theorem th at this can occur only for a subset

that would never arise in lemma 2. Therefore, using the language of the theorem

where as c=max - ·

• 8>0 v s

(8)

A pecial case occurs when all A i'A j>O. Lct v=min A r A i> a = max la ij/ . Then,

i, j i, j

/x - xo/ ~~ /(A x - b)+/. (9) v

5. Case III

Reasoning, along the lines of case II, we need only

est imate min /IC±9iiAj) +//, and it is possible to derive A J= 1

from it an expression analogous t o (8) , which un­fortunately does not seem to have a neat statement in terms of games or any other familiar object. An interesting special case occurs, however, if the matrix 91j (for S all the rows of A), has the proper ty that

Then

min II Cttg ijAj) +1/ ;:,:; n~in iti ~gijAj k k k

= min ~Aj~9ij;:':; mjn ~}.jW=W. A j=1 ;=1 A j=l

Then we obtain, with a having the ame meaning as in (9)

(10)

'W ASHING TON, J une 5, 1952 .

265