lie algebras, their representation theory and gl minor thesispanova/minorthesis.pdf · the goal of...

Lie algebras, their representation theory and GLn

Minor Thesis

Greta Panova

June 20, 2008

Contents

1 Introduction 1

2 Lie Algebras 22.1 Definitions and main theorems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22.2 Representations of Lie algebras . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72.3 Representations of sl(2, F ) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

3 Lie Groups and their Lie algebras 11

4 Representations of semisimple Lie algebras 154.1 Root space decomposition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 154.2 Roots and weights; the Weyl group . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19

5 The special linear Lie algebra sl(n,C) and the general linear group GLn(C) 225.1 Structure of sl(n,C) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 225.2 Representations of sl(n,C) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 245.3 Weyl’s construction, tensor products and some combinatorics . . . . . . . . . . . . . . . . . . 255.4 Representations of GLn(Cn) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 275.5 Some explicit construction, generators of SλV . . . . . . . . . . . . . . . . . . . . . . . . . . . 28

6 Conclusion 30

A Schur functors, Weyl’s construction and representations of GLn(C) 30

1 Introduction

The goal of this minor thesis is to develop the necessary theory of Lie algebras, Lie groups and theirrepresentation theory and explicitly determine the structure and representations of sln(C) and GLn(C). Myinterest in the representations of GL(V ) come from their strong connection to combinatorics as developed inChapter 7 and its appendix in [3]. Even though the irreducible holomorphic representations of GL(V ) canbe derived without passing through the general theory of Lie algebras, as Schur did in his dissertation, anyattempt to generalize the result or to obtain analogous results for the unitary or symplectic groups will passthrough the theory of Lie algebras.

Here we will develop the basic theory of Lie algebras and their representations, focusing on semisimpleLie algebras. We will develop most of the necessary theory to show facts like complete irreducibility ofrepresentations of semisimple Lie algebras, develop the theory necessary to decompose the lie algebras intoroot spaces and use these root spaces to decompose representations into weight spaces and list the irreducible

1

representations via weights. We will establish connections between Lie groups and Lie algebras, which will,for example, enable us to derive the irreducible representations of GL(V ) through the ones for gl(V ).

In our development of the basic theory of Lie algebras we will follow mostly [2], while studying Lie groups,roots and weights, sl(n, S) we will follow [1]. We will encounter some combinatorial facts which will be takenfor granted and whose proofs are found in [3].

2 Lie Algebras

In this section we will follow [2]. We will develop the basic theory of Lie algebras and later we’ll establishhow they arise from Lie groups and essentially motivate their existence.

2.1 Definitions and main theorems

We will, of course, start with the definition of a Lie algebra.

Definition 1. A Lie algebra is a vector space over a field F endowed with a bracket operation L× L→ Ldenoted (x, y)→ [xy], which satisfies the following axioms:

(L1) [xy] is bilinear,

(L2) [xx] = 0 for all x ∈ L,

(L3) [xy] satisfies the Jacobi identity [x[yz]] + [y[zx]] + [z[xy]] = 0 for all x, y, z ∈ L.

A homomorphism (isomorphism) of Lie algebras will be a vector space homomorphism (resp. isomor-phism) that respects the bracket operation, a subalgebra would a subspace closed under the bracket operation.

An important example of Lie algebra is the general linear algebra gl(V ), which coincides as a vectorspace with EndV (or Mn - space of n×n matrices) and has a bracket operation defined as [XY ] = XY −Y X(it is a straightforward check to verify the axioms). Next we can define the special linear algebra sl(V ) asthe subspace of EndV of trace 0 endomorphisms and the same bracket operation, it is clearly a subalgebraof gl(V ). Similarly we can define other subalgebras of gl(V ) by imposing conditions on the matrices, e.g.uppertriangular, strictly uppertriangular, diagonal.

Another important example of a gl(U) subalgebra, where U is a vector space endowed with a bilinearoperation (denoted as multiplication), is the algebra of derivations Der U. It is first of all the vector spaceof linear maps δ : U → U, such that δ(XY ) = Xδ(Y ) + δ(X)Y . In order to verify that this is a subalgebrawe need to check that [δδ′] ∈ DerU. For the sake of exercise we will do that now. We need to check that[δδ′] = δδ′ − δ′δ satisfies the derivation condition (it is clear it’s a linear map), so for X,Y ∈ U we applythe derivation rule over and over 1. Now if we let U = L with the bilinear operation defined by the bracketof L, we can talk about DerL. Some of these derivations arise as endomorphism adx : L → L given byadx(y) = [xy]. It is a derivation, since by the Jacobi identity adx(yz) = [x[yz]] = −[y[zx]] − [z[xy]] =[y[xz]] + [[xy]z] = [y adx(z)] + [ad(y)z]. We can now make a

Definition 2. The map L → DerL given by x → adx, where adx(y) = [xy] for all y ∈ L, is called theadjoint representation of L.

Continuing with the definitions we need to define ideals of a Lie algebra. The notion is the same as ringideals, just that multiplication is given by the bracket, in other words I is an ideal of L if for any x ∈ L andy ∈ I [xy] ∈ I. Two important ideals are the center Z(L) = z ∈ L|[xz] = 0 for all x ∈ L and the derivedalgebra [LL] consisting of all linear combinations of [xy]. Sums and products (given by the bracket) of idealsare clearly also ideals. A Lie algebra with no nontrivial ideals (i.e. 0 and itself) and with [LL] 6= 0 is calledsimple, in this case [LL] 6= 0 necessarily implies [LL] = L. We define the normalizer of a subalgebra K of

1[δδ′](XY ) = δ(δ′(XY ))− δ′(δ(XY )) = δ(δ′(X)Y +Xδ′(Y ))− δ′(δ(X)Y +Xδ(Y )) = δ(δ′(X)Y ) + δ(Xδ′(Y ))− δ′(δ(X)Y )−δ′(Xδ(Y )) = δ(δ′(X))Y +

δ′(X)δ(Y )+δ(X)δ′(Y )+Xδ(δ′(Y ))−δ′(δ(X))Y −

δ(X)δ′(Y )−δ′(X)δ(Y )−Xδ′(δ(Y )) = X(δδ′(Y )−

δ′δ(Y )) + (δδ′(X)− δ′δ(X))Y = X[δδ′](Y ) + [δδ′](X)Y

2

L as NL(K) = x ∈ L|[xK] ⊂ K and the centralizer of a subset X of L as CL(X) = x ∈ L|[xX] = 0,both of them turn out to be subalgebras of L by the Jacobi identity 2

We need to define what we mean by a representation of a Lie algebra since this is the ultimate goal ofthis paper. A representation of L is a homomorphism φ : L → gl(V ). The most important example ofwhich is the already mentioned adjoint representation adL :→ gl(L). Since linearity is clear, all we need tocheck is that it preserves the bracket, which again follows from the Jacobi identity 3.

We proceed now to some very important concepts in the theory of Lie algebras. We call a Lie algebraL solvable if the sequence of ideals L(0) = L,L(1) = [LL], . . . , L(n) = [L(n−1)L(n−1)] eventually equals 0.Note for example that simple algebras are not solvable, as L(n) = L for all n. Here is the place to introducethe subalgebra of gl(n, F ) t(n, F ) of upper triangular n × n matrices and the subalgebra n(n, F ) of strictlyupper triangular matrices. We notice that [t(n, F )t(n, F )] ⊂ n(n, F ), and that taking the commutator of twomatrices A and B with entries ai,j , bi,j = 0 if j − i ≤ k for k > 0 will result in C = AB − BA with ci,j = 0for j − i ≤ k + 1 and continuing this way we will eventually get a 0 matrix (e.g. for k > n), so these twoalgebras are solvable.

We note here some easy facts. If L is solvable, so are all its subalgebras and homomorphic images. Also,if I is a solvable ideal of L and L/I is solvable, then so is L. This can be shown by considering the quotientmap π : L → L/I. Since (L/I)(n) = 0, then so is 0 = (π(L))(n) = π(L(n)), so L(n) ⊂ Kerπ = I. SinceI(k) = 0 for some k, then L(n+k) = (L(n))(k) = 0. As a consequence of this fact we have that if I and J aresolvable ideals of L, then since (I + J)/J ∼ I/(I ∩ J) is solvable as a quotient of I, by the what we justproved I + J is also solvable.

This allows us to make one of the most important definitions, that of semisimplicity. If S is a maximal(not contained in any other such) solvable ideal of L, then for any other solvable ideal I we would have S+ Isolvable, so by the maximality of S, S+ I = S and hence I ⊂ S, showing that S must be unique. We denoteS by RadL, the radical of L and make the

Definition 3. A Lie algebra L is called semisimple if its radical is 0, RadL = 0.

We now define another very important notion to play a role later. As we know from linear algebraevery matrix X can be written uniquely as X = Xs +Xn, where Xn is nilpotent, Xs is diagonalizable andXs and Xn commute. We would like to do something similar with Lie algebras, i.e. find their nilpotentand semisimple elements. If X ∈ gl(V ) is a nilpotent matrix, Xn = 0, then for any Y ∈ gl(V ) we have(adX)k(Y ) =

∑i aiX

iY Xk−i by induction on k for some ai ∈ F and so we have for k ≥ 2n that eitheri or k − i is ≥ n, so Xi = 0 or Xk−i = 0 and ultimately (adX)k(Y ) = 0, so adX is nilpotent. If xis semisimple, then there is a basis of V , v1, . . . , vn, relative to which x = diag(a1, . . . , an). Then forthe basis elements eij of gl(V ) (matrices with only nonzero entry equal to 1 at row i, column j) we haveadx(eij) = xeij−eijx = aieij−ajeij so adx is a diagonal matrix with ai−aj on the diagonal. Since Jordandecomposition is unique and since the properties of nilpotency and semisiplicity are preserved we have thatadx = adxs + adxn is the Jordan decomposition of adx.

We will consider now what it means for a Lie algebra to be nilpotent and semisimple.

Definition 4. A Lie algebra L is called nilpotent if the series L0 = L,L1 = [LL], . . . , Li = [LLi−1], . . .eventually becomes 0.

For example any nilpotent algebra is also solvable since L(i) ⊂ Li. If Ln = 0, this means that for anyx0, . . . , xn ∈ L we have [xn[xn−1[. . . [x1x0] . . . ] = 0 or in operator notation adxn adxn−1 . . . adx1(x0) = 0,in particular this is true if xn = · · · = x1 = x, so (adx)n = 0 for all x ∈ L. It turns out that the converse isalso true, we have

Theorem 1 (Engel’s theorem). If all elements of L are ad-nilpotent (i.e. (adx)k = 0) then L is nilpotent.

2Linearity is clear, we need only check that they are closed under the bracket operation. If x, y ∈ NL(K) then 0 =[[xy]k] + [[yk]x] + [[kx]y] and since [yk] ∈ K, [kx] = −[xk] ∈ K we have [x[yk]] = −[[yk]x] ∈ K and [y[kx]] ∈ K, so [[xy]k] ∈ K.The case for CL follows from the second two terms being 0.

3For x, y, z ∈ L, we have [adx ad y](z) = adx(ad y(z))− ad y(adx(z)) = [x[yz]]− [y[xz]] = [[xy]z] = ad[xy](z).

3

For example then by this theorem then n(n, F ) is a nilpotent Lie algebra. We will prove Engel’s theorem,because it will illustrate some important techniques.

The proof requires a lemma, important on its own, we will prove it first.

Lemma 1. If L is a subalgebra of gl(V ) (dimV <∞) consisting of nilpotent endomorphisms, then L.v = 0for some nonzero v ∈ V .

Proof of lemma. In order to prove this lemma we will proceed by induction on dimL (this is the first commontrick in Lie algebras). The goal is to find a codimension 1 subalgebra of L and use the induction hypothesison it. The dimension 0 and 1 cases are trivial. Now let K be a maximal proper subalgebra of L, it acts onL via the adjoint operator and hence on L/K. Since the elements of K are nilpotent, so are their ad-s bythe preceding paragraph. We can use the induction hypothesis on K ⊂ gl(L/K), as clearly dimK < dimL,so there is a vector x + K ∈ L/K, such that x 6∈ K and such that [yx] = 0 for all y ∈ K, in particularx ∈ NL(K) \K. Since K ⊂ NL(K), NL(K) is a subalgebra of L as we showed and K is maximal propersubalgebra, we should have NL(K) = L as a strictly larger subalgebra. But that means that for all x ∈ L[xK] ⊂ K, i.e. K is an ideal. If K is an ideal, then the vector space spanned by K and z ∈ L is a subalgebrafor any z ∈ L (since [zK] ⊂ K ⊂ spanK, z). So in particular if dimL/K > 1, then the space spannedby K and z ∈ L \ K is a proper subalgebra violating the maximality of K. Therefore dimL/K = 1 andso L = spanK, z. Now we can apply the induction hypothesis to K, so there is a nonzero vector spaceW = w|Kw = 0. The element z is nilpotent, so is then its restriction to W (zk = 0 then (z|W )k = 0), soit has an eigenvector in W corresponding to the 0 eigenvalue, zv = 0, then Lv = spanKv, zv = 0 and weare done.

This lemma in particular implies that we can find a basis of V , for which the matrices of L are strictlyupper triangular.

Proof of Engel’s theorem. The condition that the elements of L are ad-nilpotent, suggests that we look atthe algebra adL first, it acts on the vector space gl(L) and its elements are nilpotent, so we can use thelemma and find an x ∈ L, such that [Lx] = 0. The existence of this x enables us to find a proper subalgebraof L of smaller dimension and so use again induction on dimL. In particular, since x ∈ Z(L) (centralizerof L) then Z(L) is a nonempty subalgebra of L as we showed earlier. It is also an ideal of L 4, so we canform the quotient algebra L/Z(L), which will then have a smaller dimension than L, and will still consist ofad-nilpotent elements. By induction it is nilpotent, so there is n, such that (L/Z(L))n = 0, i.e. Ln ⊂ Z(L),but then Ln+1 = [LLn] ⊂ [LZ(L)] = 0 and so L is nilpotent.

Engel’s theorem essentially implies existence of a common eigenvector for 0, we would like to extend thisresult for any eigenvalue. Here we should assume that the underlying field F is algebraically closed and ofcharacteristic 0.

Theorem 2 (Lie’s theorem.). If L is a solvable subalgebra of gl(V ), V of finite dimension, then L stabilizesa flag in V , i.e. there is a basis for V with respect to which the matrices in L are upper triangular.

Proof. We can proceed by induction on the dimV , we just need to show that there is a common eigenvectorv for all elements in L and then apply the induction to V/v. In order to prove the existence of such vector,we again apply the usual trick - induction on dimL. Again we need to find an ideal of codimension one.Since L is solvable we know that [LL] 6= L, otherwise L(n) = L, it is clearly an ideal and any subspace K ′

of the quotient L/[LL] is an ideal of L/[LL], since [Kx] ⊂ [LL] goes to 0. So we take a codimension onesubspace of L/[LL], its inverse image by the quotient map is then a codimension one ideal in L, call it K,L = K + Fz. As K(n) ⊂ L(n), K is solvable, so we apply the induction hypothesis and find an eigenvectorv for K, i.e. for any x ∈ K, there is a λ(x) ∈ F , such that x.v = λ(x)v. Here is the place to note that themap λ : K → F is actually linear 5, it will play important roles further in our discussion. Now instead offixing v, we can fix λ and consider the possibly larger subspace W = w ∈ V |x.w = λ(x).w for all x ∈ K.

4If y ∈ Z(L) and z ∈ L, then for any u ∈ L [u[zy]] = −[z[yu]]− [y[uz]] = 05Since (ax+ by)v = a(x.v) + b(y.v) = aλ(x)v + bλ(y)v.

4

Now the goal is to show that z(W ) ⊂W , so that we can find an eigenvector for z|W , which will then be aneigenvector for L.

We have to introduce now another important trick, that is the fundamental equality xz.w = zx.w +[x, z]w. Now if x ∈ K, then we would have x.(z.w) = λ(x)(z.w) + λ([x, z])w and if λ([z, x]) = 0, thenwe would have z.w ∈ W , which is what we need. Consider the spaces Wi = w, z.w, z2.w, . . . , zi.w, wehave that zWi ⊂ Wi+1. Employing our fundamental trick again and using the fact that K is an ideal wehave that for any x ∈ K, x(ziw) = z(x.zi−1w) + [x, z](zi−1w), so by induction if K.Wi−1 ⊂ Wi−1 thensince [x, z] ∈ K we have x(ziw) ∈ zWi−1 + Wi−1 ⊂ Wi, so KWi ⊂ Wi for all i. Moreover we get fromthe same induction that xziw − λ(x)ziw ∈ Wi−1: if xzi−1w = λ(x)zi−1w + wi−2 for wi−2 ∈ Wi−2, thenxziw = z(x.zi−1w) + [x, z]zi−1w = z(λ(x)zi−1w+wi−2) +wi−1 = λ(x)ziw+ zwi−2 +wi−1 and the last twoterms are in Wi−1. Since ziw form a basis for Wn (where Wn−1 ( Wn = Wn+1) every x ∈ K acts as anupper triangular matrix of Wn with this basis, so TrWn(x) = (n + 1)λ(x). In particular since [x, z] ∈ K,TrWn([x, z]) = (n + 1)λ([x, z]). But we know from linear algebra that Tr(AB) = Tr(BA), so in particularTr(xz − zx) = Tr(xz)− Tr(zx) = 0, so in characteristic 0 this forces λ([x, z]) = 0 and we are done.

As a corollary of this fact, if we consider the adjoint representation of L in gl(L) we get that L fixes a flagof subspaces L0 ⊂ L1 ⊂ · · · ⊂ Ln = L, which are necessarily ideals. If we choose a basis for this flag, thenwith respect to it adL consists of upper triangular matrices and hence [adL, adL] = adL[LL] are strictlyupper triangular and hence nilpotent, i.e. adx is nilpotent for x ∈ [LL] and so by Engel’s theorem [LL] isnilpotent.

As we know from linear algebra every matrix X can be written uniquely as X = Xs + Xn, where Xn

is nilpotent, Xs is diagonalizable and Xs and Xn commute. We would like to do something similar withLie algebras, i.e. find their nilpotent and semisimple elements. By a theorem from linear algebra, we knowthat for any matrix, its Jordan decomposition into the sum of commuting semisimple (diagonalizable) andnilpotent parts, can be achieved concretely by giving this parts as polynomials in our original matrix, namelygiven X there exist polynomials p and q with 0 constant terms, such that Xs = p(X) is the semisimple partof X and Xn = q(X) - the nilpotent 6. This fact enables us to give an easy checkable criterion for solvability.

Theorem 3 (Cartan’s cirterion). Let L be a subalgebra of gl(V ), V -finite dimensional. If Tr(xy) = 0 forall x ∈ [LL] and y ∈ L, then L is solvable.

Proof. If we show that [LL] is nilpotent, then L(i) ⊂ [LL](i−1) would be solvable, so by Engel’s theorem weneed to show that the elements of [LL] are nilpotent, so we need to figure out a trace criterion for nilpotency.

Suppose first that A ⊂ B ⊂ gl(V ) are subspaces and M = x ∈ gl(V )|[x,B] ⊂ A. Suppose that there isan x such that Tr(xy) = 0 for all y ∈M , we will show now x is nilpotent.

We can find a basis, relative to which the semisimple part of x, xs is diagonal, so fix this basis and writex = diag(a1, . . . , an) + xn. We want to show that ai = 0, consider the space E = Qa1, . . . , an, we willshow that E = 0 by showing that E∗ = f : E → Q|f linear is 0.

For any f ∈ E∗, let y = diag(f(a1), . . . , f(an)), if then eij is the corresponding basis of gl(V ) ofmatrices with only nonzero entry, equal to 1, at (i, j), then adxs(eij) = xseij − eijxs = (ai − aj)eij7

and ad y(eij) = (f(ai)− f(aj))eij . By Lagrange interpolation we can find a polynomial r(T ) in F [T ] with 0constant term, such that r(ai−aj) = f(ai)−f(aj) for all i, j. We then have that r(adxs)eij = r(ai−aj)eij =(f(ai)−f(aj))eij = ad y.eij for all i, j, and since eij are a basis for gl(V ), we must have that ad y = r(adxs).Since adxs is the semisimple part of adx as we remarked earlier, it will be a polynomial in adx withoutconstant term and so ad y = r(adxs) will be a polynomial in adx without constant term. Since by hypothesisadx(B) = [x,B] ⊂ A, we will have that ad y as a polynomial in adx will also map B into A, so y ∈ M .Hence we have that Tr(xy) = 0, but Tr(xy) =

∑i aif(ai) = 0. We can apply f to this linear combination

6One can do this explicitly: if∏i(T − ai)mi is the characteristic polynomial of X, then by the Chinese remainder theorem

there is a polynomial p(T ) ≡ ai (mod (T − ai)mi ) and p(T ) ≡ 0 (mod T ) and so on each eigensapce Vi for X we have

p(X) − ai.I = 0, so p(X) acts diagonally and clearly fixes the eigenspaces as a polynomial in X, so it is the semisimple part,Xn = X − q(X) would be the nilpotent.

7So in particular adxs is semisimple and since adxn will still be nilpotent as we showed earlier, adxs is the semisimple partof adx.

5

of ais and use the fact that f(ai) ∈ Q to get that∑i f(ai)2 = 0. Since the f(ai)s are rationals, they must

then be 0, so f = 0 and E = 0, so ai = 0 and so xs = 0 and so x = xn is nilpotent.Having shown that, we can apply it to A = [LL], B = L, so M = x ∈ gl(V )|[x, L] ⊂ [LL]. We

then have L ⊂ M , but the hypothesis of the theorem is only that Tr(xy) = 0 for y ∈ L, not y ∈ M .We will show now that Tr(xy) = 0 for x ∈ [LL], y ∈ L implies Tr(xy) = 0 for x ∈ [LL] and y ∈ M sousing the above paragraph we will have x nilpotent. Let y ∈ M and let x = [a, b] ∈ [LL] for a, b ∈ L. ThenTr([a, b]y) = Tr(aby−bay) = Tr(aby)−Tr(bay) = Tr(bya)−Tr(bay) = Tr(b[y, a]) since Tr(AB) = Tr(BA)for any two matrices. But then b ∈ [LL] and [y, a] ∈ L by definition of M , so by the hypothesis of the theoremTr(xy) = Tr(b[y, a]) = 0 and we can apply the preceding paragraph to get x - nilpotent.

As we already mentioned, a semisimple algebra is the one for which RadL = 0, but this is a difficultcondition to check. We need other criteria and for that we will introduce the Killing form.

Definition 5. Let L be a Lie algebra. If x, y ∈ L, we define κ(x, y) = Tr(adx, ad y). Then κ is a symmetricbilinear form on L, it is called the Killing form.

We can now state the criterion for semisimplicity as

Theorem 4. A Lie algebra L is semisimple if and only if its Killing form is nondegenerate.

Proof. The proof naturally involves Cartan’s criterion we just showed. For any Lie algebra then, its adjointrepresentation is a subalgebra of gl(L) and if Tr(adx ad y) = 0 adL will be solvable and since adL ∼ L/Z(L)and Z(L) is solvable we have L is solvable. So if we let S = x ∈ L|Tr(x, y) = 0 for all y ∈ L we have S issolvable and so S ⊂ RadL = 0, so κ is nondegenerate.

If now κ is nondegenerate we have S = 0. Since RadL is the union of all abelian ideals, we can showthat for every abelian ideal I, I ⊂ S. If x ∈ I and y ∈ L, then for z ∈ L we have (adx ad y)(z) = [x[yz]] ∈ Iand then (ad y([x[yz]])) ∈ I and so (adx ad y)2(z) ∈ [II] = 0. So adx ad y is nilpotent and hence has trace0 for all y ∈ L which makes x ∈ S, so RadL ⊂ S = 0 and L is semisimple.

We say that a Lie algebra is a direct sum of ideals if there are ideals I1, . . . , Ik, such that L = I1 + · · ·+Ikas a vector space. Semisimple algebras have the nice property to be such direct sums, in other words

Theorem 5. Let L be semisimple. Then there exist simple ideals (as Lie algebras) L1, . . . , Lk of L, suchthat L = L1⊕ · · · ⊕Lk and this decomposition is unique in the sense that any simple ideal is one of the Lis.

Proof. The Killing form has a nice property of associativity, i.e. κ([xy], z) = κ(x, [yz]) and so if we take theorthogonal subspace to an ideal I, i.e. I⊥ = x ∈ L|κ(x, y) = 0 for all y ∈ I, then I⊥ will also be an ideal(for z ∈ L, x ∈ I⊥, y ∈ I we have κ([xz], y) = κ(x, [zy]) = 0 since [zy] ∈ I). Moreover I ∩ I⊥ is 0 since it issolvable by Cartan’s criterion and L is semisimple. We must then have L = I ⊕ I⊥.

Now let L1 be a minimal nonzero ideal of L, we have L = L1 ⊕ L⊥1 . If I ⊂ L1 is an ideal of L1, then[LI] ⊂ L1 is an ideal of L1 by the following reason. Let x ∈ I and any element of L can be expressed asy + z, where y ∈ L1 and z ∈ L⊥1 , then [x(y + z)] = [xy] + [xz]. We already have [xy] ∈ I, we will now usethe fact that L semisimple implies that the Killing form is nondegenerate and show that [xz] = 0 for anyx ∈ L1 and z ∈ L⊥1 . Let u ∈ L, then by associativity κ([zx], u) = κ(z, [xu]) = 0 since [xu] ∈ L1 . As thisholds for all u ∈ L we need to have [zx] = 0 for κ to be nondegenerate. So [x(y + z)] = [zy] ∈ I and so I isan ideal of L. Then by the choice of minimality of L1 it follows that I = 0 or L1, so L1 has no nontrivialideals and so is simple (or abelian, which is not the case as RadL = 0). Similarly any ideal of L⊥1 is an idealof L and so L⊥1 cannot have any solvable ideals, so it is semisimple. Then by we reduce to the case of L⊥1and by induction it is a direct sum.

In order to show uniqueness suppose I is a simple ideal of L, then [IL] is an ideal of I and since L issemisimple it cannot be 0, so [IL] = I. But then L = L1 ⊕ · · · ⊕Lk, so I = [IL] = [IL1]⊕ · · · ⊕ [ILk] ( notethat [ILi] ∩ [ILj ] ⊂ Li ∩ Lj = 0) and therefore all but one summand must be 0, I = [ILi] ⊂ Li and so I isan ideal of Li, so it must be I = Li as Li is simple.

6

Since every simple Lie algebra L is isomorphic to the image of its adjoint representation (the kernel beingthe center, i.e. 0), then it is isomorphic to a subalgebra of gl(L). If L is semisimple it is the direct sum ofsimple L1 ⊕ . . . Lk, so L is isomorphic to a subalgebra of gl(L1 ⊕ . . . Lk), i.e. we have the

Corollary 1. Every semisimple Lie algebra is isomorphic to a subalgebra of gl(V ) for some V .

2.2 Representations of Lie algebras

Here we will develop some of the basic Representation Theory of semisimple Lie algebras. We start of witha

Definition 6. A F-vector space V endowed with an operation L × V → V ( (x, v) → x.v) is called anL-module if

1. (ax+ by).v = a(x.v) + b(y.v),

2. x.(av + bw) = a(x.v) + b(x.w),

3. [xy]v = x.(y.v)− y.(x.v),

for all x, y ∈ L, a, b,∈ F and v, w ∈ V .

Homomorphisms are defined as usual, an L-module is irreducible if it has no nontrivial L-submodules andis completely reducible if it is a direct sum of irreducible L-submodules. Given an L-module V , its dual V ∗ isalso an L-module as follows: if f ∈ V ∗, x ∈ L and v ∈ V , then (x.f)(v) = −f(x.v). The reason for the minussign is to satisfy the third axiom in the definition, since ([xy].f)(v) = −f([xy].v) = −f(x(y.v) − y(x.v)) =−f(x(y.v))+f(y.(x.v)) = (x.f)(y.v)−(y.f)(x.v) = −(y.(x.f))(v)+(x.(y.f))(v) = ((xy−yx).f)(v). Anotherpeculiarity comes with the action of L on a tensor product V ⊗W , where we define x(v⊗w) = x.v⊗w+v⊗x.w,one checks again that it respects the third axiom.

No discussion of representation theory could be complete without mentioning

Lemma 2 (Schur’s lemma). If φ : L→ gl(V ) is irreducible, then the only endomorphisms of V sommutingwith all φ(x) are the scalars.

We will now exhibit a commuting endomorphism and apply Schur’s lemma. We can extend our dis-cussion about the Killing form by defining a symmetric bilinear form on a semisimple L for any faithfulrepresentation φ : L → gl(V ) as β(x, y) = Tr(φ(x)φ(y)), which follows the properties of the Killing form,namely associativity and nondegeneracy. Now if (x1, . . . , xn) is a basis for L, let (y1, . . . , yn) be its dual basiswith respect to β, i.e. β(xi, yj) = δij , define the Casimir element of φ as cφ(β) =

∑ni=1 φ(xi)φ(yi). Since

φ(xi), φ(yi) ∈ End(V ), the so is cφ, our goal now is to show that it commutes with every φ(x).We need to use the associativity of β. If z ∈ L, we should consider its bracket rather than product with

the basis since the bracket is preserved under φ. We can write [zxi] =∑j aijxj and [zyi] =

∑j bijyj for

some coefficients aij , bij and use the orthogonality with respect to β, then β([zxi], yj) = β(−[xiz], yj) =−β(xi, [zyj ]) = −

∑bjlβ(xi, yl) = −bji and β([zxi], yj) =

∑ailβ(xl, yj) = aij , so aij = −bji. Now we can

consider

[φ(z), cφ(β)] =∑j

[φ(z), φ(xj)φ(yj)] =∑j

φ(z)φ(xj)φ(yj)− φ(xj)φ(yj)φ(z) =

∑j

φ(z)φ(xj)φ(yj)− φ(xj)φ(z)φ(yj) + φ(xj)φ(z)φ(yj)− φ(xj)φ(yj)φ(z) =

∑j

[φ(z)φ(xj)]φ(yj) + φ(xj)[φ(z)φ(yj)] =∑j

φ([zxj ])φ(yj) + φ(xj)φ([zyj ]) =

∑j

∑k

ajkφ(xk)φ(yj)−∑j

∑l

aljφ(xj)φ(yl) = 0.

7

So cφ(β) commutes with any φ(z) and if φ is irreducible then by Schur’s lemma we should have cφ(β) be scalarmultiplication, in particular it is 1/(dimV )Tr(cφ) = 1/ dimV

∑i Tr(φ(xi)φ(yj)) = 1/ dimV

∑dimLi=1 β(xi, yj) =

dimL/dimV and so it does not depend on the bases we choose.Casimir’s element is important in the sense that it shows there is an endomorphism commuting with the

action of L and with nonzero trace (equal to dimL); it will be used in the proof of our main theorem as aprojection to an irreducible module.

We now proceed to prove the main theorem of this section, namely

Theorem 6 (Theorem (Weyl)). Let L be semisimple and φ : L → gl(V ) be a finite dimensional represen-tation of L. Then φ is completely reducible.

Proof. First of all, since L is semisimple we can write it as a direct sum of simple ideals L = I1 ⊕ . . . Ik andthen [LL] = [I1L]⊕ . . . [IkL]. Since Ij is simple we have Ij ⊃ [IjL] ⊃ [IjIj ] = Ij , so [LL] = I1 ⊕ . . . Ik = L.As a consequence of this fact we have that L acts trivially on any one-dimensional space since there φ(xy) =φ(yx) = φ(x)φ(y) and so φ([x, y]) = [φ(x), φ(y)] = 0.

This nice fact suggests that we look first at the case where W is a codimension 1 L−submodule of V .Since as we observed L acts trivially on V/W , we have that φ(L)(V ) ⊂ W . Since cφ is an L− moduleendomorphism and it is sum of products of elements in φ(L), then cφ(V ) ⊂ φ(L)(V ) ⊂ W . Now we havejust exhibited a projection operator as the averaging operators in the representation theory of finite groups.The goal is to show that V = W ⊕ Ker cφ. Consider Ker cφ ∩W - an L− submodule of W so it must beeither 0 or W . If it is 0 we are done as then V = W ⊕Ker cφ. If it is W , then we have c2φ(V ) ⊂ cφ(W ) = 0so cφ is nilpotent and in particular Tr(cφ) = 0, but we already showed that this trace is dimL and sincecharF = 0 we reach a contradiction.

We required that W be irreducible, but we can reduce it to this case easily by induction on dimW . Again,dimV/W = 1. Since again L(V/W ) = 0 we can write an exact sequence of L−module homomorphisms0→W → V → F → 0. If W ′ is a proper module of W , we can quotient by it and get 0→W/W ′ → V/W ′ →F → 0 and since dimW/W ′ < dimW by induction this sequence splits, i.e. V/W ′ = W/W ′⊕ W/W ′, wheredim W/W ′ = 1. So now looking at W ′ and W we have again an exact sequence 0 → W ′ → W → F → 0and again by induction we have W = W ′⊕W ′′. Finally V/W ′ = W/W ′⊕ W/W ′ = W/W ′⊕W ′′/W ′. SinceW ∩ W ⊂W ′ we have W ∩W ′′ = 0, so V = W ⊕W ′′.

Now consider the general case where W is a submodule of V . The space of linear maps Hom(V,W )can be viewed as an L−module since Hom(V,W ) = V ∗ ⊗W by (f ⊗ w)v = f(v)w for v ∈ V and w ∈ W .The action of L on Hom(V,W ) can be described using the tensor product rule, namely if x ∈ L, then(x(f⊗w))v = (x.f⊗w)(v)+(f⊗x.w)(v) = x.f(v)w+f(v)x.w = −f(x.v)w+x(f(v).w) and if g(v) = (f⊗w)(v)is the actual homomorphism, then x(g)(v) = −g(x.v) + x.g(v). Now let V be the subspace of Hom(V,W ) ofmaps whose restriction to W is scalar multiplication. Then this space is also an L−submodule of Hom(V,W ):if f ∈ Hom(V,W ) and f |W = a1|W , then x.f(w) = −f(x.w) + x.f(w) = −ax.w + x.aw = 0, in particularx.f ∈ V. In fact if W ⊂ V is the subspace consisting of f |W = 0, then it is again an L−submodule andL(V) ⊂ W. Moreover dimV/W = 1 since V/W is determined by that scalar a. So we end up with a familiarsituation, namely V has a codimension one submodule W and by the special case we get V = W ⊕ f,where f ∈ V spans the complementary to W one-dimensional module, we can assume that f |W = 1W . Aswe showed earlier for the action of L on V we have 0 = (x.f)(v) = −f(x.v) + x(f(v)), i.e. x.f = f.x on V ,so f is an L−homomorphism and since clearly W ∩ Ker f = 0 and f is a projection of V into W we haveV = W ⊕Ker f as L−submodules.

As a result using this theorem we can state another important theorem which will later help us analyzethe action of L on a vector space.

Theorem 7. Let L ⊂ gl(V ) be a semisimple Lie algebra, then L contains also the nilpotent and semisimpleparts of its elements.

In particular, in any representation φ the diagonalizable elements of L will be diagonalizable in φ(L).This will help us later present any module V as a direct sum of eigenspaces to this elements.

8

Proof. We recall a result from linear algebra that says that the semisimple and nilpotent parts, xs and xnrespectively, of x can be expressed as polynomials in x with 0 constant coefficient. This in particular showsthat if xA ⊂ B for any subspaces B ⊂ A, then xsA, xnA ⊂ B also. In order to use this fact we will attemptto create subspaces A and B, such that xA ⊂ B if and only if x ∈ L.

For any W L−submodule of V , let LW = y ∈ gl(V )|y(W ) ⊂ W,Tr(y|W ) = 0. Since we showed asa consequence to the direct sum decomposition of L that [LL] = L, we have that every element is of theform [x, y] for some x, y ∈ L and hence has trace 0. Moreover we have that xs and xn are also in LW : sincex(W ) ⊂ W we have xs(W ) ⊂ W and xn(W ) ⊂ W and also Tr(xn|W ) = 0 since xn is nilpotent and henceTr(xs) = Tr(x) = 0 on W also. So L, xs, xn ⊂ LW for all L−submodules W . Let N ⊂ gl(V ) be the spaceof z ∈ gl(V ), such that [xL] ⊂ L. We also have that xs, xn ∈ N : since we showed that adxn and adxs arethe nilpotent and semisimple parts of adx : L → L, we have that adxn(L) ⊂ L and adxs(L) ⊂ L also, i.e.[xnL] ⊂ L and [xsL] ⊂ L. So if we show that the intersection L′ = N ∩ (∩WLW ) ⊂ L we will be done.

Observe that L′ is an L−module under the bracket action since for every x ∈ L and y ∈ LW we have[x, y](W ) ⊂ x(y(W )) + y(x(W )) ⊂ W as y(W ), x(W ) ⊂ W and then since it is a commutator its trace is0; N is clearly an L module as [LN ] ⊂ L ⊂ N . Now comes the big moment of applying Weyl’s theorem.Since L′ is an L-module and it contains L we have L′ = L + M where M is an L−submodule of L′ andL ∩M = 0. But since L′ ⊂ N , then [L′L] ⊂ [NL] ⊂ L, so L + [LM ] = [LL] + [LM ] = [L′L] ⊂ L, so[LM ] ⊂ L. But as an L−module under [] we have [LM ] ⊂ M , so [LM ] = 0 and in particular the elementsof M are endomorphisms commuting with L. So if y ∈M and W is an irreducible L−submodule of V , thensince M ⊂ LW we have y(W ) ⊂ W and Tr(yW ) = 0. By Schur’s lemma y should act as a scalar on W andsince its trace is 0 it should be 0. Since this is true for any irreducible W and we can write V as a direct sumof such W s by Weyl’s theorem (again!), then y acts as 0 on all of them, hence on V and so y = 0. ThereforeM = 0, so L′ = L and xs, xn ∈ L.

2.3 Representations of sl(2, F )

The special linear Lie algebra of dimension 2 is not just an easy example we start with. As we have alreadynoticed dimension induction has been a key method in proving facts about Lie algebras, so it won’t be asurprise to see that the general case of sl(n, F ) will involve the special case of sl(2, F ).

Definition 7. The Lie algebra sl(n, F ) is defined as the vector space of n × n matrices over F of trace 0with a bracket operator given by [X,Y ] = XY − Y X.

In particular we see that dim sl(n, F ) = n2−1 and of course that sl(n, F ) is well defined as a Lie algebra,since Tr(XY ) = Tr(Y X) for any n × n matrices X,Y . We will go back to the general case later, but fornow let n = 2. Clearly sl(2, F ) can be generated as a vector space by

x =(

0 10 0

), y =

(0 01 0

), h =

(1 00 −1

).

We can determine sl(2, F ) completely by listing the bracket relations among these generators. We have that

[x, y] = h, [h, x] = 2x, [y, h] = 2y, (1)

so in particular any of x, y, h would generate sl(2, F ) as a Lie algebra.It would have been pointless to discuss semisimple Lie algebras if our main example wasn’t such. So we will

show briefly that sl(2, F ) is not only semisimple, but in fact simple. For any element z = ax+by+ch ∈ sl(2, F )with a, b, c ∈ F we have that [z, x] = a.0+b[y, x]+c[h, x] = −bh+2cx and then [h[z, x]] = −b.0+2c[h, x] = 4cx.Similarly [h[z, y]] = [h, (ah− 2cy)] = −4cy and [x, [z, h]] = [x, (−2ax+ 2by)] = 2bh and [y, [z, h]] = 2ah. Sowe see that if charF = 0 and at least one of a, b, c is not 0, i.e. z 6= 0, we can get one of h or x (or y) byapplying the bracket to z and from them we can get any element, so the ideal generated by a nonzero z isjust sl(n, F ) and so sl(n, F ) is simple.

Now that we know the structure of sl(2, F ) so explicitly we can consider its irreducible representations.Let V be any sl(2, F ) module, φ : sl(2, F ) → gl(V ). Since h is semisimple (diagonal) then φ(h) is diagonal

9

with respect to some basis of V , i.e. for every eigenvalue λ of φ(h) there is a subspace Vλ = v ∈ V |h.v = λvand V = ⊕λVλ. The eigenvalues λ are called weights and the corresponding spaces Vλ - weight spaces.These agree with a more general definition which we will give when we consider representations of semisimpleLie algebras in general.

Now we should see how L acts on these Vλ. If v ∈ Vλ, then we employ our fundamental equality to geth.(x.v) = [h, x]v + x(hv) = 2x.v + x.λv = (λ + 2)(x.v), so x.v is an eigenvector of h with eigenvalue λ + 2,so xVλ ⊂ Vλ+2. Similarly h(y.v) = (λ − 2)y.v, so y(Vλ) ⊂ Vλ−2. Since V is finite dimensional and sinceV = ⊕λVλ as a vector space we have only finitely many λs, in particular, there must be a λ, such thatVλ+2 = 0 (otherwise we get all λ + 2Z), in particular we’ll have x(Vλ) = 0. We call any v ∈ Vλ for whichx.v = 0 a maximal vector of weight λ.

Now let V be irreducible. Choose one of these maximal vectors, v0 ∈ Vλ. Since x, y, h generate L both asa vector space and algebra, it is natural to think that the vector space generated by successive applicationof x, y, h on v0 will be stable under L. We have h.v0 = λ.v0, x.v0 = 0 and let vi = 1/i!yi.v0. By inductionwe will show that

Lemma 3. 1. h.vi = (λ− 2i)vi,

2. y.vi = (i+ 1)vi+1,

3. x.vi = (λ− i+ 1)vi−1.

Proof. We have that y.v0 = v1 by definition and in fact the second part of the lemma follows straightfrom the definition of vi, vi+1 = 1/(i + 1)!yi+1v0 = 1/(i + 1)y.(1/i!yiv0) = 1/(i + 1)vi. As for the rest, if(1) holds for vi, then vi ∈ Vλ−2i and so as we remarked y.vi ∈ Vλ−2i−2, so h.(y.vi) = (λ − 2(i + 1))y.vi,i.e. (i + 1)h.vi+1 = (i + 1)(λ2 − i − 2)vi+1 from (2) and so (1) follows by induction. We use (2) and ourfundamental equality again to show (3):

x.vi+1 = x(1/(i+ 1)y.vi) = 1/(i+ 1)(x.y.vi) = 1/(i+ 1)([x, y]vi + y.x.vi) =1/(i+ 1)(h.vi + y(λ− i+ 1)vi−1) = 1/(i+ 1)((λ− 2i)vi + (λ− i+ 1)y.vi−1) =

1/(i+ 1)((λ− 2i)vi + (λ− i+ 1)ivi) = 1/(i+ 1)(λ(i+ 1)− i− i2)vi = (λ− i)vi,

which is what we needed to prove.

From this recurrences we can show the following main theorem which describes the irreducible represen-tations of sl(2, F ) completely.

Theorem 8. If V is an irreducible s(2, F )−module, then

1. It is a direct sum of weight spaces Vµ for µ = m,m − 2, . . . ,−(m − 2),−m, such that each Vµ is onedimensional and is spanned by an eigenvector of h of eigenvalue µ.

2. V has a unique maximal vector, whose weight is m, called the highest weight of V .

3. The action of L on V is determined by the recurrences in the lemma, so up to isomorphism there is aunique irreducible sl(2, F )−module of dimension m+ 1.

Proof. From part (1) of lemma (3) we have that the vis are linearly independent being eigenvectors fordifferent eigenvalues. Now since dimV < ∞, the vis must be finitely many, so there is a minimal m, suchthat vm+1 = 0, so vm 6= 0. By the recurrence (2) we have of course that vi = 0 for all i > m and vi 6= 0for all i = 0, . . . ,m. The subspace of V spanned by (v0, v1, . . . , vm) is an sl(2, F )−submodule of V , sinceit is fixed by x, y, h which generate sl(2, F )− as a vector space. Since V is irreducible then we must haveV = spanv0, v1, . . . , vm and then dimV = m+ 1.

Moreover, since vm+1 = 0, relation (3) of lemma (3) shows that 0 = x.vm+1 = (λ − m)vm and sincevm 6= 0 we must have λ = m. From relation (1) then we have that h.vi = (m− 2i)vi, so the eigenvalues areprecisely m,m− 2, . . . ,m− 2m = −m, in particular symmetric around 0. The maximal vector is naturally

10

v0 and is unique up to scalar as the only one in Vm ∩ V . The uniqueness of V follows from the fact thatthe choice of m determines it completely by the recurrence relations of (3) and the fact that, again, x, y, zgenerate sl(2, F ) (so that these relations determine the action of sl(2, F )).

By Weyl’s theorem we have that any sl(2, F )−module W is completely reducible and so every eigenvectorof h would be in some irreducible module and then by the theorem it will be in one of the Vµ-s, in particularits eigenvalue will be an integer. We also have from the theorem that in any irreducible V we have either 0or 1 as an eigenvalue (if m is even or odd respectively). If W = ⊕V i for some irreducible V is, then we haveWµ = ⊕V iµ, in particular W0 = ⊕V i0 and W1 = ⊕V i1 and so dimW0 =

∑i|V i0 6=0 1 and dimW1 =

∑i|V i1 6=0 1

and since each V i falls in exactly one of these categories, then dimW0 + dimW1 is the number of V is in W ,i.e. number of summands.

Apart from existence we have essentially described the irreducible representations of sl(2, F ). We willsoon show the general case of sl(n, F ).

3 Lie Groups and their Lie algebras

So far we have considered Lie algebras from the axiomatic point of view, but that didn’t motivate neithertheir existence nor our current interest in them. To motivate their existence we will introduce a much morenatural structure.

Definition 8. A Lie group G is a C∞ manifold with differentiable maps multiplication × : G×G→ G andinverse ι : G→ G which turn G into a group.

Naturally a morphism between Lie groups would be a map that is both differentiable and a grouphomomorphism. A Lie subgroup would be a subgroup that is also a closed submanifold.

The main example of a Lie group to our interest is the general linear group GLnR (or over C) of invertiblen× n matrices. The map × : GLn ×GLn → GLn is given by the usual matrix multiplication and is clearlydifferentiable, the inverse map can be given by Cramer’s rule and so is also differentiable. Other Lie groupsof interest happen to be subgroups of GLn, for example the special linear group SLn, the group Bn of uppertriangular matrices, the special orthogonal group SOn of transformations of Rn preserving some bilinearform.

A representation of a Lie group G is defined as the usual representations of groups, i.e. a morphismG→ GL(V ).

It turns out the tangent spaces to Lie groups are Lie algebras. To see how this works we considermaps ρ : G→ H and we aim to show that they are completely described by their differentials at the identity,i.e. dρe : TeG→ TeH. This will reduce our studies to the studies of Lie algebras.

So let ρ : G→ H be a morphism of Lie groups. For any g ∈ G we have its right action on G is respectedby ρ. However it does not fix the identity, so we go further and consider conjugation by g. Define Ψg : G→ Gby Ψg(x) = g.x.g−1 for x ∈ G. Now we have that Ψρ(g)(ρ(x)) = ρ(g)ρ(x)ρ(g)−1 = ρ(g.x.g−1) = ρ Ψg(x),i.e. the following diagram commutes

G

Ψg

ρ // H

Ψρ(g)

G ρ

// H.

In particular we have that Ψ : G→ Aut(G) and Ψg(e) = e. Ψg is also a Lie group morphism, in particulardifferentiable, so we can consider its differential at e.

Definition 9. The representation Ad : G→ Aut(TeG) given by Ad(g) = (dΨg)e : TeG→ TeG is called theadjoint representation of G.

11

Since ρ Ψg = Ψρ(g) ρ we have that for any arc γ(t) in G starting at e and dγdt |t=0 = v we have

ρ(Ψg(γ(t))) = Ψρ(g)(ρ(γ(t))) and differentiating with respect to t we get dρ(dΨg(dγ(t)/dt)) = dΨρ(g)(dρ(dγ(t)/dt))and evaluating at t = 0 we get dρ(dΨg(v)) = dΨρ(g)(dρ(v)), in other words the following diagram commutes:

TeG

Ad(g)

(dρ)e // TeH

Ad(ρ(g))

TeG

(dρ)e

// TeH.

We want to get rid of the map ρ and leave only dρ in order to consider only maps between tangent spaces,so we take again a differential. We define

ad = dAd : TeG→ End(TeG),

since the tangent space of Aut(TeG) ⊂ End(TeG) is just the space of endomorphisms of TeG. Then we havethat ad(X) ∈ End(TeG), so for any Y ∈ TeG, we can define

[X,Y ] := ad(X)(Y ),

which is a bilinear map. Since ρ and (dρ)e respect Ad, we have that dρe(ad(X)(Y )) = ad(dρe(X))(dρe(Y )),i.e. the following diagram commutes

TeG

ad(X)

(dρ)e // TeH

ad(ρ(X))

TeG

(dρ)e

// TeH,

so the differential dρe respects the bracket and moreover we have arrived at a situation involving onlydifferentials and tangent spaces and in particular no actual map ρ.

The notation [X,Y ] is not coincidental with previous section, it is in fact the same bracket and TeG is aLie algebra. So that things become explicit we will consider the case when G is a subgroup of GLn(C), inwhich case we know exactly what Ad(g) is. Let β(t) and γ(t) be two arcs in G with β(0) = γ(0) = e anddβdt |t=0 = X and dγ

dt |t=0 = Y . Then Ψg(γ(t)) = gγ(t)g−1, so Ad(g)(Y ) = d(gγ(t)g−1)/dt|t=0 = gY g−1. Thenad(X) = (dAd(β(t))/dt)|t=0, so

ad(X)(Y ) =dAd(β(t))(Y )

dt=

d

dt|t=0(β(t)Y β(t)−1) = (2)

dβ(t)dt|t=0Y β(t)−1|t=0 + β(t)|t=0Y

dβ−1

dt|t=0 = XY e+ eY (−X) = (3)

XY − Y X. (4)

This definition of the bracket explains why we defined the action of a Lie algebra on a tensor product theway we did. Let V and W be representations of a Lie group G, then the usual definition of the representationV ⊗W is given by g(v ⊗ w) = g(v) ⊗ g(w). If β(t) is an arc in G with β(0) = e and dβ

dt (0) = X, then theaction of X ∈ TeG on any space V is given by X(v) = d

dt |t=0β(t).v, so the action on V ⊗W will be naturallygiven by differentiation by parts, i.e.

X(v ⊗ w) =d

dt(β(t)v ⊗ β(t)w) = (

d

dt|t=0β(t)v)⊗ w + v ⊗ (

d

dt|t=0β(t)w) = X(v)⊗ w + v ⊗X(w).

Definition 10. We define the Lie algebra associated to a Lie group G to be the tangent space to G at theorigin, i.e. TeG.

12

To make our discussion complete and justify our study of Lie algebras (as opposed to Lie groups) we willprove the following two principles

Theorem 9 (First Principle). Let G and H be Lie groups, G connected. A map ρ : G → H is uniquelydetermined by its differential dρe : TeG→ TeH at e-the identity.

Theorem 10 (Second Principle). Let G and H be Lie groups with G connected and simply connected. Alinear map δ : TeG→ TeH is the differential of a homomorphism ρ : G→ H if and only if the map preservesthe bracket operation in the sense that δ([X,Y ]) = [δ(X), δ(Y )].

We are going to prove these theorems later, after we establish the relationship between a Lie group Gand its Lie algebra. For that we will consider one parameter subgroups.

Let g ∈ G and let mg : G→ G be the map given by left multiplication by g. Let X ∈ L = TeG, we definea vector field vX on G by

vX(g) = (mg)∗(X). (5)

We can integrate this field, i.e. there is a unique differentiable map (”the integral”) φ : I → G, where 0 ∈ I,I is open and φ(0) is any given point, such that φ′(t) = vX(φ(t)). If α(t) = φ(s).φ(t) for s, t ∈ I and β(t) =φ(s+ t), then β′(t) = φ′(s+ t) = vX(φ(s+ t)) = vX(β(s+ t)) and on the other hand α′(t) = d

dt (mφ(s)φ(t)) =(mφ(s))∗(φ′(t)) = (mφ(s))∗vX(φ(t)) = (mφ(s))∗(mφ(t))∗(X) = (mφ(s).φ(t))∗X = vX(φ(s).φ(t)) = vX(α(t)).Since α(0) = β(0) the uniqueness of such maps implies α(t) = β(t), in other words φ(s + t) = φ(s).φ(t),so thus we defined a homomorphism φX : R → G. This Lie group map φX is called the one-parametersubgroup of G with tangent vector X.

If we had a homomorphism φ : R → G, such that φ′(0) = X, then this would uniquely determineφ (Exercise 8.31 in [1]). This follows because if we fix s and take d

dtφ(s + t) = ddtφ(s)φ(t) we will have

from the left-hand side φ′(s + t) and from the right-hand side (mφ(s))∗(φ′(t)). Taking t = 0 we get thatφ′(s) = (mφ(s))∗(φ(0)) = vX(φ(s)), which as we mentioned is uniquely defined. If then ψ : G → H is mapof Lie groups, then ψ φX : R → H is a homomorphism. We have that ψ∗(X) is a tangent vector of H atthe identity, so φψ∗X is the unique homomorphism with such tangent vector. However the homomorphismψ φX has tangent vector at 0 d

dtψ(φX(t))|t=0 = ψ∗(φ′X(0)) = ψ∗(X), so they must coincide, i.e.

φψ∗X = ψ φX . (6)

Let L be the Lie algebra of G, i.e. L = TeG.

Definition 11. The exponential map exp : L→ G is given by exp(X) = φX(1).

We have that φλX is the unique homomorphism such that φ′(0) = λX. Let ψ(t) = φX(λt). Thenψ is still a homomorphism and ψ′(0) = λφX(0) = λX by the chain rule, so necessarily we must haveφλX(t) = φX(λt), so the exp maps restricted to the projective space of lines through the origin gives theone-parameter subgroups of G.

We can apply (6) to the exponential map exp : L → G and get that exp is the unique map from L toG taking 0 to the identity e on G with differential at the origin the identity and whose restrictions to linesthrough the origin are the one-parameter subgroups of G. Again thanks to (6) we have that for any mapψ : G→ H φψ∗X = ψ(φX), i.e. exp(ψ∗(X)) = ψ(exp(X)) by evaluating at t = 1.

Since the differential of exp at the origin is an isomorphism, in particular it’s invertible, the inversemapping theorem tells us that exp is invertible in a neighborhood U of e, so in particular its image willcontain that neighborhood. Now let G be connected and H be a subgroup of G generated by elements in aneighborhood U of e (Exercise 8.1 in [1]), then H is an open subset of G. Suppose H 6= G. All of its cosetsaH being translates of H will also be open and besides disjoint from each other, so (∪aH 6=HaH) ∩H = 0,H ∪ (∪aH 6=H) = G and hence G is the union of two disjoint open sets. These sets are also closed in G beingcomplements to open, so G is the disjoint union of two closed sets and hence it is not connected. So G isgenerated as a group by the elements of U , in particular G is generated by exp(L). This proves the firstprinciple: since for any map ψ : G→ H of Lie groups we have the commutative diagram

13

TeG

exp

ψ∗ // TeH

exp

G

ψ// H,

and since if G is connected, it and its image in H (the respective connected component) are generated byexp(TeG) and exp(TeH), then the map ψ is determined uniquely by the differential ψ∗ = (dψ)e.

The reason the exponential map is called that becomes apparent when we consider the case of GLn(R).For any X ∈ End(V ) we can set

exp(X) = 1 +X +X2

2+X3

6+ . . . ,

which converges because if a is the maximal (absolute value) entry in X, then the maximal entry in Xk isby induction less than n|a|k (n = dimV ). This map has an inverse exp(−X). Since the differential of thismap at X = 0 is the identity and the map φX(t) = exp(tX) satisfies φ′X(t) = exp′(tX) = exp(tX)X =(mφX(t))∗X = vX(φX(t)), we have that exp restricted to any line through the origin gives the one-parametersubgroups, so it is the same exp we defined above.

We want to show now something stronger than that the exp generate G, namely that the group structureof G is encoded in its Lie algebra. We want to express exp(X). exp(Y ) as exp(Z) where Z ∈ L. The diffciultyarises from the fact that exp(X) exp(Y ) contains products X.Y , which don’t have to belong to L. Howeverthe bracket [X,Y ] ∈ L, so we aim to express exp(X). exp(Y ) as sum of bracket operations. We have thatlog(g) = (g − I)− (g−I)2

2 + (g−I)33 + . . . is the formal power series of the inverse of exp. We define then

X ∗ Y = log(exp(X). exp(Y )) ∈ L, i.e. exp(X ∗ Y ) = exp(X). exp(Y )

and we have that X ∗ Y = log(I + (X + Y ) + (X2/2 +X.Y + Y 2/2) + . . . ).

Theorem 11 (Campbell-Hausdorff formula). The quantity log(exp(X) exp(Y )) = X + Y + 12 [X,Y ] + . . .

can be expressed as a convergent sum of X,Y and the bracket operator. In particular it does depend onlyon the Lie algebra.

Using this theorem and the assumption that every Lie group may be realized as a subgroup of the generallinear group, which follows from another fact that every Lie algebra L is a subalgebra of gln(R) (we showedit for semisimple Lie algebras) and G is generated by exp(L), we can show that exp(X). exp(Y ) ∈ exp(L) forX,Y ∈ L. So if we have a Lie subalgebra Lh of L, then its image under the exponential map is locally closedand hence the group H generated by exp(Lh) is an immersed subgroup of G with tangent space TeH = Lh.

Now let H and G be Lie groups with G simply connected and let Lh and Lg be their Lie algebras.Consider the product G×H. Its Lie algebra is Lg⊕Lh. If α : Lg → Lh is a map of Lie algebras, then we canconsider LJ ⊂ Lg⊕Lh, the graph of α (LJ = (X,α(X))|X ∈ Lg). Then LJ is a Lie subalgebra of Lg×Lh,since it is clearly a vector space and [(X,α(X)), (Y, α(Y ))] = ([X,Y ], [α(X), α(Y )]) = ([X,Y ], α([X,Y ])).Then by what we just noticed the group J generated by exp(LJ) is an immersed Lie subgroup of G×H withtangent space LJ . Now we have a projection map π : J → G, whose differential dφ : LJ → Lg should agreewith the projection of the inclusion map LJ → Lg×Lh → Lg, i.e. this is the map (X,α(X))→ X, so dφ is anisomorphism, so again by the inverse function theorem φ must be an isogeny and since G is simply connectedthe image of π must be G, so π is an isomorphism. The projection on the second factor composed with thisisomorphism η = π2 π−1 : G→ H is a map of Lie groups, whose differential is dη = dπ2 dπ−1 = α id = α.This proves the second principle that a linear map of Lie algebras is the differential of a map of theirLie groups if and only if it is a map of Lie algebras.

14

4 Representations of semisimple Lie algebras

Returning back to our study of representations, lets take a look at what we did with sl(2, F )). We foundelements x, y, h, such that x and y where eigenvectors with respect to ad(h), h was semisimple and sl(2, F ) =h⊕x⊕ y. Thanks to these elements we could decompose an irreducible representation into eigenspaces withrespect to h with x and y sending one to another. We would like to do something similar for any semisimpleLie algebra L. We shouldn’t expect to find elements corresponding to h,x and y, but rather subalgebras,and if we want to decompose L into a direct sum of subalgebras the most natural approach is to look at theadjoint action of certain elements of L on L.

The crucial fact about h is that it is semisimple, acts diagonally. Suppose we can find a subalgebra H ⊂ Lof semisimple elements, which is abelian and acts diagonally on L. Then by the action of H on L we candecompose L into eigenspaces, i.e. for α a linear functional on H let Lα = x ∈ L|[h, x] = α(h)x, h ∈ H,then L = H ⊕ (⊕α∈H∗Lα), where we will have H = L0. Since L is finite dimensional we will have afinite collection of αs, the weights of the adjoint representation, we call them the roots of L, and thecorresponding spaces Lα are called the root spaces of L. The set of roots will be denoted R. We will showthat the adjoint action of Lα carries Lβ into Lα+β and so the roots will form a lattice, ΛR ⊂ H∗. It will beof rank the dimension of H. Each Lα will be one dimensional and R will be symmetric about 0.

Now if V is any irreducible representation of L, then similarly to the above decomposition we candecompose it into a direct sum V = ⊕Vα, where Vα = v ∈ V |h(v) = α(h)v, h ∈ H are the eigenspaceswith respect to H. These αs in H∗ are called the weights of V and Vα - the weight spaces, dimVα -the multiplicity of the weight α. We will have that for any root β, Lβ send Vα into Vα+β . Then thesubspaces ⊕β∈ΛRVα+β will be an invariant subspace of V and since V is irreducible the weights should allbe translates of one another by ΛR.

4.1 Root space decomposition

We will consider the case of any representation later, for the moment we will focus on the root spacedecomposition. Our presentation here will follow [2].

We call a subalgebra toral if it consists of semisimple elements. L has such subalgebras, as otherwiseall its elements would be nilpotent and so L will be nilpotent by Engel’s theorem, hence solvable andhence not semisimple. Any toral subalgebra is abelian by the following reasoning. Let T be toral, x ∈ T ,so adx is semisimple and so over an algebraically closed F it is diagonalizable. So if adx has only 0eigenvalues, then adx = 0. Suppose it has an eigenvalue a 6= 0, i.e. there is a y ∈ T , such that [x, y] = ay.Since y is also semisimple, so is ad y and it has linearly independent eigenvectors y1 = y, . . . , yn (sincead(y)(y) = 0) of eigenvalues 0, b2, . . . , bn, we can write x in this basis as x = a1y1 + . . . anyn. Then−ay = ad y(x) = 0.y + b2a2y2 + . . . , i.e. y is a linear combination of the other eigenvectors, which isimpossible. So a = 0 and adx = 0 for all x ∈ T , i.e. [x, y] = 0 for all x, y ∈ T .

Let H be a maximal toral subalgebra of L, i.e. not included in any other. For any h1, h2 ∈ H, wehave adh1 adh2(x) = [h1, [h2, x]] = −[h2, [x, h1]] − [x, [h1, h2]] = [h2, [h1, x]] = adh2 adh1(x) by theJacobi identity, so adLH consists of commuting semisimple endomorphisms and by a standard theorem inlinear algebra these are simultaneously diagonalizable 8. So we can find eigenspaces Lα = x ∈ L|[hx] =α(h)x for all h ∈ H for α ∈ H∗, such that they form a basis of L, i.e. we can write the

Root space decomposition: L = CL(H)⊕∐α∈Φ

Lα, (7)

where Φ is the set of α ∈ H∗ for which Lα 6= 0. Here CL(H) is just L0. Our goal is to prove that CL(H) = H.We will show first that

8Let A and B be two diagonalizable commuting matrices, then for any eigenvalue α of B and v such that Bv = αv we haveα(Av) = ABv = B(Av), i.e. A fixes eigenspaces Bα of B. Since then A|Bα is still semisimple we can diagonalize and obtainan eigenbasis which is necessarily an eigenbasis for B also.

15

Lemma 4. For all α, β ∈ H∗ we have [LαLβ ] ⊂ Lα+β . For any x ∈ Lα of α 6= 0, we have that adx isnilpotent and if β 6= −α then the spaces Lα, Lβ are orthogonal with respect to the Killing form κ.

Proof. Let x ∈ Lα, y ∈ Lβ and h ∈ H, then by the Jacobi identity we have adh([x, y]) = [h, [x, y]] =[x, [h, y]] + [[h, x], y] = [x, β(h)y] + [α(h)x, y] = (α+ β)(h)[x, y], so [x, y] ∈ Lα+β . Next if adx has a nonzeroeigenvalue, i.e. there is a y, such that adx(y) = ay we necessarily must have y ∈ CL(H) since otherwisey ∈ Lβ ∩ Lα+β = 0. But for any h ∈ H, [y, h] = 0, [h, x] = α(h)x, so 0 = [y, [h, x]] + [h, [x, y]] + [x, [y, h]] =−a.α(h)y + 0 + 0, so α = 0.

Next, let h ∈ H, such that (α+β)(h) 6= 0. Since the Killing form is associative, we have for x ∈ Lα, y ∈ Lβthat κ([x, y], h) = κ(x, [h, y]) = β(h)κ(x, y) from one hand and from another κ([x, y], h) = −κ([y, x], h) =−κ(y, [x, h]) = −α(h)κ(x, y), so since β(h) 6= −α(h) we have κ(x, y) = 0.

As a corollary to this lemma we have that the restriction of κ to CL(H) is nondegenerate. Since κ isnondegenerate on L and L0 = CL(H) is orthogonal to Lα for all α 6= 0, then there is no z ∈ L0, such thatκ(z, L0) = 0 as otherwise κ(z, L) = 0. This will help us prove the next theorem.

Theorem 12. Let H be a maximal toral subalgebra of L. Then H = CL(H).

Proof. Our goal is to show that H contains all semisimple elements of CL(H) and that CL(H) cannot haveany nilpotent ones.

(1) First of all since CL(H) maps H via ad to 0, then the nilpotent and semisimple parts of each adx,x ∈ CL(H) being polynomials in adx map H into 0 also. But (adx)s = adxs and (adx)n = adxn, soxs, xn ∈ CL(H) too. Now suppose x ∈ CL(H) is semisimple. Then the subalgebra generated by H andx is still toral, as x commutes with everything in H and sum of semisimple elements is still semisimplewhich follows for example from the simultaneous diagonalizability. Since H is maximal with respect to thisproperty we must have x ∈ H.

(2)If x ∈ CL(H) is nilpotent then adx is nilpotent, [x,H] = 0 and if y ∈ H, then adxad y = ad y adx,so (adx ad y)k = (ad y)k (adx)k for any k and finally adx ad y is nilpotent, i.e. κ(x, y) = 0. Since κ isnondegenerate on CL(H) we must then have κ(h,H) 6= 0 for h semisimple, i.e. in H. So κ is nondegenerateon H also.

(3) We also have that CL(H) is nilpotent by Engel’s theorem as follows. For any x ∈ CL(H), x = xs+xnand adxn is clearly nilpotent and adxs = 0 on CL(H) since by point (1) xs ∈ H. Since again by xs ∈ Hand xn ∈ CL(H) [xs, xn] = 0, we have that adx is the sum of two commuting nilpotents and so is nilpotent.

(4)If x ∈ CL(H) is nilpotent then adx is nilpotent, [x,H] = 0 and if y ∈ H, then adxad y = ad y adx,so (adx ad y)k = (ad y)k (adx)k for any k and finally adx ad y is nilpotent, i.e. κ(x, y) = 0. Since κ isnondegenerate on CL(H) we must then have κ(h,H) 6= 0 for h semisimple, i.e. in H. So κ is nondegenerateon H also.

(5) Point (4) tells us that H ∩ [CL(H)CL(H)] = 0 as follows. For h ∈ H, x, y ∈ CL(H) we haveκ(h, [x, y]) = κ([h, x], y) = 0. If [x, y] ∈ H this contradicts nondegeneracy, so [x, y] 6∈ H and hence H ∩[CL(H)CL(H)] = 0.

(6) Suppose [CL(H)CL(H)] 6= 0, by lemma 1 we have that CL(H) acts on [CL(H), CL(H)] via the adjointrepresentation, so there is a common eigenvector of eigenvalue 0, i.e. [CL(H), CL(H)] ∩ Z(CL(H)) 6= 0. Letz be such element, by (2) and then (5) z cannot be semisimple, so zn 6= 0 and zn ∈ CL(H) by (1) and since[zn + zs, c] = 0 for all c ∈ CL(H) and [zs, c] = 0 since zs ∈ H we must have [zn, c] = 0 for all c ∈ CL(H).Then ad zn ad c is nilpotent by commutativity and so κ(zn, CL(H)) = 0 contradicting nondegeneracy. So[CL(H)CL(H)] = 0 and is abelian.

(7) Finally, since CL(H) is abelian for any nilpotent element xn of it we will have κ(xn, CL(H)) = 0 asxn, c commute for all c ∈ CL(H). This contradicts the nondegeneracy of κ over CL(H), so CL(H) has nonilpotent elements. But then it must have only semisimple ones by (1) and then by (2) they are all in H, soH = CL(H).

Together with the previous arguments this shows

16

Theorem 13 (Cartan decomponsition). Let L be semisimple. If H is a maximal toral subalgebra of L thenthere are eigenspaces Lα = x ∈ L|hx = α(h)x for all h ∈ H for α ∈ H∗, such that

L = H ⊕ (⊕α∈H∗Lα).

From the above proof we have in particular that κ is nondegenerate on H, then we can find an orthonormalbasis with respect to κ, h1, . . . , hl. Then every φ ∈ H∗ is uniquely determined by its values on hi, sayφ(hi) = φi. If tφ = φ1h1 + . . . φlhl, then κ(tφ, hi) = φi, so by linearity κ(tφ, h) = φ(h) for every h ∈ H.

We will prove some useful facts about the Cartan decomposition.

Theorem 14. 1. The set Φ of roots of L relative to H spans H∗.

2. If α ∈ Φ, then −α ∈ Φ.

3. Let α ∈ Φ and let tα be the element of H as defined above, i.e. α(h) = κ(tα, h). If x ∈ Lα, y ∈ L−α,then [x, y] = κ(x, y)tα.

4. If α ∈ Φ then [LαL−α] is one dimensional with basis tα.

5. α(tα) 6= 0 for α ∈ Φ.

6. If α ∈ Φ and xα 6= 0 is in Lα, then there is a yα ∈ L−α, such that the elements xα, yα and hα = [xαyα]span a three dimensional subalgebra of L isomorphic to sl(2, F ) via xα → x, yα → y and hα → h.

7. [xαyα] = hα = 2tακ(tα,tα) and hα = −h−α.

Proof. (1) If Φ does not span H∗ then there is an h ∈ H such that α(h) = 0 for all α ∈ Φ, i.e. for any x ∈ Lα,adh(x) = 0, i.e. [h, Lα] = 0. But since [hH] = 0 by H being abelian we have [hL] = 0, i.e. h ∈ Z(L). ButZ(L) = 0 by L - semisimple, so we reach a contradiction.

(2) We have from lemma 4 that Lα and Lβ are orthogonal unless β = −α. If −α 6∈ Φ, then by the samelemma κ(Lα, L) = 0 contradicting the nondegeneracy.

(3) Consider for any h ∈ H κ([x, y] − κ(x, y)tα, h) = κ([x, y], h) − κ(x, y)κ(tα, h) = κ(x, [y, h]) −α(h)κ(x, y) = κ(x,−(−α(h)y)) − α(h)κ(x, y) = 0 by the associativity of κ. Since κ is nondegenerate onH we must have [x, y]− κ(x, y)tα = 0 (note that [x, y] ∈ H by lemma 4).

(4) Is a direct corollary of (3), provided that [LαL−α] 6= 0. If it where, then κ(Lα, L) = 0 by lemma 4,so since Lα 6= 0 this contradicts the nondegeneracy of κ on L.

(5) Suppose α(tα) = 0. Then if x ∈ Lα and y ∈ L−α, such that [x, y] 6= 0 (by (4) it is possible), wewill have that [tαx] = α(tα)x = 0 and similarly [tαy] = 0, so the subspace S spanned by x, y and tα is asubalgebra of L. It acts on L via the adjoint representation and we see that S ' adL S ⊂ gl(L). Since ad sis nilpotent for all s ∈ S (easy check that S is nilpotent), then we check that tα ∈ [SS] is nilpotent and sinceS is solvable we’ve shown that adL tα is nilpotent. But since tα is semisimple, ad tα is also, so adL tα = 0,i.e. tα ∈ Z(L) = 0, contradiction.

(6) Take xα ∈ Lα and let yα ∈ L−α be such that κ(xα, yα) = 2κ(tα,tα) = 2

α(tα) , possible because of (4) andthe nondegeneracy of κ. We set hα = 2tα

κ(tα,tα) , then [xα, yα] = hα by (3) and [hαxα] = α( 2tαα(tα) )xα = 2xα by

linearity of α and similarly since [tαyα] = −α(tα)yα we have [hαyα] = −2, the same relations as in equation(1) for sl(2, F ), so the three-dimensional algebra spanned by xα, yα, hα is isomorphic to sl(2, F ).

(7) By definition of tα we have that κ(tα, h) = α(h), so also κ(t−α, h) = −α(h), so by linearity of κκ(tα + t−α, h) = 0 and by its nondegeneracy on H we must have tα = −t−α which shows (7).

In view of what we proved in the (6)th part of this proposition, let Sα ' sl(2, F ) be a subalgebra of L,generated by xα, yα and hα. For a fixed α ∈ Φ, let M = spanH1, Lcα| for all cα ∈ Φ, where H1 ⊂ H isgenerated by hα and Kerα ⊂ H. Lemma 4 shows that ad(Sα)(M) ⊂M , i.e. it is a Sα submodule of L, i.e.it is a representation of Sα.

17

By our analysis about sl(2, F ) we have that all weights should be integers, so the ones occurring herebeing among 0 and 2c (as α(hα) = 2), where cs are among the ones for which Lcα 6= 0. So in particularwe must have c an integral multiple of 1/2. Now M is not actually irreducible. In fact, Sα is an Sα−submodule of M (it is a subspace already) and Kerα is also a Sα submodule, since for every h ∈ Kerα wehave adxα(h) = −α(h)x = 0, same for yα and adhα(h) = [hα, h] since H is abelian. So M = Kerα⊕Sα⊕M1

as an Sα module. Note that a weight 0 of hα could not occur in M1, as the weights there are the cα(hα)for c 6= 0. So we have a total of 2 occurrences of 0, exhausted by Kerα and Sα and so the even weightsare only in Kerα and Sα and these are respectively 0 and -2,0,+2. In particular then for c = 2 we cannothave 2α as a root (as it will give weight 4, not accounted for). So if twice a root is not a root, then1/2α cannot be a root either (as else 2.1/2α won’t be). So 1 is not a weight for hα and so there areonly even weights, namely the 0 and -2,0,+2, so M = Kerα ⊕ Sα as an Sα submodule. But Lα ⊂ M , soLα = Lα∩ (Kerα+Sα) = Lα∩Sα = spanxα as a vector space. In particular dimLα = 1. Thus we provedthe following theorem.

Theorem 15. For any α ∈ Φ we have that dimLα = 1 and the only roots multiples of α are ±α.

We will show now some more facts revealing the structure of the root space.

Theorem 16. Let α, β ∈ Φ, then

1. β(hα) ∈ Z and β − β(hα)α ∈ Φ, the numbers β(hα) are called the Cartan integers.

2. if also α+ β ∈ Φ, then [LαLβ ] = Lα+β .

3. if β 6= ±α, then if r and q are the largest integers for which β−rα, β+qα are roots, then all β+pα ∈ Φfor p = −r,−r + 1, . . . , q − 1, q and β(hα) = r − q.

4. L is generated as a Lie algebra by the root spaces Lα.

Proof. Consider the action of Sα on Lβ for β 6= ±α. Let K = ⊕i∈ZLβ+iα. We have that Sα acts on K asad(xα)(Lβ+iα) ⊂ Lβ+(i+1)α, similarly for yα and hα fixes them. We also have that since iα is not a rootunless i = 0,±1 and so β 6= −iα being a root, i.e. β + iα 6= 0. Anyway, K is an Sα submodule of L,so its weights must be integral. On the other hand, the weights are given by the action of hα on the onedimensional (from the previous theorem) spaces Lβ+iα, if these are spanned by zβ+iαs respectively, thenadhα(zβ+iα) = (β+ iα)(hα)(zβ+iα, so the weights are β(hα) + iα(hα) = β(hα) + 2i, in particular β(hα) ∈ Z.We also see that these integers have at most one of 0 or 1 among them (once), so K must be irreducible bytheorem 8. By the same theorem then if β(hα) + 2q and β(hα)− 2r are the highest, resp lowest weights, allβ(hα) + 2p in between must occur. These are weights if and only if Lβ+pα 6= 0, i.e. we have β + pα is a rootfor p = −r, . . . , q, i.e. they form a string called the α−string through β. Again by the same theorem theweights should be symmetric about 0, so β(hα)− 2r = −(β(hα) + 2q), i.e. β(hα) = r− q. In particular thensince −r ≤ −r + q ≤ q, the weight β + (q − r)α = β − β(hα)α also appears.

Last, if α + β ∈ Φ, then since ad(xα)(Lβ) ⊂ Lα+β and since we showed that since α + β is a root thenLα+β it is a summand of K - irreducible, so adxα(Lβ) 6= 0 as there is no other way to reach Lα+β throughLβ by the action of Sα. Since Lα+β is one dimensional then we must have [LαLβ ] = Lα+β .

Finally, let α1, . . . , αl ∈ Φ be a basis for H∗. For any β ∈ Φ, we can write it as β =∑i ciαi and we

want to see what the cis look like. We can obtain an inner product on H∗ from the Killing for, namely ifδ, γ ∈ H∗, define (γ, δ) = κ(tγ , tδ). We have (γ, γ) = κ(tγ , tγ) = γ(tγ) 6= 0 for γ ∈ Φ as we already provedand since Φ spans H∗, then we have (u, u) 6= 0 for all 0 6= u ∈ H∗, i.e. this inner product is nondegenerate.This inner product, apart from arising naturally from the Killing form, is useful because by theorem 16, part(1) we have γ(hδ) ∈ Z, so by definition of hδ this amount to γ( 2tδ

δ(tδ)) = 2γ(tδ)

δ(tδ)= 2κ(tδ,tγ)

κ(tδ,tδ)= 2 (δ,γ)

(δ,δ) ∈ Z. Sonow using this inner product we can write (β, αj) =

∑i ci(αi, αj) and we see that if we consider this as a

system with variables ci, the coefficients are all in Q, solving this system using simple linear algebra we seethat the solutions are rational functions in the coefficients, so ci ∈ Q.

18

Moreover, we will show that (α, β) ∈ Q. From linear algebra we have that Tr(AB) =∑C Tr(AC)Tr(CB)

(C span the space of matrices, e.g. Eij), so in particular we have κ(tγ , tδ) = Tr(ad tγ ad tδ) =∑α κ(tγ , tα)κ(tδ, tα),

in particular then (β, β) =∑α(β, α)2. We know already (β,α)

(β,β) ∈ Q, so dividing both sides by (β, β)2 we get

that 1(β,β) =

∑α( (α,β)

(β,β) )2 ∈ Q, so (β, β) ∈ Q and then also (α, β) ∈ Q. The last equality also shows that(β, β) > 0, i.e. our inner product is positive definite. Thus we have shown the following theorem.

Theorem 17. If α1, . . . , αl span Φ, then every β ∈ Φ can be written as∑i ciαi with ci ∈ Q and the inner

product on H∗ defined as the extension of

(γ, δ) = κ(tγ , tδ)

is positive definite and is rational restricted to Φ, i.e. (β, α) ∈ Q for all α, β ∈ Φ.

4.2 Roots and weights; the Weyl group

Let ρ : L → gl(V ) be a representation of L. We know that ρ(H) would still be an abelian subalgebra ofsemisimple matrices, i.e. we can write a decomposition V = ⊕Vα where α ∈ H∗ and Vα = v ∈ V |h.v =α(h)v for all h ∈ H . We will drop the ρ as it will be clear what we mean by h.v = ρ(h).v. These eigenvaluesα are called weights, so for example the weights of the adjoint representation are the roots.

If now β is a root of L we can consider the action of Lβ over Vα. Let y ∈ Lβ and v ∈ Vα. Then by ourfundamental computations we have h.(y.v) = y.(h.v) + [h, y]v = y.(α(h)v) + β(h)y.v = (α + β)(h)(y.v) forany h ∈ H, so Lβ : Vα → Vα+β . In particular, if V is irreducible then the weights must be congruent to oneanother module the root lattice ΛR = ΛΦ as we showed in the beginning of the Root space decompositionsection9.

We showed that for any roots β and α that β(hα) = 0 from theorem 16. Now suppose β is a weight ofV , we claim that the same is true. It follows in the same way we proved it for the roots. Again consider thesubalgebra Sα ' sl(2, F ) and its action of Vβ , let K = ⊕iVβ+iα and assume β 6= mα for any m ∈ Z, thennone of the β + iα would be 0. Then K is a Sα submodule of V and again by what we know about therepresentations of sl(2, F ), we get an α−string through β and the weights must be integral. The weights aregiven by the action of hα on Vβ+iα, i.e. iα(hα) + β(hα), α has still the same meaning, i.e. it is a root, soα(hα) = 2 (by definition of hα) and so β(hα) ∈ Z. So we have just shown a lemma.

Lemma 5. All weights of all representations assume integer values on the hαs.

Now let ΛW be the set of linear functionals on H, which are integer valued on the hαs. Then ΛW is alattice and all weights will lie in it, in particular the roots also.

We already noticed that the set of roots is invariant under addition, so we are going to explore furtherwhat group action keeps the weights invariant.

Definition 12. The Weyl group W is the group generated by the involutions Wα given by

Wα(β) = β − β(hα)α = β − 2(β, α)(α, α)

α,

where the inner product is the extension of the one given by (α, β) = κ(tα, tβ) when α, β are roots.

The Wα is in fact a reflection and the plane it fixes is given by β(hα) = 0, i.e. it is Ωα = β ∈ H∗|β(hα) =0. With respect to the inner product then we will have for every β ∈ Ωα that (β, α) = κ(tα, tβ) proportionalto κ(tβ , hα) = β(hα) = 0, so Ωα ⊥ α. We also have Wα = α − 2α = −α, so the Wαs are indeed reflections.We can say then that the Weyl group is generated by the reflections in the planes perpendicular to the roots.

As a consequence of theorem 16 we have that Wα(Φ) ⊂ Φ for all α ∈ Φ, so

Lemma 6. The set of weights of a representation is invariant under the Weyl group.9We will denote the same object - set of roots by both R and Φ as the notations in [2] and [1] differ.

19

Again from theorem 16, we have that β−rα, . . . , β+qα appear, if we let γ = β−rα, then the weights willbe γ, . . . , γ+ (q+ r)α and under hα the corresponding weights in sl(2, F ) will be γ(hα), . . . , γ(hα) + 2(p+ q),which should be symmetric about 0, so we have γ(hα) = p + q, i.e. the length of the α−string is equal tothe value of the first weight evaluated at hα.

We also have that the multiplicities of the weights are invariant under the Weyl group, again restrictingour analysis to any string of weights, i.e. to the action of Sα and applying theorem 8.

In our study of sl(2, F ) we started off with choosing the ”farthest” nonzero Vλ, but this was possiblesince the integers have natural ordering. We would like to apply similar analysis and for this purpose weneed to introduce an ordering on the roots.

Now let l : RΦ → R be any linear functional and let R+ = α ∈ Φ|l(α) > 0 be the set of positive rootswith respect to l and R− = β ∈ Φ|l(β) < 0 - the negative ones. Note that we can choose l so that no rootlies on its kernel. So we have Φ = R+ ∪ R−. Note that the positivity/negativity depends on the choice offunctional.

Definition 13. Let V be a representation of L. A nonzero vector v ∈ V , which is both an eigenvector ofthe action of H and is in the kernel of all Lα for α ∈ R+ (for some choice of a functional l : H∗ → R), iscalled a highest weight vector of V . If V is also irreducible then we call the weight α the highest weight ofthe representation.

We will prove now one of the main facts.

Theorem 18. Let L be a semisimple Lie algebra. Then

1. Every finite dimensional representation possesses a highest weight vector.

2. The subspace W of V generated by the images of a highest weight vector v under successive applicationsof the root spaces Lβ for β ∈ R− is an irreducible subrepresentation.

3. Given the choice of l, i.e. fixed positive roots, the highest weight vector of an irreducible representationis unique up to scalar.

Proof. Let β be the weight of V which maximizes l. Let then v ∈ Vβ . If γ ∈ R+ is a positive root, thenl(γ + β) = l(β) + l(γ) > l(β), so by our choice γ + β is not a weight, i.e. Vγ+β = 0. But LγVβ ⊂ Vγ+β = 0,so Lγ kills v for all positive roots γ, proving the first part.

We will show part (2) by induction on the number of applications of negative root spaces. Namely, let Wn

be the subspace spanned by all wn.v, such that wn is a word of length at most n of elements of negative rootspaces. Let x ∈ Lα for α ∈ R+ and consider any wn.v. We have that wn = y.wn−1, where wn−1 ∈Wn−1 andy ∈ Lβ , for β ∈ R−. Then we have x.(wn.v) = x.y.(wn−1.v) = y.x.(wn−1.v) + [x, y](wn−1.v). By inductionx.wn−1.v ∈ Wn−1, so y.x.wn−1.v ∈ Wn. Also we know that if β 6= −α, then [x, y] = 0, so we are done inthis case. In the case β = −α, we have [x, y] = utα from theorem 14, where u is some scalar. But then[x, y] ∈ H, i.e. it fixes the weight spaces, so [x, y](wn−1v) ∈ Wn−1. Finally we have that x(wn.v) ∈ Wn.This shows that the space W =

∑nWn, generated by successive applications of only negative roots to v, is

fixed by positive roots. As it is also fixed by negative roots, we have that it is a L−submodule of V . It isclearly irreducible since it is a space generated by L.v.

From here it follows in particular that if V is irreducible then dimVα = 1. If otherwise there were twovectors v, w ∈ Vα not scalar multiples of each other, then we cannot possibly obtain w from v by applyingonly negative roots, as Lβ1(Lβ2 . . . v) ⊂ Vα+β1+... 6= Vα fore l(α + β1 + . . . ) < l(α). So w is not in theirreducible subrepresentation generated by v as in part (2) and so we must have dimVα = 1. Uniquenessthen follows from the uniqueness of α.

As a corollary to this theorem, or more precisely - to the proof of part (2), we get exercise 14.15 from[?Fh] by considering the adjoint representation. If α1, . . . , αk are roots of L, then by induction we can showthat the subalgebra L′ generated by H,Lα1 , . . . , Lαk is in fact H ⊕ (⊕Lα) = L′′ for all αs which are inNα1, . . . , αk ∩ Φ. The containment L′ ⊂ L′′ follows similarly to the proof of theorem 18 part (2) by

20

induction on the number of summands αi, or it is simply obvious and any Lαi sends L′′ into itself. Thecontainment L′′ ⊂ L′ follows from theorem 16 since [LαLβ ] = Lα+β if α+ β is a root.

In view of this result we can make the following definition.

Definition 14. We call a root α simple if it cannot be expressed as a sum of two positive or two negativeroots.

We see that every negative root is a sum of simple negative roots and in view of the preceding paragraphthe subalgebra generated by H and Lβ for β running over the negative simple roots is in fact the direct sumof H ⊕ (⊕β∈R−Lβ). So we can rephrase the last theorem by replacing negative roots with primitive negativeroots and get that any irreducible representation is generated as a vector space by the images of a highestweight vector under successive applications of negative simple roots.

We are now going to prove a theorem that ties together the Weyl group and weights of irreduciblerepresentations.

Theorem 19. Let V be an irreducible representation and α a highest weight. The vertices of the convexhull of the weights of V are conjugate to α under the Weyl group. The set of weights of V are exactly theweights that are congruent to α module the root lattice ΛR and that lie in the convex hull of the images ofα under the Weyl group W.

Proof. In view of our observations so far we have that since V is generated by successive applications ofnegative roots to the highest weight vector v, then all the weights appearing in V are in fact of the formα + β1 + β2 + . . . for βi ∈ R−, so all weight lie in the positive cone α + C−, where C− is the positive conespanned by R−. Note that by our definitions, C− is entirely in contained in one half-plane (the one where lis negative).

Since α is a weight, we showed that if α + β is also a weight, then so must be the whole string α, α +β, . . . , α−α(hβ)β. Since α is maximal, it is necessarily the case that β ∈ R−. It also follows that α−α(hα)β =Wβ(α) is a weight. Any vertex of the convex hull of the weights of V will lie on a line passing through αand α+ β for some β ∈ R−. We then must necessarily have that this vertex is Wβ(α) as we showed earlier(it is the other end of the string, geometrically - line, containing weights). As we also know the weights ofthe form γ + nδ for γ -weight and δ ∈ R must form a string, then for any two weights of V the intersectionof the line segment connecting them and α + ΛR (i.e. all possible weights) must be the segment itself. Sothe set of weights is convex, i.e. it is the intersection of ΛR and its convex hull.

Note that if α − α(hβ)β is weight then its ordering must be ”lower” than α, i.e. we must have l(α) >l(α− α(hβ)β) = l(α)− α(hα)l(β) and since l(β) < 0 we must have α(hβ) < 0 or alternatively α(−h−β) < 0i.e. α(h−β) > 0 where now by the symmetry of roots −β runs over R+.

Definition 15. A Weyl chamber is the locus W of points in H∗, such that γ(hβ) ≥ 0 for all β ∈ R+.Alternatively, it is a connected region (polytope) with faces the planes Ωα.

The Weyl group acts transitively on the Weyl chambers. The importance of the Weyl chamber is revealedin the following theorem.

Theorem 20. Let W be the Weyl chamber associated to some ordering of the roots. Let ΛW be the weightlattice. Then for any α ∈ W ∩ ΛW , there exists a unique irreducible finite-dimensional representation Γα ofL with highest weight α.

This theorem gives a bijection between the points inW∩ΛW and the irreducible representation. In viewof the previous theorem we have that the weights of Γα will be the weighs in the polytope whose vertices arethe images of α under the Weyl group W. Note moreover that these are all the irreducible representations -for every irreducible representation its highest weight will lie in the Weyl chamber.

Proof. We will show uniqueness first. If V and W are two irreducible finite-dimensional representations of Lwith highest weight vectors v and w of weight α, then consider the vector (v, w) ∈ V ⊕W . Its weight is again

21

α and in general the weight space decomposition is still ⊕(Vβ ⊕Wβ), so the weights are the same and α isagain highest (note that the weights are the same because they are solely determined by α from the previoustheorem). Let U ⊂ V ⊕W be the irreducible subrepresentation generated by (v, w) and consider the nonzeroprojection maps π1 : U → V and π2 : U →W , nonzero since they map (v, w) to v and w respectively. Then=πi is a nonzero submodule of the irreducible V (or W ) and Kerπi a submodule of U , so we must have bothπi surjective and injective, i.e. isomorphism, showing in particular that π2 π−1

1 : V →W is an isomorphismand so the uniqueness is proved.

Existence is much harder and involves concepts we have not introduced. We will try to describe it herethough. It involves the concept of a universal enveloping algebra, that is an associative algebra U over F ,such that there is a linear map i : L→ U, such that i([x, y]) = i(x)i(y)−i(y)i(x), and which has the universalproperty of every other such algebra arising from a homomorphism with U. Such algebra can be explicitlyconstructed as U(L) = T(L)/J , where T(L) =

∐T iL is the tensor algebra and J is the ideal generated by

x⊗ y − y ⊗ x− [xy] for x, y ∈ L. Now we can construct a so called standard cyclic module Z(α) of weightα as follows. If we choose nonzero xβ ∈ Lβ for β ∈ R+ and let I(α) be the left ideal of U(L) generated byall these xβ and all hγ − α(hγ).1 for all γ ∈ Φ = R. The choice of such ideal makes sense as its elementsannihilate a highest weight vector of weight α, so we can choose Z(α) = U(L)/I(α), where the coset of 1corresponds to the line through our highest weight vector v, i.e. Z(α) ' Uv, where v is our highest weightvector. Next by some facts about standard cyclic modules Z(α) would have a unique maximal submodule Yand the V (α) = Z(α)/Y will be our irreducible representation of weight α. To summarize, the point was toconstruct a module of highest weight α explicitly and take its irreducible submodule containing the highestweight vector we started with.

Just like with the definition of simple roots, we can define the fundamental weights ω1, . . . , ωn -the weights such that any weight in the Weyl chamber can be expressed uniquely as nonnegative integralcombination of these fundamental weights. These are in fact the first weights along the edges of the Weylchamber, i.e. ωi(hαj ) = δij , where αis are the simple roots. Then every weight α in the Weyl chamber (alsocalled dominant weight) is an integral nonegative sum α =

∑aiωi and then we can write the corresponding

irreducible representation as Γa1,...,an = Γa1ω1+···+anωn .

5 The special linear Lie algebra sl(n, C) and the general lineargroup GLn(C)

It is time now that we apply all the developed theory to our special case of interest - the linear group andcorresponding Lie algebra. We will first reveal the structure of L = (n,C), i.e. find its Cartan subalgebraH, roots and Lα and then we will proceed to finding the weights, the Weyl chamber and constructing theirreducible representations. We will also see at the end how all this helps in finding the representations ofGLn(C).

5.1 Structure of sl(n, C)

We will first find (a) maximal toral subalgebra of L = sl(n,C). The obvious choice would be the to considerthe set of all diagonal matrices. It is clearly a toral subalgebra, the only question is whether it is maximal.Suppose that H is not maximal and H ⊂6= T , for some other toral subalgebra. Then since T still must beabelian by what we showed in the beginning of the section on root space decomposition, so we have thatfor every t ∈ T and h ∈ H, [th] = 0, i.e. T ⊂ CL(H). The question then is to show that for our particularchoice the matrices that commute with any diagonal matrix of trace 0 are only the diagonal matrices, i.e.H ⊂ T ⊂ CL(H) = H so T = H. Well, let eij be the matrix with only nonzero entry at (i, j) and this entryis equal to 1. If h = diag(a1, . . . , an), then [heij ] = aieij − ajeij = (ai − aj)eij . Let x =

∑xijeij , i.e. x is a

matrix with entries xij . Then [hx] =∑i,j(ai − aj)xijeij and x ∈ CL(H) iff (ai − aj)xij = 0 for all i, j and

all a1, . . . , an such that a1 + · · ·+an = 0. So fixing x for every i 6= j we can take ai = 1, aj = −1 and ak = 0otherwise and force xij = 0, showing that x must be diagonal.

22

Now that we have our maximal toral subalgebra H we need to find the root spaces Lα and for that we firstneed to find the roots. We just saw that if h ∈ H, i.e. h = diag(a1, . . . , an), then [heij ] = (ai−aj)eij , so thematrices eij are eigenvectors for H. If we then let αij : H → C be given by αij(diag(a1, . . . , an)) = ai − aj ,we see that Lαij = x ∈ L|[hx] = αij(h)x for all h ∈ H for i 6= j is nonempty as it contains Ceij . In factcounting dimensions, the nilpotent part of L, i.e. the complementary to H in the Cartan decomposition, isspanned as a vector space by the n(n−1) vectors eij for i 6= j and on the other hand it contains the n(n−1)disjoint nonempty spaces Lαij , so we must have

L = H ⊕ (⊕i 6=jLαij )

and clearly αij are the roots of L = sl(n,C). Either from theorems 14 and 16 or simply from dimension countagain, since dimH = n−1 (codimension 1 in Cn) and so dim(⊕ijLαij ) ≤ dimL−dimH = n2−1− (n−1) =n(n− 1) , we must have dimLαij = 1, so Lαij = Ceij .

For the sake of simplicity, define the functionals on the space of diagonal matrices li(diag(b1, . . . , bn)) = bi,then we see αij = li − lj and so we express the roots as pairwise differences of n functionals. On the spaceH since the sum of diagonal entries is 0, these n linear functionals are not independent, but we havel1 + · · ·+ ln = 0. This is the only relation they need to satisfy, so we can picture them in n− 1-dimensionalspace.

In order to draw a realistic picture we need to determine the inner product, i.e. the Killing form onsl(n,C). We will determine it from the definition as κ(x, y) = Tr(adx ad y).

Since κ is linear we can determine the values κ(eii, ejj) and extend them over H by linearity. We have thatκ(eii, ejj) = Tr(ad eii ad ejj) and for any basis element of gln epr we have ad eii ad ejj(epr) = [eii, [ejj , epr]] =[eii, (δpj − δrj)epr] = (δpj − δrj)[eii, epr] = (δpj − δrj)(δpi− δri)epr. So epr are eigenvectors and by dimensioncount are all possible eigenvectors, so

Tr(ad eii ad ejj) =∑p

∑r

(δpj − δrj)(δpi − δri) = (8)−2 for i 6= j ,∑p

∑r(δpi − δri)2 =

∑p 6=i 1 +

∑r(1− δri)2 = 2(n− 1) for i = j

(9)

Now let h = diag(a1, . . . , an) and g = diag(b1, . . . , bn) with h, g ∈ H, i.e.∑ai = 0 and

∑bi = 0.

Then κ(h, g) =∑i,j aibjκ(eii, ejj) =

∑i 6=j(−2)aibj +

∑i=j 2(n − 1)aibj =

∑i,j −2aibj +

∑i 2naibi =

−2(∑i ai)(

∑j bj) + 2n

∑aibi = 2n

∑i aibi.

Having figured out κ we need to find the inner product on H∗ given by (α, β) = κ(tα, tβ) and forthat we need to find tα. For h = diag(a1, . . . , an) ∈ H we have li(h) = ai, and if tli = diag(t1, . . . , tn),we must have by definition of tα that ai = li(h) = κ(tli , h) = 2n

∑k tkak and that for all ais with sum

0, so the conditions are just enough to determine tli uniquely as 12neii. Then we can say that the inner

product on H∗ is determined by (li, lj) = κ(tli , tlj ) = 12nδij . In particular the roots given by αij = li − lj

will have inner products (αij , αkl) = 12n (δik − δil − δjk + δjl). And in general the inner product on H∗ =

Cl1, . . . , ln/(l1 + · · ·+ ln) = a1l1 + · · ·+ anln|a1 + · · ·+ an = 0 is given by

(∑

aili,∑

bili) =1

2n(∑

aibi).

We see that our inner product makes the li orthogonal, if we want to picture the root lattice then wecan consider the space Cn with unit vectors given by the lis (e.g. the usual coordinates ei = (0, . . . , 1, 0, . . . )with 1 at position i), then H∗ will be the plane given by

∑xi = 0 and the roots αij = li− lj are the vertices

of a certain kind of polytope (if n = 3 this is a planar hexagon for example), called very creatively the rootpolytope. The root lattice then will simply be

ΛR = ∑

aili|ai ∈ Z,∑i

ai = 0/(∑

li = 0),

where the quotient is just to emphasize we are in H∗, i.e. the plane defined by∑li = 0.

23

Now we will look at the weight lattice ΛW . It is the set of linear functionals β ∈ H∗ which assume integervalues on hαij . So we need to find the hαij s. But we already showed that hα = 2tα

κ(tα,tα) = 2tαα(tα) and we

found tαij = eii − ejj , so αij(tαij ) = 1 − (−1) = 2 and so hαij = tαij = eii − ejj . Then if β =∑bili ∈ H∗

is a weight, we must have that β(hαij ) = β(eii − ejj) = bi − bj ∈ Z, so all coefficients are congruent oneanother modulo Z, so since we have

∑li = 0 we can assume they are in Z, so

ΛW = Zl1, . . . , ln/(∑

li = 0).

We can now determine the Weyl group also, it is given by reflections Wαij (β) = β−β(hαij )α =∑bklk−

(bi − bj)(li − lj) =∑bklk + (bj − bi)li + (bi − bj)lj = b1l1 + . . . bj li + · · ·+ bilj + . . . , i.e. it exchanges li with

lj , fixing everything else. So we have that W ' Sn -the symmetric group on the n elements li.We can now pick a direction, divide the roots into positive and negative and find a Weyl chamber. For

β ∈ H∗, i.e. β =∑aili a linear functional will be determined by its values on li (plus the condition that

these values sum to 0), so if φ : H∗ → R is our linear functional with φ(li) = ci, we have φ(β) =∑bici and∑

ci = 0. On the roots we have φ(αij) = ci − cj , which suggest a natural ordering given that c1 > c2 >· · · > cn, then φ(αij) = ci − cj > 0 iff i < j. So R+ = li − lj |i < j and R− = li − lj |i > j. We can easilydetermine the simple roots also, as for any i < j we have li− lj = (li− li+1)+(li+1− li+2)+ · · ·+(lj−1− lj−1)is a sum of positive roots if i+ 1 < j. So the simple (primitive) positive roots must be among li − li+1 andit is easy to see that these cannot be epxressed as sums of other positive roots.

The Weyl chamber associated to this ordering will be given by the β ∈ H∗, such that β(hγ) ≥ 0 forevery γ ∈ R+, i.e. if β =

∑bili, then we will have β(hαij ) = bi − bj ≥ 0 for all i < j, i.e. then the Weyl

chamber is W = ∑bili|b1 ≥ b2 ≥ . . . bn. So we can see how choosing a different order we will get a

disjoint and geometrically equal Weyl chamber, as we expect by the action of W. If we let ai = bi − bi+1

with an = bn, we see that β ∈ W is equivalent to ai ≥ 0 for all i. Also bi =∑j≥i aj and so we have

that β =∑i(∑j≥i aj li) =

∑j aj(

∑i≤j li), so in particular the Weyl chamber is the cone between the rays

given by l1, l1 + l2, . . . , l1 + · · ·+ ln−1. The faces of W are always given by the hyperplanes Ωα, in this caseΩαij =

∑bili|bi = bj, perpendicular to the roots αij , but we see that given our ordering b1 ≥ · · · ≥ bn

in Ωαij ∩W we must have bi = bi+1 = . . . bj . This means that the codimension 1 faces of W must be theΩαi,i+1 =

∑bili|bi = bi+1, the ones perpendicular to the simple roots.

We see here explicitly what the fundamental weights are, namely ωi = l1 + · · ·+ li, which are the first onthe edges of the Weyl chamber. Then the irreducible representations will correspond to any (n − 1)−tupleof nonnegative integers (a1, . . . , an−1), corresponding to the weight α =

∑aiωi = (a1 + · · · + an−1)l1 +

. . . an−1ln−1, Γa1,...,an−1 .

5.2 Representations of sl(n, C)

We saw that α = (a1 + · · ·+an−1)l1 + . . . an−1ln−1 is in the Weyl chamberW and for ai ∈ N it is also clearlyin ΛW , so it is a weight. We want to show that the representation corresponding to it, Γα does in fact exist.

Let V = Cn be the standard representation of sl(n,C). What are its weights? For h = diag(b1, . . . , bn)with

∑bi = 0, its eigenvalues as a matrix are simply its diagonal entries b1, . . . , bn. So if ei are the usual

basis vectors of V , then h.ei = bi.ei, so for every h we have h.ei = li(h)ei, so ei ∈ Vli - weight space forweight li and clearly V = ⊕iVli . This shows that the weights of this representation are the lis. It is clearlyirreducible, for example by checking that the matrix with 1s above the main diagonal and 1 at the lowerleft corner and all other entries 0 sends en into en−1, then en−2, e1 to en, i.e. acts transitively on the basisvectors.

Now consider∧i

V , the ith exterior power of V . Its elements are v1∧· · ·∧vi and the action of an elementu ∈ L is given by the usual action on a tensor product. We have u(v1 ⊗ . . . vi) =

∑j v1 ⊗ . . . u(vj)⊗ . . . vi,

with the wedge product we just replace ⊗ with ∧. Now∧i

V has basis induced from the tensor product basisof i−tuples of unordered basis vectors, i.e. ej1,...,ji = ej1 ∧ · · · ∧ eji such that j1 < j2 · · · < ji. Consider againthe action of h = diag(b1, . . . , bn) on a basis vector, we have h(ej1,...,eji ) =

∑k ej1 ∧ . . . (bjkejk) ∧ . . . eji =

(∑k bjk)ej1,...,ji . So we see that h(ej1,...,ji) = (lj1 + · · · + lji)(h)ej1,...,ji and so every basis vector of

∧iV

24

spans a one-dimensional weight space Vlj1+···+ji. The weights of this representation are given by all possible

sums of i different ljs, i.e. from l1 + · · · + li to ln−i+1 + · · · + ln. In particular we see that every weight isin the image of the Weyl group acting on l1 + · · · + li, namely if lj1 + · · · + lji is a weight than if σ is thepermutation sending σ(k) = jk for k = 1, . . . , i and everything else in what remains, then since σ ∈ Sn 'Wwe have σ(l1 + · · · + li) = lj1 + · · · + lji as the elements of the Weyl group are all possible transpositionslr ↔ lp. So since we saw how l1 + · · · + li ∈ W, the Weyl chamber, we then have that

∧iV = Γl1+···+li as

follows. First of all, l1 + · · ·+ li is certainly the highest weight of∧i

V and if it had any subrepresentations,then one of them would have to account for the highest weight and so be Γl1+···+li . But then Γl1+···+li wouldalso account for the weights that are images of l1 + · · · + li under W, so all the weights of

∧iV , counting

multiplicities as these are all 1, and so it must be equal to∧i

V .By theorem 20 we must have an irreducible representation Γα for every α = (a1 + · · ·+ an−1)l1 + · · ·+

an−1ln−1, where ai ∈ N. If we rewrite α as α = a1l1 +a2(l1 + l2)+ · · ·+an−1(l1 + · · ·+ ln−1), this can suggestwhere to look for the desired representation in view of the preceding paragraph. Namely, since

∧iV has

highest weight l1+· · ·+li and a highest weight vector v, then Symk∧i V will have highest weight k(l1+· · ·+li)and highest weight vector v . . . v︸︷︷︸

k

. We will show this explicitly. For I = j1, . . . , jk, let eI = ej1 ∧· · ·∧eji and

lI =∑p∈I lp, then Symk∧i V is spanned by the commutative products eI1 . . . . eIk and again by the action

of a Lie algebra on tensor products we have that for h ∈ H, h(eI1 . . . eIk) =∑kr=1 eI1 . . . h(eIr ) . . . eIk =∑

r lIr (eI1 . . . eIk) = (∑r lIr )eI1 . . . eIk . So each of these elements eI1 . . . eIk span a one dimensional weight

space for Symk∧i V for the weight∑r

∑p∈Ir lp and we readily see by the arrangement of weights li > lj as

long as i < j that the maximal weight is k.(l1 + · · ·+ li) (by maximizing each Ir to be 1, . . . , i, keeping inmind that IR should contain i different numbers). Now since Symk∧i V has maximal weight k(l1 + · · ·+ li),it must also contain the unique irreducible representation Γk(l1+···+li) (i.e. it must contain an irreduciblerepresentation generated by a highest weight vector for the given highest weight and since there is only oneirreducible representation of a given highest weight, it must be our Γ).

Now the last step is to show that the representation S = Syma1 V ⊗Syma2∧2

V ⊗· · ·⊗Syman−1(∧n−1

V )has highest weight α = a1l1 + a2(l1 + l2) + · · · + an−1(l1 + · · · + ln−1) and by the discussion at the end ofthe preceding paragraph we will necessarily have that it contains the irreducible representation Γα. Butthis actually follows by similar reasoning as in the previous paragraph. Again our representation S isgenerated by all tensor products v1 ⊗ . . . vn−1, where vi run through the generators of Symai

∧iV as in

the preceding paragraph. By it we have h(vi) = mi(h)vi, where if vi = eI1 . . . eIk then mi =∑kr=1 lIr . So

h(v1 ⊗ · · · ⊗ vn−1) =∑n−1p=1 v1 ⊗ . . . h(vp)⊗ · · · ⊗ vn−1 = (

∑pmp)(h)(v1 ⊗ · · · ⊗ vn−1) and the weights of S

must be exactly these sums∑n−1p=1 mp. In order to maximize this sum, since the mps are independent, we

maximize each mp, which by the preceding paragraph gives us exactly ap(l1 + · · · + lp), so at the end themaximal weight is

∑p ap(l1 + · · ·+ lp) = α, which is what we needed to show. Since the αs run through all

possible maximal weights we have exhausted all possible irreducible representations and so we’ve proven thefollowing theorem.

Theorem 21. The irreducible representations of sl(n,C) are the Γa1,a2,...,an−1 of highest weight (a1 +. . . an−1)l1+. . . an−1ln−1 and it appears as a subrepresentation of Syma1 V⊗Syma2

∧2V⊗· · ·⊗Syman−1(

∧n−1V ).

5.3 Weyl’s construction, tensor products and some combinatorics

Our goal here is to exhibit the irreducible representations Γa1,...,an−1 as explicitly as possible as subspaces ofthe d−th tensor power V ⊗d, where d is the dimension of Syma1 V ⊗Syma2

∧2V ⊗· · ·⊗Syman−1(

∧n−1V ) ⊂

V ⊗d, i.e. d =∑i aii.

Observe that since sl(n,C) acts on V , and Sd acts on V ⊗d by permuting coordinates, then the twoactions commute. Moreover, we know from both [1] and [3] what the representations of Sd are, in particularthey are nicely indexed by partitions λ ` d λ1 ≥ λ2 ≥ . . . λn, and in fact so are the Γa1,...,an−1 if we writethe partition of d as (

∑n−1i=1 ai,

∑n−1i=2 ai, . . . , an−1, 0) in bijection with any sequence of nonnegative integers

25

(a1, . . . , an−1). We can then use the representations of Sn to study the representations of sl(n,C) and forthat we will digress a little bit in reviewing representations of Sn.

A standard way of studying representations of Sn is as shown in [1] the use of Young symmetrizers.Consider the group algebra CSd and let T be a Young tableau of shape λ, where λ ` d. Then let Pλ =g ∈ Sd : g preserves each row of T and Qλ = g ∈ Sd : g preserves each column of T. The choice of Tdoes not matter as long as we are consistent (use the same one) as the sets would be conjugate. Now letaλ =

∑g∈Pλ eg and bλ =

∑g∈Qλ sgn(g).eg, where eg is the element in the group algebra corresponding to g.

The define the Young symmetrizer cλ = aλ.bλ ∈ CSd. The main theorem in the representaiton theoryof Sd states that c2λ = nλcλ for nλ ∈ N and the image of cλ by right multiplication on CSd is an irreduciblerepresentation Vλ and every irreducible representation is obtained in this way.

Going back to V ⊗d we have the action of Sd on it given by (v1⊗· · ·⊗vd).σ = (vσ(1)⊗· · ·⊗vσ(d)) for σ ∈ Sdand so we can consider the image of cλ on V ⊗d and denote it by SλV = Im(cλ|V ⊗d) = V ⊗d.cλ = V ⊗d⊗CSdVλ,which will be a subrepresentation of V ⊗d for GLn and hence for SLn and taking the differential of therepresentation map we obtain a corresponding irreducible representation of sl(n,C).

Theorem 22. The representations Sλ(Cn) is the irreducible representation of sl(n,C) with highest weightλ1l1 + · · ·+ λnln. It is in fact the representation Γa1,...,an−1 given by ai = λi − λi−1.

Proof. We need to see how the representation of SLn relates to a corresponding representation of sl(n,C).There is no natural map from a Lie group to its Lie algebra, but there is certainly a map from a Lie algebrato a Lie group, the exponential. If ρ : SL(V )→ SL(W ) is a representation, the induced representation willbe dρ : sl(V )→ sl(W ) and the following diagram will commute

sl(V )dρ //

exp

sl(V ′)

exp

SL(V )

ρ// SL(V )

. (10)

So in particular if V ′ = ⊕V ′α is a weight space decomposition, i.e. for every w ∈ V ′α and h ∈ Hwe have ρ(h)w = α(h).w, then by commutativity we will have that ρ(exp(h)) = exp((dρ)(h)). Since weare dealing with matrices we have exp((dρ)(h)) = 1 + (dρ)(h) + (dρ)(h)2

2 + . . . and in particular we have

exp((dρ)(h))w =∑i

(dρ(h))i

i! w =∑iα(h)i

i! w = eα(h)w, so its eigenvalues are eα(h) with the same eigenvectors.Therefore we have that if A ∈ SL(V ) with A = exp(h), then Tr(ρ(A)) =

∑α∈W eα(h).

Now if h = diag(y1, . . . , yn) ∈ sl(n,C) then A = diag(ey1 , . . . , eyn), then we have for β = b1l1+· · ·+bnln ∈W (i.e. a weight of V’, so bi ∈ Z), we have β(h) = b1y1 + · · ·+ bnyn, so

Tr(A) =∑β∈W

eβ(h) =∑β∈W

eb1y1+b2y2+···+bnyn =∑β∈W

(ey1)b1(ey2)b2 . . . (eyn)bn . (11)

Now if we assume for a moment some of the representation theory of GLn, which will be shown later inappendix A, we can use a theorem from the theory of Schur functors giving us a formula for the trace ofA when V ′ = SλV . The theorem states that for any A ∈ GL(V ), the trace of A on SλV , which is now arepresentation of GL(V ), is the value of the Schur polynomials (see ??) on the eigenvalues x1, . . . , xn of Aon V , i.e. Tr|SλV (A) = sλ(x1, . . . , xn).

We know from ?? that sλ = mλ +∑µ<λKλµmµ, where mµ are the monomial symmetric functions, i.e.

mµ =∑i1,i2,...,in distinct x

µ1i1xµ2i2. . . xµnin , and Kλµ are the Kostka coefficients, whose combinatorial represen-

tation is given by the number of semi-standard Young tableaux of shape λ and type µ. So for any diagonalmatrix A ∈ GL(V ), which we can always write as A = diag(ey1 , . . . , eyn), we have that

Tr(A) = sλ(ey1 , . . . , eyn) = mλ(ey1 , . . . , eyn) +∑µ<λ

mµ(ey1 , . . . , eyn). (12)

26

Now the last equation (12) can be expanded as a sum of monomials in ey1 , . . . , eyn , but so is the formula (11).As these must be equal for all y1, . . . , yn, we must have that the monomials coincide, so the weights β ∈Wmust be of the kind µi1 l1 + · · · + µin ln for some permutation i1, . . . , in of (µ1, . . . , µn) for some µ ≤ λ andmust occur with multiplicity Kλµ. In particular we have that the weight λ1l1 + · · · + λnln has multiplicityexactly one and is highest among all appearing weights10. By the uniqueness of irreducible representationswith given highest weight it follows that the representation Sλ(Cn) is the irreducible representation of highestweight λ1l1 + · · ·+ λnln.

As a consequence to this theorem we are enabled thanks to combinatorics rules to determine the decom-position of any tensor power into irreducible representations. Specifically, if V1, . . . , Vk are representations,then the trace of a matrix A ∈ GLn on V1 ⊗ V2 ⊗ · · · ⊗ Vk is Tr|V1⊗···⊗Vk(A) = Tr|V1(A). . . . T r|Vk(A). Wenow know that the traces are polynomials (necessarily symmetric!) in the eigenvalues and the traces of theirreducible representations are the Schur polynomials. So if f(x1, . . . , xn) is the trace of our representationof interest, if we write f(x1, . . . , xn) =

∑λ cλsλ(x1, . . . , xn), then we will have that cλ is the multiplicity of

the irreducible representation Sλ(Cn).An important example of this is the decomposition of Sµ(Cn)⊗ Sν(Cn) into irreducible representations.

The trace polynomial will be sµ(x1, . . . , xn)sν(x1, . . . , xn) and we know, e.g. by [3], that sµsν =∑λ c

λµνsλ,

where the coefficients cλµν are called the Littlewood-Richardson coefficients and since they necessarily areequal to the multiplicity of an irreducible representation they must be nonnegative integers. They can alsobe computed by the Littlewood-Richardson rule, saying that cλµν is the number of skew SSYTs of shape λ µ,type ν, and such that their reverse reading word is a lattice permutation.

5.4 Representations of GLn(Cn)

We certainly have that the SλCns are irreducible representations of GLn as shown in the appendix, thequestion is whether these are all the irreducible representations.

The commutative diagram (??), which comes from the theory we developed about maps between Liegroups and their Lie algebras, shows that when the Lie group is simply connected then there is one-to-onecorrespondence between the irreducible representations of the Lie group and the irreps of its Lie algebra.One can show this by arguing as follows: if V1 ⊂ V is a subrepresentation of the group / algebra then thesame is true for the algebra/group by the commutativity of exp and the representation map.

Since SLn(C) is simply connected we have such one-to-one correspondence with the irreducible represen-tations Γa1,...,an−1 of sl(n,C). GLn however is not simply connected and gln is not even semisimple, so wewill try to build the irreducible representations from the special groups.

Denote Cn − V and let Dk be the one-dimensional representation of GLnC given by the kth powerof the determinant, so if k ≥ 0 we have Dk = (

∧nV )⊗k. Given a representation of SLn we can lift

it to CLn by tensoring with Dk. If for any a = (a1, . . . , an) we denot by Φa the subrepresentation ofSyma1 V ⊗ · · · ⊗ Syman(

∧nV ), spanned by v = (e1)a1 .(e1 ∧ e2)a2 . . . (e1 ∧ · · · ∧ en)an , we see that passing to

the Lie algebra corresponds to the irrep of highest weight a1l1 + · · · + (a1 + · · · + an)ln, and in particularrestricting to SLn gives Γ(a1,...,an−1). Or we can set λ = (a1 + · · · + an, a2 + · · · + an, . . . , an) and considerSλV as a representation Ψλ of GLn. We see that in the first case we have Φa1,...,an+k = Φa1,...,an⊗Dk and inthe second Psiλ1+k,...,λn+k = Ψλ⊗Dk. Moreover these two representations are actually isomorphic, as theirrestrictions to SLn agree and their restriction to the center C∗ ⊂ GLn are both Dan , so agree everywhere.We will show finally that these are all the irreducible representations of GLn.

Theorem 23. Every irreducible complex representation of GLnC is isomorphic to Ψλ for a unique indexλ = (λ1, . . . , λn) with λ1 ≥ λ2 ≥ · · · ≥ λn.

Proof. Consider the corresponding Lie algebras. We have that glnC = slnC × C, where C is the idealformed by matrices a.I with a ∈ C and I -the identity matrix. In particular C is the radical of gln and n is

10Follows by the ordering of the weights li > li+1 and by the fact that any other weight appearing is of the form µ1l1+· · ·+µnlnwith µ < λ

27

the semisimple part. The following reasoning shows that every irreducible representation of gln is a tensorproduct of an irreducible representation of sln and a one-dimensional representation.

Let U be an irreducible representation of gln. We have that by Lie’s theorem since Rad(gln) = C issolvable and since Rad(gln) ⊂ gl(U) then U contains a common eigenvector for all elements in Rad(gln) andso there is a linear functional α, such that W = w ∈ U |x.w = α(x)v for every x ∈ Rad(L) is nonempty.But then since every element of gln can be written as x + y with x ∈ Rad(gln) and y ∈ sln we have thatfor w ∈ W x.y.w = y.x.w + [x, y](w) = α(x)(y.w) since in our case Rad(gln) commutes with every element.Then we must have y.w ∈W , so gln : W →W and hence W is a subrepresentation of U , so must be all of U .Now if we extend α to gln and let D be the one-dimensional representation given by α, i.e. for every z ∈ Dand x ∈ gln we have x.z = α(x)z and so U ⊗ D∗ is trivial on Rad(gln) and so is a necessarily irreduciblerepresentation of sln.

So if Wλ = Sλ(Cn) is the irreducible representation of sln(C), given by the partition λ1, . . . , λn, we canextend it to gln = sln × C by acting trivially on the second factor. By the above consideration we havethat every irreducible representation of gln is isomorphic to Wλ ⊗ L(w), where L(w) is the one dimensionalrepresentation given by multiplication by w ∈ C on the second factor of slnC× C.

So passing back to Lie groups, the same will be true for the simply connected SLnS × C (the simply-connectedness we take for granted here). We now need to pass from SLn × C to GLnC and for that thereis a natural map ρ : SLnC × C → GLnC given by ρ(g, z) = ez.g. The kernel of this map is generatedby esI × (−s) and for esI ∈ SLnC we need to have s = 2πi/n. So SLnC × C = GLnC × ker ρ. Now anirreducible representation of GLnC can be lifted to an irreducible representation on SLn×C by letting it be0 on ker ρ, so actually this defines a one-to-one correspondence of irreducible representations and we needto find which of Wλ × L(w) are 0 on ker ρ. We have that es.I acts on Sλ by multiplication by esd, whered =

∑λi, since that’s how it acts on V ⊗d. −s acts on L(W ) by multiplication by e−sw, so the action on

ker ρ is given by multiplication by esd−sw, which is trivial iff sd− sw ∈ frm−eπiZ and since s = 2πi/n, thisis equivalent to w =

∑λi + kn for k ∈ Z.

Then we have that the irreducible representations are Wλ ⊗ L(∑λi + kn), which restricted to GLn is

Ψλ1+k,...,λn+k and the pullback of Ψ is Wλ⊗L(∑λi+kn), so we have exhausted all irreducible representations

of GLnC and so are done.

5.5 Some explicit construction, generators of SλVFor a sequence of nonnegative integersa = (a1, . . . , ak), let Aa = Syma1

∧kV ⊗· · ·⊗Symak V , as we saw the

irreducible representation Γa ⊂ Aa. Here we will describe Aa and exhibit a basis for Sλ.A way to describe Aa is to look at the right action of the symmetric group on V ⊗d and note that on one

hand we have

Sλ(V ) = V ⊗dcλ ⊂ Aa ⊂ (k∧V )⊗ak ⊗ · · · ⊗ (V )a1 = V ⊗dbλ.

These inclusions suggest that we can probably find a a′ ∈ CSd, such that Aa = V ⊗d.a.bλ. Findingthis a′ is not that hard at all if we note that (k, . . . , k︸︷︷︸

ak

, k − 1, . . . , k − 1︸︷︷︸ak−1

, . . . , 1, . . . , 1︸︷︷︸a1

) is in fact the conjugate

partition of λ denoted λ′, i.e. it is the list of lengths of the columns of λ. The wedges∧l

V are justV ⊗l.b(1l), where b(1l) =

∑σ permutes the column of length l sgn(σ)σ (i.e. bλ for λ - a column of length l). On

the other hand Symai(∧i

V ) is actually taking all these columns of length i and symmetrizing them, i.e.permuting them in on all possible ways, i.e. (

∧iV )⊗ai .aai , where a(ai) is the aλ for λ = ai (don’t confuse

the two different kinds of as!). So wedging over the columns of different lengths gives us that AaV =(V k.b1k)⊗aka(ak) ⊗ · · · ⊗ (V )⊗a1 .a(a1) = V ⊗d.a′.bλ, where a′ =

∑r∈Sak×···×Sa1

er, is the sum over thepermutations r which permute the columns of same length between each other, i.e. the group Sak×· · ·×Sa1 .Note also that a′ and bλ commute for the same reason that aλ and bλ do. It is also clear that aλ = a′′.a′,where a′′ =

∑p∈(Sλ1×···×Sλk )/(Sak×···×Sa1 ) ep over the coset representatives.

28

Set A•(V ) = ⊕aAa(V ) = Sym•(V ⊕

∧2V ⊕ · · · ⊕

∧nV ), which makes a graded ring out of the aaV s.

Then we can define a graded ring S• as the quotient of A•(V ) by the two-sided ideal I• generated for allp ≥ q ≥ r ≥ 1 by all elements of the form (called the ”Plucker relations”) (v1 ∧ · · · ∧ vp).(w1 ∧ · · · ∧ wq) −∑

(v1 ∧ · · · ∧ w1 ∧ · · · ∧ wr ∧ · · · ∧ vp).(vi1 ∧ vi2 ∧ · · · ∧ vir ∧ wr+1 ∧ · · · ∧ wq), where the sum is over all1 ≤ i1 < i2 < · · · < ir ≤ p and the w1, . . . , wr are inserted in the places of vi1 , . . . , vir .

11

We are now going to exhibit the coordinates of S• in A•. Let e1, . . . , en be a basis for V and let T bea semistandard Young tableaux of shape λ filled with the integers 1, . . . , n, i.e. the rows are nondecreasingand the columns strictly increasing. Let T (i, j) denote the entry of T in row i and column j and define

eT =l∏

j=1

eT (1,j) ∧ eT (2,j) ∧ · · · ∧ eT (λ′j ,j),

i.e. we wedge together the entries in columns and multiply the results in S•. We are going to prove thefollowing theorem.

Theorem 24. The eT for T a semistandard tableaux on λ form a basis for Sa(V ). The projection fromAa(V ) to Sa(V ) maps the subspace Sλ isomorphically onto Sa(V ).

Proof. We will first show that the elements eT generate Sa.To see that the eT span Sa, note that clearly Sa is spanned by all eT for all tableaux T with strictly

increasing columns (not just semistandard) as these span Aa itself. Introduce a reverse lexicographic orderingon the T s by comparing the listing of their elements column by column, and within each column list topto bottom and we compare starting from the last entry going forward until we find two differing entries.Then for example we have that the tableaux T0 which has T (i, j) = i for all i, j is the smallest one. Supposenow that a tableaux T is not semistandard, i.e. there are two adjacent columns indexed j and j + 1, forwhich there is a row r, such that T (r, j) > T (r, j + 1) and let r be the smallest such. Let then v1, . . . , vpbe the column of index j and w1, . . . , wq- the one of index j + 1 (i.e. vi = T (i, j), wi = T (i, j + 1)). Forthe r we just picked then we have by the Plucker relations in I• that (v1 ∧ · · · ∧ vp).(w1 ∧ · · · ∧ wq) =∑

(v1 ∧ · · · ∧ w1 ∧ · · · ∧ wr ∧ · · · ∧ vp).(vi1 ∧ · · · ∧ vir ∧ wr+1 ∧ . . . wq). Multiplying the remaining columnsof T we get that eT =

∑eT ′ , where the T ′ are obtained from T by interchanging the first r elements from

column j+ 1 of T with some r elements of column j of T . We see then that the listing of entries of T ′ is thesame as the one for T starting from the back until we encounter the position (i, j+ 1), where the entry of T ′

will be vir = T (ir, j) ≥ T (r, j) > T (r, j + 1), so in the reverse lexicographic ordering we will have T ′ > T .In this way we see that by the Plucker relations eT =

∑eT ′ with T ′ > T as long as T is not semistandard.

Continuing this process iteratively we see that we can express eT for every non-semistandard tableaux Twith a sum of eT ′ with T ′ > T and so we will only be increasing in the lexicographic order. Since there arefinitely many tableaux on n elements of shape λ we will reach a moment where eT is a sum (of sums of sumsetc) of tableaux that must be semistandard (otherwise we can keep expressing as sums of larger ones). Thisshows that eT for T -semistandard generate Sa(V ).

We need to show now that they are linearly independent. We will show this indirectly by showing thatSλ(V ) ⊂ Sa. If the projection of Sλ into Sa is not zero, then their intersection is not 0 and since both Sλand Sa are representations of GL(V ) with Sλ irreducible we must have that Sλ is a submodule of Sa. Forevery representation we have that its dimension is the trace (character) at the identity of the group, thenin our particular case we have that dim Sλ = Tr(I) = sλ(1, . . . , 1) (from appendix A). On the other handfrom the combinatorial interpretation of Schur functions ([3]) we have sλ(x1, . . . , xn) =

∑T |shape(T )=λ x

T , so

11We have that S• is in fact the graded algebra of the Sλs. A way to see this is by noticing that Sλ = (V ⊗da′′)a′.bλis equivalent to averaging the elements of Aa which are obtained by one another via a coset representative p ∈ (Sλk × · · · ×Sλ1 )/(Sak×· · ·×Sa1 ). In particular the given relations restrict to the case of only two columns, represented by (v1∧· · ·∧vp) and(w1∧· · ·∧wq), in which we average out over all r over all possible picks of rows i1 < · · · < ir, for which we exchange the elementsin rows i1, . . . , ir to get the first column to be (v1∧· · ·∧wi1 ∧· · ·∧wir ∧ . . . vp) and the second (w1∧ . . . vi1 ∧· · ·∧vir ∧· · ·∧wq).In the original relations we move elements from positions (i1, . . . , ir) to (1, . . . , r) two times, i.e. conjugate by a permutationpreserving columns, and since we apply it twice we don’t change the sign (think of permutations appearing in bλ). In any casewe have that A•/I• = S•.

29

sλ(1, . . . , 1) =∑T |shapte(T )=λ 1 is the number of semistandard tableaux T of shape λ filled with the numbers

1, 2, . . . , n. But this is the same number of possible generators eT for Sa - the semistandard tableaux T ofshape λ and filled with 1, . . . , n, so counting dimensions we get #eT | generating Sa, i.e. T -semistandard ≥dim Sa ≥ dim Sλ = #T semistandard , so all inequalities are equalities showing that the eT s are linearlyindependent and that Sa ' Sλ.

So we need to show only that the image of Sλ in S• is nonempty. We first of all have that each eT is infact in Sλ (one can show this by seeing that eT .cλ = ceT , for some scalar c). Next we show that at least oneof them has a nonzero image in S•. We will check this for eT0 with T0(i, j) = i for all i, j. Now T0 is definitelythe smallest in the reverse lexicographic order and moreover, applying the Plucker relations to any two ofits columns results in the same tableaux (we have that w1 = 1, . . . , wr = r and vi1 = i1, . . . , vir = ir and thewedge v1 ∧ . . . w1 ∧ · · · ∧ vp = 0 for having two equal elements unless we replaced w1, . . . , wr with v1, . . . , vr,which leads to the trivial relation eT0 − eT0), in particular we cannot have it as a summand in any Pluckerrelation as reversing the exchange of r elements between two columns of T0 can’t result into anything butT0. So eT0 6∈ I•, showing that its image under the projection is not 0 and so that eT0 ∈ Sλ ∩ Sa.

6 Conclusion

In this paper we developed the basic theory of Lie algebras, classifying them through properties like solv-ability, nilpotency, simplicity and showing criteria for establishing such properties. We studied the structureof semisimple Lie algebras via roots and root space decomposition and used that to study the irreduciblerepresentations of semisimple Lie algebras. We showed in particular a theorem that established a bijectionbetween irreducible representations and highest weights (i.e. points in the intersection of the Weyl chamberwith the root lattice), which gave us a nice way of listing all irreducible representations. We showed thetheory in practice by studying the structure and representations of sl(n,C). We established connectionsbetween Lie groups and Lie algebras, which enabled us in particular to limit the irreducible representationsof the Lie group GL(V ) by the irreducible representations of its Lie algebra gl(V ), thereby proving thatthe irreducible representations Sλ are in fact all the irreducible representations of GL(V ). We showed someexplicit constructions in the intersection of combinatorics and algebra.

A Schur functors, Weyl’s construction and representations of GLn(C)

Consider the right action of Sd on V ⊗d for any vector space V given by (v1⊗· · ·⊗vd).σ = vσ(1)⊗· · ·⊗vσ(d).This action commutes with the left action of GL(V ) acting coordinatewise on the tensor product. Let λ bea partition of d and let cλ be the Young symmetrizer in the group algebra of Sn defined as cλ = aλbλ, whereaλ =

∑g∈P eg and bλ =

∑g∈Q sgn(g)eg, where P is the set of permutations fixing the rows of a Young

tableaux of shape λ filled with the elements 1, . . . , d and Q is the set of permutation fixing the columnsof the same tableaux. Let SλV = V ⊗d.cλ be the so called Schur functor (it is a functor on the set of vectorspaces V and linear maps between them). It is clearly a representation of GL(V ). We are going to provethe following theorem.

Theorem 25. For any g ∈ GL(V ) the trace of g on Sλ(V ) is the value of the Schur polynomial on theeigenvalues x1, . . . , xn of g on V , Tr|SλV (g) = sλ(x1, . . . , xn). Each SλV is an irreducible representation ofGL(V ).

In order to prove this theorem we will first prove some general lemmas. Let G be any finite group,in our case its role will be played by Sd and A = CG be its group algebra. For U a right module overA, let B = HomG(U,U) = φ : U → U |φ(v.g) = φ(v).g for all v ∈ U and g ∈ G. If U decomposes intoa direct sum of irreducible modules via U = ⊕iU⊕nii , then since by Schur’s lemma a map between twoirreducible modules is either 0 or scalar multiplication, we have that HomG(Ui, Uj) = 0 if i 6= j andHomG(U⊕nii , U⊕nii ) = Mni(C), ni × ni matrices. If then W is a left A module, then the tensor productU⊗AW is left B module as for φ ∈ B and a ∈ A we have φ.(u.a⊗w) = φ(u.a)⊗w = φ(u).a⊗w = φ(u)⊗a.w.

30

Lemma 7. Let U be a finite-dimensional A−module. Then(i) For any c ∈ A the map U ⊗A Ac→ UC is an isomorphism of left B−modules.(ii) If W = Ac is an irreducible left A−module, then U ⊗AW = Uc is an irreducible left B−module.(iii) If Wi = Aci are the distinct irreducible left A−modules, with mi = dimWi, then U ' ⊕i(Uci)⊕mi .

is the decomposition of U into irreducible B−modules.

Proof. First of all, U ⊗A Ac→ Uc is clearly surjective, as Uc is the image of U ⊗A c. So we need to show itis injective. Now Ac is a submodule of A and A is a representation of a finite group G, so A is completelyreducible and in particular we have A = Ac⊕A′, for some submodule A′. So U = U⊗AA = U⊗A (Ac+A′) =(U⊗AAc)⊕ (U⊗AA′). So if φ : U⊗AAc→ Uc is not injective, then so is the inclusion U⊗AAc→ Uc → U ,contradicting the fact that U ⊗A Ac is direct summand of U . This proves part (i).

For part (ii) suppose first that U is irreducible. From the representation theory of finite groups we havethat if Wi are all the irreducible representations of G, then from one hand we have

∑i(dimWi)2 = |G| and

from the other if we map the group algebra into End(⊕iWi) = ⊕End(Wi) = ⊕iMdimWiC since the elements

of G are automorphisms of ⊕iWi, by counting dimensions we must have that this inclusion map is anisomorphism. So we can view A = ⊕Mni(C), where ni are the dimensions of its irreducible representations.Since W = Ac is also a left A− ideal, it must be a minimal ideal of A (as every ideal is a A−module), andso must be a minimal ideal in the matrix ring Mni for some i. Since if a matrix that has only one nonzerocolumn of index j, when multiplied on the left by any matrix, results again in a matrix with only one nonzerocolumn of the same index, and since every matrix can be multiplied on the left by a certain matrix (of onlynonzero entry at (j, j)) to get such a matrix of one nonzero column, we must have that minimal ideal consistsof matrices with only nonzero column at j for some j. Similarly since U is a right A-ideal, it must haveonly one nonzero row, say at i, for its inclusions in MdimWk

. Finally for every element of U ⊗AW its partin MdimWi is either 0 if one of U of W are 0 there, or is of the form C ⊗A D, for matrices C, D, we canwrite C = I.C and C ∈ A, so C ⊗A D = I ⊗ C.D. But for every matrix C with only nonzero row i and Dwith only nonzero column j, we have that C.D is matrix with only nonzero entry at (i, j). This means thatU ⊗AW is one dimensional and hence necessarily irreducible.

Now suppose U = ⊕iU⊕nii with U − i - irreducible, then U ⊗A W = ⊕i(Ui ⊗W )⊕ni . The Uis beingirreducible by the above paragraph must be in some MdimWj and so that Ui ⊗A W is not 0, we consideronly the Ui ⊂ MdimWk

for W ⊂ MdimWk. However all irreducible modules of a matrix algebra must be

isomorphic, so we have that finally U ⊗A W = (Uk ⊗A W )⊕nk = C⊕nk , which is clearly irreducible overB = ⊕Mnj (C), proving part (ii).

For part (iii) we have that since A = ⊕iW⊕mii (by for example the beginning of the second paragraph)then for U as an A−module we have U ' U ⊗A A = U ⊗A (⊕iW⊕mi) = ⊕i(U ⊗A Wi)⊕mi . since by (ii)U ⊗A Wi is irreducible over B and by (i) it is isomorphic to Uci, we get that U ' ⊕i(Uci)⊕mi and byuniqueness of irreducible submodules decomposition, it must be the decomposition of U into irreducibleB−modules.

To prove theorem 25 we apply this lemma with A = CSd and U = V ⊗d viewed as a right A−module. Thensince we know, by for example [1] or [3], what the irreducible representations of Sd are, namely Vλ = Acλwith λ ` d and cλ - the Young symmetrizer, we can decompose U ' ⊕λ(Ucλ) as a left B−module. Thequestion now is to show that the algebra B is spanned by End(V ).

Lemma 8. The algebra B is spanned as a linear subspace of End(V ⊗d) by End(V ). A subspace of V ⊗d isa B−submodule if and only if it is invariant under GL(V ).

Proof. One can show that SymdW is spanned by wd = d1w⊗ · · · ⊗w as w runs through W . One can showthis for example by induction on d, starting with the fact that v⊗w+w⊗v, the generic element of Sym2W ,can be written as (v +w)⊗ (v +w)− (v ⊗ v)− (w ⊗w) and then proceed as if dealing with the elementarysymmetric polynomials. We can consider End(V ) = V ∗ ⊗ V , we have that End(V ⊗d) = (V ⊗d)∗ ⊗ (V ⊗d) =(V ∗)⊗d ⊗ V ⊗d = (V ∗ ⊗ V )⊗d = (End(V ))⊗d. Now since B = HomSd(V ⊗d, V ⊗d) ⊂ End(V ⊗d) = End(V )⊗d

and it is invariant under Sd it must in fact be a subset of Symd(End(V )), which is spanned by End(V ) (asφd). But GL(V ) is dense in End(V ), so this implies the second statement of the lemma.

31

This lemma now proves that a B−module decomposition is the same as a GL(V ) decomposition inour case, showing that V ⊗d = ⊕λ(V ⊗dcλ) is the irreducible decomposition over GL(V ) and of courseSλV = V ⊗dcλ, showing the second part of theorem 25.

For the first part now we will use the fact that both the complete symmetric polynomials hλ and theSchur polynomials span the space of symmetric polynomials. We don’t know what exactly V ⊗dcλ is, butwe know this for V ⊗daλ, it is in fact isomorphic to Symλ1 V ⊗ Symλ2 V ⊗ · · · ⊗ Symλk V ' V ⊗d ⊗A A.aλ,as we symmetrize the rows of the Young tableaux of shape λ. From the representations of Sd we have thatAaλ = ⊕µ`dKµλAcµ is the decomposition into irreducible representations of Sd. From our lemma then wehave that as GL(V )-modules

Symλ1 V ⊗ Symλ2 V ⊗ · · · ⊗ Symλk V = V ⊗daλ = ⊕µ(V ⊗dcλ)⊕Kµλ . (13)

Let g ∈ GL(V ) be a matrix with eigenvalues x1, . . . , xn. Then the trace of g on the left-hand side can becomputed by the standard way of dealing with tensor products, i.e.

Tr|V ⊗daλ(g) =∏i

Tr|Symλi V (g),

and we can compute the trace of g on Symp V by observing that if g is diagonal(or diagonalizable), then ifvi1 , . . . , vip are eigenvectors, g(vi1 . . . vip) = xi1 . . . xip(vi1 . . . vip) in Symp V and by dimension count goingover all i1 ≤ · · · ≤ ip, these are all eignvectors of g acting on Symp V , so Tr(g) =

∑i1≤···≤ip xi1 . . . xip =

hp(x1, . . . , xn) - the complete symmetric function. Using the fact that the semisimple matrices are a denseset in GL(V ) we can extend this result for any g of the given eigenvalues. We then have that Tr|V ⊗daλ(g) =∏i hλi(x1, . . . , xn) = hλ(x1, . . . , xn) by definition of hλ.Now taking Tr(g) on both sides of equation (13) give us that

hλ(x1, . . . , xn) =∑µ

KµλTr|V ⊗dcµ(g), (14)

which coincides with the expression for hλ in terms of sλ. Since we know that the matrix with entries Kµλ

is invertible, after writing a system of equations (14) for all λ ` d, we can multiply by the inverse of the Kµλ

matrix and get that the vector of entries Tr|SµV (g) running over µ ` d is equal to K−1.h, where h is thevector of entries hλ. Since we know s = K−1.h, where s is the vector of entries the Schur functions sµ, wemust have

Tr|SλV (g) = sλ(x1, . . . , xn),

as desired.

References

[1] William Fulton and Joe Harris, Representation theory, Graduate Texts in Mathematics, vol. 129, Springer-Verlag, NewYork, 1991. A first course; Readings in Mathematics. MR1153249 (93a:20069)

[2] James E. Humphreys, Introduction to Lie algebras and representation theory, Graduate Texts in Mathematics, vol. 9,Springer-Verlag, New York, 1978. Second printing, revised. MR499562 (81b:17007)

[3] Richard P. Stanley, Enumerative combinatorics. Vol. 2, Cambridge Studies in Advanced Mathematics, vol. 62, CambridgeUniversity Press, Cambridge, 1999. With a foreword by Gian-Carlo Rota and appendix 1 by Sergey Fomin. MR1676282(2000k:05026)

32

lie algebras, their representation theory and gl minor thesispanova/minorthesis.pdf · the goal of...

Documents