11.9 zero-mean gaussian noise is the worst additive noise · worst additive noise • we will show...

11.9 Zero-Mean Gaussian Noise is the Worst Additive Noise

Worst Additive Noise

• We will show that in terms of the capacity of the system, the zero-mean

Gaussian noise is the worst additive noise given that the noise vector has

a fixed correlation matrix.

• The diagonal elements of the correlation matrix specify the power of the

individual noise variables.

• The other elements in the matrix give a characterization of the correlation

between the noise variables.

Zero-Mean Gaussian System

1. Consider a system of correlated Gaussian channels

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

2. Let C⇤be the capacity of the system.

3. Let X

⇤be the zero-mean Gaussian input vector that

achieves the capacity.

4. Let Y

⇤be the output of the system with X

input, i.e.,

⇤= X

⇤+ Z

Alternative System

1. Consider a system exactly the same as the Zero-

Mean Gaussian System except that the noise vector

Z, which has the same correlation matrix as Z

⇤, may

neither be zero-mean nor Gaussian.

2. Assume that the joint pdf of Z exists.

3. Let C be the capacity of the system.

4. Let Y be the output of the system with X

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

Z : K̃Z = K

• For the Zero-Mean Gaussian System, C⇤ = I(X⇤;Y⇤).

• For the alternative system, I(X⇤;Y) C.

• We will show that I(X⇤;Y⇤) I(X⇤;Y).

• Hence,C⇤ = I(X⇤;Y⇤) I(X⇤;Y) C.

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

Z : K̃Z = K

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

Z : K̃Z = K

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

Z : K̃Z = K

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

Z : K̃Z = K

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

Z : K̃Z = K

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

+X⇤ Y⇤

Z : K̃Z = K

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

+X⇤ Y⇤

Z : K̃Z = K

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

+X⇤ Y⇤

Z : K̃Z = K

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

+X⇤ Y⇤

Z : K̃Z = K

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

+X⇤ Y⇤

Z : K̃Z = K

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

+X⇤ Y⇤

Z : K̃Z = K

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

+X⇤ Y⇤

Z : K̃Z = K

+X⇤ Y

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

+X⇤ Y⇤

Z : K̃Z = K

+X⇤ Y

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

+X⇤ Y⇤

Z : K̃Z = K

+X⇤ Y

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

+X⇤ Y⇤

Z : K̃Z = K

+X⇤ Y

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

+X⇤ Y⇤

Z : K̃Z = K

+X⇤ Y

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

+X⇤ Y⇤

Z : K̃Z = K

+X⇤ Y

___________________

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

+X⇤ Y⇤

Z : K̃Z = K

+X⇤ Y

__________________________

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

+X⇤ Y⇤

Z : K̃Z = K

+X⇤ Y

_________________

with noise vector Z

⇤ ⇠ N(0, K), and so

⇤ = K.

3. Let X

4. Let Y

input, i.e.,

⇤= X

⇤+ Z

Alternative System

⇤, may

⇤as input,

⇤+ Z.

Main Idea

Z� � N (0, K)

+X⇤ Y⇤

Z : K̃Z = K

+X⇤ Y

Theorem 11.32 For a fixed zero-mean Gaussian random vector X

⇤, let

⇤+ Z,

where the joint pdf of Z exists and Z is independent of X

⇤. Under the constraint

that the correlation matrix of Z is equal to K, where K is any symmetric positive

definite matrix, I(X

⇤;Y) is minimized if and only if Z = Z

⇤ ⇠ N (0, K).

Lemma 11.33 Let X be a zero-mean random vector and

Y = X+ Z

where Z is independent of X. Then

˜KY =

˜KX +

Remark The scalar case has been proved in the proof of Theorem 11.21.

Lemma 11.34 LetY⇤ ⇠ N (0,K) andY be any random vector with correlation

matrix K. Then

ZfY⇤

(y) log fY⇤(y)dy =

fY(y) log fY⇤(y)dy.

Remark A similar technique has been used in proving Theorems 2.50 and

10.41 (maximum entropy distributions).

Y = X+ Z

˜KY =

˜KX +

matrix K. Then

ZfY⇤

Y = X+ Z

˜KY =

˜KX +

matrix K. Then

ZfY⇤

Y = X+ Z

˜KY =

˜KX +

matrix K. Then

ZfY⇤

Theorem 2.50 Let

p⇤(x) = e

��0

�iri(x)

for all x 2 S, where �0

, · · · ,�m are chosen such

p(x)ri(x) = ai for 1 i m. (1)

Then p⇤ maximizes H(p) over all probability distribu-

tion p on S subject to (1).

Sketch of Proof

H(p⇤) � H(p)

= �X

x2Sp⇤(x) ln p

⇤(x) +

p(x) ln p(x)

= �X

p(x) ln p⇤(x) +

p(x) ln p(x)

p(x) ln

p⇤(x)

= D(pkp⇤)

� 0.

Remark The key step is to establish that

x2Sp⇤(x) ln p

⇤(x) =

p(x) ln p⇤(x).

Theorem 10.41 Let

f⇤(x) = e

��0

�iri(x)

ri(x)f(x)dx = ai for 1 i m. (2)

Then f⇤maximizes h(f) over all pdf f defined on S,

subject to the constraints in (2).

Sf⇤(x) ln f

⇤(x)dx =

f(x) ln f⇤(x)dx. (3)

Theorem 10.45 Let X be a vector of n continuous

random variables with correlation matrix

˜K. Then

h(X) 1

h(2⇡e)

n| ˜K|i

with equality if and only if X ⇠ N(0, ˜K).

Then (3) and Theorem 10.45 together imply

Lemma 11.34 Let Y

⇤ ⇠ N(0, K) and Y be any ran-

dom vector with correlation matrix K. Then

⇤ (y) log fY

⇤ (y)dy =

(y) log fY

⇤ (y)dy.

Theorem 2.50 Let

p⇤(x) = e

��0

�iri(x)

Sketch of Proof

H(p⇤) � H(p)

= �X

x2Sp⇤(x) ln p

⇤(x) +

p(x) ln p(x)

= �X

p(x) ln p⇤(x) +

p(x) ln p(x)

p(x) ln

p⇤(x)

= D(pkp⇤)

� 0.

x2Sp⇤(x) ln p

⇤(x) =

p(x) ln p⇤(x).

Theorem 10.41 Let

f⇤(x) = e

��0

�iri(x)

Sf⇤(x) ln f

⇤(x)dx =

˜K. Then

h(X) 1

h(2⇡e)

n| ˜K|i

Lemma 11.34 Let Y

⇤ (y) log fY

⇤ (y)dy =

(y) log fY

⇤ (y)dy.

Theorem 2.50 Let

p⇤(x) = e

��0

�iri(x)

Sketch of Proof

H(p⇤) � H(p)

= �X

x2Sp⇤(x) ln p

⇤(x) +

p(x) ln p(x)

= �X

p(x) ln p⇤(x) +

p(x) ln p(x)

p(x) ln

p⇤(x)

= D(pkp⇤)

� 0.

x2Sp⇤(x) ln p

⇤(x) =

p(x) ln p⇤(x).

Theorem 10.41 Let

f⇤(x) = e

��0

�iri(x)

Sf⇤(x) ln f

⇤(x)dx =

˜K. Then

h(X) 1

h(2⇡e)

n| ˜K|i

Lemma 11.34 Let Y

⇤ (y) log fY

⇤ (y)dy =

(y) log fY

⇤ (y)dy.

Theorem 2.50 Let

p⇤(x) = e

��0

�iri(x)

Sketch of Proof

H(p⇤) � H(p)

= �X

x2Sp⇤(x) ln p

⇤(x) +

p(x) ln p(x)

= �X

p(x) ln p⇤(x) +

p(x) ln p(x)

p(x) ln

p⇤(x)

= D(pkp⇤)

� 0.

x2Sp⇤(x) ln p

⇤(x) =

p(x) ln p⇤(x).

Theorem 10.41 Let

f⇤(x) = e

��0

�iri(x)

Sf⇤(x) ln f

⇤(x)dx =

˜K. Then

h(X) 1

h(2⇡e)

n| ˜K|i

Lemma 11.34 Let Y

⇤ (y) log fY

⇤ (y)dy =

(y) log fY

⇤ (y)dy.

Theorem 2.50 Let

p⇤(x) = e

��0

�iri(x)

Sketch of Proof

H(p⇤) � H(p)

= �X

x2Sp⇤(x) ln p

⇤(x) +

p(x) ln p(x)

= �X

p(x) ln p⇤(x) +

p(x) ln p(x)

p(x) ln

p⇤(x)

= D(pkp⇤)

� 0.

x2Sp⇤(x) ln p

⇤(x) =

p(x) ln p⇤(x).

Theorem 10.41 Let

f⇤(x) = e

��0

�iri(x)

Sf⇤(x) ln f

⇤(x)dx =

˜K. Then

h(X) 1

h(2⇡e)

n| ˜K|i

Lemma 11.34 Let Y

⇤ (y) log fY

⇤ (y)dy =

(y) log fY

⇤ (y)dy.

Theorem 2.50 Let

p⇤(x) = e

��0

�iri(x)

Sketch of Proof

H(p⇤) � H(p)

= �X

x2Sp⇤(x) ln p

⇤(x) +

p(x) ln p(x)

= �X

p(x) ln p⇤(x) +

p(x) ln p(x)

p(x) ln

p⇤(x)

= D(pkp⇤)

� 0.

x2Sp⇤(x) ln p

⇤(x) =

p(x) ln p⇤(x).

Theorem 10.41 Let

f⇤(x) = e

��0

�iri(x)

Sf⇤(x) ln f

⇤(x)dx =

˜K. Then

h(X) 1

h(2⇡e)

n| ˜K|i

Lemma 11.34 Let Y

⇤ (y) log fY

⇤ (y)dy =

(y) log fY

⇤ (y)dy.

Theorem 2.50 Let

p⇤(x) = e

��0

�iri(x)

Sketch of Proof

H(p⇤) � H(p)

= �X

x2Sp⇤(x) ln p

⇤(x) +

p(x) ln p(x)

= �X

p(x) ln p⇤(x) +

p(x) ln p(x)

p(x) ln

p⇤(x)

= D(pkp⇤)

� 0.

x2Sp⇤(x) ln p

⇤(x) =

p(x) ln p⇤(x).

Theorem 10.41 Let

f⇤(x) = e

��0

�iri(x)

Sf⇤(x) ln f

⇤(x)dx =

˜K. Then

h(X) 1

h(2⇡e)

n| ˜K|i

Lemma 11.34 Let Y

⇤ (y) log fY

⇤ (y)dy =

(y) log fY

⇤ (y)dy.

Theorem 2.50 Let

p⇤(x) = e

��0

�iri(x)

Sketch of Proof

H(p⇤) � H(p)

= �X

x2Sp⇤(x) ln p

⇤(x) +

p(x) ln p(x)

= �X

p(x) ln p⇤(x) +

p(x) ln p(x)

p(x) ln

p⇤(x)

= D(pkp⇤)

� 0.

x2Sp⇤(x) ln p

⇤(x) =

p(x) ln p⇤(x).

Theorem 10.41 Let

f⇤(x) = e

��0

�iri(x)

Sf⇤(x) ln f

⇤(x)dx =

˜K. Then

h(X) 1

h(2⇡e)

n| ˜K|i

Lemma 11.34 Let Y

⇤ (y) log fY

⇤ (y)dy =

(y) log fY

⇤ (y)dy.

Theorem 2.50 Let

p⇤(x) = e

��0

�iri(x)

Sketch of Proof

H(p⇤) � H(p)

= �X

x2Sp⇤(x) ln p

⇤(x) +

p(x) ln p(x)

= �X

p(x) ln p⇤(x) +

p(x) ln p(x)

p(x) ln

p⇤(x)

= D(pkp⇤)

� 0.

x2Sp⇤(x) ln p

⇤(x) =

p(x) ln p⇤(x).

Theorem 10.41 Let

f⇤(x) = e

��0

�iri(x)

Sf⇤(x) ln f

⇤(x)dx =

˜K. Then

h(X) 1

h(2⇡e)

n| ˜K|i

Lemma 11.34 Let Y

⇤ (y) log fY

⇤ (y)dy =

(y) log fY

⇤ (y)dy.

Theorem 2.50 Let

p⇤(x) = e

��0

�iri(x)

Sketch of Proof

H(p⇤) � H(p)

= �X

x2Sp⇤(x) ln p

⇤(x) +

p(x) ln p(x)

= �X

p(x) ln p⇤(x) +

p(x) ln p(x)

p(x) ln

p⇤(x)

= D(pkp⇤)

� 0.

x2Sp⇤(x) ln p

⇤(x) =

p(x) ln p⇤(x).

Theorem 10.41 Let

f⇤(x) = e

��0

�iri(x)

Sf⇤(x) ln f

⇤(x)dx =

˜K. Then

h(X) 1

h(2⇡e)

n| ˜K|i

Lemma 11.34 Let Y

⇤ (y) log fY

⇤ (y)dy =

(y) log fY

⇤ (y)dy.

Theorem 2.50 Let

p⇤(x) = e

��0

�iri(x)

Sketch of Proof

H(p⇤) � H(p)

= �X

x2Sp⇤(x) ln p

⇤(x) +

p(x) ln p(x)

= �X

p(x) ln p⇤(x) +

p(x) ln p(x)

p(x) ln

p⇤(x)

= D(pkp⇤)

� 0.

x2Sp⇤(x) ln p

⇤(x) =

p(x) ln p⇤(x).

Theorem 10.41 Let

f⇤(x) = e

��0

�iri(x)

Sf⇤(x) ln f

⇤(x)dx =

˜K. Then

h(X) 1

h(2⇡e)

n| ˜K|i

Lemma 11.34 Let Y

⇤ (y) log fY

⇤ (y)dy =

(y) log fY

⇤ (y)dy.

Theorem 2.50 Let

p⇤(x) = e

��0

�iri(x)

Sketch of Proof

H(p⇤) � H(p)

= �X

x2Sp⇤(x) ln p

⇤(x) +

p(x) ln p(x)

= �X

p(x) ln p⇤(x) +

p(x) ln p(x)

p(x) ln

p⇤(x)

= D(pkp⇤)

� 0.

x2Sp⇤(x) ln p

⇤(x) =

p(x) ln p⇤(x).

Theorem 10.41 Let

f⇤(x) = e

��0

�iri(x)

Sf⇤(x) ln f

⇤(x)dx =

˜K. Then

h(X) 1

h(2⇡e)

n| ˜K|i

Lemma 11.34 Let Y

⇤ (y) log fY

⇤ (y)dy =

(y) log fY

⇤ (y)dy.

Theorem 2.50 Let

p⇤(x) = e

��0

�iri(x)

Sketch of Proof

H(p⇤) � H(p)

= �X

x2Sp⇤(x) ln p

⇤(x) +

p(x) ln p(x)

= �X

p(x) ln p⇤(x) +

p(x) ln p(x)

p(x) ln

p⇤(x)

= D(pkp⇤)

� 0.

x2Sp⇤(x) ln p

⇤(x) =

p(x) ln p⇤(x).

Theorem 10.41 Let

f⇤(x) = e

��0

�iri(x)

Sf⇤(x) ln f

⇤(x)dx =

˜K. Then

h(X) 1

h(2⇡e)

n| ˜K|i

Lemma 11.34 Let Y

⇤ (y) log fY

⇤ (y)dy =

(y) log fY

⇤ (y)dy.

Theorem 2.50 Let

p⇤(x) = e

��0

�iri(x)

Sketch of Proof

H(p⇤) � H(p)

= �X

x2Sp⇤(x) ln p

⇤(x) +

p(x) ln p(x)

= �X

p(x) ln p⇤(x) +

p(x) ln p(x)

p(x) ln

p⇤(x)

= D(pkp⇤)

� 0.

x2Sp⇤(x) ln p

⇤(x) =

p(x) ln p⇤(x).

Theorem 10.41 Let

f⇤(x) = e

��0

�iri(x)

Sf⇤(x) ln f

⇤(x)dx =

˜K. Then

h(X) 1

h(2⇡e)

n| ˜K|i

Lemma 11.34 Let Y

⇤ (y) log fY

⇤ (y)dy =

(y) log fY

⇤ (y)dy.

Data Processing Inequality for Informational Divergence

Theorem Let X,X 0, Y , and Y 0be real random variables such that fXY and

fX0Y 0exist, and fY |X = fY 0|X0

. Then

D(fXkfX0) � D(fY kfY 0

Proof Exercise.

fY |XX

X 0 Y 0

. Then

Proof Exercise.

fY |XX

X 0 Y 0

. Then

Proof Exercise.

fY |XX

X 0 Y 0

. Then

Proof Exercise.

fY |XX

X 0 Y 0

Theorem 11.32 For a fixed zero-mean

Gaussian random vector X

⇤, let

⇤+ Z,

where the joint pdf of Z exists and Z is

independent of X

that the correlation matrix of Z is equal

to K, where K is any symmetric positive

⇤;Y) is minimized if

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

⇤ = K. Therefore, Z

⇤and Z have the same correlation matrix.

2. By noting that X

⇤has zero mean, we apply

Lemma 11.33 to see that Y

⇤and Y have the same

correlation matrix.

3. The theorem is proved by considering

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

Z� � N (0, K)

+X⇤ Y⇤

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

Z� � N (0, K)

+X⇤ Y⇤

Z : K̃Z = K

+X⇤ Y

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

Z� � N (0, K)

+X⇤ Y⇤

Z : K̃Z = K

+X⇤ Y

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

Z� � N (0, K)

+X⇤ Y⇤

Z : K̃Z = K

+X⇤ Y

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

Lemma 11.33 Let X be a zero-mean random vector

Y = X + Z

˜KY =

˜KX +

Z� � N (0, K)

+X⇤ Y⇤

Z : K̃Z = K

+X⇤ Y

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

_____________________

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

_______

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

_______

___________________

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

______

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

______

__________________

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

____________________

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

Lemma 11.34 Let Y⇤ ⇠ N(0, K) and Y be any ran-

ZfY⇤ (y) log fY⇤ (y)dy =

SYfY(y) log fY⇤ (y)dy.

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

______

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

______

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

______

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

______

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

____________________

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

____________________

__________________

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

____________________

__________________

_____________________

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

___________________

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

___________________

____________________

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

___________________

____________________

________________________

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

_____________________

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

_____________________

__________

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

_______________________

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

_______________________

__________

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

+ Y⇤

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

+ Y⇤

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

+ Y⇤

⇤, let

⇤+ Z,

independent of X

and only if Z = Z

⇤ ⇠ N(0, K).

1. Since EZ

⇤= 0,

⇤ = KZ

2. By noting that X

correlation matrix.

⇤) � I(X

⇤;Y)

⇤) � h(Z

⇤) � h(Y) + h(Z)

= �Z

⇤ (y) log fY

⇤ (y)dy +

⇤ (z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

= �Z

(y) log fY

⇤ (y)dy +

(z) log fZ

⇤ (z)dz

(y) log fY

(y)dy �Z

(z) log fZ

⇤ (y)

(y)dy �Z

⇤ (z)

= D(fY

⇤ ) � D(fZ

+ Y⇤

11.9 zero-mean gaussian noise is the worst additive noise · worst additive noise • we will show...

Documents