how to solve gaussian interference channelghan/wpi/fanslides.pdfย ยท wpi, hku, 2019 fan cheng...

Post on 09-Jul-2020

4 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

How to Solve Gaussian Interference

Channel

WPI, HKU, 2019

Fan Cheng

Shanghai Jiao Tong University

chengfan@sjtu.edu.cn

โ„Ž ๐‘‹ + ๐‘ก๐‘

๐œ•2

2๐œ•๐‘ฅ2๐‘“ ๐‘ฅ, ๐‘ก =

๐œ•

๐œ•๐‘ก๐‘“(๐‘ฅ, ๐‘ก)

โ„Ž ๐‘‹ = โˆ’โˆซ ๐‘ฅlog๐‘ฅ d๐‘ฅ๐‘Œ = ๐‘‹ + ๐‘ก๐‘๐‘ โˆผ ๐’ฉ(0,1)

โ–ก A new mathematical theory on Gaussian distributionโ–ก Its application on Gaussian interference channelโ–ก History, progress, and future

โ–ก History of โ€œSuper-Hโ€ Theorem

โ–ก Boltzmann equation, heat equation

โ–ก Shannon Entropy Power Inequality

โ–ก Complete Monotonicity Conjecture

โ–ก How to Solve Gaussian Interference Channel

Outline

Fire and Civilization

DrillSteam engine

James WattsMyth: west and east

Independence of US

The Wealth of Nations

1776

Study of Heat

Heat transferโ–ก The history begins with the work of Joseph

Fourier around 1807โ–ก In a remarkable memoir, Fourier invented

both Heat equation and the method of Fourier analysis for its solution

๐œ•

๐œ•๐‘ก๐‘“ ๐‘ฅ, ๐‘ก =

1

2

๐œ•2

๐œ•๐‘ฅ2๐‘“(๐‘ฅ, ๐‘ก)

Information Age

๐‘๐‘ก โˆผ ๐’ฉ(0, ๐‘ก)Gaussian Channel:

X and Z are mutually independent. The p.d.f of X is g(x)

๐‘Œ๐‘ก is the convolution of X and ๐‘๐‘ก. ๐‘Œ๐‘ก โ‰” ๐‘‹ + ๐‘๐‘ก

The probability density function (p.d.f.) of ๐‘Œ๐‘ก

๐‘“(๐‘ฆ; ๐‘ก) = โˆซ ๐‘”(๐‘ฅ)1

2๐œ‹๐‘ก๐‘’(๐‘ฆโˆ’๐‘ฅ)2

2๐‘ก

๐œ•

๐œ•๐‘ก๐‘“(๐‘ฆ; ๐‘ก) =

1

2

๐œ•2

๐œ•๐‘ฆ2๐‘“(๐‘ฆ; ๐‘ก)

The p.d.f. of Y is the solution to the heat equation, and vice versa.

Gaussian channel and heat equation are identical in mathematics.

A mathematical theory of communication,

Bell System Technical Journal.

Ludwig Boltzmann

Boltzmann formula:

Boltzmann equation:

H-theorem:

Ludwig Eduard Boltzmann

1844-1906

Vienna, Austrian Empire

๐‘† = โˆ’๐‘˜๐ตln๐‘Š๐‘† = โˆ’๐‘˜๐‘โˆ‘

๐‘–๐‘๐‘–ln๐‘๐‘–

๐‘‘๐‘“

๐‘‘๐‘ก= (

๐œ•๐‘“

๐œ•๐‘ก)force + (

๐œ•๐‘“

๐œ•๐‘ก)diff + (

๐œ•๐‘“

๐œ•๐‘ก)coll

๐ป(๐‘“(๐‘ก))is nonโˆ’decreasing

Gibbs formula:

โ–ก McKeanโ€™s Problem on Boltzmann equation (1966): โ–ก ๐ป(๐‘“(๐‘ก)) is CM in ๐‘ก, when

๐‘“ ๐‘ก satisfies Boltzmann equationโ–ก False, disproved by E. Lieb in

1970sโ–ก the particular Bobylev-Krook-Wu

explicit solutions, this โ€œtheoremโ€ holds true for ๐‘› โ‰ค 101 and breaks downs afterwards

โ€œSuper H-theoremโ€ for Boltzmann Equation

H. P. McKean, NYU.

National Academy of Sciences

A function is completely monotone (CM) iff all the signs of its derivatives

are alternating in +/-: +, -, +, -,โ€ฆโ€ฆ (e.g., 1/๐‘ก, ๐‘’โˆ’๐‘ก )

โ–ก Heat equation: Is ๐ป(๐‘“(๐‘ก)) CM in ๐‘ก, if ๐‘“(๐‘ก) satisfies heat equation

โ–ก Equivalently, is ๐ป(๐‘‹ + ๐‘ก๐‘) CM in t? โ–ก The signs of the first two order derivatives were obtainedโ–ก Failed to obtain the 3rd and 4th. (It is easy to compute the

derivatives, it is hard to obtain their signs)

โ€œSuper H-theoremโ€ for Heat Equation

โ€œThis suggests thatโ€ฆโ€ฆ, etc., but I could not prove itโ€

-- H. P. McKean

C. Villani, 2010 Fields Medalist

Claude E. Shannon and EPI

โ–ก Entropy power inequality (Shannon 1948): For any two independent continuous random variables X and Y,

Equality holds iff X and Y are Gaussianโ–ก Motivation: Gaussian noise is the worst noiseโ–ก Impact: A new characterization of Gaussian distribution in

information theoryโ–ก Comments: most profound! (Kolmogorov)

๐‘’2โ„Ž(๐‘ฟ+๐’€) โ‰ฅ ๐‘’2โ„Ž(๐‘ฟ) + ๐‘’2โ„Ž(๐’€)

Central limit theoremCapacity region of Gaussian broadcast channelCapacity region of Gaussian Multiple-Input Multiple-Output broadcast channelUncertainty principle

All of them can be proved by Entropy Power Inequality (EPI)

โ–ก Shannon himself didnโ€™t give a proof but an explanation, which turned out to be wrong

โ–ก The first proof is given by A. J. Stam (1959), N. M. Blachman (1966)

โ–ก Research on EPIGeneralization, new proof, new connection. E.g., Gaussian interference channel is

open, some stronger โ€œEPIโ€™โ€™ should exist.

โ–ก Stanford Information Theory School: Thomas Cover and his students: A. El Gamel, M. H. Costa, A. Dembo, A. Barron (1980-1990)

โ–ก After 2000, Princeton && UC Berkeley

Entropy Power Inequality

Heart of Shannon theory

Ramification of EPI

Shannon EPI

Gaussian perturbation: โ„Ž(๐‘‹ + ๐‘ก๐‘)

Fisher Information: ๐ผ ๐‘‹ + ๐‘ก๐‘ =๐œ•

๐œ•๐‘กโ„Ž(๐‘‹ + ๐‘ก๐‘)/2

Fisher Information is decreasing in ๐‘ก

๐‘’2โ„Ž(๐‘‹+ ๐‘ก๐‘) is concave in ๐‘กFisher information inequality (FII):

1

๐ผ(๐‘‹+๐‘Œ)โ‰ฅ

1

๐ผ(๐‘‹)+

1

๐ผ(๐‘Œ)

Tight Youngโ€™s inequality

๐‘‹ + ๐‘Œ ๐‘Ÿ โ‰ฅ ๐‘ ๐‘‹ ๐‘ ๐‘Œ ๐‘ž

Status Quo: FII can imply EPI and all its generalizations.

Many network information problems remain open even

the noise is Gaussian.

--Only EPI is not sufficient

Where our journey begins Shannon Entropy power inequality

Fisher information inequality

โ„Ž(๐‘‹ + ๐‘ก๐‘)

โ„Ž ๐‘“ ๐‘ก is CM

When ๐‘“(๐‘ก) satisfied Boltzmann equation, disproved

When ๐‘“(๐‘ก) satisfied heat equation, unknown

We even donโ€™t know what CM is!

Mathematician ignored it

Raymond introduced this paper to me in 2008

I made some progress with Chandra Nair in 2011 (MGL)

Complete monotonicity (CM) was discovered in 2012

The third derivative in 2013 (Key breakthrough)

The fourth order in 2014

Recently, CM GIC

Motivation

Motivation: to find some inequalities to obtain a better rate region; e.g., the

convexity of ๐’‰(๐‘ฟ + ๐’†โˆ’๐’•๐’), the concavity of ๐‘ฐ ๐‘ฟ+ ๐’•๐’

๐’•, etc.

โ€œAny progress?โ€

โ€œNopeโ€ฆโ€

It is widely believed that there should be no

new EPI except Shannon EPI and FII.

Observation: ๐‘ฐ(๐‘ฟ + ๐’•๐’) is convex in ๐’•

๐ผ ๐‘‹ + ๐‘ก๐‘ =๐œ•

2๐œ•๐‘กโ„Ž ๐‘‹ + ๐‘ก๐‘ โ‰ฅ 0 (de Bruijn, 1958)

๐ผ(1) =๐œ•

๐œ•๐‘ก๐ผ ๐‘‹ + ๐‘ก๐‘ โ‰ค 0 (McKean1966, Costa 1985)

Could the third one be determined?

Discovery

Observation: ๐‘ฐ(๐‘ฟ + ๐’•๐’) is convex in ๐’•

โ„Ž ๐‘‹ + ๐‘ก๐‘ =1

2ln 2๐œ‹๐‘’๐‘ก, ๐ผ ๐‘‹ + ๐‘ก๐‘ =

1

๐‘ก. ๐ผ is CM: +, -, +, -โ€ฆ

If the observation is true, the first three derivatives are: +, -, +

Q: Is the 4th order derivative -? Because ๐‘ is Gaussian! If so, thenโ€ฆ

The signs of derivatives of โ„Ž(๐‘‹ + ๐‘ก๐‘) are independent of ๐‘‹. Invariant!

Exactly the same problem in McKeanโ€™s 1966 paper

To convince people, must prove its convexity

My own opinion:

โ€ข A new fundamental result on Gaussian distribution

โ€ข Invariant is very important in mathematics

โ€ข In mathematics, the more beautiful, the more powerful

โ€ข Very hard to make any progress

Challenge

Let ๐‘‹ โˆผ ๐‘”(๐‘ฅ)

โ„Ž ๐‘Œ๐‘ก = โˆ’โˆซ ๐‘“(๐‘ฆ, ๐‘ก) ln ๐‘“(๐‘ฆ, ๐‘ก) ๐‘‘๐‘ฆ: no closed-form expression

except for some special ๐‘” ๐‘ฅ . ๐‘“(๐‘ฆ, ๐‘ก) satisfies heat equation.

๐ผ ๐‘Œ๐‘ก = โˆซ๐‘“12

๐‘“๐‘‘๐‘ฆ

๐ผ 1 ๐‘Œ๐‘ก = โˆ’โˆซ๐‘“2

๐‘“โˆ’

๐‘“12

๐‘“2

2

๐‘‘๐‘ฆ

So what is ๐ผ(2)? (Heat equation, integration by parts)

Challenge (contโ€™d)

It is trivial to calculate derivatives.

It is not generally obvious to prove their signs.

๐‘ฐ

Breakthrough

Integration by parts: โˆซ ๐‘ข๐‘‘๐‘ฃ = ๐‘ข๐‘ฃ โˆ’ โˆซ ๐‘ฃ๐‘‘๐‘ข

First breakthrough since

McKean 1966

GCMCGaussian complete monotonicity conjecture (GCMC):

๐‘ฐ(๐‘ฟ + ๐’•๐’) is CM in ๐’•

A general form: number partition. Hard to determine the coefficients.

Conjecture 2: ๐ฅ๐จ๐ ๐‘ฐ(๐‘ฟ + ๐’•๐’) is convex in ๐’•

Hard to find ๐›ฝ๐‘˜,๐‘— !

Moreover

C. Villani showed the work of H. P. McKean to us.

G. Toscani cited our work within two weeks:

the consequences of the evolution of the entropy and of its subsequent

derivatives along the solution to the heat equation have important consequences.

Indeed the argument of McKean about the signs of the first two derivatives are

equivalent to the proof of the logarithmic Sobolev inequality.

Gaussian optimality for derivatives of differential entropy using linear matrix inequalities

X. Zhang, V. Anantharam, Y. Geng - Entropy, 2018 - mdpi.com

โ€ข A new method to prove signs by LMI

โ€ข Verified the first four derivatives

โ€ข For the fifth order derivative, current methods cannot find a solution

Complete monotone function

Herbert R. Stahl, 2013

๐‘“ ๐‘ก = เถฑ0

โˆž

๐‘’โˆ’๐‘ก๐‘ฅ ๐‘‘๐œ‡(๐‘ฅ)

A new expression for entropy involved special

functions in mathematical physics

How to construct ๐œ‡(๐‘ฅ)?

๐œ‡

Complete monotone function

Theorem: A function ๐‘“(๐‘ก) is CM in ๐‘ก, then log ๐‘“(๐‘ก) is also convex in ๐‘ก ๐ผ ๐‘Œ๐‘ก is CM in ๐‘ก, then log ๐ผ(๐‘Œ๐‘ก) is convex in ๐‘ก (Conjecture 1 implies

Conjecture 2)

A function f(t) is CM, a Schur-convex function can be obtained by f(t) Schur-convex โ†’ Majority theory

Remarks: The current tools in information

theory donโ€™t work. More sophisticated tools

should be built to attack this problem.

A new mathematical foundation of

information theory

1946

True Vs. False

If GCMC is true A fundamental breakthrough in mathematical physics, information

theory and any disciplines related to Gaussian distribution A new expression for Fisher information Derivatives are an invariant

Though โ„Ž(๐‘‹ + ๐‘ก๐‘) looks very messy, certain regularity exists Application: Gaussian interference channel?

If GCMC is false No Failure, as heat equation is a physical phenomenon A Gauss constant (e.g. 2019), where Gaussian distribution fails. Painful!

Complete Monotonicity:

How to Solve Gaussian Interference Channel

Two fundamental channel coding problem: BC and GIC

โ„Ž ๐‘Ž๐‘‹1 + ๐‘๐‘‹2 + ๐‘1 , โ„Ž ๐‘๐‘‹1 + ๐‘‘๐‘‹2 + ๐‘2 exceed the

capability of EPI

Han-Kobayashi inner bound

Many researchers have contributed to this model

Foundation of wireless communication

The Thick Shell over โ„Ž(๐‘‹ + ๐‘ก๐‘)

โ„Ž(๐‘‹ + ๐‘ก๐‘) is hard to estimate:

The p.d.f of ๐‘‹ + ๐‘ก๐‘ is messy

๐‘“ ๐‘ฅ log ๐‘“(๐‘ฅ) โˆซ ๐‘“ ๐‘ฅ log๐‘“(๐‘ฅ)No generally useful lower or upper bounds

--The thick shell over ๐‘‹ + ๐‘ก๐‘

Analysis: alternating is the worst

If the CM property of โ„Ž(๐‘‹ + ๐‘ก๐‘) is not true Take 5 for example: if CM breaks down after n=5 If we just take the 5th derivative, there may be nothing special.

(So GIC wonโ€™t be so hard) CM affected the rate region of GIC

Prof. Siu, Yum-Tong: โ€œAlternating is the worst thing in analysis as the integral is hard to converge, though CM is very beautifulโ€ It is not strange that Gaussian distribution is the worst in

information theory

Common viewpoint: information theory is about information inequality: EPI, MGL, etc.

CM is a class of inequalities. We should regard it as a whole in application. We should pivot our viewpoint from inequalities.

Information Decomposition

The lesson learned from complete monotonicity

๐ผ ๐‘‹ + ๐‘ก๐‘ = เถฑ0

โˆž

๐‘’โˆ’๐‘ก๐‘ฅ๐‘‘๐œ‡(๐‘ฅ)

Two independent components: ๐‘’โˆ’๐‘ก๐‘ฅ stands for complete monotonicity

๐‘‘๐œ‡(๐‘ฅ) serves as the identity of ๐ผ ๐‘‹ + ๐‘ก๐‘ Information decomposition:

Fisher Information = Complete Monotonicity + Borel Measure

CM is the thick shell. It can be used to estimate in majority theory Very useful in analysis and geometry

๐‘‘๐œ‡(๐‘ฅ) involves only ๐‘ฅ, and ๐‘ก is removed The thick shell is removed from Fisher information ๐‘‘๐œ‡(๐‘ฅ) is relatively easier to study than Fisher information WE know very little about ๐‘‘๐œ‡(๐‘ฅ)

Only CM is useless for (network) information theory The current constraints on ๐‘‘๐œ‡(๐‘ฅ) are too loose Only the โ€œspecial oneโ€ is useful, otherwise every CM function should

have the same meaning in information theory

CM && GIC

A fundamental problem should have a nice and clean solution.

To understand complete monotonicity is not an easy job (10 years).

Top players are ready, but the football is missingโ€ฆ

Thanks!

Guangyue

Raymond, Chandra, Venkat, Vincent...

top related