mmse vector precoding

11
Signal Processing 87 (2007) 2823–2833 MMSE vector precoding with joint transmitter and receiver design for MIMO systems $ Feng Liu , Lingge Jiang, Chen He Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai 200240, China Received 23 November 2006; received in revised form 28 May 2007; accepted 29 May 2007 Available online 7 June 2007 Abstract The transmitter side (Tx) vector precoding (VP) shows excellent performance. However, joint transmitter and receiver (Tx–Rx) design might achieve further improvement. In this paper, we propose a novel joint Tx–Rx VP for multiple input multiple output (MIMO) systems. A unitary matrix is used at the receiver to partially equalize the channel. We design the joint Tx–Rx VP with the minimum mean square error (MMSE) criterion and deduce the general closed-form solution. Then we apply several methods including the singular value decomposition, QR decomposition and geometric mean decomposition (GMD) to specify the general joint Tx–Rx VP design. Moreover, by using the extended channel, we achieve an improved scheme to obtain further performance gain. Simulation results show that the specification with GMD method outperforms the other specifications and the Tx MMSE VP. The improved joint Tx–Rx MMSE VP with GMD is superior to other MMSE VP schemes. r 2007 Elsevier B.V. All rights reserved. Keywords: MIMO systems; MMSE vector precoding; Joint transmitter and receiver design; Geometric mean decomposition 1. Introduction It has been shown that the multiple input multiple output (MIMO) systems achieve high capacity and better performance than the single-antenna systems [1–3]. If the channel state information (CSI) is available at the transmitter, precoding technique can be exploited to improve the system performance. The ‘‘vector perturbation technique’’ [4,5] is a new transmitter side (Tx) technique which gener- alizes the Tomlinson–Harashima precoding (THP) [6] for MIMO systems. It is also called as vector precoding (VP) by the following researchers. The key idea is to add a perturbation component to the data symbols, which shapes the symbols to satisfy a special goal. As in the THP, the perturbation symbol in VP is an integer scaled by a factor and can be eliminated by a modulo operation at the receiver. In contrast with the THP, the perturbation symbol in VP can be purposively designed. In [4,5] Tx zero forcing (ZF) VP and an improved regularized VP are proposed which minimizes the transmit energy. In [7] Tx minimum mean square error (MMSE) VP is designed to minimize the mean square error (MSE) instead of the transmit energy and achieves better performance ARTICLE IN PRESS www.elsevier.com/locate/sigpro 0165-1684/$ - see front matter r 2007 Elsevier B.V. All rights reserved. doi:10.1016/j.sigpro.2007.05.025 $ This research is supported by the Science and Technology Committee of Shanghai Municipality (under Grant No. 06DZ15013). Corresponding author. Tel.: +86 21 34204578 603. E-mail address: [email protected] (F. Liu).

Upload: cristian-camilo

Post on 25-Oct-2014

39 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: MMSE Vector Precoding

ARTICLE IN PRESS

0165-1684/$ - se

doi:10.1016/j.si

$This resear

Committee of

06DZ15013).�CorrespondE-mail addr

Signal Processing 87 (2007) 2823–2833

www.elsevier.com/locate/sigpro

MMSE vector precoding with joint transmitter and receiverdesign for MIMO systems$

Feng Liu�, Lingge Jiang, Chen He

Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai 200240, China

Received 23 November 2006; received in revised form 28 May 2007; accepted 29 May 2007

Available online 7 June 2007

Abstract

The transmitter side (Tx) vector precoding (VP) shows excellent performance. However, joint transmitter and receiver

(Tx–Rx) design might achieve further improvement. In this paper, we propose a novel joint Tx–Rx VP for multiple input

multiple output (MIMO) systems. A unitary matrix is used at the receiver to partially equalize the channel. We design the

joint Tx–Rx VP with the minimum mean square error (MMSE) criterion and deduce the general closed-form solution.

Then we apply several methods including the singular value decomposition, QR decomposition and geometric mean

decomposition (GMD) to specify the general joint Tx–Rx VP design. Moreover, by using the extended channel, we achieve

an improved scheme to obtain further performance gain. Simulation results show that the specification with GMD method

outperforms the other specifications and the Tx MMSE VP. The improved joint Tx–Rx MMSE VP with GMD is superior

to other MMSE VP schemes.

r 2007 Elsevier B.V. All rights reserved.

Keywords: MIMO systems; MMSE vector precoding; Joint transmitter and receiver design; Geometric mean decomposition

1. Introduction

It has been shown that the multiple input multipleoutput (MIMO) systems achieve high capacity andbetter performance than the single-antenna systems[1–3]. If the channel state information (CSI) isavailable at the transmitter, precoding techniquecan be exploited to improve the system performance.

The ‘‘vector perturbation technique’’ [4,5] is anew transmitter side (Tx) technique which gener-

e front matter r 2007 Elsevier B.V. All rights reserved

gpro.2007.05.025

ch is supported by the Science and Technology

Shanghai Municipality (under Grant No.

ing author. Tel.: +8621 34204578 603.

ess: [email protected] (F. Liu).

alizes the Tomlinson–Harashima precoding (THP)[6] for MIMO systems. It is also called as vectorprecoding (VP) by the following researchers. Thekey idea is to add a perturbation component to thedata symbols, which shapes the symbols to satisfy aspecial goal. As in the THP, the perturbationsymbol in VP is an integer scaled by a factorand can be eliminated by a modulo operation atthe receiver. In contrast with the THP, theperturbation symbol in VP can be purposivelydesigned. In [4,5] Tx zero forcing (ZF) VP and animproved regularized VP are proposed whichminimizes the transmit energy. In [7] Tx minimummean square error (MMSE) VP is designed tominimize the mean square error (MSE) instead ofthe transmit energy and achieves better performance

.

Page 2: MMSE Vector Precoding

ARTICLE IN PRESSF. Liu et al. / Signal Processing 87 (2007) 2823–28332824

than the Tx ZF VP and the Tx regularized VP.All these Tx VP schemes significantly outperformconventional THP. Without the cooperation amongthe receiver antennas, these Tx VP schemes areadaptable to broadcast channels and point-to-pointchannel.

However, joint transmitter and receiver (Tx–Rx)design for VP might achieve further performanceimprovement. In this paper, we consider a simplescenario where only a unitary matrix is used at thereceiver. This design adapts to the point-to-pointchannel. With MMSE criterion, we deduce a closed-form solution of the joint Tx–Rx VP design. Wefind it generalizes the Tx MMSE VP. Then somespecifications of the general joint Tx–Rx VP areconsidered: by singular value decomposition (SVD),by QR decomposition (QRD), and by geometricmean decomposition (GMD) [8]. Then we proposean improved version of the joint Tx–Rx VP by usingthe extended channel. We compare the performanceof these specifications by computer simulations andfind that the specifications by GMD methodsignificantly outperform the Tx VP and the speci-fications by SVD and QRD methods. The improvedjoint Tx–Rx MMSE GMD VP is shown to besuperior to the other VP schemes and achievefurther diversity gain.

Notation: Throughout the paper, we will denotevectors and matrices by lower and upper case boldletters, respectively. We use E( � ), ( � )H, ( � )�1, andtr( � ) for expectation, conjugate transposition, theinverse, and the trace of a matrix, respectively. Theidentity matrix is denoted by I. The norm of avector or matrix is denoted as || � ||.

The rest of this paper is organized as follows.Section 2 introduces the system model. Subse-quently, we propose the joint Tx–Rx MMSE VPin Section 3. Simulation results are provided inSection 4. Finally, Section 5 concludes the paper.

2. System model

We consider a MIMO system with MT transmitand MR receive antennas in a frequency flat fadingchannel. A block of data symbols with length NB istransmitted at each channel instance. There is atotal power constraint PT for the transmitted block.Throughout the paper, we assume that perfect CSIis known at both the transmitter and receiver.

The signal model is expressed as

y½n� ¼ Hx½n� þ n½n�; n ¼ 1; 2; � � � ;NB, (1)

where y[n] is the nth MR� 1 symbol vector amongthe received block, x[n] the nth MT� 1 symbolvector among the transmitted block, n[n] thecorresponding nth MR� 1 vector of independentadditive white Gaussian noise with zero-mean andcovariance Rn ¼ s2nI, and H the MR�MT indepen-dent zero-mean unit-variance complex Gaussianchannel. We note that the more general frequency-selective channel can be represented by a spatial–temporal channel with a larger dimensionality.Hence, (1) is rather general.

VP schemes generate transmitted symbols x[n]as a function f (which will be derived later) of themix of data symbols s[n] and perturbation symbolsp[n] as

x½n� ¼ f d½n�ð Þ; n ¼ 1; 2; � � � ;NB, (2)

where d½n�9s½n� þ p½n� is the nth MR� 1 mixedsymbol vector. The perturbation vector p[n] ischosen from an integer lattice aZMR with an integerscalar a. At the receiver, a modulo operation(mod a) is used to remove the perturbation vectorp[n]. If square M-point Gray coded quadratureamplitude modulation (QAM) constellation

�1

2; � � � ;�

ffiffiffiffiffiffiMp� 1

2

� �2

with variance s2s ¼ ðM � 1Þ=6 is used, the scalar isset to be a ¼

ffiffiffiffiffiffiMp

.In this paper, we consider a simple scenario for

joint Tx–Rx design by using a unitary matrix AH atthe receiver to partially equalize the channel as

d½n� ¼ gAHy½n�; n ¼ 1; 2; � � � ;NB, (3)

where g is a power control factor. Finally, thesymbol is estimated after the modulo operation

s½n� ¼ d½n� mod a; n ¼ 1; 2; � � � ;NB. (4)

The proposed joint Tx–Rx VP design is demon-strated by Fig. 1.

3. Proposed schemes

In this section, we consider the problem of jointTx–Rx VP design with the MMSE criterion for theabove model. We first deduce the generalizedsolution of the problem. Then specializations basedon different methods are given. And then animproved version of the joint Tx–Rx VP designis proposed with the extended channel. Finally,we provide the performance analysis and somediscussion.

Page 3: MMSE Vector Precoding

ARTICLE IN PRESS

s[n]( )f ⋅

x[n]d[n]H gAH

n[n]

ˆ [n]d ˆ[n]s

Sphere

encoding

mod

MRpopt[n]∈�

Fig. 1. Demonstration of the proposed joint Tx–Rx VP design for MIMO systems.

F. Liu et al. / Signal Processing 87 (2007) 2823–2833 2825

3.1. Problem formulation and solution

Our goal is to find the optimal solution todetermine the perturbation vector p[n] AND thefunction f (i.e. the relationship between x[n] andd[n]). The power control factor g can also beincluded into this question. We choose the MMSEcriterion to design the joint Tx–Rx VP. Themeasurement is the total MSE of the differencebetween the mixed symbol block and its estimation,which is defined as

� p½n�;x½n�; gð Þ9XNB

n¼1

E d½n� � d½n��� ��2n o

, (5)

which can be computed as

�ðp½n�;x½n�; gÞ ¼XNB

n¼1

E gAHðHx½n� þ n½n�Þ � d½n�

�� ��2n o

¼XNB

n¼1

E gAHðHx½n� þ n½n�Þ � d½n�

� �Hn� gAH

ðHx½n� þ n½n�Þ � d½n�� ��

¼XNB

n¼1

g2xH½n�HHHx½n�

� gdH½n�AHHx½n� þ g2trðRnÞ

�gxH½n�HHAd½n� þ dH½n�d½n��. ð6Þ

Now the optimizing problem reads as

p½n�; x½n�; g �

¼ argmin � p½n�;x½n�; gð Þ

s:t:XNB

n¼1

xH½n�x½n� ¼ PT. ð7Þ

Theorem 1. The solution of the problem (7) is

given by

popt½n� ¼ argminp½n�2aZMR

LA s½n� þ p½n�ð Þ�� ��2,

n ¼ 1; 2; � � � ;NB, ð8Þ

x½n� ¼ g�1ðHHHþ xIÞ�1HHAd½n�

¼ g�1HHðHHH þ xIÞ�1Ad½n�; n ¼ 1; 2; � � � ;NB,

ð9Þ

g ¼

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi1

PT

XNB

n¼1

dH½n�AHH HHH þ xI ��2

HHAd½n�

vuut

¼

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi1

PT

XNB

n¼1

dH½n�AHðHHH þ xIÞ�1HHHðHHH þ xIÞ�1Ad½n�

vuut ,

ð10Þ

where x ¼ NB trðRnÞ=PT, and we use the Cholesky

factorization ðHHH þ xIÞ�1 ¼ LHL to obtain the

triangular matrix L. The minimal MSE obtained by

the above optimal solution is given by

� ¼ xXNB

n¼1

dH½n�AHLHLAd½n�

¼ xXNB

n¼1

LA s½n� þ p½n�ð Þ�� ��2. ð11Þ

Proof. : see Appendix A.On comparing the above results with the Tx

MMSE VP [7], we can see if we set A ¼ I, the abovejoint Tx–Rx design turns into the Tx MMSE VP.Therefore, our joint Tx–Rx MMSE VP generalizesthe Tx MMSE VP. Furthermore, if the goal is tominimize the power control factor g, it turns intothe regularized VP discussed in [4,5], which isobviously inferior to the MMSE design.

3.2. Specifications of the general solution

In the former subsection, the matrix A at thereceiver can be any unitary. Thus, it gives a generalform of the joint Tx–Rx MMSE VP with a unitarymatrix at the receiver. However, it is difficult todetermine the optimal unitary matrix A. Here wespecify the general form by some existing methodsfor extra performance improvement. We consider

Page 4: MMSE Vector Precoding

ARTICLE IN PRESSF. Liu et al. / Signal Processing 87 (2007) 2823–28332826

the SVD, QRD, and GMD [8] methods. Thesemethods decompose the channel H into the follow-ing general form:

H ¼ QRPH, (12)

where Q and P are unitary matrices. For the SVDmethod, R is a real diagonal matrix with singularvalues of H. For the QRD method, P ¼ I, and R isan upper triangular matrix. For the GMD method,R is a real upper triangular matrix with diagonalelements all equal to the geometric mean of thepositive singular values of H. We note the TxMMSE VP can be viewed as a special case of theabove general form where Q ¼ I, R ¼ RH

0 , andP ¼ Q0 using the QRD: HH ¼ Q0R0.

We choose the unitary matrix A ¼ Q to partiallyequalize the channel. Now, the joint Tx–Rx MMSEVP can be specified as

x½n� ¼ g�1PðRHRþ xIÞ�1RHd½n�

¼ g�1PRHðRRH þ xIÞ�1d½n�; n ¼ 1; 2; � � � ;NB,

ð13Þ

g ¼

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi1

PT

XNB

n¼1

dH½n�ðRRH þ xIÞ�1RRHðRRH þ xIÞ�1d½n�

vuut

¼

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi1

PT

XNB

n¼1

RHðRRH þ xIÞ�1d½n��� ��2

vuut , ð14Þ

� ¼ xXNB

n¼1

dH½n�ðRRH þ xIÞ�1d½n�. (15)

By the Cholesky factorization

ðRRH þ xIÞ�1 ¼ BHB (16)

we can further simplify Eq. (15) as

� ¼ xXNB

n¼1

dH½n�BHBd½n� ¼ xXNB

n¼1

B s½n� þ p½n�ð Þ�� ��2.

(17)

In order to minimize the total MSE, eachindependent item in Eq. (17) needs to be minimized.Now the optimal perturbation vector is chosen by

popt½n� ¼ argminp½n�2aZMR

B s½n� þ p½n�ð Þ�� ��2; n ¼ 1; 2; � � � ;NB,

(18)

which can be resolved by the closest-point searchalgorithm (sphere encoding).

3.3. Improved joint Tx– Rx MMSE VP with the

extended channel

The specification schemes in the previous subsec-tion is based on the SVD/QRD/GMD methods ofchannel H. From the literature, e.g. [9], we knowusing the extended channel He9½H sn=ssI� forprecoding can further improve the performance.Here, we develop an improved version of jointTx–Rx MMSE VP based on the extended channel.

Now the mentioned methods decompose theextended channel He as

He ¼ QeRePHe . (19)

The following steps are done as in subsection 3.2withQ,R, P replaced byQe,Re, Pe. We note that sinceHe has a dimension expansion, dimension reduction isneeded for correct matrix computation. Thus, only theproper upper-left parts of Qe, Re, Pe are used in theimproved joint Tx–Rx MMSE VP schemes.

3.4. Performance analysis

For the specifications with SVD, QRD, and GMDmethods, we provide some brief discussion. The SVDmethod gives a diagonal matrix R, while the QRDand GMD methods give a triangular matrix R. Sincethe sphere encoding considers interference cancella-tion to find the optimal perturbation vector, weknow from Eq. (18) that the sphere encoding cannotbenefit from a diagonal channel B. Therefore, theQRD and GMD specifications obviously outperformthe SVD specification. Moreover, the GMD methodgives a triangular matrix with equal diagonalelements, which eliminates the imbalance amongthe subchannel gains. Thus, the GMD specification issuperior to the QRD specification.

The authors of [8] propose two schemes (namedGMD and UCD) based on the GMD method forMIMO systems [10,11]. In fact, the GMD scheme [10]is a ZF THP based on the GMD of channel H, whilethe UCD scheme [11] is a MMSE THP with water-filling power allocation based on the GMD of theextended channel He. In [11], the authors prove thatthe GMD scheme has a diversity of order dGMD ¼

ðM �mþ 1Þm and the UCD scheme has a diversityof order dUCD ¼Mm, where M ¼ maxðMR;MTÞ

and m ¼ minðMR;MTÞ. The authors also point outthat the water filling power allocation does not helpimprove diversity gains. Therefore, we can expect ourjoint Tx–Rx MMSE VP with GMD has equaldiversity order as that of the GMD scheme and the

Page 5: MMSE Vector Precoding

ARTICLE IN PRESSF. Liu et al. / Signal Processing 87 (2007) 2823–2833 2827

improved joint Tx–RxMMSE VP with GMD has thesame diversity order as that of the UCD scheme.

Since the exact performance is difficult to analyze,we will use computer simulations to compare theseschemes in the next section.

3.5. Further remarks

In [12], the authors propose a joint precodingmatrix and perturbation vector design. From (12),we know that it can be regarded as a special case ofour design. Let Q ¼ I and the QRD: HH

¼ Q0R0,similar processing can be done. The difference is theoptimization of [12] includes the QRD-M algo-rithm. Compared with [12], our design even doesnot constrain the precoding matrix to be linear,although the linear structure is proved optimal. Wepoint out the QRD-M algorithm discussed in [12]can be further considered in our design to reduce thecomplexity of the sphere encoding, which is not thefocus of this paper.

4. Simulation results

The performance is measured in terms of theuncoded bit error rate (BER) over the equivalent

0 2 4 6 8 10 12 1

10-7

10-6

10-5

10-4

10-3

10-2

10-1

100

Eb/N

BE

R

Tx MMSE VP

Tx-Rx MMSE VP by GMD

Improved Tx-Rx MMSE VP

Tx-Rx MMSE VP by SVD

Improved Tx-Rx MMSE VP

Tx-Rx MMSE VP by QRD

Improved Tx-Rx MMSE VP

Fig. 2. Average uncoded BER performance comparison for the specifica

4-QAM for (2,2) MIMO systems.

received SNR which is defined as: Eb=N0 ¼

MRs2s=Rms2n, where Rm is the modulation rate (i.e.bits per modulated symbol).

4.1. Performance comparison for the different

specification schemes

We show the performance of the differentspecifications based on different methods in Figs.2 and 3 for a 2-by-2 MIMO system with 4-QAMand 16-QAM, respectively. We see that the specifi-cations with SVD and QRD methods performworse than the Tx MMSE VP. The specificationwith GMD method significantly outperformsthe Tx MMSE VP. In detail, there is about 2 and3 dB SNR gain for the Tx–Rx MMSE VP by GMDin comparison with the Tx MMSE VP atBER ¼ 10�5 for 4-QAM and 16-QAM, respec-tively. We attribute this to the property of GMDmethod, which eliminates the imbalance betweenthe gains of subchannels. By using the extendedchannel, the improved Tx–Rx MMSE VP by GMDhas further processing gain of about 4 dB comparedwith the Tx MMSE VP. Therefore, we only considerthe specification with GMD in the followingsimulation.

4 16 18 20 22 24 26 28

0 (dB)

by GMD

by SVD

by QRD

tions of the joint Tx–Rx MMSE VP with different methods using

Page 6: MMSE Vector Precoding

ARTICLE IN PRESS

0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30

Eb/N

0 (dB)

10-5

10-4

10-3

10-2

10-1

100

BE

R

Tx MMSE VP

Tx-Rx MMSE VP by GMD

Improved Tx-Rx MMSE VP by GMD

Tx-Rx MMSE VP by SVD

Improved Tx-Rx MMSE VP by SVD

Tx-Rx MMSE VP by QRD

Improved Tx-Rx MMSE VP by QRD

Fig. 3. Average uncoded BER performance comparison for the specifications of the joint Tx–Rx MMSE VP with different methods using

16-QAM for (2,2) MIMO systems.

F. Liu et al. / Signal Processing 87 (2007) 2823–28332828

4.2. The performance comparison for the GMD-

based schemes

In this subsection, we compare the performanceof the GMD-based schemes, including the GMD,UCD, the joint Tx–Rx MMSE VP by GMD, andthe improved joint Tx–Rx MMSE VP by GMD. Wealso provide the Tx ZF VP in [5] and Tx MMSE VP[7] for comparison. Since the Tx regularized VP [5]is inferior to the Tx MMSE VP, we omit it in oursimulation.

Figs. 4 and 5 show the performance comparisonfor a 2-by-2 MIMO system with 4-QAM and16-QAM, respectively. We see the GMD schemeonly outperforms the Tx ZF VP, while the UCDscheme is superior to all others. The joint Tx–RxMMSE VP by GMD significantly outperforms theGMD scheme. Compared with the UCD scheme,the improved joint Tx–Rx MMSE VP by GMDshows little performance loss. As analyzed in theabove section, our joint Tx–Rx MMSE VP designdoes not consider power allocation, which leads tosome performance loss in comparison with theUCD scheme.

We give more simulation data to compare therelated schemes. In Figs. 6 and 7, the performance is

compared for a 3-by-3 MIMO system with 4-QAMand 16-QAM, respectively. Similar results hold asthe 2-by-2 MIMO systems. We see the specificationwith GMD still outperforms the Tx MMSE VP.The improved joint Tx–Rx MMSE VP by GMDhas about 2 dB SNR gain compared with the TxMMSE VP.

Furthermore, the slopes of these curves in highSNR region indicating the diversity order verify ouranalysis about the diversity order. From Figs. 4–7,we see the diversity orders of the GMD and jointTx–Rx MMSE VP with GMD schemes are equal,while the UCD and improved joint Tx–Rx MMSEVP by GMD schemes share the same diversityorder. Moreover, the Tx MMSE VP shows the samediversity order as the GMD, while the Tx ZF VPobtains low diversity order.

4.3. Explanation for the performance improvement of

the GMD-based schemes

The performance of VP schemes is difficult toanalyze theoretically. In this subsection, we try toexplain the performance improvement of theproposed GMD-based VP schemes by simulation.As mentioned before, the GMD method eliminates

Page 7: MMSE Vector Precoding

ARTICLE IN PRESS

0 2 4 6 8 10 12 14 16 18 20 22 24 26 28

Eb/N

0 (dB)

GMD

UCD

Tx ZF VP

Tx MMSE VP

Tx-Rx MMSE VP by GMD

Improved Tx-Rx MMSE VP by GMD10-7

10-6

10-5

10-4

10-3

10-2

10-1

100

BE

R

Fig. 4. Average uncoded BER performance comparison for different schemes using 4-QAM for (2,2) MIMO systems.

0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30

Eb/N

0 (dB)

GMD

UCD

Tx ZF VP

Tx MMSE VP

Tx-Rx MMSE VP by GMD

Improved Tx-Rx MMSE VP by GMD

10-7

10-6

10-5

10-4

10-3

10-2

10-1

100

BE

R

Fig. 5. Average uncoded BER performance comparison for different schemes using 16-QAM for (2,2) MIMO systems.

F. Liu et al. / Signal Processing 87 (2007) 2823–2833 2829

the difference of subchannel gains, which affects thesearching of the perturbation vectors. Looking atthe triangular matrices used in Eqs. (8) and (18) for

sphere encoding, we find that the difference of thediagonal elements of this triangular matrix playsan important role in searching the perturbation

Page 8: MMSE Vector Precoding

ARTICLE IN PRESS

0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30

Eb/N

0 (dB)

GMD

UCD

Tx ZF VP

Tx MMSE VP

Tx-Rx MMSE VP by GMD

Improved Tx-Rx MMSE VP by GMD

10-5

10-6

10-4

10-3

10-2

10-1

100

BE

R

Fig. 6. Average uncoded BER performance comparison for different schemes using 4-QAM for (3,3) MIMO systems.

0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30

Eb/N

0 (dB)

GMD

UCD

Tx ZF VP

Tx MMSE VP

Tx-Rx MMSE VP by GMD

Improved Tx-Rx MMSE VP by GMD

10-5

10-6

10-7

10-4

10-3

10-2

10-1

100

BE

R

Fig. 7. Average uncoded BER performance comparison for different schemes using 16-QAM for (3,3) MIMO systems.

F. Liu et al. / Signal Processing 87 (2007) 2823–28332830

vectors. With computer simulation, we can see itexplicitly. Fig. 8 shows the average variance of thediagonal elements of the triangular matrix used forsphere encoding with 4-QAM, where A, B, and C

represent the Tx MMSE VP, the Tx–Rx MMSE VPby GMD, and the improved Tx–Rx MMSE VP byGMD, respectively. We compare the (2,2), (5,5),and (10,10) MIMO systems. From the left part of

Page 9: MMSE Vector Precoding

ARTICLE IN PRESS

0 10 20 30

0

0.2

0.4

0.6

0.8

1

1.2

1.4

1.6

1.8

Eb/N

0 (dB)

Variance

0 10 20 30

10-5

10-4

10-3

10-2

10-1

100

101

Eb/N

0 (dB)

Variance

A. (2,2)

B. (2,2)

C. (2,2)

A. (5,5)

B. (5,5)

C. (5,5)

A. (10,10)

B. (10,10)

C. (10,10)

Fig. 8. Comparison of average variance of the diagonal elements of the triangular matrix used for sphere encoding with 4-QAM. A

represents the Tx MMSE VP scheme, B represents the Tx–Rx MMSE VP by GMD scheme, and C represents the improved Tx–Rx MMSE

VP by GMD scheme. (2,2), (5,5), and (10,10) MIMO systems are considered. Left: linear Y-axis; right: logarithmic Y-axis.

F. Liu et al. / Signal Processing 87 (2007) 2823–2833 2831

Fig. 8 (using linear scale for Y-axis) we can see thatthe average variance of Tx MMSE VP significantlyincreases with the SNR, while the GMD-based VPschemes keep a very low level of average variance.Bigger variance means higher imbalance among thediagonal elements. This helps explain why theGMD-based VP schemes significantly outperformthe Tx MMSE VP in high SNR region. However, ifthe antenna number increases, the average varianceof Tx MMSE VP reduces. This can explain why theperformance improvement of the GMD-based VPschemes decreases as the number of antennasincreases compared with the Tx MMSE VP.Furthermore, we change the Y-axis into logarithmicscale to see it in detail as the right part of Fig. 8. Wefind that the average variance of the improvedTx–Rx MMSE VP by GMD is the smallest amongthe three schemes, which explains why the improvedTx–Rx MMSE VP by GMD has a superiorperformance.

5. Conclusion

We presented the joint transmitter and receiverdesign of VP with MMSE criterion for MIMO

systems, while the receiver is multiplied with aunitary matrix. This joint Tx–Rx VP MMSE designgeneralizes the transmitter side MMSE VP. Wededuced a closed-form solution with any unitarymatrix at the receiver. Then we used severalmethods to specify the general solution. We foundthe specification with GMD method improves theperformance due to the elimination of gain imbal-ance among subchannels of MIMO systems incomparison with the Tx MMSE VP. Moreover,we exploited the extended channel to obtain animproved version of the joint Tx–Rx MMSE VP.The improved joint Tx–Rx MMSE VP with GMDachieves further diversity gain and processing gain.Although our design still has little performance losscompared with the UCD scheme, it significantlyoutperforms the Tx MMSE VP. The simulationresults verified our design and analysis.

Acknowledgment

The authors are grateful to the anonymousreviewers for their helpful suggestions for improvingthe submitted manuscript.

Page 10: MMSE Vector Precoding

ARTICLE IN PRESSF. Liu et al. / Signal Processing 87 (2007) 2823–28332832

Appendix A. Proof of Theorem 1:

Proof. We use the Lagrangian multiplier method tosolve the question. The cost function is

L p½n�; x½n�; g; lð Þ ¼ � p½n�;x½n�; gð Þ

þ lXNB

n¼1

xH½n�x½n� � PT

!, ð20Þ

where l is the Lagrangian multiplier to be deter-mined. By setting the derivatives of Lðp½n�;x½n�; g; lÞto zeros with respect to x[n], g, and l, respectively,we get

qL

qx½n�¼ 0) g2xH½n�HHH� gdH½n�AHHþ lxH½n� ¼ 0,

(21)

qL

qg¼ 0)

XNB

n¼1

2gxH½n�HHHx½n� � dH½n�AHHx½n�

þ2g trðRnÞ � xH½n�HHAd½n��¼ 0, ð22Þ

qL

ql¼ 0)

XNB

n¼1

xH½n�x½n� � PT ¼ 0. (23)

Rearrange Eq. (21) to obtain

x½n� ¼ g�1 HHHþlg2

I

� �1HHAd½n�

¼ g�1HH HHH þlg2

I

� �1Ad½n�, ð24Þ

� ¼XNB

n¼1

dH½n�AHðHHH þ xIÞ�1HHHHHHðHHH þ xIÞ�1Ad½n� þ dH½n�d½n�

�dH½n�AHHHHðHHH þ xIÞ�1Ad½n� � dH½n�AHðHHH þ xIÞ�1HHHAd½n�

þxdH½n�AHðHHH þ xIÞ�1HHHðHHH þ xIÞ�1Ad½n�

8>><>>:

9>>=>>;

¼XNB

n¼1

dH½n�AH

ðHHH þ xIÞ�1HHHHHH HHH þ xI ��1

þ I

�HHHðHHH þ xIÞ�1 � ðHHH þ xIÞ�1HHH

þxðHHH þ xIÞ�1HHHðHHH þ xIÞ�1

8>>><>>>:

9>>>=>>>;Ad½n�

¼ xXNB

n¼1

dH½n�AHðHHH þ xIÞ�1Ad½n�, ð30Þ

and

gxH½n�HHHx½n� � dH½n�AHHx½n� þlgxH½n�x½n� ¼ 0,

gxH½n�HHHx½n� � xH½n�HHAd½n� þlgxH½n�x½n� ¼ 0.

ð25Þ

From Eq. (24) we prove that the optimal relation-ship between x[n] and d[n] is linear. Bring Eq. (25)into Eq. (22):

XNB

n¼1

�2lgxH½n�x½n� þ 2g trðRnÞ

� �¼ 0. (26)

Then we get

lg2¼

NB � trðRnÞPNB

n¼1

xH½n�x½n�

¼NB � trðRnÞ

PT9x. (27)

Thus Eq. (24) can be written as

x½n� ¼ g�1ðHHHþ xIÞ�1HHAd½n�

¼ g�1HHðHHH þ xIÞ�1Ad½n�. ð28Þ

Now by taking Eq. (28) into Eq. (23), we get

g ¼

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi1

PT

XNB

n¼1

dH½n�AHHðHHH þ xIÞ�2HHAd½n�

vuut

¼

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi1

PT

XNB

n¼1

dH½n�AHðHHH þ xIÞ�1HHHðHHH þ xIÞ�1Ad½n�

vuut .

ð29Þ

With Eqs. (28) and (29), the MSE expression (6) canbe computed

where the last step uses the fact

ðHHH þ xIÞ�1HHHHHHðHHH þ xIÞ�1

� Iþ I�HHHðHHH þ xIÞ�1

þ I� ðHHH þ xIÞ�1HHH

Page 11: MMSE Vector Precoding

ARTICLE IN PRESSF. Liu et al. / Signal Processing 87 (2007) 2823–2833 2833

þ xðHHH þ xIÞ�1HHHðHHH þ xIÞ�1

¼ ðHHH þ xIÞ�1HHHHHHðHHH þ xIÞ�1

� Iþ 2xðHHH þ xIÞ�1

þ xðHHH þ xIÞ�1HHHðHHH þ xIÞ�1

¼ ðHHH þ xIÞ�1HHHHHH��ðHHH þ xIÞ þ 2xI

�ðHHH þ xIÞ�1

þ xðHHH þ xIÞ�1HHHðHHH þ xIÞ�1

¼ ðHHH þ xIÞ�1HHH � I� �

HHH þ xI �� ðHHH þ xIÞ�1 þ xðHHH þ xIÞ�1

�HHHðHHH þ xIÞ�1

¼ x �ðHHH þ xIÞ�1HHH þ I �

ðHHH þ xIÞ�1

þ xðHHH þ xIÞ�1HHHðHHH þ xIÞ�1

¼ xðHHH þ xIÞ�1, ð31Þ

where the matrix inversion lemma is applied for.Then we use the Cholesky factorization

ðHHH þ xIÞ�1 ¼ LHL, (32)

so that the MSE expression (30) can be furthersimplified:

� ¼ xXNB

n¼1

dH½n�AHLHLAd½n�

¼ xXNB

n¼1

LA s½n� þ p½n�ð Þ�� ��2. ð33Þ

From Eq. (33), the optimal perturbation vectorminimizing the MSE can be effectively found by theclosest-point search algorithm (sphere encoding) as

popt½n� ¼ argminp½n�2aZMR

LA s½n� þ p½n�ð Þ�� ��2,

n ¼ 1; 2; � � � ;NB. ð34Þ

Thus, we complete the proof. &

References

[1] I.E. Teletar, Capacity of multi-antenna Gaussian channels,

AT&T Bell Labs Internal Tech. Memo, June 1995.

[2] A.J. Goldsmith, S.A. Jafar, N. Jindal, S. Vishwanath,

Capacity limits of MIMO channels, IEEE J. Select. Areas

Comm. 21 (June 2003) 684–702.

[3] P. Viswanath, D. Tse, Sum capacity of the vector

Gaussian broadcast and uplink–downlink duality, IEEE

Trans. Inform. Theory (August 2003) 1912–1921.

[4] C.B. Peel, B.M. Hochwald, A.L. Swindlehurst, A vector-

perturbation technique for near-capacity multiantenna

multiuser communication-part I: channel inversion and

regularization, IEEE Trans. Comm. 53 (1) (January 2005)

195–202.

[5] B.M. Hochwald, C.B. PEEL, A.L. Swindlehurst, A vector-

perturbation technique for near-capacity multiantenna

multiuser communication-part II: perturbation, IEEE

Trans. Comm. 53 (3) (March 2005) 537–544.

[6] M. Tomlinson, New automatic equaliser employing modulo

arithmetic, Electron. Lett. 7 (March 1971) 138–139.

[7] D. Schmidt, M. Joham, W. Utschick, Minimum mean

square error vector precoding, in: Proceedings of PIMRC

2005, September 2005, pp. 107–111.

[8] Y. Jiang, W.W. Hager, J. Li, The Geometric Mean

Decomposition, Linear Algebra and Its Applications,

February 2005, pp. 373–384.

[9] D. Wubben, R. Bohnke, V. Kuhn, K.D. Kammeyer, MMSE

extension of V-BLAST based on sorted QR decomposition,

in: Proceedings of IEEE VTC2003-Fall, vol. 1, Orlando, FL,

USA, October 6–9, 2003, pp. 508–512.

[10] Y. Jiang, J. Li, W.W. Hager, Joint transceiver design for

MIMO communications using geometric mean decomposi-

tion, IEEE Trans. Signal Processing 53 (10, part. 1) (October

2005) 3791–3803.

[11] Y. Jiang, J. Li, W.W. Hager, Uniform channel decomposi-

tion for MIMO communications, IEEE Trans. Signal

Processing 53 (November 2005) 4283–4294.

[12] J. Zhang, K.J. Kim, Near-capacity MIMO multiuser

precoding with QRD-M algorithm, in: Proceedings of the

36th Asilomar Conference on Signals, Systems and Compu-

ters, 2005, pp. 1498–1502.