group detection
TRANSCRIPT
-
8/9/2019 Group Detection
1/15
1
Reduced Complexity Demodulation of MIMO
Bit-Interleaved Coded Modulation using IQ
Group DetectionZak Levi , Dan Raphaeli
Abstract In this paper we propose a novel reduced
complexity technique for the decoding of multiple-input
multiple-output bit interleaved coded modulation (MIMO-
BICM) using IQ Group Detection (GD). It is well known
that the decoding complexity of the MAP detector for
MIMO-BICM increases exponentially in the product of
the number of transmit antennas and number of bits
per modulation symbol, and becomes prohibitive even
for simple schemes. We propose to reduce complexity
by partitioning the signal into disjoint groups at the
receiver and then detecting each group using a MAP
detector. Complexity and performance can be traded off
by the selection of the group size. Group separation and
partitioning is performed such as to maximize the Mutual
Information between the transmitted and received signal.
It is shown that for moderate to high SNR IQ Group
Detection outperforms conventional MMSE per antenna
detection schemes by 1-2[dB] under a fast Rayleigh fading
channel, and by 3-4[dB] under a Quasi static Rayleigh
fading channel with practically no increase in decoding
complexity. It is also shown that higher gains can beachieved with some increase in complexity. We further
propose an ad hoc Iterative Group Cancelation scheme
using hard decision feedback to enhance performance.
Index Terms: Mutual Information, Group Detection, Max-
imum A posteriori, Minimum Mean Square Error, Log
Likelihood Ratio.
I. INTRODUCTION
Bit interleaved coded modulation (BICM) is a well
known transmission technique widely used in practical
single-input single-output (SISO) systems. In SISO sys-tems the BICM approach received a lot of attention due
to its ability to exploit diversity under fading channels
in a simple way [1]. Inspired by such an approach
BICM was proposed as a transmission technique for
multi carrier MIMO systems [7], [2].MAP decoding
of BICM transmission involves the computation of the
Log Likelihood Ratio (LLR) for each transmitted bit.
The LLR computation is performed using a detector,
the detector complexity is exponential in the product
of the number of transmit antennas and number of bits
per modulation symbol. Even for simple scenarios this
complexity becomes overwhelming. Various techniques
have been suggested to reduce the computational burdenof the MAP detector. Most of them can be classified into
either list sphere detector based techniques, or Interfer-
ence Suppression and Cancelation based techniques.
The list sphere detector was proposed by [15], [8]
and is a modification of the sphere decoder. The sphere
decoder is an efficient algorithm for finding the nearest
lattice point to a noisy lattice observation. The vector
space spanned by the MIMO channel matrix is regarded
as a lattice and the received signal as its perturbed
version. The algorithm searches for the nearest latticepoint in a sphere around the perturbed received lattice
point. The list sphere detector is a modification of the
sphere decoder and enables the production of approx-
imate LLR values for coded bits. The complexity and
performance of the list sphere detector greatly depend
on the selection of the sphere radius and the list size.
These depend on the channel matrix and the SNR. An
iterative scheme using the list sphere detector and a
turbo code was reported to closely approach capacity of
the multi antenna fast Rayleigh fading channel [15]. The
complexity of the list sphere detector is generally higher
then that of decoding techniques employing Interference
Suppression and Cancelation.
For decoding techniques employing Interference
Suppression and Cancelation, the MAP detector is ap-
proximated by linear processing of the MIMO channel
outputs followed a per antenna LLR computer. In [10]
the authors proposed a Zero Forcing (ZF) detector
followed by a per antenna LLR computer. The authors
made the simplifying assumption of white post equal-
ization interference. In [11] a Minimum Mean Squared
Error (MMSE) based detector was derived followed by
a per antenna LLR computer without the assumption
of white post equalization interference. It was shown
that such a receiver has the same complexity as the onein [10] but offers superior performance. The authors
in [16] proposed an iterative scheme comprising of
a soft Interference Canceler (IC), an adaptive MMSE
detector followed by a per antenna LLR computer and
a soft output decoder. Soft outputs from the decoder
were used to both reconstruct estimates of the channel
output and to adapt the MMSE detector. The recon-
structed channel output estimates were subtracted from
the true received signal (IC) and the resulting signal was
detected via the adaptive MMSE detector and the LLR
computer. It was shown that when bit reliability at the
output of the soft decoder was high the adaptive MMSE
detector coincided with the Matched Filter. A reducedcomplexity approximation to [16] was proposed in [9]
-
8/9/2019 Group Detection
2/15
2
where the soft output decoder was replaced by a hard
output Viterbi decoder and the soft Interference Can-
celer by a hard one. After the first decoding stage ,due
to the lack of soft information from the decoder, correct
decisions were assumed. After the initial decoding stage
the MMSE detector was replaced by a Matched Filter.
In this paper we propose a Group Detection (GD) In-
terference Suppression based technique. GD was widely
studied in the context of Multi User Detection (MUD) in
CDMA systems [3]. The idea is to jointly detect a subset
of the transmitted information while treating the rest
of the transmission as noise. Many existing detection
techniques can be regarded as Group Detection based
techniques, namely the per antenna detection techniques
where each antenna can be identified as a single group.
The authors in [4] used Group Detection in the context
of V-BLAST decoding ( [6]) as a remade for error
propagation. In their work a group of the worst p
sub-channels was jointly detected using ML decoding.
A DFE was then used to detect the rest of the sub-channels. In [5] a GD scheme was proposed as a trade
off between diversity gain and spatial multiplexing gain
by partitioning the signal at the transmitter into groups.
Each group was encoded separately and per group rate
adaptation was performed.
In our work group detection was employed only
at the receiver side with no special treatment at the
transmitter. Unlike [4], [5], [11], [9], [16] where a
group was defined as a collection of antennas/sub-
channels, we define a group as a collection of In Phase
and Quadrature (I/Q) components of the transmitted
symbols possibly from different antennas. The smallest
group is defined as a single (I/Q) component of thea transmitted symbol. The GD scheme consists of
group partitioning, group separation and detection. The
proposed GD scheme was derived from an information
theoretic point of view. Group separation was performed
by linear detection. Under a Gaussian assumption on the
transmitted signal, the MMSE detector was identified
as a canonical (information lossless) detector for group
detection. A group partitioning scheme was derived
such as to maximize the sum rate. The selection of
the group size allowed us to tradeoff performance with
complexity. At one end when the number of groups was
set to one, the entire transmission was jointly detected
and the scheme coincided with full MAP, while at the
other end each dimension was decoded separately. An
Iterative group interference canceling technique using
hard outputs from the decoder similar to [9] was also
investigated. Finally, performance was evaluated via
simulations using a rate 1/2 64-state convolutional codewith octal generators (133,171) and random interleav-
ing. The proposed GD scheme was compared to the full
MAP detection scheme and the standard MMSE scheme
[11], [9] for both fast Rayleigh fading and quasi static
Rayleigh fading channels.
The organization of this paper is as follows. In
Section II the system model is presented along witha review of MIMO-BICM MAP detection. Section III
bi
ciRandom
Interleaver
Binary
SourceEncoder
Symbol
Mapper (Gray)
NTX
a
1a
S/P
TX
H
Detector
(Bit LLR)
1y
NRX
y
De
InterleaverDecoder
bi
RX
Fig. 1. MIMO-BICM NRX x NTX System Model.
introduces the concept of Group Detection and deals
with group separation and detection. Group partitioning
is addressed in Section IV. Iterative Group Interfer-ence Cancelation is discussed in Section V. Simulation
results for fast and quasi static Rayleigh fading are
presented in Section VI, and Section VII concludes the
paper.
I I . SYSTEM MODEL AND NOTATIONS
We consider a MIMO-BICM system with NT Xtransmit and NRX receive antennas as illustrated inFig. 1. The information bit sequence b = [b1,...,bNb ] isencoded into coded bits which are then interleaved by
a random interleaver. The interleaved bits, denoted by
c = [c1,...,cNc ], are mapped onto an 2m
QAM signalconstellation using independent I&Q gray mapping. The
block of Nc/m symbols is split into sub-blocks oflength NT X . At each instant n a sub-block a(n) =[a1(n),...,aNTX (n)]
T is transmitted simultaneously by
the NT X antennas. We assume that the transmitted sym-bols are independent with a covariance matrix Raa =2a INTXNTX . The NRX 1 received signal is denotedby y(n) = [y1(n),...,yNRX (n)]
T and is given by
y(n) = H(n) a(n) + z(n) (1)Where H(n) is the NRXNT X complex channel
matrix [hi,j(n)]i=1..NRX ,j=1..NTX and is assumed tobe perfectly known at the receiver (full CSI at thereceiver). z(n) is an NRX1 additive white complexGaussian noise vector z(n) = [z1(n),...,zNRX (n)]
T
with a covariance matrix of Rzz = 2z INRXNRX .
For simplicity we consider the case where
NT X =NRX NT. The extension to an arbitrarynumber of transmit and receive antennas satisfying
NT X NRX is strait forward. The above systemmodel can be used to describe a multi carrier MIMO
system by interpreting the instant index n as afrequency (subcarrier) index. Each block of Nc/msymbols would correspond to a single multi carrier
symbol with NC/(mNT X ) sub carriers,and H(n)would correspond to MIMO channel experienced by
-
8/9/2019 Group Detection
3/15
3
the nth subcarrier. For the reminder of this paper we
omit the instant index n for clarity of notation.
A. Review of MIMO-BICM MAP detection
The decoding scheme is shown in Fig. 1,the MAP
detector performs soft de-mapping by computing the
conditional Log Likelihood Ratio(LLR) for each codedbit. The conditional LLR of the kth coded bit is the
logarithm of the ratio of the likelihood that the bit
was a one, conditioned by the received signal and
channel state, to the likelihood that the bit was a zero,
conditioned by the received signal and channel state
(full CSI at the receiver). The conditional LLR for the
kth coded bit is given by
LLR( ck| y, H) = log
Pr
ck = 1| y, H
Pr
ck = 0| y, H (2)
For clarity and ease of notation from here on we omit
conditioning on the channel matrix H from our notation.Using Bayes rule and the ideal interleaving assumption,
the conditional LLR of the kth coded bit is given by
LLR
ck| y
= log
aSk,r1
e
12z
yHa2
aSk,r0
e
12z
yHa2
(3)
Where Sk,rl CNT is the set of all complex QAMsymbol vectors whose kth bit in the rth symbol is l {0, 1}. The complexity of the above LLR computationis 2
mNT1
and is thus exponential in the product of theM-QAM constellation size and the number of antennas.
III. GROUP DETECTION
Before presenting the group detection scheme let us
reformulate the system model using real signals only.
Eq. (1) is transformed intoy
Ry
I
=
HR HIHI HR
aRaI
+
zRzI
(4)
The subscripts R or I imply taking the Real
or Imaginary part of the Vector or Matrix it is
associated with. For clarity of notation for therest of this paper all real vectors and matrices
derived from the complex ones described in
Sec. II will inherit the names of their complex
versions. For example from here on the vector a =[real {a1} , ...,real {aNT} , imag {a1} , ...,imag{aNT}]T.Denote the number of real dimensions as N = 2 NT
The transmitted signal a is partitioned into Ng dis- joint groups of equal size M, where M = N/Ng.The extension to non equal size groups is trivial. Let
= {1, 2, . . . N } be the set of indexes of entries in thetransmitted vector a. Define the group partitioning of into disjoint groups Gi such that:Gi ,|Gi| = M andNgi=1
Gi = . For any a N, the group aGi |Gi|
y
1Ga
LLR
LLR
LLR
P/S Deinterleaver Decoder
b
Group Separation Group Detection
2Ga
NgG
a
M x 1
M x 1
M x 1
N x 1
N x 1
N x 1
1GW
2GW
NgGW
Fig. 2. Group Detection Scheme.
is obtained from a by striking out ak, k / Gi. Thechannel experienced by the group Gi namely HGi isobtained by striking out the kth column of H k / Gi.
At the receiver the groups are separated by an NNseparation matrix. The sub-matrix WGi of size M N,that filters out the ith group out of the received vector
y, is obtained from W by striking out the kth row ofW
k / Gi. The separate detection of real and imaginaryparts of the transmitted symbols is possible due to the
independent I&Q mapping rule. The separation scheme
is depicted in Fig. 2
A. Group Separation Matrix
Given a group partitioning scheme, we propose to
optimize the separation matrix W such as to maximizethe sum rate
Ng
i=iIWGiy; aGi (5)
Denote the output of the separation matrix corre-
sponding to the group Gi by
aGi WGiy (6)Eq. (5) is maximized by choosing the group sep-
aration matrix WGi such as to maximize the MutualInformation between each transmitted group and the
output of the separation matrix
MN
WoptGi = arg maxWGi{I(aGi; aGi)} (7)
From the data processing inequality [12] follows that
I
y; aGi IWGiy; aGi = IaGi; aGi (8)
Thus if exists a separation matrix WGi that achievesthe equality in Eq. (8) then it clearly maximizes the
mutual information in Eq. (7) and in Eq. (5). It is well
known [22], [13] that the sub-matrix of the MMSE
separation matrix Wmmse
Wmmse = RayR1yy = H
T
HHT +
2z2a
INN
1(9)
corresponding to the group Gi achieves the equalityin Eq. (8). The sub-matrix for group Gi is given by
-
8/9/2019 Group Detection
4/15
4
WmmseGi = HTGi
HHT +
2z2a
INN
1(10)
The estimation error covariance matrix is given by
Ree = Raa
RayR1
yy
Rya (11)
and the estimation error covariance matrix for the
group Gi is a sub-matrix of Ree and is denoted byRe
GieGi
B. Group LLR Computation
The output of the MMSE separation matrix for group
G is given by
aG = Wmmse
G HGaG + WmmseG
HGaG + z
vG
(12)
Where HG is the N |G| sub matrix of H corre-sponding to group G and HG is the N
G sub matrixofH corresponding to group G,and GG = . Denoteby vG the noise experienced by group G, the covariancematrix of vG is given by
RvGvG = WmmseG
2a2
HGHTG
+2z2
INN
(WmmseG )
T(13)
The conditional LLR for the kth coded bit, where kbelongs to the set of coded bits mapped into one of the
symbols belonging to group G is given by
LLR( ck|
aG) = logPr {ck = 1| aG}Pr {ck = 0| aG} (14)To compute Eq. (14) we need the conditional pdf
f( aG| aG). From Eq. (12) and under the Gaussianassumption on the inter group interference [21] follows
aG| aG N
WmmseG HGaG, RvGvG
(15)
The fact the the noise term in Eq. (12) is colored
complicates the evaluation of Eq. (15). We propose
to whiten the noise in Eq. (12). The noise covariance
matrix is symmetric positive semi-definite and thus has
the following eigen value decomposition
RvGvG = UGGUTG (16)
Where UG is a |G| |G| unitary matrix and G is a|G| |G| diagonal matrix of the eigen values of RvGvG.
UG (UG)T
= I|G||G|
G = diag
G1 , . . . G|G|
(17)The noise whitening matrix is given by
FG = 12G U
TG (18)
The output of the group whitening separation matrixfor group G is given by
aG = FGWmmse
G HGaG + vG (19)
Where
RvG
vG
= I|G||G|
The conditional LLR is then derived by using Bayeslaw and the ideal interleaving assumption. The condi-
tional LLR is given by
LLR( ck| aG) = log
aGS
k,rG,1
e12aGFGWmmseG HGaG2
aGS
k,rG,0
e12aGFGWmmseG HGaG2
(20
Sk,rG,l R|G| is the set of all real |G| dimensionalPAM symbol vectors whose kth bit in the rth symbol
is l {0, 1}. The complexity of the LLR computation
for all groups is Ng2
m2 |G|
and is exponential in theproduct of the group size and the number of bits per real
dimension. We are then able to trade off performance
with complexity by the selection of group size.
C. Simplified LLR Computation for group size of 2
For a group size of Ng = 2 it is possible to derive asimplified closed form approximation for the LLR with-
out computing the noise whitening matrix in Eq. (18).
The derivation is in the spirit of [11] and can be done for
an arbitrary group size. The LLR will be derived given
a zero forcing separation matrix. The MMSE structure
will then emerge from the derivation. As in [11] byusing the log max approximation [1] the conditional
LLR can be expressed as
LLR
ck| y max
dSk,rG,1
{logf ( aZF| aG = d)}max
dSk,rG,0
{log f( aZF| aG = d)} (21)
Where
aZF = H#y =
HTH
1HTy = a + w (22)
H# is the ZF separation matrix given by the Moore
Penrose pseudo inverse of H. The noise covariancematrix is given by
Rww =12
2z
HTH1
(23)
Let the group G = {i, j}. Under the Gaussianassumption on the inter group interference [21] follows
that
f( aZF| aG) = e12Q(aG)
(2)M |G|
Q (aG) =
aZF GT
1G
aZF G
(24)To find the mean and covariance in Eq. (24) we note
that
-
8/9/2019 Group Detection
5/15
5
aZF = aiei + aj ej +
k /{i,j}
akek + w (25)
Where ei is the N 1 ith unit vector. The mean andvariance are given by
G
= E{ aZF| aG} = aiei + aj ejG = E
aZF G
aZF G
TaG
= 122a
INN VGVTG
+ Rww
VG =
eiej (26)
Substituting Eq. (26) into Eq. (24) gives
Q (aG) = aTZF
1G aZF 2aTZF1G
aiei + aj ej
+
a2i eTi
1G ei + a
2j e
Tj
1G ej
(27)
Substituting Eq. (27) and Eq. (25) into Eq. (21) and
noting that the first term in Eq. (27) is not a function
of aG and thus cancels out we arrive at
LLR
ck| y max
dSk,rG,0
Q (d)
max
dSk,rG,1
Q (d)
(28)
Where
Q (aG) = 2aTZF1G
aiei + aj ej
+ a2i eTi
1G ei
+a2j eTj
1G ej
(29)
In Appendix. (A) we show that
Q (aG) =12
2a
(1pjj )pii(1pii)(1pjj )p2ij
aMMSEipii
ai2
+122a
(1pii)pjj(1pii)(1pjj )p2ij
aMMSEjpjj
aj2
+
12
2a1
(1pii)(1pjj )p2ij
aMMSEi pij aj
2+
12
2a
1(1pii)(1pjj )p2ij
aMMSEj pij ai
2+C
(30)
Where C is not a function of aG and will cancel outin Eq. (28) and pi,j is the i,jth element of P
P = WmmseH (31)
In Appendix. (B) we show that 1/pii is the MMSE
bias compensation factor from [14] and that the bestunbiased linear estimate of aMMSEi from aj is pij aj .Thus in Eq. (30) we identify four euclidian distance
terms. We note that when when i, j are taken as the Iand Q of the same transmit antenna follows that pij = 0and pii = pjj . In Appendix. (B) we show that
pii1 pii = SN RMMSE U,i
By denoting the unbiased euclidian distance by
ij =
aMMSEipii
d1+j
aMMSEj
pii d2
(32)
Eq. (30) coincides with the LLR in [11] namely:
LLR
ck| y SN RMMSE U,i
max
dSk,r{i,j},0
2ij (d)
maxdSk,r
{i,j},1
2ij (d)
(33)
IV. GROUP PARTITIONING
The number of ways to partition the transmitted
signal into groups is a function of the transmitted signal
size N and the groups size M. For example when bothN and M are powers of 2 the number of partitioningpossibilities is given by Eq. (34)
NP =12
N
N/2
12
N/2N/4
. . .
12
2MM
=
12
log2(N/M) N!
M!
log2(N/M)i=1 (N2i)!(34)
Table II summarizes the number of partitioning pos-
sibilities for several values of N and M.
Scheme M NP2 2 (N=4) 2 34 4 (N=8) 2 1054 4 (N=8) 4 35
8 8 (N=16) 2 6756758 8 (N=16) 4 2252258 8 (N=16) 8 6435
TABLE I
NUMBER OF PARTITIONING POSSIBILITIES
We are faced with the problem of choosing a parti-
tioning scheme from amongst all the partitioning pos-
sibilities. A natural selection would be to choose the
partitioning scheme that minimizes some probability
of error measure, this although very intuitive is very
difficult to trace analytically. Instead we propose to
select the partitioning scheme that maximizes the sum
rate of the groups.
By using the chain rule of mutual information the
mutual information of the MIMO channel can be written
as Eq. (35)
Iy; a = Iy; aG1 + Iy; aG2 aG1++I
y; aGNg
aG1 , . . . , aGNg1 (35)When using the GD scheme information is not ex-
changed between groups, and so the mutual information
of Eq. (35) cannot in general be realized. The mutual
information (sum rate) given the GD scheme is given
by Eq. (36)
Ngi=1
I
y; aGi Iy; a (36)
The sum rate is simply the sum of mutual information
since the groups are disjoint. The mutual information inEq. (36) can be written as
-
8/9/2019 Group Detection
6/15
6
Ngi=1
I
y; aGi
=
Ngi=1
h
aGi Ng
i=1
h
aGi y (37)
Using the fact that the covariance of the MMSE
estimator of aGi from y equals the covariance of theestimation error, namely R
aGiy aGiy = ReGieGi [13]and assuming that the Inter Group Interference is Gaus-
sian [21] follows that aGi y is also Gaussian and that
Ngi=1
I
y; aGi
=
Ngi=1
h
aGi 1
2
Ngi=1
logReGieGi (38)
Under the assumption that transmitted symbols are
i.i.d the first term in Eq. (38) does not depend on the
partitioning scheme, the determinants of the error co-
variance matrices
ReGi
eGi
, i = 1 . . . N g are a functionthe partitioning scheme. Given a partitioning scheme
{G1, G2, . . . GNg } the error covariance matrix for theith group ReGieGi is obtained from the covariancematrix Ree by striking out the kth rows and the kthcolumns k / Gi. Thus finding the partitioning schemethat maximizes the mutual information in Eq. (38)
is equivalent to finding the partitioning scheme that
minimizes the product of the the determinants of the
group error covariance matrices. Thus we need to solve
the following optimization problem
{Gopt1 , Gopt2 , . . . GoptNg } =arg min
{G1, G2, . . . GNg }s.tGi |Gi| = MGi Gj = i = j
Ngi=1
ReGieGi
(39)
The complexity of the above search quickly becomes
overwhelming (Eq. (34)). To reduce the complexity of
Eq. (39) we turn to the structure of Ree. Since Ree is areal covariance matrix it is obviously symmetric positive
semi-definite, however its structure goes deeper due to
the symmetry in the real channel matrix H (Eq. 4). Thisstructure can be utilized to greatly simplify Eq. (39)
for the 2 2 scheme. Intuition form the simplifiedexpressions for the 2
2 scheme will then lead us to
develop simple suboptimal ad hoc approximations to(Eq. (39).
A. Simplified Partitioning for2 2 schemeFor the 2 2 antenna scenario the MMSE error
covariance matrix (Eq. (11)) is given by:
Ree =12
2a
I4x4 HT
HHT +
2z2a
I4x4
1H
(40)
In Appendix. (C) we prove that
Ree =2
2a 1
ad b2 c2
d bb a
0 cc 0
0 cc 0 d bb a
(41)
Where hi denotes the ith column ofH and the scalars,a,b,c,d are given by
=2z2a
a = 1 + hT1 h1 d = 1 + hT2 h2
b = hT1 h2 c = hT1 h4
(42)
For simplicity of notation we drop the scalar mul-
tiplying the matrix in Eq. (41) since it will not affectthe minimization of Eq. (39). For the 2 2 scenariothere are 3 ways to partition the transmitted signal intogroups of size 2,namely
Scheme G1 G21 {1, 2} {3, 4}2 {1, 3} {2, 4}3 {1, 4} {2, 3}
TABLE II
PARTITIONING POSSIBILITIES FOR 2 2 SCENARIO
The group error covariance matrices corresponding
to each partitioning possibility are given in Table. (III).
Scheme ReG1eG1
ReG2eG2
1
d b
b a
d b
b a
2
d 00 d
a 00 a
3
d c
c a
a cc d
TABLE III
ERROR COVARIANCE MATRICES FOR ALL PARTITIONINGPOSSIBILITIES OF 2 2 SCENARIO
Denote the product of the determinants of the group
error covariance matrices of the ith partitioning scheme
by Di, since all the error covariance matrices are posi-tive semi-definite their determinants are nonnegative.
D1=
ad b22 = (ad)2 b2 (2ad 1) 0D2
= (ad)
2 0D3
=
ad c2
2
= (ad)2 c2 (2ad 1) 0(43)
Substituting Eq. (43) into Eq. (39) we obtain
Scheme =Arg mini
(Di) (44)
From Eq. (43) and Eq. (44) follows that partitioning
scheme 2 will be chosen only in cases where 2ad hT1 h4 choose scheme 1 else chosescheme 3.
The above result is simple to compute and very
intuitive. Each one of the columns of the channel matrix
H can be regarded as the channel experienced by asingle element of the transmitted signal. Elements are
thus grouped together based on the correlation between
their corresponding channels. This is intuitive since
grouping correlated elements will result in less noise
enhancement in the group separation process.
B. Ad hoc Simplified Partitioning
In general for NT > 2 obtaining a closed formexpression for the noise covariance matrix as was done
for NT = 2 is hard to tackle. Instead we draw intuitionfrom the 2 2 scenario and propose ad hoc algorithmsfor the partitioning of a general NT NT system intogroups of 2 and 4.
1) Simplified Partitioning into groups of size 2 :The algorithm is a greedy one that partitions the trans-
mitted signal into groups of size 2 such as to maximize
the correlation at the output of the channel. The algo-
rithm stats off with a candidate list consisting of all
the transmitted elements. At each stage, the algorithm
finds the two maximally correlated elements from the
candidate list, groups them together and then erases
them from the candidate list.
Simplified partitioning algorithm for a group size of 2
1) n = 1, n = {(i, j) : i < j}
2) hi,j = hTi hj , (i, j) n3) [in, jn] = arg max(i,j)n (hi,j )
4) Gn = {in, jn}5) n = {(k, jn), (in, k), (jn, k), (k, in) : k,
(k, jn), (in, k), (jn, k), (k, in) n}6)n+1 = n \ n7) if (+ + n) Ng goto 3 else end
Since matrix HTH is a byproduct from the computationof Wmmse (see App ()) it needs not to be recomputedin stage 2. The above algorithm is very simple and its
complexity is that of finding the maximum entry from
a list. At each one of the Ng stages of the algorithmthe list size decreases drastically. This is a tremendous
reduction compared the combinatorial complexity
in Eq. (12). In Sec (IV-C) we show that under a
Gaussian alphabet and Rayleigh channel assumptions,
for NT = 4 the loss of the simplified partitioningalgorithm with respect to optimal partitioning increases
with the SNR. When transmitting 16bit/ChannelUsethe loss is about 0.3[dB] and when transmitting
30bit/ChannelUse the loss is about 0.45[dB]
2) Simplified Partitioning into groups of size 4 :
The algorithm is a greedy one that partitions the
transmitted signal into groups of size 4 namely
Gn = {in, jn, kn, ln}such as to maximize the following heuristical corre-
lation measure
[in, jn] = arg maxi,j hTi hjkn = arg maxk hTinhk + hTjnhk
ln = arg maxlhTinhl + hTjnhl + hTknhl
(47)
The correlation measure is built in the following
fashion. First the pair of elements with maximal cor-
relation is found then the third element is selected
such that maximizes the sum of correlations with the
already selected pair. The fourth element is found using
the same procedure thus selecting the element that
maximizes the sum of correlations with respect to the
pre selected triplet. Using the same notations as in the
previous chapter the partitioning algorithm is given by:
Simplified partitioning algorithm for a group size of 4
1) n = 1, n = {(i, j) : i < j}2) hi,j =
hTi hj , (i, j) n3) [in, jn] = arg max(i,j)n (hi,j)4) kn = arg maxk:(in,k)&(jn,k)n&k=in,jn
(hin,k + hjn,k)5) ln = arg maxl:(in,k)&(jn,k)&(kn,l)n&l=in,jn,kn
(hin,l + hjn,l + hkn,l)6) Gn = {in, jn, kn, ln}7) n = {(t, jn), (jn, t), (t, in), (in, t), (t, kn),
(kn, t), (t, ln), (ln, t) :
-
8/9/2019 Group Detection
8/15
8
{t : (t, jn), (jn, t), (t, in), (in, t), (t, kn), (kn, t), (t, ln), (ln, t) n}
8)n+1 = n \ n9) if + + n Ng goto 3 else end
Since matrix HTH is a byproduct from the com-putation of Wmmse (see App ())it needs not to berecomputed in stage 2. The above algorithm is very
simple and its complexity is that of finding the max-
imum entry from a list. At each one of the Ng stagesof the algorithm the list size decreases drastically. This
is a tremendous reduction compared the combinatorial
complexity in Eq. (12).
In Sec (IV-C) we show that under a Gaussian
alphabet and Rayleigh channel assumptions, for NT = 4the loss of the simplified partitioning algorithm with
respect to optimal partitioning increases with the SNR.
When transmitting 16bit/ChannelUse the loss is about0.25[dB] and when transmitting 30bit/ChannelUsethe loss is about 0.35[dB]
C. Mutual Information for Rayleigh fading channel
The capacity loss resulting from group detection
was computed for the Rayleigh fading channel, thus
the entries of the complex matrix H were indepen-dent Gaussian random variables (Rayleigh Amplitude,
uniform phase) with a variance of 1/NT generatedindependently at each instant. The expectation of the
capacity is given by
C = E
{Iy; a}= E{ 12 log2 2a2z HHT + INN} (48)
The expectation of the sum rate when using group
detection is computed using Eq. (38) and given by:
E{Ngi=1
I
y; aGi} = NgM2 log 122a
12Ngi=1
E{log ReGieGi }(49)
To probe the loss in capacity incurred by using group
detection we turn to simulations. Since the capacity we
are interested in is ergodic we can approximate the
expectation in Eq. (48) and Eq. (49) by the instantaverage. We consider the 2 2 and 4 4 systemsand group sizes of 2 and 4 with Optimal Search (OS)
partitioning, Simplified Search (SS) partitioning and
simple Per Antenna (PA) partitioning.
Fig. 3 summarizes results for a 2 2 system andshows that for medium to high SNR group detection
has a gain of around 1.5[dB] over the simple perantenna partitioning and is only around 0.6[dB] fromthe capacity.
Fig. 5 summarizes results for a 4 4 system forgroups of size 4 and 2.
For medium to high SNR, partitioning into groups
of size 4 with optimal group partitioning losses roughly2[dB] from capacity. Using the simplified partitioning
0 5 10 15 20 25 300
2
4
6
8
10
12
14
16
18Rayleigh Fading Capacity (Ntx=Nrx=2)
SNR [dB]
Bits/ChannelU
se
Full Capacity
GD
PA
12 13 14 15 16 17 18 19 20
6.5
7
7.5
8
8.5
9
9.5
SNR[dB]
Bits/ChannelUse
1 6 1 8 2 0 22 24 2 6 2 8
10.5
1 1
11.5
1 2
12.5
1 3
S N R [d B ]
Bits/ChannelUse
0.54[dB]
2[dB]
0.64[dB]
2.09[dB]
Fig. 3. 22 Capacity Loss Per Antenna Vs Group Detection.
0 5 10 15 20 25 300
5
10
15
20
25
30
35
40Rayleigh Fading Capacity (Ntx=Nrx=4)
SNR [dB]
Bits/ChannelUse
Full Capacity
2G_OS_GD
2G_PA_GD
4G_OS_GD
4G_PA_GD
8 10 12 14 16 18 20 22
11
12
13
14
15
16
17
18
19
20
21
Rayleigh Fading Capacity (Ntx=Nrx=4)
SNR [dB]
Bits/ChannelUse
16 18 20 22 24 26 28 30
20
21
22
23
24
25
26
27
28
Rayle igh Fad ing Capacity (Ntx=Nrx=4)
SNR [dB]
Bits/ChannelUse
1.7[dB]
2.9[dB]
4.8[dB]
2.2[dB]
3.6[dB]
5.5[dB]
Fig. 4. 44 Capacity Loss Per Antenna Vs Group Detection.
algorithm losses roughly an extra 0.3[dB]. The sim-ple antenna partitioning scheme for a group size of
4 (two antennas per group) losses roughly 3.5[dB]from capacity, thus smart group partitioning shows a
gain of roughly 1 1.5[dB] over simple per antennapartitioning.
For medium to high SNR, partitioning into groups
of size 2 with optimal group partitioning losses roughly3.5[dB] from capacity, using the simplified partitioningalgorithm losses roughly an extra 0.4[dB]. The simpleantenna partitioning scheme (two antennas per group)
losses roughly 5 5.5[dB] from capacity, thus smartgroup partitioning shows a gain of roughly 1 2[dB]over simple per antenna partitioning. It is interesting
to note that partitioning into groups of size 2 withoptimal partitioning achieves just about the same mutual
information as partitioning into groups of size 4 withsimple per antenna partitioning.
-
8/9/2019 Group Detection
9/15
9
0 5 10 15 20 25 300
5
10
15
20
25
30
35
40Rayleigh Fading Capacity (Ntx=Nrx=4)
SNR [dB]
Bits/Channel
Use
Full Capacity
2G_OS_GD
2G_SS_GD
4G_OS_GD
4G_SS_GD
20 21 22 23 24 25 26
21
22
23
24
25
26
SNR[dB]
Bits/ChannelUse
13 1 4 1 5 1 6 17 18 19
13
1 3 . 5
14
1 4 . 5
15
1 5 . 5
16
1 6 . 5
17
1 7 . 5
18
SN R [ d B]
Bits/ChannelUse
0.26[dB]
0.3[dB]
0.35[dB]
0.45[dB]
Fig. 5. 4 4 Capacity Loss GD Optimal Search Vs SimpleSearch.
V. ITERATIVE GROUP INTERFERENCE
CANCELATION
The detector in Eq. (3) does not exploit dependencies
between coded bits which leads to degraded perfor-
mance. The detector in Eq. (20) is an approximation
to Eq. (3) and is even more information lossy since
information is not exchanged between groups. An opti-
mal decoder would regard the channel code and MIMO
channel as serially concatenated codes and would de-
code them jointly, such a decoder would have extraor-
dinary complexity. Many authors [18], [15], [16], [17],[9] propose to use iterative schemes since it has been
shown that such schemes are very effective and com-
putationally efficient in other joint detection/decoding
problems [19], [20]. The iterative scheme proposed here
uses hard decisions from the decoder. Using soft outputs
would result in superior performance however hard
output decoders are commonly implemented in many
practical systems and are less complex then soft output
decoders. The iterative scheme proposed here is similar
to the one in [9].
For each group namely group G, hard decoded bitsfrom the decoder are re-encoded, re-interleaved and
used to reconstruct a version of the transmitted MIMOsymbol from all symbols but the ones corresponding
to group G. This reconstructed signal is then passedthrough the effective MIMO channel. Group Interfer-
ence Canceling is performed by subtracting the filtered
reconstructed signal from the true received signal. The
signal after Interference Cancelation is given by:
yiG
= HGaG + HG
aG aiG
eG
+z (50)
The superscript i in Eq. (50) denotes the iteration
number. Assuming correct decisions ai
G = aG theabove expression is further simplified.
yiG
= HGaG + z (51)
The noise after Interference Canceling (assuming
correct decisions) is white and thus a canonical front
end matrix is the Matched Filter HTG
aiG = HTGHGaG + H
TGz (52)
The group noise covariance matrix after matched
filtering is no longer white and is given by
RGG =12
2z HTG HG (53)
A. Group Partitioning For Iterative Group Detection
The partitioning into groups for the iterative stage
introduces a new trade off with respect to the original
group partitioning. In the first part of the decoding
process we traded off decoding complexity with per-formance, where larger groups resulted in better perfor-
mance and higher complexity. After the first decoding
pass we have hard estimates for all bits. If one partitions
the signal into large groups then one is using less
new information and at the extreme not using any
new information when no partitioning is done thus
only one group (MAP decoding). On the other hand
if one partitions the signal into very small groups (at
the extreme groups of 1 bit each) one may be more
susceptible to error propagation since one only has
hard estimates of the decoded bits with no reliability
measure. We propose to partition into groups of size
2. Note that the partitioning scheme in Sec. IV is nolonger relevant since it does not take into account the
new information from the initial stage. We thus propose
to use the simple antenna partitioning. The LLR for
group G can be efficiently computed by Eq. (33) andby setting
P = HTG
HGH
TG +
2z2a
INN
1HG (54)
At the end of each iteration one obtains hard decoded
bits that can be used by the next iteration. Simulation
results in chapter VI suggest that performing two itera-
tions achieves most of the performance gain.
V I . SIMULATION RESULTS
The performance of the GD scheme for MIMO-
BICM was evaluated via Monte-Carlo simulations. At
the transmitter blocks (packets) of 2000 information
bits were encoded and interleaved using a rate 1/264 state convolutional encoder with octal generators
(133, 171) followed by a random per packet inter-leaver. Two antenna configurations were considered,
a 2x2 configuration and a 4x4 configuration. For the
2x2 configuration the detection schemes considered
were full MAP detection, Per Antenna group detec-
tion (conventional MMSE) and optimal search GroupDetection all with zero, one and two hard iterations.
-
8/9/2019 Group Detection
10/15
10
Most of the performance gain due to iterations was
achieved after two iterations. For the 4x4 configuration
two partitioning schemes were considered namely the
partitioning into Ng = 4 groups of size 2 each andthe partitioning into Ng = 2 groups of size 4 each.For both partitioning schemes the detection schemes
considered were full MAP detection (only for fast
fading), Per Antenna group detection (PA GD - con-
ventional MMSE), Optimal Search Group Detection
(OS GD),Simplified Search Group Detection (SS GD)
all with zero,one and two iterations.The complex MIMO
channel matrix entries were drawn from a zero mean
complex Gaussian distribution with variance 1/NT inan iid fashion. Simulation results were summarized via
average Bit Error Rate (BER) and average Packet Error
Rate (PER) versus SNR1 plots.
6 8 10 12 14 16 18 20 22 24 26
10-4
10-3
10-2
10-1
Snr
Ber
Ber(Snr) Mimo (2,2) 64QAM ConvK =7 ,Rate = 1/2
16QAM Coded PA Detection
16QAM Coded GD Detection
16QAM Coded Map Detection
16QAM Coded IPA Detection
16QAM Coded IGD Detection
16QAM Coded IMAP Detection
16QAM Coded I2PA Detection
16QAM Coded I2GD Detection
16QAM Coded I2MAP Detection
64QAM Coded PA Detection
64QAM Coded GD Detection
64QAM Coded Map Detection
64QAM Coded IPA Detection
64QAM Coded IGD Detection
64QAM Coded IMAP Detection
64QAM Coded I2PA Detection
64QAM Coded I2GD Detection
64QAM Coded I2MAP Detection
16QAM
64QAM
2 Iteration
0 Iteration
1 Iteration
2 Iteration
0 Iteration
1 Iteration
Fig. 6. 2 2 16QAM,64QAM Fast Fading Rayleigh.
Gain [dB] @BER MAP/GD GD/PA
104 105
No Iter 0.2-0.3 0.8-1.7
1 Iter 0.1 0.4-0.8
2 Iter 0.1-0.3 0.35
M AP Iter Gain GD It er Gain
1 Iter 1-1.3 1-1.5
2 Iter 1 0.8-1
PA Iter Gain
1 Iter 2-2.5
2 Iter 1-1.5
TABLE IV
FAS T RAYLEIGH FADING MAP,GD,PA COMPARISON AT
BE R 104 - 105 , 2 2 SCENARIO
A. Fast Fading
For fast fading the MIMO channel was independently
generated at each instant. Fig 6 presents simulation re-
sults for the 2x2 configuration for both 16 and 64QAM.
Table IV summarizes the gain of the MAP scheme
over GD, the gain of GD over PA and the gain due
to iterations for each one of the schemes. The gain
1The SNR is defined as E
Ha2
E
z2
= 1
2z
was measured at a BER of 104 - 105. The gainsin Table IV correspond to both 16 and 64QAM since
they were found to be similar. Performing more than
two iterations did not show much gain. Fig 6 suggests
that the GD gain over PA increases with the SNR.
Without iterations GD shows a substantial gain over
PA. Performing iterations closes the gap between GD
and full MAP as well as the gain of GD over PA.
7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 2210
-5
10-4
10-3
10-2
10-1
Snr
Ber
Ber(Snr) Mimo (4,4) 16QAM Conv K=7 ,Rate = 1/2
16QAM Coded Map Detection16QAM Coded 4G_PA_GD Detection16QAM Coded 2G_PA_GD Detection16QAM Coded 4G_OS_GD Detection16QAM Coded 2G_OS_GD Detection16QAM Coded IMAP Detection16QAM Coded 4G_PA_IGD Detection16QAM Coded 2G_PA_IGD Detection16QAM Coded 4G_OS_IGD Detection16QAM Coded 2G_OS_IGD Detection16QAM Coded 4G_PA_I2GD Detection16QAM Coded 2G_PA_I2GD Detection16QAM Coded 4G_OS_I2GD Detection16QAM Coded 2G_OS_I2GD Detection16QAM Coded I2MAP Detection
0 Iteration
1 Iteration
2 Iteration
Fig. 7. 4 4 16QAM Fast Fading Rayleigh.
5 10 15 2010
-5
10-4
10-3
10-2
10-1
Snr
Ber
Ber(Snr) Mimo (4,4) 16QAM Conv K=7 ,Rate = 1/2
16QAM Coded 4G_SS_GD Detection
16QAM Coded 4G_OS_GD Detection
16QAM Coded 2G_SS_GD Detection
16QAM Coded 2G_OS_GD Detection
16QAM Coded 4G_SS_IGD Detection
16QAM Coded 4G_OS_IGD Detection
16QAM Coded 2G_SS_IGD Detection
16QAM Coded 2G_OS_IGD Detection
0 Iteration
1 Iteration
Fig. 8. 4 4 16QAM Fast Fading Rayleigh Optimal SearchVs Simple Search.
Fig 7 summarizes simulation results for the 4x4 con-
figuration for 16QAM. Table V presents a comparison
between the various GD schemes and the MAP scheme
as well as a comparison between GD with group size
of 2 to that of GD with a group size of 4, the gain due
to iteration is also included.
It is interesting to note that GD with a group size
of 2 outperforms PA with a group size of 4, especially
since the detection complexity of the former is much
lower then the later. The results show that performing
iterations reduces the gap between MAP, the various GD
schemes and PA.Fig 8 presents simulation results for the 16QAM
-
8/9/2019 Group Detection
11/15
11
Gain [dB] @BER |G|=2 MAP/GD |G|=2 GD/PA104 105
No Iter 1.5-2 1-2
1 Iter 1 0.5
2 Iter 0.6 0.3
|G|=4 MAP/GD |G|=4 GD/PANo Iter 1 1.5-2
1 Iter 0.4-0.8 0.7
2 Iter 0.3-0.5 0.5
GD,PA |G|=4/2 Iter GainNo Iter 1-2
1 Iter 0.4-0.7 1.5-4.5
2 Iter 0.1-0.3 1-2
TABLE V
FAS T RAYLEIGH FADING MAP,GD,PA COMPARISON AT
BE R 104 - 105 , 4 4 SCENARIO
4x4 configuration for the various GD schemes with
the simplified group partitioning (SS GD) algorithms.
Results were compared to those of the Optimal Search
partitioning (OS GD). The simplified partitioning intogroups of size 2 (See IV-B.1) showed a loss of no
more then 0.2[dB] with respect to optimal partitioning,the loss after one iteration dropped to 0.1[dB]. Thesimplified partitioning into groups of size 4 (See IV-
B.2) showed a loss of no more then 0.35[dB] withrespect to the optimal partitioning, the loss after one
iteration remained around 0.35[dB].
B. Quasi Static Fading
For quasi static fading the MIMO channel remained
constat over a duration of a block and changed indepen-
dently from block to block. Fig 6 presents simulation
results for 16QAM 2x2 configuration. Table VI summa-
rizes the gain of MAP scheme over GD,the gain of GD
scheme with respect to PA scheme as well as the gain
due to iterations for each one of the schemes all at a
PER of 102-103.
Fig 10 and Fig 11 presents simulation results for
16QAM 4x4 configuration. Fig 10 summarizes results
for the partitioning into 4 groups of size 2 each using
the simplified search algorithm, while Fig 11 present
simulation results for partitioning into 2 groups of size
4 each using the simplified search algorithm. Table VII
presents a comparison between the various GD schemesat a PER of102-103, as well as gain due to iterations.
Gain [dB] @PER MAP/GD GD/PA
102 103
No Iter 4-8 3-4
1 Iter 4-5 2-4
2 Iter 2-3 2
MAP Iter Gain GD,PA Iter Gain
1 Iter 1-2 3-4
2 Iter 0.5-1 1
TABLE VI
FAS T RAYLEIGH FADING MAP,GD,PA COMPARISON AT
BE R 104 - 105 , 4 4 SCENARIO
5 7 9 11 13 15 17 19 21 23 25 27 29 31 3310
-3
10-2
10-1
100
Snr
Ber
Ber(Snr) Mimo (2,2) 16QAM Conv K=7 ,Rate = 1/2
16QAM Coded PA Detection
16QAM Coded GD Detection
16QAM Coded Map Detection
16QAM Coded IPA Detection
16QAM Coded IGD Detection
16QAM Coded IMAP Detection
16QAM Coded I2PA Detection
16QAM Coded I2GD Detection
16QAM Coded I2MAP Detection
Fig. 9. 2 2 16QAM Quasi Static Fading Rayleigh.
5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 3510
-3
10-2
10-1
100
Snr
PER
PER(Snr) Mimo(4,4) Packet size 2000 infromation bits 16QAM Conv K=7 R=1/2
16QAM Coded 4G_PA_GD Detection
16QAM Coded 4G_SS_GD Detection
16QAM Coded 4G_PA_IGD Detection
16QAM Coded 4G_SS_IGD Detection
16QAM Coded 4G_PA_I2GD Detection
16QAM Coded 4G_SS_I2GD Detection
Fig. 10. 4 4 16QAM 4 Groups Simple Search Quasi StaticFading Rayleigh.
5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 3510
-3
10-2
10
-1
100
Snr
PER
PER(Snr) Mimo(4,4) Packet size 2000 infromation bits 16QAM Conv K=7 R=1/2
16QAM Coded 2G_PA_GD Detection
16QAM Coded 2G_SS_GD Detection
16QAM Coded 2G_PA_IGD Detection
16QAM Coded 2G_SS_IGD Detection
16QAM Coded 2G_PA_I2GD Detection
16QAM Coded 2G_SS_I2GD Detection
Fig. 11. 4 4 16QAM 2 Groups Simple Search Quasi Static
Fading Rayleigh.
-
8/9/2019 Group Detection
12/15
12
Gain [dB] @BER |G|=2 GD/PA |G|=4 GD/PA104 105
No Iter 3-4 1.5-2
1 Iter 3.5-4 1.5-2
2 Iter 2.5-3 1-1.5
PA |G|=4/2 SS |G|=4/2No Iter 7-10 5-8
1 Iter 4-7 3-5
2 Iter 3-4 2-3
|G|=2 Iter Gain |G|=4 Iter Gain1 Iter 5-6 2.5-3.5
2 Iter 2.5-3.5 1.5-2
TABLE VII
QUASI STATIC RAYLEIGH FADING GD,PA COMPARISON AT
PE R 102 103 , 4 4 SCENARIO
C. Simulation Summary
Simulation results suggest that under a fast Rayleigh
fading at low BER GD achieves gains of 1-2[dB] with
respect to PA with practically no increase in complexity.
An extra gain of 1-2[dB] can be achieved by choosing
larger group sizes with a complexity price. Under for
Quasi static Rayleigh fading at low BER GD achieves
gains of 3-4[dB] with respect to PA with practically
no increase in complexity. An extra gain of 5-10[dB]
can be achieved by choosing larger group sizes with
a complexity price. For both Quasi static and fast
Rayleigh fading performing hard iterations improved
performance of all the schemes as well as reduced the
gaps between them.
VII. CONCLUSIONS
In this paper we proposed a scalable reduced com-plexity detection algorithm for MIMO-BICM. Com-
plexity reduction was achieved by performing detec-
tion in groups instead of joint detection of the entire
MIMO signal. A simple group partitioning algorithm
was derived as well as a approximate expression for
the LLR for group size of 2. Performance and com-
plexity were shown to be traded off by the selection
of the group size. Computer simulations showed that
GD achieves gains of 1-4[dB] with respect to PA with
practically no increase in complexity. Gains of up to
10[dB] were achieved by using larger group size. A
simple hard iterative interference canceling scheme was
further proposed to enhance performance. Performing
hard iterations improved performance of all the schemes
as well as reduced the gaps between them.
APPENDIX
APPENDIX A - CALCULATION OF Q (aG)
To compute Q (aG) we first derive a closed formexpression for 1G . Define
P = INN +2z2a H
TH1
1
(A-1)
And note that
G =12
2a
INN + 2z2a HTH1
P1
+VG (I2x2) VTG
(A-2)
Then make use of the matrix inversion lemma 2
1G =12
2aP
INN + VG I2x2 VTG P VG1
T
VTG P
(A-3)
Noting that
T =
I2x2 VTG P VG1
=
1 pjj pij
pij 1 pii
(1 pii) (1 pjj ) p2ij(A-4)
Where pij is the i,jth element of P in Eq. (A-1).To compute the first term in Eq. (29) we evaluate
aTZF1G ei =
12
2aaTZFP
INN + VGT V
TG P
ei (A-5)
Noting that P converts ZF estimation into MMSEestimation
aTZFP =
PTaZFT
=INN +
2z2a
HTH
11 HTH
1HTy
T=
HTH+
2z
2aIN
1HTy
T=
WmmseyT
= aTMMSE
(A-6)
The forth equality in Eq. (A-6) follows from the
following Eq. (A-7)HTH+
2z2a
I1
HT =2a2z
2a2z
HTH+ I1
HT
=2a2z
I HT
HHT +
2z2a
I1
H
HT
=2a2z
HT
I
HHT +2z2a
I1
HHT
=2a2z
HT
2a2z
HHT + I1
= HT
HHT +2z2a
I1
= Wmmse
(A-7)
The second and forth equalities in Eq. (A-7) follow
from the matrix inversion lemma while the rest are
trivial. Substituting Eq. (A-6) into Eq. (A-5) yields
aTZF1G eiai =
12
2aa
TMMSE
ei + VGT V
TG P ei
ai
= 122a(1pjj )a
MMSEi +pij a
MMSEj
(1pii)(1pjj )p2ijai
aTZF1G ej aj =
12
2aaTMMSE
ej + VGT V
TG P ej
aj
= 122a
pij aMMSEi +(1pii)a
MMSEj
(1pii)(1pjj )p2ijaj
(A-8)
The last two terms in Eq. (29) evaluate to
2
A1
A1
B D1 + CTA1B1 CTA1 =A + BDCT
1
-
8/9/2019 Group Detection
13/15
13
a2i eTi
1G ei =
12
2aeTi P ei +
12
2aeTi P VGT V
TG P ei
= 122a
pii(1pjj )+p2ij
(1pii)(1pjj )p2ija2i
a2j eTj
1G ej =
12
2ae
Tj P ej +
12
2ae
Tj P VGT V
TG P ej
= 122a
pjj (1pii)+p2ij
(1pii)(1pjj )p2ija2j
(A-9)
Substituting Eq. (A-6,A-8) into Eq. (A-3) yields
Q (aG) =12
2a
(1pjj )pii(1pii)(1pjj )p2ij
aMMSEi
pii ai
2+12
2a
(1pii)pjj(1pii)(1pjj )p2ij
aMMSEj
pjj aj
2+
12
2a
1(1pii)(1pjj )p2ij
aMMSEi pij aj
2+
12
2a
1(1pii)(1pjj )p2ij
aMMSEj pij ai
2
+C
(A-10)
APPENDIX B - CONNECTING P WITH MSE
From the matrix inversion lemma follows that
P =
INN +2z2a
HTH
11=
I
2a2z
HTH+ INN
1 (B-1)Again from the matrix inversion lemma follows that
2a2z
HTH+ INN1
=
INN HT HHT + 2z2a INN1 H (B-2)Substituting Eq. (B-2) into Eq. (B-1) gives
INN +
2z2a
HTH
11=
HT
HHT +2z2a
INN
1H
(B-3)
From Eq. (11) and Eq. (A-1) then follows that
Ree =2a2 (INN P) (B-4)
Since Ree is the MMSE error covariance matrix itsdiagonal elements are the MSEMMSE in the estimationof each element of a. The unbiased SNR (See [14]) ofthe ith element ai is given by
SN RMMSE U,i =2a2
MS EMMSE,i 1 = pii1pii (B-5)
The bias compensation scaling factor is given by
2a2
2a2 M SEMMSE,i
=1
pii(B-6)
We next prove that the best unbiased linear estimate
of aMMSEi from aj is pij aj . From Eq. (B-4) follows
that
pij = 22a E
ai aMMSEi
aj aMMSEj
=2
2aE
aMMSEj
ai aMMSEi 22a E{aiaj}
+ 22aE
aMMSEi aj (B-7)
The first term in the second equality is zero from
the orthogonality principle and the second term in the
second equality is zero since transmitted symbols are
statistically independent. Thus
pij =2
2aE
aMMSEi aj
= 22aE
aMMSEj ai
(B-8)
From linear estimation theory follows that
aMMSEi (aj ) =E{aMMSEi aj}
E{a2j} aj =2
2aE
aMMSEi aj
aj
= pijaj(B-9)
Since both aj and aMMSEi are zero mean follows
that the best unbiased linear estimate of aMMSEi from
aj is pij aj . The same proof can be repeated foraMMSEj .
APPENDIX C - COMPUTING Ree FOR THE 2 2SCHEME
Using the matrix inversion lemma follows that
Ree =12
2a
I4x4 HT
HHT +
2z2a
I4x4
1H
=
12
2a
I4x4 +
2a2z
HTH1 (C-1)
Denoting the ith row of H by hTi follows that
HTH =
hT1 h1 hT1 h2 0 h
T1 h4
hT1 h2 hT2 h2 hT1 h4 0
0 hT1 h4 hT1 h1 hT1 h2hT1 h4 0 h
T1 h2 h
T2 h2
(C-2)
Denoting =2z2a
and substituting Eq. (C-2) into
Eq. (C-1)
Ree = 2A
1 0 0 00 1 0 00 0 1 00 0 0 1
+
hT
1 h1 hT
1 h2 0 hT
1 h4hT1 h2 h
T2 h2 hT1 h4 0
0 hT1 h4 hT1 h1 hT1 h2hT1 h4 0 h
T1 h2 h
T2 h2
1 (C-3)
Denote the scalars a,b,c,d
a = 1 + hT1 h1 d = 1 + hT2 h2
b = hT1 h2 c = hT1 h4
(C-4)
The inverse of the matrix in Eq. (C-3) can be
computed in closed form by noting that the matrix in
Eq. (C-3) has the following block symmetry
Ree = 122a A B
BT A1 (C-5)
-
8/9/2019 Group Detection
14/15
14
And then using the block matrix inversion lemma 3
follows
A B
BT A
1=
E1 E1BA1A1BTE1 A1 + A1BTE1BA1 E = A BA1BT
(C-6)
Each one of the sub-matrices can now be computed
in close form thus:
A1 = 1adb2
d bb a
E = A c2adb2 0 1
1 0
d b
b a
0 11 0 =
1 c2adb2
A
E1 =
1adb2c2
d bb a
E1BA1 =c
d bb a
0 1
1 0
d bb a
(adb2)(c2ad+b2)
= cc2ad+b2
0 1
1 0
A1BTE1 = E1BA1T = cc2ad+b2
0 11 0
(C-7)
A1 + A1BT
E1BA1
=
1(adb2)
d b
b a
+
c2
d bb a
0 11 0
0 1
1 0
(adb2)(adb2c2) =
1(adb2) + c2(adb2)(adb2c2) d bb a =1
(adb2c2)
d bb a
= E1
(C-8)
Substituting Eq. (C-7) and Eq. (C-8) into Eq. (C-5)
Ree =2
2a
1
ad b2 c2
d bb a
0 cc 0
0 cc 0
d bb a
(C-9)
3 A BCT D 1 = E1 E1BD1D1BTE1 D1 + D1CTE1BD1 E = A BD1CT
REFERENCES
[1] G.Caire, G. Taricco and E. Biglieri, Bit-interlived codedmodulation , IEEE Trans. Inf Theory, vol. 44, pp. 927-946, May 1998.
[2] S. H. Muller-Weinfurtner, Coding approaches for mul-tiple antenna transmission in fast fading and OFDM,IEEE Trans. Signal Proces., vol.50, pp 2442-2450, Oct.
2002.
[3] M. K. Varanasi, Group detection for synchronousGaussian code-division multiple-access channels, IEEETrans. Inf. Theory, vol. 41, no4, pp 1083-1096, Jul. 1995.
[4] Won-Joon Choi, Rohit Negi and John M. Cioffi, Com-bined ML and DFE Decoding for the V-BLAST Sys-tems, ????? Star Laboratory Stanford Uni.
[5] Sana Sfar, Lin Dai, Khaled B.Letaief, Optimal Diversity-Multiplexing Tradeoff With Group Detection for MIMOSystems, IEEE TRans Commun, Vol. 53, No 7, July2005.
[6] G.J. Foschini, Layered space-time architecture for wire-less communications in a fading enviroment when usingmulti-element antennas, Bell Labs. J, vol. 1, no 2,pp 41-59, Autumn 1996.
[7] J. Boutrus, F. Boixadera, and C. Lamy, Bit-Interleavedcoded modulations for multiple-input multiple-outputchannels, Proc. of the IEEE 6th ISSSTA00, New Jersey,Sept. 2000.
[8] J. Boutros, N. Gresset, L. Brunel and M. FossorierSoft-input, soft-output lattice sphere decoder for linearchannels IEEE GLOBECOM 2003, San Francisco, USA
[9] K.B Song, and S.A Mujtaba,A Low Complexity Space-Frequency BICM MIMO-OFDM System for Next-Generation WLANs, Globecom 2003.
[10] Michael R.G Butler, and Iain B. Collings, A Zero-Forcing Approximate Log-Likelihood Receiver forMIMO Bit-Interleaved Coded Modulation, IEEE Com-mun Letters. Vol 8, no 2, pp 105-107, Feb 2004.
[11] D. Seethaler, G. Matz, and F. Hlawatch, An Efficient
MMSE-Based Demodulator for MIMO Bit-InterleavedCoded Modulation, IEEE Globecom 2004 Dallas, Texas,Nov/Dec 2004.
[12] Thomas M. Cover and Joy A. Thomas , Elements ofInformation Theory. John Wiley, 1991.
[13] W. Yu and J. Cioffi, The sum capacity of a Gaussianvectorbroadcast channel, IEEE Trans. Inform. Theory,submitted for publication
[14] J.M. Cioffi, G.P. Dudevoir, M.V. Eyuboglu and G.D. For-ney, , MMSE decision-feedback equalizers and coding- Part I:Equalization results IEEE Trans. Commun, vol43, no 10, pp. 2582-2594, Oct 1995.
[15] B. Hochwald and S. ten Brink, Achieving near-capacityon a multipleantenna channel, submitted to the IEEETransactions on Communications, July 2001.
[16] M. Witzke, S. Baro, F. Schreckenbach, and J. Hagenauer,Iterative. detection of MIMO signals with linear detec-tors, in Proc. Asilomar Conference on Signals, Systemsand Computers (ACSSC), Pacific Grove, CA USE, IEEE,Nov. 2002, pp. 289 -293.
[17] Boronka, A. Rankl, T. Speidel, J.Iterative Nonlinear De-tection of MIMO Signals using an MMSE-OSIC Detectorand Adaptive Cancellation, ITG Conference on Sourceand Channel Coding (SCC), Erlangen, Germany, January2004, pp. 17-24. IEEE Trans Infomation Theory, Vol 43,No 3, May. 1997.
[18] W. J. Choi, K. W. Cheong, and J. Cioffi Iterative softinterference cancellation for multiple antenna systems,
IEEE Wireless Communications and Networking Confer-ence, 2000
[19] Xiadong Li and J.A. Ritcey, Bit Interleaved CodedModulation with Iterative Decoding, IEEE Communi-
-
8/9/2019 Group Detection
15/15
15
cations Letters, vol. 1, No. 6, Novermber 1997, pp. 169-171.
[20] S. Ten Brink, J. Speidel and R. H. Yan, Iterative demap-ping and decoding for multilevel modulation, Proc. of the
IEEEGlobal Telecommunications Conference, vol. 1, pp.579-584, November 1998
[21] H. Vincent Poor and Sergio VerdU Probability of Errorin MMSE Multiuser Detection, IEEE Trans Infomation
Theory, Vol 43, No 3, May. 1997.[22] G.D. Forney, Jr. Shannon meets Wiener II: On MMSE
estimation in successive decoding schemes, To appear inProc. 2004 Allerton Conf. (Monticello, IL), Sept. 2004 ,Vol 43, No 3, May. 1997.