chapter 2 soundfield modelling
TRANSCRIPT
Chapter 2
Soundfield Modelling
2.1 Introduction
Acoustic signal processing techniques have been studied now for many years. These
techniques perform well under ideal conditions but still face difficulty in real envi-
ronments, where captured signals are corrupted by reverberation. In the interest
of developing algorithms more robust to reverberation, it is important to have a
simple analytical technique to quantify algorithmic performance. Researchers often
resort to computer simulation and real-world experiment for testing of algorithms.
However, the results may not promote deep insights or generality. In this chapter,
we describe general soundfield models useful for analyzing the performance of sig-
nal processing algorithms in reverberant rooms. These models provide the tools
that are used throughout the rest of the thesis.
Acoustic waves are well described by the wave equation. Any soundfield model
of reverberation must be consistent with it. One way of representing the wave
equation solutions in a source free region is with the single layer potential [18,51].
The single layer potential has proven useful for exploring the statistical properties
of wireless communication channels in different scattering environments [51, 100].
In a wide range of room environments, the soundfield can be described with
statistical room acoustics [83,84]. Here the sound pressure at each point is thought
of as a random variable, with predictable statistical properties. An important
model for describing these statistical properties of sound pressure is the diffuse
field model [57], where sound is evenly diffused throughout the room. A diffuse
soundfield can be thought of as the superposition of a large number of plane waves,
arriving evenly from all directions. With this understanding, one can derive the cor-
relation of sound pressure between different positions (and different time instances)
in a diffuse field [79].
In other disciplines, wave field models have benefitted by recognizing that plane
waves do not always arrive evenly from all directions. Often they are physically
constrained to a limited range of directions. Examples are ambient sea noise [20,22]
25
z
j
φ
θx
r
yx
k
i
(a)
xφ
r
i
j
xy
(b)
Figure 2.1: Representation of a position vector x in (a) the 3-D spherical coordi-nates (r, θ, φ) and (b) the 2-D polar coordinates (r, φ).
and the multipath of telecommunications systems [76, 100]. Accounting for such
directionality here has resulted in more accurate system models.
In this chapter, we derive a statistical description of the field pressure based
on a generalization of the diffuse field model. This so called generalized diffuse
field model, extends the classical diffuse field model to soundfields of a directional
nature. We also explore some general representations of the reverberation: in
particular, the wave equation solutions describing the soundfield in a source-free
region, and the geometric interpretation of such a soundfield as a single layer po-
tential. Relationships between these representations are established. The models
are used in subsequent chapters to quantify the performance of acoustic systems
and to motivate new system designs.
Previewing the chapter’s content, Section 2.2 establishes basic notation used in
this thesis. Section 2.3 provides general solution to the Wave equation. Section
2.4 develops the generalized diffuse field model of a reverberant field. Section 2.5
presents the geometric interpretation of a reverberant field. Section 2.6 describes
sample geometric configurations. Finally, Section 2.7 explores the 2-D case.
2.2 Coordinate Systems and Notation
Before describing the soundfield models, we define the notation for representation
of geometric features: positions, directions, surfaces and volumes in space.
In 3-D space, it is convenient to represent coordinates in the spherical coordinate
system. As shown in Figure 2.1(a), the spherical coordinates (r, θ, φ) of a position
vector x ∈ R3 are related to the Cartesian coordinates (x, y, z) through:
(x, y, z) = (r sin θ cos φ, r sin θ sin φ, r cos θ),
where r , ‖x‖ is the distance from the origin, θ is the polar angle, φ is the azimuthal
angle and the operator ‖·‖ denotes Euclidean distance. Direction is specified either
through a pair of polar and azimuthal angles (θ, φ) or through the unit vector φ =
[sin θ cosφ, sin θ sinφ, cos θ]T pointing in the direction of interest. Representations
of position and direction are used interchangeably in parameterizations of functions.
That is, f(r, θ, φ) ≡ f(x) and g(θ, φ) ≡ g(φ).
In 2-D space, it is convenient to represent coordinates with the polar coordinate
system shown in Figure 2.1(b). The polar coordinates (r, φ) of a position vector
x ∈ R2 are related to Cartesian coordinates (x, y) through:
(x, y) = (r cos φ, r sin φ),
where r is the distance from the origin and φ is the polar angle. Direction is
specified through either the angle φ or the unit vector φ , [cos φ, sinφ]T .
We utilize notation which covers the 2-D and 3-D cases simultaneously. Position
is represented with a vector x ∈ R` and direction with a unit vector φ ∈ R
` where
` = 2 for 2-D and ` = 3 for 3-D. We refer to the unit surface S`−1 , {x ∈ R
` :
‖x‖ = 1}, which represents a unit circle S = {(r, φ) : r ≡ 1, 0 ≤ φ < 2π} for ` = 2
and unit spherical shell S2 = {(r, θ, φ) : r ≡ 1, 0 ≤ θ ≤ π, 0 ≤ φ < 2π} for ` = 3.
The integral:∫
S`−1
f(φ)ds(φ),
for ` = 2 represents a line integral over a circle where ds(φ) = dφ is the differential
curve element at φ. For ` = 3, it represents a surface integral over a spherical shell
where ds(φ) = sin θ dθ dφ is the differential surface element at φ.
2.3 Wave Equation Solution
Following [114], we derive the sets of solutions to the Helmholtz wave equation.
These solutions describe the sound pressure of any soundfield inside a source-free
region. They provide an efficient parametrization of the sound pressure in terms
of orthogonal functions, useful for describing the reverberation in signal processing
problems.
We first summarize the frequency domain approach of solving the wave equation
(Section 2.3.1). We derive the general 3-D solution (Section 2.3.2). We then restrict
attention to the soundfield inside a source-free spherical region (Section 2.3.3).
2.3.1 Helmholtz Equation
In this thesis, we seek to study the propagation of sound waves, desiring to know
the pressure p(x, t) at any point x and time t. For an homogeneous medium un-
dergoing inviscid fluid flow, one can linearize the equations governing the dynamic
behavior to the fluid, namely the Euler’s equation, the continuity equation, the
state equation and the adiabatic hypothesis [18], to obtain the wave equation,
∇2p =1
c2∂2p
∂t2, (2.1)
where ∇2 is the Laplacian operator and c is the speed of sound. The wave equation
provides a good description of the propagation of sound waves of small amplitude in
air. It accurately describes the pressure in the soundfield provided |p(x, t)| � ρoc2
where ρ0 is the density of the propagation medium at equilibrium [49, p. 11].
It is convenient to consider (2.1) in the frequency domain [114, p. 18]. The
Fourier transform operator, represented by calligraphic letter F , is defined with
the analysis equation
F{p(x, t)}(ω) =
∫ ∞
−∞
p(x, t)e−iωtdt. (2.2)
Writing the Fourier transform of (2.1) and utilizing the derivative property,
F{
∂p(x, t)
∂t
}
(ω) = −iωP (x;ω),
where P (x;ω) , F{p(x, t)}(ω), (2.1) implies
∇2P (x;ω) + k2P (x;ω) = 0, (2.3)
where k is called the wave number, and is related to angular frequency ω and
wavelength λ through
k =ω
c=
2π
λ.
Equation 2.3 is known as the Helmholtz equation and underlies the soundfield
models of this thesis.
In this thesis we present a frequency domain formulation where the angular
frequency ω is arbitrary but fixed. This formulation applies directly to narrow-band
problems but extends naturally to broadband problems. Because the Helmholtz
equation is linear and time invariant, the steady-state dependence of pressure on
time is obtained through the inverse Fourier transform of P (x;ω),
p(x, t) =1
2π
∫ ∞
−∞
P (x;ω)eiωtdω. (2.4)
Quantities in the frequency domain are written with capital letters and an explicit
dependence on ω.
2.3.2 General Solution
The Helmholtz equation can be solved for sound pressure P (x;ω) in spherical
coordinates using the following separation of variables:
P (x;ω) = X(r;ω)Θ(θ;ω)Φ(φ;ω), (2.5)
where θ, φ and r are the polar angle, azimuthal angle and length of x respectively.
Substituting (2.5) and the expression for the Laplacian operator in spherical coor-
dinates,
∇2(·) =1
r2
∂
∂r
[
r2 ∂
∂r(·)
]
+1
r2 sin θ
∂
∂θ
[
sin θ∂
∂θ(·)
]
+1
r2 sin2 θ
∂2
∂φ2(·), (2.6)
into (2.3) leads to separation into three ordinary differential equations [87, pp. 379-
380]:d2Φ
dφ2+m2Φ = 0, (2.7)
d
dθ
(
sin θdΘ
dθ
)
+
[
n(n + 1) − m2
sin2 θ
]
Θ = 0, (2.8)
1
r2
d
dr
(
r2dX
dr
)
+ k2X − n(n + 1)
r2X = 0, (2.9)
where m and n are integers. The solutions to (2.7) are Φ+(φ) = eimφ and Φ−(φ) =
e−imφ. The finite solutions of (2.8) are Θ(θ) = Pmn (cos θ), the associated Legendre
functions of degree n and order m. Now Pmn (cos θ) = 0 for m > n so m can only
take values −n, . . . , n. From (2.9), the solutions for the radial component X(r) are
the spherical Bessel functions of the first and second kinds of order n, jn(kr) and
yn(kr), with r = ‖x‖. These functions are related to the ordinary Bessel functions
of first and second kind1 respectively through:
jn(x) =
√
π
2xJn+1/2(x),
yn(x) =
√
π
2xNn+1/2(x).
The solutions of the angular functions are conveniently expressed in the spher-
ical harmonic functions Y mn (·) [114]:
Y mn (φ) , Λm
n Pmn (cos θ)eimφ,
where Λmn ,
√
2n+14π
(n−m)!(n+m)!
is a normalization term. To avoid confusion, please
1The functions of second kinds Nn(·) and yn(·) are also known as the Neumann and sphericalNeumann functions respectively.
note that this definition differs from that mentioned in several sources of literature
[18, 49]2. Spherical harmonic functions satisfy the orthogonality property,
∫
S2
Y mn (φ)[Y p
q (φ)]∗ ds(φ) = δmpδnq, (2.10)
where [·]∗ is the complex conjugation operator and δnm is the Kronecker delta
function
δnm ,
{
1, m = n
0, m 6= n.
Spherical harmonic functions form an orthonormal set spanning the unit spherical
shell S2.
Noting that jn(k‖x‖)Y mn (x) and yn(k‖x‖)Y m
n (x) are both solutions to (2.3),
the complete solution is written as [114, p. 186],
P (x;ω) =
∞∑
n=0
n∑
m=−n
[
β(j)nm(ω)jn(k‖x‖) + β(y)
nm(ω)yn(k‖x‖)]
Y mn (x). (2.11)
The sound pressure in any soundfield is fully represented by the set of coefficients
{β(j)nm(ω)}n∈Z∗
⋃
{β(y)nm(ω)}n∈Z∗ where Z
∗ = {0, 1, 2, . . .} is the set of non-negative
integers. Conversely any function P (x;ω) that cannot be written in this form does
not satisfy the Helmholtz equation and is not a valid pressure field3.
In the next section, we present a set of solutions for the case where no sound
sources lie in the region of interest. We shall see that the spherical Neumann
function coefficients are all zero for this case.
2.3.3 Interior Field Solution
The soundfield inside a source free spherical region is known as the interior field.
Define a spherical ball centered about the origin of radius R, B` = {x ∈ R
` : ‖x‖ ≤R}, as shown in Figure 2.2, excludes all sound sources. In this section we write the
general solution in the interior of B` for the case ` = 3.
The interior field solution consists of the solutions of the Helmholtz equation
(2.11) that are finite at the origin. Since the spherical Neumann function yn(k‖x‖)becomes infinite at the origin (limx→0+ yn(x) = −∞), each β
(y)nm(ω) must be zero4.
The general interior field solution is hence:
P (x;ω) =∞
∑
n=0
n∑
m=−n
βnm(ω)jn(k‖x‖)Y mn (x). (2.12)
2The definition in this thesis has the property that Y −mn (φ) = (−1)m[Y m
n (φ)]∗.3One such example of an invalid pressure field is P (x; ω) ≡ 1 for x ∈ R
3.4The spherical Bessel function jn(‖x‖) remains finite for x ∈ R
3.
O
Sources
Region B`
R
Sources
x
Figure 2.2: Illustration of the interior field problem, where the pressure at a pointx in the source-free ball B
`, with ` = 2, 3, is determined.
The coefficients βnm(ω) shall be called the soundfield coefficients and functions
jn(k‖x‖)Y mn (x) the modes of the soundfield5. The radial dependence of the modes
is illustrated in Figure 2.3, where several of the low order Bessel functions are
shown. The soundfield coefficients characterize the soundfield completely. All
information about the reverberant field within B3 is specified through coefficients
βnm(ω).
The soundfield coefficients can be determined from the sound pressure P (x;ω).
If the soundfield pressure function P (x;ω) is known, the associated soundfield
coefficients can be determined from P (x;ω). Multiplying both sides of (2.12) by
[Y pq (x)]∗, integrating over S
2 and applying the orthogonality property (2.10) yields:
βqp(ω) =1
jq(k‖x‖)
∫
S2
P (x;ω)[Y pq (x)]∗ds(x), (2.13)
provided jq(k‖x‖) 6= 0.
Some of the zeros of the spherical Bessel functions are shown in Figure 2.3.
For each spherical Bessel function the zeros are spaced approximately 2π apart
[114, p. 118]. For frequencies at which jq(k‖x‖) = 0, the associated soundfield
coefficients {βqp(ω)}qp=−q can be found with an equation derived by performing the
integration over B3 rather than S
2 [51]. Alternative is to perform integration over
a pair of concentric spherical shells, and is presented in Chapter 5.
The usefulness of the modal expansion (2.12) originates from the following
properties:
(i) An efficient parametrization of a soundfield over B3. As is exploited in Chap-
5The term mode contrasts with use of the term in the audio field [66], where it refers tofrequency resonances. In this thesis, each mode represents a particular standing wave.
0 2 4 6 8 10 12 14 16 18 20−0.4
−0.2
0
0.2
0.4
0.6
0.8
1
x
j n(x)
j0(x)
j1(x)
j2(x)
j5(x)
j10
(x)
Figure 2.3: Plot of the spherical Bessel function jn(x) for n ∈ {0, 1, 2, 5, 10}.
ter 5, the modal expansion allows representation of the soundfield in B3 with
a small number of terms. In fact, (2.12) can be truncated to n ≤ N = dkRewhere operator d·e is the integer ceiling function, creating an expansion with
(N + 1)2 terms [48, 106].
(ii) The orthogonality of the spherical harmonic functions. It ensures simple ex-
pressions when integrating linear and certain quadratic functions of (2.12)
over the spherical shell S2.
In the next section, we describe the principal model of a reverberant field used
in this thesis.
2.4 Diffuse Field Model of Reverberation
A general model of soundfield pressure, largely free from the geometric details of
the room, is an important part of this thesis. We describe such a model in this
section.
We differentiate in the model between direct and reverberant components. The
direct component Pd(x;ω) is the part of the sound that arrives from a sound source
directly, without reflection or diffraction (shown in blue in Figure 1.3). The rever-
berant component Pr(x;ω) is what remains after subtraction of the direct compo-
O Reverb
ReverbSouce
Reverb −φ2
B3
R
−φ1
−φ3
x
Figure 2.4: Representation of a soundfield resulting from a component comingdirectly from the sound source and generalized diffuse component consisting ofplane waves originating from directions φn for n = 1, . . . , 3. Each plane wave hasamplitude ξr(φn;ω).
nent6. Complex sound pressure P (x;ω) can then be expressed
P (x;ω) = Pd(x;ω) + Pr(x;ω). (2.14)
The direct component Pd(x;ω) is given by free-field expressions for sound pressure
(1.3) and (1.4). We shall model the reverberant component of sound pressure with
a generalized diffuse field, described below. In this model we use statistical room
acoustics, where the pressure Pr(x;ω) at each position x modelled as a random
variable.
The relative size of direct and reverberant components at any point x in a
soundfield is described by the direct-to-reverberant ratio γP (x;ω) [57, 67]. The
direct-to-reverberant ratio is the ratio of the energy density of the direct component
to the energy density of the reverberant component. We can write:
γP (x;ω) ,|Pd(x;ω)|2
E{|Pr(x;ω)|2} , (2.15)
where E{·} is the expectation operator. The expectation is used to obtain an
ensemble average of reverberant energy density.
6The utility of the distinction arises from the ease of predicting the direct component. Thedirect component is a function of only the distance to the sound source and source strength.Further, for many signal processing algorithms to be practical, the direct signal must comprisea significant proportion of the reverberated signal energy [13, 99, 108]. For example, data inde-pendent beamforming, which operates by aligning direct components, is ineffective if the directenergy is negligible.
2.4.1 Basic Description
We start with a concise mathematical description of the reverberant sound pressure
in the diffuse field, and extend the definition to the generalization of the diffuse
field. The key to this description is in modelling the sound pressure at each point
in a room as a random variable.
In a diffuse field, the sound pressure at any point results from superposition of
planar waves. We can represent this fact in equation form as [18, p. 53]:
Pr(x;ω) =
∫
S`−1
ξr(φ;ω)e−ikx·φds(φ), (2.16)
where the reverberation geometry function ξr(φ;ω) describes the complex ampli-
tude of the planar wave propagating from each direction φ and e−ikx·φ describes
the spatial variation of the plane wave with position. The basic idea is illustrated
in Figure 2.4. |ξr(φ;ω)| represents the amplitude and ∠ξr(φ;ω) the phase of each
plane wave. ξr(φ;ω) is written as a spatial random process in the variable φ. This
equation is applicable to both 2-D (` = 2) and 3-D (` = 3) spaces.
The diffuse field is usually defined in terms of plane waves. To quote some
textbook definitions of a diffuse soundfield: “plane waves are incident from all
directions with equal probability and random phase” [28, p. 157] and “the sound
energy arriving at any point inside the enclosure is uniformly distributed over all
possible directions of incidence” [57, p. 1110].
Such textbook definitions are substantiated by the literature where these prop-
erties are more precisely defined [19, 57, 59, 102]. The field is only diffuse if the
amplitudes of plane waves from different angles of incidence are statistically inde-
pendent [59]. The relative phases of the plane waves are uniformly distributed [60]
and the amplitudes often modelled with equal intensity [57,59].
These properties of the plane waves comprising the diffuse field are neatly sum-
marized into the following definition.
Definition 2.4.1 (The Diffuse Field) In the diffuse field, the pressure at each
position x is described by (2.16) where the reverberant geometry function ξr(θ;ω) =
|ξr(θ;ω)|ei∠ξr(φ;ω) is a random variable possessing the following properties:
(i) The phase ∠ξr(φ;ω) of each plane wave is uniformly distributed over [0, 2π]
and independent of the plane wave amplitude |ξr(φ;ω)|.
(ii) The random variables ξr(φ;ω) and ξ
r(ϕ;ω) are independent for all φ 6= ϕ.
(iii) The distribution of ξr(φ;ω) is invariant with φ.
We extend and generalize the diffuse field to encompass fields with a directional
character, so that distribution of plane waves in such a field can be non-uniform.
We make this generalization by relaxing the property (c) of Definition 2.4.1:
Definition 2.4.2 (The Generalized Diffuse Field) In the generalized diffuse
field, the pressure at each position x is described by (2.16) where the reverberant
geometry function ξr(θ;ω) possesses the following properties:
(i) The phase ∠ξr(φ;ω) of each plane wave is uniformly distributed over [0, 2π]
and independent of the plane wave amplitude |ξr(φ;ω)|.
(ii) The random variables ξr(φ;ω) and ξ
r(ϕ;ω) are independent for all φ 6= ϕ.
The distribution of ξr(φ;ω) is no longer invariant with φ. In particular the variance
σ2ξ (φ;ω) , E{|ξr(φ;ω)|2} can be an arbitrary function of φ. In the next section
we derive the statistical properties of ξr(φ;ω) common to both definitions.
2.4.2 Statistical Properties
We now statistically describe the relationship of the sound pressure in a general-
ized diffuse soundfield. These statistical properties are utilized in the performance
analyses of Chapters 3 and 4. We start by establishing the underlying statistical
properties of the reverberant geometry function.
The Reverberant Geometry Function
Theorem 2.4.1 below summarizes some properties implied for the reverberant ge-
ometry function ξr(φ;ω) by the common property (i) of Definitions 2.4.1 and 2.4.2.
Theorem 2.4.1 For a random variable ξr= |ξ
r|ei∠ξr whose phase ∠ξ
ris uniformly
distributed over [0, 2π] and independent of amplitude |ξr|:
(i) ξris zero mean.
(ii) The real and imaginary parts of ξrare uncorrelated.
(iii) The variances of the real and imaginary parts of ξrare identical.
The proof is in the appendix at the end of this chapter.
We next write the covariance of the reverberant geometry functions between
two directions φ and ϕ. Before we do so, we define the following Dirac delta
functions. Let the delta functions δ`−1(φ − ϕ), ` = 2, 3 satisfy the sifting property
of the surface integral:
∫
S`−1
f(φ)δ`−1(φ − ϕ)ds(φ) = f(ϕ), ` = 2, 3. (2.17)
Delta functions that satisfy the sifting property (2.17) are, for ` = 2 (φ, ϕ ∈ R2):
δ(φ − ϕ) , δ(φ− ϕ), (2.18)
where φ and ϕ represent polar angles φ and ϕ respectively, and for ` = 3 (φ, ϕ ∈R
3):
δ2(φ − ϕ) , δ(φ− ϕ)δ(cos θ − cos ϑ), (2.19)
where φ and ϕ represent the polar and azimuthal angle pairs (θ, φ) and (ϑ, ϕ)
respectively.
From property (b) of Definitions 2.4.1 and 2.4.2, functions ξr(φ;ω) and ξr(ϕ;ω),
φ 6= ϕ, are independent. The implication for the covariance is summarized as
follows.
Observation 2.4.1 For the random process ξr(φ;ω) of the generalized diffuse field
defined in Definition 2.4.2, the covariance can be written
E{ξ∗r(φ;ω)ξ
r(ϕ;ω)} = σ2
ξ (φ;ω)δ`−1(φ − ϕ), (2.20)
where σ2ξ (φ;ω) = E{|ξ
r(φ;ω)|2} and the Dirac delta function δ`−1(φ− ϕ), ` = 2, 3
satisfies sifting property (2.17).
This statistical relation is used to derive the covariance of the reverberant field
pressure Pr(x;ω) between two points.
Generalized Diffuse Field Pressure
We now identify statistical properties of the sound pressure in the generalized
diffuse soundfield model.
It can readily be proved that complex sound pressure in a diffuse field has a
complex Gaussian distribution [110]. Provided certain conditions on the reverber-
ant geometry function are upheld, sound pressure in the generalized diffuse field
can also be seen to be complex Gaussian:
Theorem 2.4.2 (Gaussianity of Complex Pressure) The sound pressure Pr
(x;ω) in a generalized diffuse field, defined by (2.16), is a complex Gaussian random
variable provided:
(i) both real and imaginary parts of the integrand ξr(φ;ω)e−ikx·φ in (2.16) satisfy
the Lindeberg condition and
(ii) the domain of angles over which ξr(φ;ω) is nonzero is a region with nonzero
measure.
The Gaussianity property follows from application of the Lindeberg-Feller Central
Limit Theorem [21, p. 58]. The Lindeberg condition is a prerequisite for application
of this theorem. It requires the variances of individual random variables are small
compared to their integral sum so that no single angle φ has a variance σ2ξ (φ;ω)
that dominates. Condition (ii) ensures the integral (2.16) represents an infinite
sum of random variables, another prerequisite for application7.
The mean and covariance of sound pressure, are summarized as follows.
Theorem 2.4.3 (Statistics of Complex Pressure) The sound pressure Pr(x;
ω) in a generalized diffuse field possesses a zero mean E{Pr(x;ω)} = 0, a covari-
ance of:
E{P ∗r(x;ω)P
r(x + r;ω)} =
∫
S`−1
σ2ξ (φ;ω)e−ikr·φds(φ), (2.21)
and the circularity property:
E{Pr(x;ω)P
r(x + r;ω)} = 0, (2.22)
where σ2ξ (φ;ω) , E{|ξ
r(φ;ω)|2} and the reverberant geometry function ξ
r(φ;ω)
obeys the properties of Definition 2.4.2.
The proof is in the appendix at the end of this chapter. An important observation
is that the covariance is independent of the reference position x. The covariance is
dependent only on the displacement r between points. The diffuse property of the
diffuse soundfield [85], that mean square pressure E{|Pr(x;ω)|2} is independent of
position, is seen from (2.21):
E{|Pr(x;ω)|2} =
∫
S`−1
σ2ξ (φ;ω)ds(φ). (2.23)
We are able to reveal more insight into (2.21) by considering the spherical
harmonic expansion of σ2ξ (φ;ω). In the 3-D case:
σ2ξ (φ;ω) =
∞∑
n=0
n∑
m=−n
χnm(ω)Y mn (φ), (2.24a)
χnm(ω) =
∫
S2
σ2ξ (φ;ω)[Y m
n (φ)]∗ds(φ), (2.24b)
where χnm(ω) shall be called the reverberant geometry variance coefficients. Since
σ2ξ (φ;ω) is real, χ∗
nm(ω) = (−1)mχn(−m)(ω). The covariance can be expressed in
terms of these coefficients. Substituting (2.24a) and the Jacobi-Anger expansion
[18],
e−ikr·φ = 4π
∞∑
n=0
n∑
m=−n
(−i)njn(k‖r‖)Y mn (r)[Y m
n (φ)]∗, (2.25)
where r , r/‖r‖, into (2.21) and applying the orthogonality property (2.10), the
7For example, the soundfield in Figure 2.4 does not possess Gaussian pressure, since here thedomain over which ξr(φ; ω) is nonzero, namely {φ1, φ2, φ3}, is a region with zero measure.
covariance reduces to
E{P ∗r (x;ω)Pr(x + r;ω)} = 4π
∞∑
n=0
n∑
m=−n
(−i)nχnm(ω)jn(k‖r‖)Y mn (r). (2.26)
Note the similarity of this covariance relation to the interior field solution (2.44) for
sound pressure. The covariance is a linear combination of the modes {jn(k‖r‖)Y mn
(r) : n ∈ Z∗, m = −n, . . . , n} of the soundfield.
The generalized diffuse field has application to room acoustics modelling. It
preserves the diffuse property of a diffuse soundfield, while extending it to describe
fields with a directional character. Fields possessing a directional character are
quite common. In concert halls, sound absorbers are often strategically placed,
and the room geometry selected to limit range of the directions of propagation of
reverberant reflections. In an office with a heavy curtain covering one wall, the
curtain, acting as a sound absorber, modifies the directional distribution of the
reverberation. The generalized diffuse field could provide a first order model for
such room inhomogeneities.
Summarizing the results of this section, sound pressure in a generalized dif-
fuse field is circularly complex Gaussian, with a covariance given by (2.21). The
main use of these results is in Chapters 3 and 4, where we study the reverber-
ant performance of combined beamforming and equalization, and combined source
localization and tracking.
2.5 Geometric Representation of Reverberation
One way to represent a reverberant field is geometrically, by specifying the positions
of a set of sound sources that can generate it. This was already done in the
generalized diffuse field model, by defining the field in terms of plane waves, which
can be viewed as originating from farfield sound sources. Actually, by appropriately
arranging sound sources over a sphere, any soundfield can be constructed. We
present such a geometric representation of a soundfield here.
Consider the soundfield created by distributing sources, which we call reverber-
ant sources, over a spherical shell of fixed radius R′ > R concentric with the region
of interest B3. We describe the complex amplitude of the reverberant source at
each point R′φ on the shell with the reverberant geometry function ξr(φ;ω). The
sound pressure Pr(x;ω) can then be calculated as:
Pr(x;ω) =
∫
S2
ξr(φ;ω)e−ik‖R′φ−x‖
4π‖R′φ − x‖ds(φ). (2.27)
For ξr(φ;ω) appropriately chosen, any reverberant soundfield can be represented
by (2.27). This fact is shown by demonstrating the equivalence of (2.27) to the
general modal form (2.12), as will now be shown.
Perform a spherical harmonic expansion on the reverberant geometry function:
ξr(φ;ω) =∞
∑
n=0
n∑
m=−n
ψnm(ω)Y mn (φ), (2.28a)
ψnm(ω) =
∫
S2
ξr(φ;ω)[Y mn (φ)]∗ ds(φ), (2.28b)
where ψnm(ω) are the spherical harmonic coefficients of ξr(φ;ω). These coefficients
shall be called the reverberant geometry coefficients. These coefficients are closely
related to the reverberant field coefficients of modal form (2.12). Substituting
(2.28a) and the spherical harmonic expansion [18, p. 30],
e−ik‖y−x‖
4π‖y − x‖ = −ik∞
∑
n=0
n∑
m=−n
h(2)n (k‖y‖)[Y m
n (y)]∗ jn(k‖x‖)Y mn (x), ‖y‖ > ‖x‖,
(2.29)
where y = y/‖y‖ into (2.27), and applying the orthogonality property (2.10) yields:
Pr(x;ω) = −ik∞
∑
n=0
n∑
m=−n
ψnm(ω)h(2)n (kR′)jn(k‖x‖)Y m
n (x). (2.30)
Equation 2.30 is of the form of (2.12) with the reverberant soundfield coefficients
given by:
β(r)nm(ω) = −ikh(2)
n (kR′)ψnm(ω). (2.31)
Since spherical Hankel functions h(2)n (x) have no zeros, this equation can always
be solved for ψnm(ω). For any given field inside B3 specified through reverberant
soundfield coefficients β(r)nm(ω), we can always choose a reverberant source geometry
in (2.27) that will generate that field:
ξr(φ;ω) =∞
∑
n=0
n∑
m=−n
i
k
β(r)nm(ω)
h(2)n (kR′)
Y mn (φ). (2.32)
The relationships between sound pressure Pr(x;ω), geometric parameter ξr(φ;ω)
and modal parameters β(r)nm(ω) and ψnm(ω) are summarized in Figure 2.5.
In summary, the spatial variation of sound pressure in a reverberant soundfield
can equivalently be interpreted modally (as the sum of orthogonal functions) or
geometrically (as reverberant sources arranged over a sphere). For the interested
reader, Appendix A at the end of this thesis explores the modal interpretation for
the generalized diffuse field. There we determine the statistical properties of the
modal coefficients implied by the generalized diffuse field model.
Pressure{
Pr(x) : x ∈ S2}
Sound FieldCoefficients
{
β(r)nm : n = 0, 1, . . . ,
∞, m = n, . . . , n}
{
ξr(φ) : φ ∈ S2}
∫
S2
ξr(φ)e−ik‖x−Rφ‖
4π‖x −Rφ‖ds(φ)
∑
n,m
ψnmYmn (φ)
Reverberant Geometry
∑
n,m
β(r)nmjn(k‖x‖)Y m
n (x)
∫
S2
ξr(φ)[Y mn (φ)]∗ds(φ)
1
jn(k‖x‖)
∫
S2
Pr(x)[Y mn (x)]∗ds(x)
{
ψnm : n = 0, 1, . . . ,
∞, m = −n, . . . , n}
Coefficients
Sound Field Reverberant GeometryFunction
iβ(r)nm
kh(2)n (kR)
−ikh(2)n (kR)ψnm
[*]
Figure 2.5: Transformation map of parameters specifying a soundfield ina source-free region. Dependence on angular frequency ω is suppressedfor brevity. The sum
∑
n,m is short for∑∞
n=0
∑nm=−n. Equation [*] is
−ik∞
∑
n=0
n∑
m=−n
[h(2)n (kR′)/jn(k‖x‖)]
∫
S2
Pr(x)[Y mn (x)]∗ds(x)Y m
n (φ).
2.6 Geometric Configurations of Reverberation
In this section we describe several geometric configurations of reverberant sources,
appropriate for studying the relationship between the geometric parameters ξr(φ;ω)
and σ2ξ (φ;ω) and properties of the resultant soundfields. Although these config-
urations do not describe real reverberant environments exactly, they are useful
idealizations.
We investigate the reverberant source configurations shown in Figure 2.6. These
configurations have in common the geometric parameter set equal to 1 over the
subset D2 of the unit sphere:
U(φ) =
{
1, φ ∈ D2
0, otherwise,(2.33)
where U(φ) = σ2ξ (φ;ω) for studying the generalized diffuse field and U(φ) =
ξr(φ;ω) for studying general soundfield properties. The configurations are de-
scribed below, along with acoustic conditions under which each geometry can be
applied.
(e)(a) (b) (c) (d)
θc
Figure 2.6: Configurations of reverberant sources around a sphere. (a) Isotropicshell. (b) Conical sector with half cone angle θc. (c) Spherical slice. (d) Circularring (e) Circular arc.
By Theorem 2.4.2, provided D2 is a region of nonzero measure, the sound pres-
sure in this generalized diffuse field is complex Gaussian. Since the variance of
ξr(φ;ω) is constant over D2, the set of random variables {ξr(φ;ω)e−ikx·φ : φ ∈ D
2}obeys the Lindeberg condition8.
We now calculate the associated soundfield and geometry variance coefficients
for these configurations. Substituting (2.33) into (2.24a) or (2.28b), the equation
for the spherical harmonic coefficient in either case reduces to:
υnm =
∫
D2
[Y mn (φ)]∗ds(φ), (2.34)
where υnm = ψnm(ω) or υnm = χnm(ω). The resulting expressions for the coef-
ficients are summarized in Table 2.1. Using these coefficient expressions we can
study:
(i) the sound pressure (2.27) in an arbitrary soundfield and
(ii) the sound pressure covariance (2.26) in a generalized diffuse field,
as a function of the reverberant source configuration. The configurations themselves
are now described.
Spherical Shell
In this configuration we evenly distribute a continuum of reverberant sources over
a spherical shell (Figure 2.6(a)). The spherical shell yields the simplest field prop-
erties of all the source geometries. Setting D2 = S
2 in (2.34), we see the soundfield
given by (2.27) is composed of only one mode, j0(kx)Y00 (x). The generalized diffuse
field reduces to the classical diffuse field.
Spherical Sector
Here, we uniformly distribute a continuum of reverberant sources over the sector
of the sphere. Two spherical sectors considered are the conical sector D2cone =
8The Lindeberg condition in general holds when no random variable has a dominating variance.Here they all have the variance σ2
ξ (φ; ω).
{(θ, φ) : 0 < θ < θc, 0 < φ < 2π} (Fig. 2.6(b)) and the spherical slice D2slice =
{(θ, φ) : 0 < θ < π, φ0 − ∆φ < φ < φ0 + ∆φ}. (Fig. 2.6(c)). The equations
for the soundfield parameters are summarized in Table 2.1.9 The spherical sector
is useful for describing non-spherically symmetrical fields where the reverberation
comes only from a certain range of directions.
Circular Ring and Circular Arc
For this configuration, we uniformly distribute a continuum of reverberant sources
over the circular ring D2ring = {(θ, φ) : θ = π
2, 0 < φ < 2π} (Fig. 2.6(d)) and circular
arc D2arc = {(θ, φ) : θ = π
2, φ0 − ∆φ < φ < φ0 + ∆φ}, both centered about origin
in the x-y plane. These geometries are the 2-D analogs of the conical sector and
spherical slice respectively. The circular ring generates a diffuse-like field in the x-y
plane. We expect these geometries to describe the reverberant field best in rooms
with a highly sound-absorbing floor and ceiling.
As is the case when reverberant source geometries are axisymmetric about the
z-axis (including the conical sector geometry), we see from Table 2.1 that the
coefficients χnm(ω) and ψnm(ω) for which m 6= 0 are zero.
Image-Source Model
It is noteworthy to contrast the above-mentioned reverberant source geometries
with the image-source model [4] described in Chapter 1. In the image-source model,
the reverberant field is written as the sum of the soundfield produced by each
image-source,
Pr(x;ω) =∞
∑
`=1
ζ`(ω)e−ik‖y`−x‖
4π‖y` − x‖ , (2.35)
where ζ`(ω) and y` denote the accumulated reflection coefficient and position for
the `th image-source, respectively.
Here the reverberant geometry coefficients ψnm(ω) are found from β(r)nm(ω) using
(2.31), where β(r)nm(ω) are obtained by substituting (2.29) into (2.35) and comparing
with (2.44). The result is:
ψnm(ω) =
∞∑
`=1
ζ`(ω)h
(2)n (k‖y`‖)h
(2)n (kR′)
[Y mn (y`)]
∗. (2.36)
This spherical harmonic expansion is utilized in Chapter 3.
9The following expression was used to derive the coefficients for the spherical slice case:
∫ φ0+∆φ
φ0−∆φ
eimφdφ = 2∆φeimφ0j0(m∆φ).
Configuration χnm(ω) or ψnm(ω)
Spherical shell√
4πδm0δn0
Conical sector 2πΛ0n
∫ 1
cos θc
Pn(u)du δm0
Spherical slice 2∆φΛmn e
−imφ0j0(m∆φ)
∫ 1
−1
Pmn (u)du
Circular ring 2πΛ0nPn(0) δm0
Circular arc 2∆φΛmn e
−imφ0j0(m∆φ)Pn(0)
Arbitrary D2
∫
D2
[Y mn (φ)]∗ds(φ)
Table 2.1: Reverberant geometry variance coefficients for various geometric config-urations in a generalized diffuse field.
Equation (2.36) describes the geometric mapping of the image-sources onto a
sphere of radius R′. In contrast to the simple geometries mentioned above, ψnm(ω)
and hence ξr(φ;ω) depend nonlinearly10 on the angular frequency ω.
2.7 Two Dimensional Case
A simpler reverberant performance analysis is offered with a two dimensional sound-
field model. For that reason we sometimes make use of them in this thesis. We
shall see that the relationships and properties closely correspond to those of the
3-D case.
We start this section by outlining the derivation of the interior field solution
to the wave equation (Section 2.7.1). We describe the differences from the 3-D
case in the generalized diffuse model (Section 2.7.2). We then apply the geometric
interpretation to the 2-D case and define reverberant source geometries (Sections
2.7.3 and 2.7.4).
10If all image-sources lie in the farfield however, the dependence on angular frequency is signif-icantly simpler. Rewriting (2.35) for farfield image-sources:
Pr(x; ω) =
∞∑
`=1
ζ`(ω)e−ikx·y` .
Using the sifting property (2.17), this equation is rewritten:
Pr(x; ω) =
∫
S2
∞∑
`=1
ζ`(ω)δ2(φ − y`)e−ikx·φds(φ). (2.37)
Comparing (2.37) with (2.16), we can identify the reverberant geometry function as ξr(φ; ω) =∑
∞
`=1ζ`(ω)δ2(φ − y`). In cases where the wall reflection coefficients ζ`(ω) are independent of
angular frequency, so is ξr(φ; ω).
2.7.1 Wave Equation Solution
Following [114], we solve the Helmholtz equation (2.3) for sound pressure P (x;ω)
using the separation of variables
P (x;ω) = X(r;ω)Φ(φ;ω). (2.38)
Substituting (2.38) and the expression for the Laplacian operator in polar coordi-
nates:
∇2(·) =∂2
∂r2+
1
r
∂
∂r+
1
r2
∂2
∂φ2(2.39)
into (2.3) leads to separation into two ordinary differential equations [114, pp. 117]:
d2Φ
dφ2+ n2Φ = 0, (2.40)
∂2X
∂r2+
1
r
∂X
∂r+
(
k2 − n2
r2
)
X = 0, (2.41)
where φ and r are the polar angle and length of x respectively. Solutions to (2.40)
are again Φ+(φ) = einφ and Φ−(φ) = e−inφ with n integer. Solutions to (2.41) are
the Bessel functions of the first and second kinds with order n, Jn(kr) and Nn(kr).
The exponential functions summarize the angular dependence of the solution. They
possess the orthogonality property,
∫ 2π
0
e−inφeimφdφ = 2πδnm. (2.42)
Writing the complete solution to the Helmholtz equation in R2,
P (x;ω) =∞
∑
n=−∞
[
β(J)n (ω)Jn(k‖x‖) + β(N)
n (ω)Nn(k‖x‖)]
einφ. (2.43)
Sound pressure is fully represented by the set of coefficients {β(J)n (ω)}n∈Z
⋃
{β(N)n (ω)}n∈Z
where Z is the set of integers. For the interior field problem, the field in a region
B2 = {x ∈ R
2 : x ≤ R} absent of any sources reduces to:
P (x;ω) =∞
∑
n=−∞
βn(ω)Jn(k‖x‖)einφ, (2.44)
where {βn(ω)}n∈Z are the 2-D soundfield coefficients. Here sound pressure is ex-
pressed in terms of the 2-D modes {Jn(k‖x‖)einφ}n∈Z of the soundfield. Similar to
the 3-D case in (2.13), each βn(ω) can be determined with the relation:
βn(ω) =1
2πJn(kx)
∫ 2π
0
P (x;ω)e−inφdφ, (2.45)
provided Jn(kx) 6= 0. The advantage of the modal expansion (2.44) lies in its
efficient parametrization of the soundfield (c.f. Chapter 5) and the functional de-
pendence of terms on orthogonal functions.
2.7.2 Generalized Diffuse Field
The generalized diffuse field, as defined in Definition 2.4.2 encompasses both 2-D
and 3-D cases. Field properties are still governed by Theorems 2.4.2 and 2.4.3 but
the Fourier expansion of the spatial covariance is different from the 3-D case.
For 2-D plane waves, (2.16) reduces to:
Pr(x;ω) =
∫ 2π
0
ξr(φ;ω)e−ikx·φdφ. (2.46)
The spatial covariance property (2.21) becomes:
E{P ∗r (x;ω)Pr(x + r;ω)} =
∫ 2π
0
σ2ξ (φ;ω)e−ikr·φdφ, (2.47)
where σ2ξ (φ;ω) , E{|ξr(φ;ω)|2}. The Fourier expansion of σ2
ξ (φ;ω) is written as:
σ2ξ (φ;ω) =
∞∑
n=−∞
χn(ω)einφ, (2.48a)
χn(ω) =1
2π
∫ 2π
0
σ2ξ (φ;ω)e−inφdφ. (2.48b)
Substituting the Jacobi-Anger expression [18],
e−ikx·y =∞
∑
n=−∞
(−i)ne−inφyJn(kx)einφx , (2.49)
into (2.47) and applying (2.48b) we show that:
E{P ∗r (x;ω)Pr(x + r;ω)} = 2π
∞∑
n=−∞
(−i)nχn(ω)Jn(kr)einφr , (2.50)
where φr is the polar angle of r.
2.7.3 Geometric Representation of Reverberation
We now present the geometric interpretation of the reverberant field in the 2-D
case. Any soundfield in the circular region B2 can be interpreted as resulting from
a continuum of reverberant sources arranged on a circle of radius R′ > R concentric
with B2:
Pr(x;ω) =
∫ 2π
0
ξr(φ;ω)H(2)0 (k‖R′φ − x‖) dφ, (2.51)
where H(2)n (·) is the Hankel function of the second kind and order n. The term
H(2)0 (k‖R′φ − x‖) is the fundamental solution of the Helmholtz equation in R
2.
Performing a Fourier expansion on ξr(φ;ω):
ξr(φ;ω) =
∞∑
n=−∞
ψn(ω)einφ, (2.52a)
ψn(ω) =1
2π
∫ 2π
0
ξr(φ;ω)e−inφ dφ, (2.52b)
and applying the addition property of the Hankel function [18, p. 67]:
H(2)0 (k‖x − y‖) =
∞∑
n=−∞
H(2)n (ky)e−inφyJn(kx)einφx ,
where φx and φy are the polar angles of x and y respectively, the field representation
in (2.51) is shown equivalent to the 2-D modal representation (2.44) with
β(r)n (ω) = 2πH(2)
n (kR′)ψn(ω). (2.53)
2.7.4 Geometric Configurations of Reverberation
For soundfield analysis performed in 2-D in this thesis, we only use one reverberant
source configuration - the circular arc. Here we set geometric parameters equal to
1 over the subset of angles φ ∈ [φ0 − ∆φ, φ0 + ∆φ]:
U(φ) =
{
1, φ0 − ∆φ ≤ φ < φ0 + ∆φ
0, otherwise,(2.54)
where U(φ) = σ2ξ (φ;ω) for a generalized diffuse field and U(φ) = ξr(φ;ω) for
studying general soundfield properties.
By Theorem 2.4.2, provided ∆φ 6= 0, the sound pressure in this generalized
diffuse field is complex Gaussian. Because the variance of ξr(φ;ω) is constant over
[φ0 − ∆φ, φ0 + ∆φ], the Lindeberg condition holds for the set of random variables
{ξr(φ;ω)e−ikx·φ : φ ∈ [φ0 − ∆φ, φ0 + ∆φ]}.Substituting (2.54) into (2.48b) or (2.52b), the equation for the spherical har-
monic coefficients in either case reduces to:
υn =
∫ φ0+∆φ
φ0−∆φ
e−inφdφ = 2∆φe−inφ0j0(n∆φ).
for υn = ψn(ω) or υn = χn(ω). When ∆φ = π, the reverberant source configuration
is a circular ring, in which case the generalized diffuse field reduces to the classical
2-D diffuse field described in [59].
2.8 Summary and Contribution
In this chapter we have described different models for the sound pressure in a
reverberant room. We have developed a generalized diffuse field model, and a
representation of a sound field as a single layer potential.
We now itemize the contributions made in this chapter:
i. A generalization of the diffuse field has been made to describe soundfields
where the sound propagates in as plane waves from a subset of directions.
Here the complex sound pressure is shown to be a circularly complex Gaussian
distribution and an expression for spatial covariance determined.
ii. Application of the single layer potential to create a new interpretation where
a soundfield results from nearfield reverberant sources lying on a shell. We
explored the relationship to the modal expansion of sound pressure for the
interior field problem.
iii. A classification of basic reverberant source geometries has been made, which
can be used to explore the statistical properties of the generalized diffuse field
and properties of soundfields in general.
In the coming chapters, we will utilize the soundfield models to analyze the
performance of different signal processing algorithms. In Chapters 3 and 4, these
models will be used to explore equalization and source localization algorithms re-
spectively. In Chapter 5, the interior field solution is exploited to measure the
soundfield within a portion of the room, in order to facilitate soundfield reproduc-
tion in a reverberant room.
2.9 Appendices
2.9.1 Proof of Theorem 2.4.1
We start by calculating some relevant expectations for the phase ∠ξr. Since ∠ξr
is uniformly distributed over [0, 2π], the expectation of any function f(∠ξr) of the
phase is:
E{f(∠ξr)} =1
2π
∫ 2π
0
f(∠ξr)d(∠ξr).
A simple calculation yields:
E{ei∠ξr} = 0, (2.55)
E{cos(∠ξr) sin(∠ξr)} = 0, (2.56)
E{cos2(∠ξr)} = E{sin2(∠ξr)} =1
2. (2.57)
Property (i) is seen from the independence of the amplitude and phase of ξr and
(2.55), E{ξr} = E{|ξr|}E{ei∠ξr} = 0. The real and imaginary parts of ξr can be ex-
panded as E{Re[ξr]Im[ξr]} = E{|ξr| cos(∠ξr)|ξr| sin(∠ξr)}. From the independence
property and (2.56):
E{Re[ξr]Im[ξr]} = E{|ξr|2}E{cos(∠ξr) sin(∠ξr)} = 0.
Finally, the independence property implies:
E{Re2[ξr]} = E{|ξr|2 cos2(∠ξr)} = E{|ξr|2}E{cos2(∠ξr)},
and
E{Im2[ξr]} = E{|ξr|2 sin2(∠ξr)} = E{|ξr|2}E{sin2(∠ξr)}.
Thus by (2.57), E{Re2[ξr]} = E{Im2[ξr]} = 12E{|ξr|2}. �
2.9.2 Proof of Theorem 2.4.3
The zero mean property follows from the linearity of (2.16) and zero mean property
of ξr(φ;ω) in Theorem 2.4.1. From (2.16), the covariance is calculated as:
E{P ∗r (x;ω)Pr(x + r;ω)} =
∫
S`−1
∫
S`−1
E{ξ∗r (φ;ω)ξr(ϕ;ω)}
× eikx·φe−ik(x+r)·ϕds(φ)ds(ϕ).
Applying the reverberant geometry function covariance property (2.20):
E{P ∗r (x;ω)Pr(x + r;ω)} =
∫
S`−1
∫
S`−1
σ2ξ (φ;ω)δ`−1(φ − ϕ)
× eikx·(φ−ϕ)e−ikr·ϕds(φ)ds(ϕ).
Equation 2.21 then follows from applying the sifting property (2.17).
To show (2.22), note that E{ξ2r (φ;ω)} = 0. This property can be seen by
expanding E{ξ2r (φ;ω)} into real and imaginary parts:
E{ξ2r (φ;ω)} = E{Re2[ξr(φ;ω)]} − E{Im2[ξr(φ;ω)]}
+ 2iE{Re[ξr(φ;ω)]Im[ξr(φ;ω)]}. (2.58)
Because their variances are identical, the first and second terms are equal. The
third term of (2.58) is zero by independence of real and imaginary parts of ξr(φ;ω).
Equation 2.22 is then proved in a manner identical to (2.21), by noting that
E{ξr(φ;ω)ξr(ϕ;ω)} = E{ξ2r (φ;ω)}δ`−1(φ − ϕ) = 0.
�