fast variance prediction for iteratively reconstructed ct...

4
1 Fast Variance Prediction for Iteratively Reconstructed CT with Arbitrary Geometries Stephen M. Schmitt and Jeffrey A. Fessler Abstract —Fast variance prediction for iteratively recon- structed CT images is useful for the analysis of reconstruction algorithms and potentially for automatic tube current mod- ulation. Prior methods are either computationally intractable or require impractical computation times to produce a map of the reconstructed image variance. In this paper we present the extension of prior work for fast variance prediction, which was specific to limited classes of CT geometries, to arbitrary CT geometries. We compare the results of our method to an empirical variance map produced from repeated axial CT scans of a chest phantom. I. I NTRODUCTION Iterative reconstruction methods for CT offer improved resolution and noise properties compared to FBP-like re- construction methods [1]. However, the statistical proper- ties of iteratively reconstructed images are more difficult to analyze than those of FBP-like images. Prior work has provided closed form but computationally intractable expressions for the covariance of an iteratively reconstructed CT image [2]. Other work has made evalu- ating this closed form more tractable by using frequency- domain approximations [3]. These methods require com- puting a projection and back-projection of each voxel of interest and are impractical for producing a map of the variance for an entire 3D volume. In this paper, we apply further approximations to the frequency-domain approximation that significantly acceler- ate these methods, allowing us to produce a variance map in less time than methods that require a projection and back-projection. We compare the resulting prediction to an empirical variance map produced from repeated CT scans of the same object. II. METHODS A. Problem Domain In this work, we consider statistically reconstructed im- ages of the form ˆ x = argmin x L(Y; x) + αR (x) (1) Here, L is the negative log-likelihood of the vectorized observations Y given an image vector x. The function R (x) is a regularization penalty. We assume: 1) The covariance of Y is diagonal, and can be estimated from the data and knowledge of the instrumentation. This work was supported in part by NIH grant R01 HL-098686 and by equipment donations from Intel. Stephen M. Schmitt and Jeffrey A. Fessler are with the Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI 48109 USA ([email protected], [email protected]) 2) Given an image x, the elements of Y are statistically independent, and the likelihood of a particular ob- servation Y i is modeled in terms of the projection [Ax] i j a ij x j , such that L(Y; x) = N d i =1 L i (Y i ;[Ax] i ); (2) the matrix A is a projection matrix with elements a ij representing the projection of voxel j onto observa- tion i . We denote the number of observations N d . 3) The regularizer takes the general form R (x) = N C d =1 r d k ψ([C d x] k ). (3) In the common case of a regularizer that penal- izes first differences between neighboring voxels, d indexes the directions over which we take the dif- ferences, C d is a first differencing matrix between voxels in that direction, and r d is the relative strength of the regularizer in that direction. We assume the regularizer penalty ψ is twice-differentiable at 0, and scaled such that ψ (0) = 1. 4) Let ˘ x denote the reconstruction using noise-free data ¯ Y. We assume that the Hessian of the regularizer, evaluated at ˘ x, can be approximated by P: 2 R x) P N C d =1 r d C T d C d . (4) This approximation is accurate except near edges. B. Methods Previous work has computed variance predictions using local frequency domain expressions for A T WA, where W is a diagonal statistical weighting matrix, and for P, using an approximation of local shift-invariance. The local impulse response (LIR) of A T WA for the voxel j is defined by h W j A T WAe j , (5) where e j is defined as the unit vector with a single 1 at position j . This LIR can be written exactly as the impulse e j operated on by a local frequency-domain filter H W j ( ν): h W j = F * D H W j F e j , (6) where D is a “diagonalization” operator: (D {H } X )( ν) = H ( ν) X ( ν), and F is the DSFT with the spatial extent limited by the image support. We refer to H W j as a local frequency response (LFR). In the region near voxel j , A T WA The 13th International Meeting on Fully Three-Dimensional Image Reconstruction in Radiology and Nuclear Medicine 228

Upload: others

Post on 18-Mar-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Fast Variance Prediction for Iteratively Reconstructed CT ...web.eecs.umich.edu/~fessler/papers/files/proc/15/web/schmitt-15-fvp.pdf · In this case, ½ max Æ 1/(2max {jcos ©j,

1

Fast Variance Prediction for IterativelyReconstructed CT with Arbitrary Geometries

Stephen M. Schmitt and Jeffrey A. Fessler

Abstract—Fast variance prediction for iteratively recon-structed CT images is useful for the analysis of reconstructionalgorithms and potentially for automatic tube current mod-ulation. Prior methods are either computationally intractableor require impractical computation times to produce a mapof the reconstructed image variance. In this paper we presentthe extension of prior work for fast variance prediction, whichwas specific to limited classes of CT geometries, to arbitraryCT geometries. We compare the results of our method to anempirical variance map produced from repeated axial CT scansof a chest phantom.

I. INTRODUCTION

Iterative reconstruction methods for CT offer improvedresolution and noise properties compared to FBP-like re-construction methods [1]. However, the statistical proper-ties of iteratively reconstructed images are more difficult toanalyze than those of FBP-like images.

Prior work has provided closed form but computationallyintractable expressions for the covariance of an iterativelyreconstructed CT image [2]. Other work has made evalu-ating this closed form more tractable by using frequency-domain approximations [3]. These methods require com-puting a projection and back-projection of each voxel ofinterest and are impractical for producing a map of thevariance for an entire 3D volume.

In this paper, we apply further approximations to thefrequency-domain approximation that significantly acceler-ate these methods, allowing us to produce a variance mapin less time than methods that require a projection andback-projection. We compare the resulting prediction to anempirical variance map produced from repeated CT scansof the same object.

II. METHODS

A. Problem Domain

In this work, we consider statistically reconstructed im-ages of the form

x = argminx L(Y;x)+αR(x) (1)

Here, L is the negative log-likelihood of the vectorizedobservations Y given an image vector x. The function R(x)is a regularization penalty. We assume:

1) The covariance of Y is diagonal, and can be estimatedfrom the data and knowledge of the instrumentation.

This work was supported in part by NIH grant R01 HL-098686 and byequipment donations from Intel.

Stephen M. Schmitt and Jeffrey A. Fessler are with the Department ofElectrical Engineering and Computer Science, University of Michigan, AnnArbor, MI 48109 USA ([email protected], [email protected])

2) Given an image x, the elements of Y are statisticallyindependent, and the likelihood of a particular ob-servation Yi is modeled in terms of the projection[Ax]i ,

∑j ai j x j , such that

L(Y;x) =Nd∑

i=1Li (Yi ; [Ax]i ); (2)

the matrix A is a projection matrix with elements ai j

representing the projection of voxel j onto observa-tion i . We denote the number of observations Nd .

3) The regularizer takes the general form

R(x) =NC∑

d=1rd

kψ([Cd x]k ). (3)

In the common case of a regularizer that penal-izes first differences between neighboring voxels, dindexes the directions over which we take the dif-ferences, Cd is a first differencing matrix betweenvoxels in that direction, and rd is the relative strengthof the regularizer in that direction. We assume theregularizer penalty ψ is twice-differentiable at 0, andscaled such that ψ′′(0) = 1.

4) Let x denote the reconstruction using noise-free dataY. We assume that the Hessian of the regularizer,evaluated at x, can be approximated by P:

∇2R(x) ≈ P,NC∑

d=1rd CT

d Cd . (4)

This approximation is accurate except near edges.

B. Methods

Previous work has computed variance predictions usinglocal frequency domain expressions for ATWA, where W isa diagonal statistical weighting matrix, and for P, using anapproximation of local shift-invariance.

The local impulse response (LIR) of ATWA for the voxelj is defined by

hWj ,ATWAe j , (5)

where e j is defined as the unit vector with a single 1 atposition j . This LIR can be written exactly as the impulsee j operated on by a local frequency-domain filter HW

j (~ν):

hWj =F∗D

{HW

j

}Fe j , (6)

where D is a “diagonalization” operator: (D {H } X ) (~ν) =H(~ν)X (~ν), and F is the DSFT with the spatial extentlimited by the image support. We refer to HW

j as a local

frequency response (LFR). In the region near voxel j , ATWA

The 13th International Meeting on Fully Three-Dimensional Image Reconstruction in Radiology and Nuclear Medicine

228

Page 2: Fast Variance Prediction for Iteratively Reconstructed CT ...web.eecs.umich.edu/~fessler/papers/files/proc/15/web/schmitt-15-fvp.pdf · In this case, ½ max Æ 1/(2max {jcos ©j,

is typically approximately spatially shift-invariant, leadingto an approximation

[ATWA]k j ≈ eTkF∗D

{HW

j

}Fe j , (7)

for voxel k near voxel j , which is suggested by (5) and (6).Except at the edges of the reconstructed image, P can berepresented in terms of its frequency response R(~ν):

P =F∗D {R}F . (8)

In [4], we develop a separable approximation to HWj :

HWj (~ν) ≈ J (~ν)EW

j (~Θ), (9)

where ~Θ , ~ν/||~ν|| is the angle of ~ν. The utility of thisfactorization is that EW

j is the only term dependent on the

weighting W and voxel location j , but as a function of ~Θrather than ~ν, EW

j is a function of one fewer dimension

than HWj . The J (~ν) term does not depend on the weighting

and voxel location.We describe applying this factorization to accelerate com-

puting variance maps and predicting numerical observerSNRs using frequency-domain methods.

C. Variance Prediction

For reconstructions made under the assumptions de-scribed in Section II-A, the covariance of the reconstruction,denoted Σx, is given approximately as [2]:

Σx ≈ (ATWA+α∇2R(x))−1ATWA·(ATWA+α∇2R(x))−1, (10)

where the diagonal matrices W and W are defined as:

[W]i i ,∂2

∂y2 Li (Yi ; y)

∣∣∣∣y=[Ax]i

(11)

[W]i i , var(Yi ) · ∂2

∂y∂YiLi (Yi ; y)

∣∣∣∣y=[Ax]i

. (12)

Expression (10) is impractical in CT due to the inversionof a large matrix. Approximating the Hessian using (4) andusing the frequency-domain approximations of (7) and (8),finding one element of (10) simplifies to

var(x j ) ≈∫[− 1

2 , 12

]n

HWj (~ν)

(HWj (~ν)+αR(~ν))2

d~ν. (13)

Using the factorization of (9) in (13), we can reduce thisintegral by a dimension:

var(x j ) ≈α−1∫

Sn

EWj (~Θ)

EWj (~Θ)

G(α−1EWj (~Θ),~Θ)d~Θ, (14)

where G(γ,~Θ) is a function defined as

G(γ,~Θ),∫ %max(~Θ)

0

γJ (%,~Θ)

(γJ (%,~Θ)+R(%,~Θ))2%n−1 d%, (15)

and where %max(~Θ) = 1/(2||~Θ||∞) is the maximum extent of% in [−1/2,1/2]n . In general, G cannot be computed in a

closed form, but it is well-behaved and depends only onvoxel shape (which determines J (~ν)) and regularizer (whichdetermines R(~ν)). We precompute a single table of valuesof G and use that table to predict variance maps via (14)for multiple voxels, any regularization parameter α, anyweighting W, any voxel spacing or scan geometry.

Specific to small cone angle 3DCT geometries, in [5] weproposed another factorization like (9). This factorizationis separable not in spherical coordinates (%,~Θ) but incylindrical coordinates (ρ,Φ,ν3):

HWj (~ν) ≈ Jcyl(~ν)EW

j ,cyl(Φ). (16)

Like (9), Jcyl does not depend on the voxel location orweighting; EW

j ,cyl does, but is a function of only the cylinderangle Φ. Like (14), we simplify (13) using (16):

var(x j ) ≈α−1∫ 2π

0

EWj ,cyl(Φ)

EWj ,cyl(Φ)

Gcyl(Φ,α−1EWj ,cyl(Φ))dΦ, (17)

where we define another object-independent function Gcyl:

Gcyl(Φ,γ),∫ ρmax(Φ)

0

∫ 12

− 12

γJcyl(~ν)

(γJcyl(~ν)+R(~ν))2 ρdν3dρ. (18)

In this case, ρmax = 1/(2max{|cosΦ|, |sinΦ|}). Again, Gcyl hasno closed form but is a well-behaved function of only twoparameters that we precompute and tabulate. We computethis table only once for a given regularizer and voxel shape.Using the table, variance prediction via (17) simply requireslooking up values of Gcyl and numerically integrating themin 1D. This integration can use a coarse discretizationof Φ with reasonably accurate predicted variance. Whilethe derivation differs, (17) is the form for fast varianceprediction given in [4], [5], which also reduces to theform given in [6] for quadratic regularization and an axialgeometry.

III. RESULTS

To evaluate our fast variance prediction approach (17), wecompared it to an empirical variance map. We scanned athorax phantom with added spherical nodules 10 times witha GE Discovery CT750 HD scanner and reconstructed eachof the 10 sinograms separately to produce the empiricalvariance map of the reconstruction. Each scan was a one-rotation axial scan—since we could not ensure that eachscan began at the same starting angle, using multiple real-izations of the same helical scan to produce an empiricalvariance was not possible with our physical CT scanner.With the axial scans, we used a projection matrix A thatwas correctly aligned to the starting angle of each scan sothat each reconstruction was aligned to the same voxel grid.We used a 40mA tube current and 120 kVp tube voltage. Thescan time was 0.5 seconds.

We reconstructed each of the 10 sinograms using statisti-cal reconstruction methods. The size of the reconstructionwas 512×512×32 voxels with voxel size ∆x ×∆z = 0.9764×0.625mm. Each reconstruction used 100 iterations of anordered-subset method [7] using 64 subsets. We performed

The 13th International Meeting on Fully Three-Dimensional Image Reconstruction in Radiology and Nuclear Medicine

229

Page 3: Fast Variance Prediction for Iteratively Reconstructed CT ...web.eecs.umich.edu/~fessler/papers/files/proc/15/web/schmitt-15-fvp.pdf · In this case, ½ max Æ 1/(2max {jcos ©j,

the reconstructions using two different regularizers. In thefirst case, the regularization used a quadratic penalty andwas spatially varying strength using the method of [8] toproduce uniform spatial resolution. In the second case, thepenalty function used the Huber potential with a thresholdδ of 10 Hounsfield units but was not spatially varying inregularization strength. In both cases, the elements of theweighting matrix W were chosen to correspond to the CTscanner’s estimate of the inverse of the variance of each raygiven the scanner-specific corrections used [9].

Figures 1(a) (with spatially varying, quadratic regular-ization) and 2(a) (uniform, Huber-penalized regularization)show axial, sagittal, and coronal slices of the 3D map ofthe empirical standard deviation from our simulated recon-structions. As in the simulated empirical standard deviationmaps, the empirical maps were noisy, so we blurred theempirical variance maps with a 2D gaussian kernel witha FWHM of 5 voxels each in each direction. Figures 1(b)and 2(b) show the corresponding slices through the 3Dpredicted standard deviation map from (17). We computedthe standard deviation once per 4× 4× 1 block and usednearest-neighbor interpolation to fill in the rest. Figures1(c) and 2(c) show the absolute magnitude of the errorof our approximated standard deviation compared to theempirical results. Figures 1(d) and 2(d) show the empiricaland predicted standard deviation along a one-dimensionalcoronal profile through the center of the image, along withthe standard deviation as computed from (13) using theDSFT of hW

j as the LFR (labelled ‘DFT-based’).Table I compares the computation time required to find

the empirical variance with the computation time requiredto predict the variance for the entire volume using the DFT-based method and using our methods. We used the DFT-based method only to produce the one-dimensional profilesshown in Figure 1(d), and 2(d); since the computation timeis large, we extrapolate to predict the DFT computation timefor the entire volume for Table I.

Empirical DFT-based Proposed3.63 ·105 1.07 ·108 6.73 ·102

(10 realizations)

TABLE I: Computation time of variance prediction methods(CPU seconds)

IV. DISCUSSION

We demonstrated a method that is fairly accurate in thecase of quadratic regularization, and accurate away fromimage edges in the case of edge-preserving regularization.From the profiles in Figures 1(d) and 2(d), we can seethat the majority of the error in our method is incurredin the step of approximating (10) with (13), and not inapproximating (13) with (17). However, we can compute(17) faster than (13) by a factor of over 105.

V. FUTURE WORK

One main area of future work is using the separableapproximation (9) in fast prediction of the performance of a

linear image observer for binary classification of iterativelyreconstructed images. For example, the squared SNR ofthe ideal non-prewhitened image observer for detectingwhether a feature f has been added to a background imagex is given by:

SNR2 = (fT f)2

fTΣx f, (19)

where f is the difference in the mean reconstructionswith and without the feature present. While this is noteasily computable due to the presence of Σx and f, wecan make frequency-domain approximations specific to thenumerator and denominator and then accelerate them byusing our factorization.

Empirical DFT-based ProposedSNR2 4.0 5.0 5.3

Time (CPU sec.) 3.00 ·106 16.4 4.32 ·10−4

TABLE II: Comparison of SNR prediction methods.

Table II shows preliminary results of this observer perfor-mance prediction for a simulated 2D problem. These resultsare similar to those for variance prediction, namely, thatthere is a notable speed up using our method at the costof decreased accuracy.

REFERENCES

[1] J.-B. Thibault, K. Sauer, C. Bouman, and J. Hsieh, “A three-dimensionalstatistical approach to improved image quality for multi-slice helicalCT,” Med. Phys., vol. 34, pp. 4526–44, Nov. 2007.

[2] J. A. Fessler, “Mean and variance of implicitly defined biased estimators(such as penalized maximum likelihood): Applications to tomography,”IEEE Trans. Im. Proc., vol. 5, pp. 493–506, Mar. 1996.

[3] J. Qi and R. M. Leahy, “A theoretical study of the contrast recovery andvariance of MAP reconstructions from PET data,” IEEE Trans. Med.Imag., vol. 18, pp. 293–305, Apr. 1999.

[4] S. M. Schmitt and J. A. Fessler, “Fast variance prediction for iterativelyreconstructed CT images,” IEEE Trans. Med. Imag., 2015. In revsion.

[5] S. M. Schmitt and J. A. Fessler, “Fast variance computation for iterativereconstruction of 3D helical CT images,” in Proc. Intl. Mtg. on Fully3D Image Recon. in Rad. and Nuc. Med, pp. 162–5, 2013.

[6] S. Schmitt and J. A. Fessler, “Fast variance computation for quadrati-cally penalized iterative reconstruction of 3D axial CT images,” in Proc.IEEE Nuc. Sci. Symp. Med. Im. Conf., pp. 3287–92, 2012.

[7] H. Erdogan and J. A. Fessler, “Ordered subsets algorithms for trans-mission tomography,” Phys. Med. Biol., vol. 44, pp. 2835–51, Nov. 1999.

[8] J. A. Fessler and W. L. Rogers, “Spatial resolution properties ofpenalized-likelihood image reconstruction methods: Space-invarianttomographs,” IEEE Trans. Im. Proc., vol. 5, pp. 1346–58, Sept. 1996.

[9] Z. Chang, J.-B. Thibault, K. Sauer, and C. Bouman, “Statistical X-raycomputed tomography from photon-starved measurements,” in Proc.SPIE 9020 Computational Imaging XII, p. 90200G, 2014.

The 13th International Meeting on Fully Three-Dimensional Image Reconstruction in Radiology and Nuclear Medicine

230

Page 4: Fast Variance Prediction for Iteratively Reconstructed CT ...web.eecs.umich.edu/~fessler/papers/files/proc/15/web/schmitt-15-fvp.pdf · In this case, ½ max Æ 1/(2max {jcos ©j,

0

20

40

60

80

0

20

40

60

80

(a) Empirical (b) Predicted

0

2

4

6

8

10

−200 −100 0 100 2000

20

40

60

80

x coordinate (mm)

SD

(H

U)

empirical

predicted

DFT−based

(c) Absolute Error (d) Coronal Profile

Fig. 1: Three slices of standard deviation maps using spatially varying, quadratic regularization (Hounsfield units). Coronaland sagittal slices were stretched in the trans-axial direction by a factor of two for visualization.

0

5

10

15

20

0

5

10

15

20

(a) Empirical (b) Predicted

0

2

4

6

8

10

−200 −100 0 100 2000

5

10

15

20

x coordinate (mm)

SD

(H

U)

empirical

predicted

DFT−based

(c) Absolute Error (d) Coronal Profile

Fig. 2: Three slices of standard deviation maps using spatially uniform, Huber-penalized regularization (Hounsfield units).Coronal and sagittal slices were stretched in the trans-axial direction by a factor of two for visualization.

The 13th International Meeting on Fully Three-Dimensional Image Reconstruction in Radiology and Nuclear Medicine

231