wenxiang zuo, qiang li, xianming liu* harbin institute of ... · especially for serious depth...

5
DEPTH RESTORATION: A FAST LOW-RANK MATRIX COMPLETION VIA DUAL-GRAPH REGULARIZATION Wenxiang Zuo, Qiang Li, Xianming Liu* Harbin Institute of Technology ABSTRACT As a real scenes sensing approach, depth information obtains the widespread applications. However, resulting from the re- striction of depth sensing technology, the depth map captured in practice usually suffers terrible noise and missing values at plenty of pixels. In this paper, we propose a fast low-rank matrix completion via dual-graph regularization for depth restoration. Specifically, the depth restoration can be trans- formed into a low-rank matrix completion problem. In order to complete the low-rank matrix and restore it to the depth map, the proposed dual-graph method containing the local and non-local graph regularizations exploits the local similar- ity of depth maps and the gradient consistency of depth-color counterparts respectively. In addition, the proposed approach achieves the high speed depth restoration due to closed-form solution. Experimental results demonstrate that the proposed method outperforms the state-of-the-art methods with re- spect to both objective and subjective quality evaluations, especially for serious depth degeneration. Index TermsDepth restoration, Low-rank matrix com- pletion, Dual-graph regularization 1. INTRODUCTION Depth maps play a rudimentary role in many computer vision and computation photography applications, such as robot vi- sion, augmented reality, and 3D reconstruction ect. Contem- porarily, affordable RGB-D cameras are popular and have en- abled all kinds of applications, such as Time of Flight (ToF) camera [1] and Microsoft Kinect [2]. However, depth maps captured by the equipment may suffer various quality degra- dations with terrible noise and missing values at plenty of pixels, resulting from the restrictions of current depth sens- ing technologies. For instance, the viewpoint differences of multiple sensors in the depth camera results in depth-missing around object boundaries. Moreover, the depth cameras of structured-light provide incorrect depth frequently due to dark surfaces light absorption [3]. To facilitate the use of depth information, tremendous ef- forts have been spent on the depth restoration, which exploits additional information for the ill-posed problem. In the liter- ature, many works follow multi-depth fusion way to trans- form multiple displaced degraded depth maps into a com- plete, clear and high-quality depth map. For example, Kim et al.[4] combined images from multiple color camera and depth sensors into 3D reconstruction. However, these meth- ods are limited to static scenes rather than the dynamic in many scenarios. Another potential depth restoration method is to fuse captured depth maps and the corresponding color images, in order to exploit the high-quality color image for the depth degeneration offset. For instance, the conventional filter method joint bilateral filter (JBF) modeled the relation- ship between depth maps and their color image counterparts [5], however they have intrinsic different characteristics. Dis- tinctly, depth discontinuities with blurring artifacts are pro- duced by color discontinuities [6]. In another direction, re- searchers attempt to recover depth in an optimizaiton-based approach with designed prior models similar to natural im- age restoration. For instance, the autoregressive (AR) model has been successfully applied to denoise in other image pro- cessing tasks [7] with color information guidance. However, this method is subject to the local smoothness of the depth only, so the large-scale edges are not well preserved. Dong et al.[8] combine AR, the total-variation (TV) and low-rank models jointly into an optimization scheme of depth restora- tion. However, additional blurred edges of the restored depth result from the conflicting priors model. In additional, above- mentioned methods contains high computational complexity. Most of approaches combine depth channels with additional color channels and obtain an iterative solution. In this paper, we propose a fast low-rank matrix comple- tion via dual-graph regularization for depth restoration. The technical contributions of this work are multi-fold: Low-rank Matrix Completion: We utilize the low- rank matrix completion for depth restoration motivated by the low-rank subspace constraint [3] and fast robust PCA on graphs [9]. Consequently, the depth restora- tion can be transformed into a low-rank signal recovery problem, i.e., the low-rank matrix completion. Dual-graph Regularization: The dual-graph consists of local and non-local graph regularizations with mu- tual complementation: The local graph regularization takes advantage of the local similarity of depth maps; The non-local graph regularization exploits the gradi- arXiv:1907.02841v1 [cs.CV] 5 Jul 2019

Upload: others

Post on 27-Sep-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Wenxiang Zuo, Qiang Li, Xianming Liu* Harbin Institute of ... · especially for serious depth degeneration. IndexTerms— Depth restoration, Low-rank matrix com-pletion, Dual-graph

DEPTH RESTORATION: A FAST LOW-RANK MATRIX COMPLETION VIA DUAL-GRAPHREGULARIZATION

Wenxiang Zuo, Qiang Li, Xianming Liu*

Harbin Institute of Technology

ABSTRACT

As a real scenes sensing approach, depth information obtainsthe widespread applications. However, resulting from the re-striction of depth sensing technology, the depth map capturedin practice usually suffers terrible noise and missing valuesat plenty of pixels. In this paper, we propose a fast low-rankmatrix completion via dual-graph regularization for depthrestoration. Specifically, the depth restoration can be trans-formed into a low-rank matrix completion problem. In orderto complete the low-rank matrix and restore it to the depthmap, the proposed dual-graph method containing the localand non-local graph regularizations exploits the local similar-ity of depth maps and the gradient consistency of depth-colorcounterparts respectively. In addition, the proposed approachachieves the high speed depth restoration due to closed-formsolution. Experimental results demonstrate that the proposedmethod outperforms the state-of-the-art methods with re-spect to both objective and subjective quality evaluations,especially for serious depth degeneration.

Index Terms— Depth restoration, Low-rank matrix com-pletion, Dual-graph regularization

1. INTRODUCTION

Depth maps play a rudimentary role in many computer visionand computation photography applications, such as robot vi-sion, augmented reality, and 3D reconstruction ect. Contem-porarily, affordable RGB-D cameras are popular and have en-abled all kinds of applications, such as Time of Flight (ToF)camera [1] and Microsoft Kinect [2]. However, depth mapscaptured by the equipment may suffer various quality degra-dations with terrible noise and missing values at plenty ofpixels, resulting from the restrictions of current depth sens-ing technologies. For instance, the viewpoint differences ofmultiple sensors in the depth camera results in depth-missingaround object boundaries. Moreover, the depth cameras ofstructured-light provide incorrect depth frequently due to darksurfaces light absorption [3].

To facilitate the use of depth information, tremendous ef-forts have been spent on the depth restoration, which exploitsadditional information for the ill-posed problem. In the liter-ature, many works follow multi-depth fusion way to trans-

form multiple displaced degraded depth maps into a com-plete, clear and high-quality depth map. For example, Kimet al.[4] combined images from multiple color camera anddepth sensors into 3D reconstruction. However, these meth-ods are limited to static scenes rather than the dynamic inmany scenarios. Another potential depth restoration methodis to fuse captured depth maps and the corresponding colorimages, in order to exploit the high-quality color image forthe depth degeneration offset. For instance, the conventionalfilter method joint bilateral filter (JBF) modeled the relation-ship between depth maps and their color image counterparts[5], however they have intrinsic different characteristics. Dis-tinctly, depth discontinuities with blurring artifacts are pro-duced by color discontinuities [6]. In another direction, re-searchers attempt to recover depth in an optimizaiton-basedapproach with designed prior models similar to natural im-age restoration. For instance, the autoregressive (AR) modelhas been successfully applied to denoise in other image pro-cessing tasks [7] with color information guidance. However,this method is subject to the local smoothness of the depthonly, so the large-scale edges are not well preserved. Donget al.[8] combine AR, the total-variation (TV) and low-rankmodels jointly into an optimization scheme of depth restora-tion. However, additional blurred edges of the restored depthresult from the conflicting priors model. In additional, above-mentioned methods contains high computational complexity.Most of approaches combine depth channels with additionalcolor channels and obtain an iterative solution.

In this paper, we propose a fast low-rank matrix comple-tion via dual-graph regularization for depth restoration. Thetechnical contributions of this work are multi-fold:

• Low-rank Matrix Completion: We utilize the low-rank matrix completion for depth restoration motivatedby the low-rank subspace constraint [3] and fast robustPCA on graphs [9]. Consequently, the depth restora-tion can be transformed into a low-rank signal recoveryproblem, i.e., the low-rank matrix completion.

• Dual-graph Regularization: The dual-graph consistsof local and non-local graph regularizations with mu-tual complementation: The local graph regularizationtakes advantage of the local similarity of depth maps;The non-local graph regularization exploits the gradi-

arX

iv:1

907.

0284

1v1

[cs

.CV

] 5

Jul

201

9

Page 2: Wenxiang Zuo, Qiang Li, Xianming Liu* Harbin Institute of ... · especially for serious depth degeneration. IndexTerms— Depth restoration, Low-rank matrix com-pletion, Dual-graph

ent consistency between depth blocks in the positionswhere the corresponding color blocks are similar. Bycombining them jointly, we achieve the best of bothworlds and better performance than state-of-the-artcolor-guided depth restoration methods.

• Higher Speed: Majority of methods address optimiza-tion of depth restoration with time-consuming iterationform. In contrast, we can achieve the high speedrestoration resulting from the closed-form solution ofthe proposed objective function.

2. METHODOLOGY

2.1. Outline of the Proposed Method

The outline of the proposed method is shown in Fig. 1.Firstly, we pre-inpaint the depth map and obtain the cor-responding low-rank matrix via block-matching; Next, weutilize the low-rank matrix to construct the dual-graph as theobjective function regular term and optimize objective func-tion to get the completed low-rank matrix; Finally, we restorethe completed low-rank matrix to the corresponding depthmap and achieve depth restoration.

2.2. Pre-inpainting and Block-matching

Depth map should be pre-inpainted before block-matchingdue to depth-missing. We use a variant of NLM for depthpre-inpainting [3] and assign an initial depth value Dpre(a)for each depth-missing pixel a,

Dpre(a) =1∑

b∈Ωaωa,b

∑b∈Ωa

ωa,bD(b), (1)

where Ωa is the neighborhood of pixel a, D(b) is the depthvalue in the pixel b, and ωa,b stands for the weight consistingof two components,

ωa,b = exp

(−‖G (a− b)‖2

σp

)

× exp

(−∑c∈ch ‖G (Ica − Icb )‖2

3σI

),

(2)

whereG is a mask matrix, a and b represent pixel locations, Icaand Icb stand for the values of pixel a and b in the color channelc with ch ∈ R,G,B. The parameters σp and σI control thesensitivity of the weights of distances. As indicated in Eq.(2),we can exploit the local similarity of depth-color pairs.

It has been confirmed that the matrix obtained by block-matching is low-rank [3], thus we introduce the block-matching to the proposed method. Block-matching is a wayof locating matching blocks within a searching window in the

Fig. 1. Outline of the proposed depth restoration method.

Fig. 2. Illustration of the gradient consistency of the depthblocks in the positions where the corresponding color blocksare similar.

given image. A metric for matching blocks is based on a costfunction Eq.(3),

d(p, q) =αd ‖PD(p)− PD(q)‖2F+ αI

∑c∈ch

‖PIc(p)− PIc(q)‖2F ,(3)

where p and q are block index, PD(p) is the block p in depthmap, and PIc(p) is the block p in color channel c. We setαd = 2, αI = 100 and the block size m × m. We utilizethe top k − 1 most similar blocks and stretch each block intoa vector. Then we combine vectors into a m2 × k matrix,which is the degradation low-rank matrix of the depth map tobe completed.

Specifically, we get three similar blocks from the adja-cent area of the color image in Fig. 2-(a), three blocks inthe same locations from the depth map in Fig. 2-(b) and thecorresponding detailed images in the first two rows in Fig. 2-(c). From left to right, the larger the depth value, the darkerthe depth block. We perform histogram equalization on thethree depth blocks and display the results in the third row ofFig. 2-(c), which proves the gradient consistency of the threedepth blocks at the corresponding positions of the three simi-lar color blocks.

2.3. Dual-graph Regularization

Depth restoration is an ill-posed inverse problem, whichneeds to design an additional prior to make inverse problemtractable. The proposed dual-graph regularization consists oflocal and non-local graph regularizations.

Page 3: Wenxiang Zuo, Qiang Li, Xianming Liu* Harbin Institute of ... · especially for serious depth degeneration. IndexTerms— Depth restoration, Low-rank matrix com-pletion, Dual-graph

2.3.1. Local Graph Regularization

As indicated by [10], most of pixels have gradient magnitude0 and a non-negligible part magnitude 1. Consequently, wedesign the low-gradient prior by local similarity of the blocksto restore the depth.

We transform each row of the low-rank matrix got fromblock-matching to a vertex in the row graph, and constitute afull-connection graph Gr with those vertices. The weight ofthe edge connecting the vertices i and j in Gr is:

W ri,j = exp

(−‖Dr(i)−Dr(j)‖2F

σrd

)

× exp

(−∑c∈ch ‖Icr(i)− Icr(j)‖2F

3σrI

),

(4)

where Dr(i) is the i-th row of the low-rank matrix for thedepth map, Icr(i) is the i-th row of the low-rank matrix for thec channel of color image. The local graph regularization canthus be expressed as:∑

i,j

‖Xi −Xj‖2F Wri,j = tr

(XTLrX

), (5)

where Lr is a m2 × m2 row graph Laplacian matrix, Lr =Dr −W r [11], where W r is a adjacent weight matrix, Dr isdegree matrix where Dii =

∑jWij and X is the low-rank

matrix.Specifically, the form of the weighted total variation

(WTV) [12] is ETV =∑i,j |∇xi,j W |, where ∇xi,j

is the gradient of the image x in pixel (i, j), stands forelement-wise multiplication, W is the anistropic weightwhere W = exp (−

∑c (∇Ic)α/θ) and ∇Ic is the gradi-

ent of channel c. Obviously, E.q.(5) is the relaxtion of WTV.Consequently, we demonstrate the denoising effectiveness ofthe local graph regularization theoretically.

2.3.2. Non-local Graph Regularization

Since the gradient consistency of depth blocks is guaranteedin the positions where color blocks are similar, we utilize theblock gradient consistency regularization to eliminate the de-viation of pre-inpainted depth image.

Specifically, considering the gradient similarity of thedepth map, we subtract the column mean from each columnof the low-rank matrix and compensate the column meanback after the low-rank matrix completion.

In the process of constructing a non-local graph, we con-sider each column of the low-rank matrix after mean sub-traction as a vertex in the column graph and construct a full-connection graph Gc. The weight of the edge connecting ver-tex i and j in Gc is:

W ci,j = exp (−

‖Dc(i)−Dc(j)‖2Fσcd

), (6)

where Dc(i) is the i-th column of the low-rank matrix. Thenon-local graph regularization is formulated as:∑

i,j

∥∥Xci −Xc

j

∥∥2

FW ci,j = tr

(XLcXT

), (7)

where Lc is a k × k column graph Laplacian matrix with thesame expression as Lr.

Depth maps exist large spatial redundancy, namely, thereare many similar blocks with approximative gradient. Thenon-local graph regularization with block gradient consis-tency, which utilizes spatial redundancy adequately, can re-store the depth map theoretically.

2.4. Depth Restoration Optimization

We design a dual graph regulatization scheme combining lo-cal graph with non-local graph as the regularization for depthrestoration by addressing the following minimization prob-lem:

arg minX

∥∥X −X0∥∥2

F+ αrtr(XTLrX) + αctr(XLcXT ),

(8)where X0 is degeneration low-rank matrix after block-matching, αr and αc are the weight of local and non-localgraph regularizations respectively. The terms in E.q. (8) areall 2nd-order form, thus the analytic solution can be obtainedin linear time.

We assume that we have given tr(XTLrX

)and intro-

duce M as a temporary variable. The objective function canbe rewritten as:

arg minM

∥∥X0 −M∥∥2

F+ αctr(MLcMT ),

and the corresponding closed form solution is:

M = X0(I + αcLc)−1.

Similarly, considering αr tr(NTLrN

)we rewritten the

objective function again:

arg minN

‖M −N‖2F + αrtr(NTLrN),

and the corresponding closed form solution is:

N = (I + αrLr)−1M.

Therefore, the optimization low-rank matrix can be repre-sented as:

X = (I + αrLr)−1X0(I + αcL

c)−1. (9)

We choose X0 with small size in order to avoid high timecomplexity. According to the Fig. 1, We restore each valueof the completed low-rank matrix back to the correspondingpixel location and average all values in each pixel location toobtain the restored depth map.

Page 4: Wenxiang Zuo, Qiang Li, Xianming Liu* Harbin Institute of ... · especially for serious depth degeneration. IndexTerms— Depth restoration, Low-rank matrix com-pletion, Dual-graph

Fig. 3. Visual quality comparison on RGBZ dataset: (a) clean color image, (b) degradation depth image, (c) ground truth, (d)JBF, (e) BM3D, (f) LMC, (g)NNM, (h)AR, (i)RCG, (j)LN, (k) JARTM, (l) local graph, (m) non-local graph, (n) proposed.

3. RESULTS

In this section, we provide extensive experimental results todemonstrate the superior performance of the proposed depthrestoration method. The proposed method is compared withmany methods, including 1) joint bilateral filter (JBF) [5], 2)BM3D, 3) low-rank matrix completion (LMC) [3], 3) nuclearnorm minimization (NNM) [13], 4) the color-guided auto-regressive (AR) [7], 5) the robust color-guided model (RCG)[6], 6) joint local structure and non-local low-rank regulati-zation (LN) [8], 7) joint adaptive regularization and thresh-olding on manifolds (JARTM) [14]. To illustrate the effec-tiveness of the dual-graph regularization, we also comparethe results of local and non-local graph regularization sepa-rately. The test RGB-D images in our experiments are se-lected from public RGBZ dataset [15] and artificial degrada-tion of structured-light method are the same as [3].

Necessary parameters setting as follows: In the pre-completion, we set σp = 10 and σI = 700; In the block-matching, we set block size 6 × 6, the searching windows10× 10, the search step 3 pixel, the number of similar blocksk = 12; In the dual-graph regularization, we set σrd = 2,σrI = 40 and σcd = 2.

Table 1 presents the quantitative results in terms of PSNR,SSIM and time with best results highlighted. Evidently, pro-posed method achieves the highest SSIM in state-of-the-artcases, and the best PSNR except BM3D. Moreover, the pro-posed method still possesses fast speed compared with mostof approaches. For the sake of fairness, the comparisonschemes of JBF, BM3D (filtering method) and RCG (con-structed by C++) are not considered in terms of speed.

Fig. 3 shows the visual comparison of tableA in dataset.To clearly show the details, two detailed patches are dis-played at the bottom of the corresponding images. It can

Table 1. Quantitative performance comparison in PSNR(dB), SSIM and TIME (s) for eleven methods

Method PSNR(dB) SSIM TIME(s)

JBF 34.68 0.9795 1.31BM3D 36.42 0.9855 1.05LMC 35.27 0.9388 140.42NNM 35.37 0.9447 137.32AR 31.25 0.7608 48.03

RCG 34.06 0.9727 6.43LN 34.91 0.9634 63.11

JARTM 32.06 0.7634 13.12LGraph 34.90 0.9534 17.14

NLGraph 36.25 0.9843 9.43Dual-graph 36.35 0.9866 19.56

be observed that JBF has artifact around edges. The BM3Dproduces the worst over-smoothed results, although it getsthe best PSNR. The images of LMC, NNM and JARTM arestill occupied by noise. The AR suffer jagged artifact aroundthe edges. The RCG exhibits over-sharpen edges. The LNproduces over-smoothed and noisy results. In addition, thelocal graph method works well in areas with small gradi-ent changes, while the noise is obvious in areas with greatgradient changes. In contrast, the non-local method exhibitsa good restoration effect, but the wrong features appear inthe flat area. Among the eleven compared methods, the pro-posed method achieves the best visual results with sharperand clearer edges, lower noise and more details than othermethods. Therefore, we experimentally demonstrate the ben-efit mainly from the proposed low-rank matrix completion

Page 5: Wenxiang Zuo, Qiang Li, Xianming Liu* Harbin Institute of ... · especially for serious depth degeneration. IndexTerms— Depth restoration, Low-rank matrix com-pletion, Dual-graph

via dual-graph regularization, which combines local withnon-local graph to promote mutual complementation.

4. CONCLUSIONS

In this paper, we propose a fast color-guided depth restora-tion method based on the dual-graph regularization of thelow-rank matrix completion. Initially, we pre-inpaint depth-missing pixels by NLM and obtain the low-rank matrix byblock-matching with the corresponding color-guided images;Then, we construct the dual-graph regularization based on thelow-rank matrix and complete the low-rank matrix by a fastoptimization scheme. Finally, we restore the completed low-rank matrix back to the corresponding position in the depthmap and average the values in the pixel to obtain the restoreddepth map. Experimental results exhibit the proposed methodachieves better objective and subjective quality compared tostate-of-the-art methods.

5. REFERENCES

[1] Robert Lange and Peter Seitz, “Solid-state time-of-flightrange camera,” IEEE Journal of quantum electronics,vol. 37, no. 3, pp. 390–397, 2001.

[2] Daniel Herrera, Juho Kannala, and Janne Heikkila,“Joint depth and color camera calibration with distortioncorrection,” IEEE Transactions on Pattern Analysis andMachine Intelligence, vol. 34, no. 10, pp. 2058–2064,2012.

[3] Si Lu, Xiaofeng Ren, and Feng Liu, “Depth enhance-ment via low-rank matrix completion,” in Proceedingsof the IEEE conference on computer vision and patternrecognition, 2014, pp. 3390–3397.

[4] Young Min Kim, Christian Theobalt, James Diebel,Jana Kosecka, Branislav Miscusik, and Sebastian Thrun,“Multi-view image and tof sensor fusion for dense 3d re-construction,” in 2009 IEEE 12th International Confer-ence on Computer Vision Workshops, ICCV Workshops.IEEE, 2009, pp. 1542–1549.

[5] Johannes Kopf, Michael F Cohen, Dani Lischinski, andMatt Uyttendaele, “Joint bilateral upsampling,” in ACMTransactions on Graphics (ToG). ACM, 2007, vol. 26,p. 96.

[6] Wei Liu, Xiaogang Chen, Jie Yang, and Qiang Wu, “Ro-bust color guided depth map restoration,” IEEE Trans-actions on Image Processing, vol. 26, no. 1, pp. 315–327, 2016.

[7] Jingyu Yang, Xinchen Ye, Kun Li, Chunping Hou, andYao Wang, “Color-guided depth recovery from rgb-ddata using an adaptive autoregressive model,” IEEE

transactions on image processing, vol. 23, no. 8, pp.3443–3458, 2014.

[8] Weisheng Dong, Guangming Shi, Xin Li, Kefan Peng,Jinjian Wu, and Zhenhua Guo, “Color-guided depth re-covery via joint local structural and nonlocal low-rankregularization,” IEEE Transactions on Multimedia, vol.19, no. 2, pp. 293–301, 2016.

[9] Nauman Shahid, Nathanael Perraudin, Vassilis Kalofo-lias, Gilles Puy, and Pierre Vandergheynst, “Fast robustpca on graphs,” IEEE Journal of Selected Topics in Sig-nal Processing, vol. 10, no. 4, pp. 740–756, 2016.

[10] Hongyang Xue, Shengming Zhang, and Deng Cai,“Depth image inpainting: Improving low rank matrixcompletion with low gradient regularization,” IEEETransactions on Image Processing, vol. 26, no. 9, pp.4311–4320, 2017.

[11] Mikhail Belkin and Partha Niyogi, “Laplacian eigen-maps and spectral techniques for embedding and clus-tering,” in Advances in neural information processingsystems, 2002, pp. 585–591.

[12] Markus Unger, Thomas Mauthner, Thomas Pock, andHorst Bischof, “Tracking as segmentation of spatial-temporal volumes by anisotropic weighted tv,” in In-ternational Workshop on Energy Minimization Methodsin Computer Vision and Pattern Recognition. Springer,2009, pp. 193–206.

[13] Lei Zhang and Wangmeng Zuo, “Image restoration:From sparse and low-rank priors to deep priors [lecturenotes],” IEEE Signal Processing Magazine, vol. 34, no.5, pp. 172–179, 2017.

[14] Xianming Liu, Deming Zhai, Rong Chen, Xiangyang Ji,Debin Zhao, and Wen Gao, “Depth restoration from rgb-d data via joint adaptive regularization and thresholdingon manifolds,” IEEE Transactions on Image Processing,vol. 28, no. 3, pp. 1068–1079, 2018.

[15] Christian Richardt, Carsten Stoll, Neil A Dodgson,Hans-Peter Seidel, and Christian Theobalt, “Coherentspatiotemporal filtering, upsampling and rendering ofrgbz videos,” in Computer Graphics Forum. Wiley On-line Library, 2012, vol. 31, pp. 247–256.