ieee transactions on visualization and ...orca.cf.ac.uk/47333/1/rosin registration of 3d...

20
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 1 Registration of 3D Point Clouds and Meshes: A Survey From Rigid to Non-Rigid Gary K.L. Tam 1 , Zhi-Quan Cheng 2 , Yu-Kun Lai 1 , Frank C. Langbein 1 , Yonghuai Liu 3 , David Marshall 1 , Ralph R. Martin 1 , Xian-Fang Sun 1 and Paul L. Rosin 1 Abstract—3D surface registration transforms multiple 3D datasets into the same coordinate system so as to align overlapping components of these sets. Recent surveys have covered different aspects of either rigid or non-rigid registration, but seldom discuss them as a whole. Our study serves two purposes: (i) to give a comprehensive survey of both types of registration, focusing on 3D point clouds and meshes, and (ii) to provide a better understanding of registration from the perspective of data fitting. Registration is closely related to data fitting in that it comprises three core interwoven components: model selection, correspondences & constraints and optimization. Study of these components (i) provides a basis for comparison of the novelties of different techniques, (ii) reveals the similarity of rigid and non-rigid registration in terms of problem representations, and (iii) shows how over-fitting arises in non-rigid registration and the reasons for increasing interest in intrinsic techniques. We further summarise some practical issues of registration which include initializations and evaluations, and discuss some of our own observations, insights and foreseeable research trends. Index Terms—Deformation modeling, digital geometry processing, surface registration, point clouds, meshes, 3D scanning 1 I NTRODUCTION S URFACE registration transforms multiple 3D datasets into the same coordinate system so as to align overlapping components of these sets. The datasets comprise measured points representing sur- faces of 3D objects or scenes. Due to limitations of 3D scanning technology, typically multiple datasets must be captured from different viewpoints, each is asso- ciated with a different coordinate system. To allow them to be recombined to reconstruct the surfaces that represent the original objects or scenes [1], these data must be registered. Surface registration is thus an es- sential component of the 3D acquisition pipeline and is fundamental to computer vision, computer graphics and reverse engineering. Registering templates to a set of deforming surfaces provides cross-parametrization, and facilitates texture and skeleton transfer, shape in- terpolation, and statistical shape analysis. Numerous applications also benefit from the continual research on correspondences and registration (e.g. features and saliency), including symmetry detection and articu- lated object matching, finding object correspondences, fractured object reassembly, sub-part identification, and skeleton and pose construction. Surface registration may consider rigid or non-rigid shapes. The former assumes that two (or more) sur- faces are related by a rigid transformation. The lat- ter allows deformation (e.g. morphing, articulation) Gary K.L. Tam, E-mail: [email protected] 1 School of Computer Science and Informatics, Cardiff University, UK 2 Computer School, National University of Defense Technology, China 3 Department of Computer Science, Aberystwyth University, UK between them. Rigid registration is a challenging problem. Firstly, the data itself poses many difficul- ties, which may include noise, outliers, and limited amounts of overlap. Noise may take the form of perturbations of points, or unwanted points close to a 3D surface. Outliers are unwanted points far from the surface, which can seriously affect results if not dis- carded. Limited overlap arises due to different parts of the object being in view in each scan; typically the number of scans is kept low for efficiency, with few points in common between successive scans. Further problems may arise due to self-occlusion when the object is scanned from certain viewing angles. While such problems can be mitigated by careful scanning, they are hard to avoid completely. Secondly, varia- tions in initial positions and orientations (and what is known about them), as well as resolutions of data, can also affect algorithm performance, and must be taken into account when comparing rates of conver- gence, methods of correspondence determination, and approaches to optimization. Non-rigid registration is even more difficult, as it not only faces the above challenges but also needs to account for deformation, so the solution space is much larger. Unlike the rigid case, where a few corre- spondences are sufficient to define one candidate rigid transformation for hypothesis testing, both deforma- tion and alignment in the non-rigid case, without strong prior assumptions, often require a lot more reliable correspondences to define. Establishing mean- ingful and natural correspondences, however, is a challenging problem in its own right. Choice of appro- priate representation for the deformation, and suitable

Upload: others

Post on 23-Jan-2021

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 1

Registration of 3D Point Clouds and Meshes:A Survey From Rigid to Non-Rigid

Gary K.L. Tam1, Zhi-Quan Cheng2, Yu-Kun Lai1, Frank C. Langbein1, Yonghuai Liu3, David Marshall1,Ralph R. Martin1, Xian-Fang Sun1 and Paul L. Rosin1

Abstract—3D surface registration transforms multiple 3D datasets into the same coordinate system so as to align overlappingcomponents of these sets. Recent surveys have covered different aspects of either rigid or non-rigid registration, but seldomdiscuss them as a whole. Our study serves two purposes: (i) to give a comprehensive survey of both types of registration,focusing on 3D point clouds and meshes, and (ii) to provide a better understanding of registration from the perspective of datafitting. Registration is closely related to data fitting in that it comprises three core interwoven components: model selection,correspondences & constraints and optimization. Study of these components (i) provides a basis for comparison of the noveltiesof different techniques, (ii) reveals the similarity of rigid and non-rigid registration in terms of problem representations, and (iii)shows how over-fitting arises in non-rigid registration and the reasons for increasing interest in intrinsic techniques. We furthersummarise some practical issues of registration which include initializations and evaluations, and discuss some of our ownobservations, insights and foreseeable research trends.

Index Terms—Deformation modeling, digital geometry processing, surface registration, point clouds, meshes, 3D scanning

F

1 INTRODUCTION

SURFACE registration transforms multiple 3Ddatasets into the same coordinate system so as

to align overlapping components of these sets. Thedatasets comprise measured points representing sur-faces of 3D objects or scenes. Due to limitations of 3Dscanning technology, typically multiple datasets mustbe captured from different viewpoints, each is asso-ciated with a different coordinate system. To allowthem to be recombined to reconstruct the surfaces thatrepresent the original objects or scenes [1], these datamust be registered. Surface registration is thus an es-sential component of the 3D acquisition pipeline andis fundamental to computer vision, computer graphicsand reverse engineering. Registering templates to a setof deforming surfaces provides cross-parametrization,and facilitates texture and skeleton transfer, shape in-terpolation, and statistical shape analysis. Numerousapplications also benefit from the continual researchon correspondences and registration (e.g. features andsaliency), including symmetry detection and articu-lated object matching, finding object correspondences,fractured object reassembly, sub-part identification,and skeleton and pose construction.

Surface registration may consider rigid or non-rigidshapes. The former assumes that two (or more) sur-faces are related by a rigid transformation. The lat-ter allows deformation (e.g. morphing, articulation)

• Gary K.L. Tam, E-mail: [email protected] School of Computer Science and Informatics, Cardiff University, UK2 Computer School, National University of Defense Technology, China3 Department of Computer Science, Aberystwyth University, UK

between them. Rigid registration is a challengingproblem. Firstly, the data itself poses many difficul-ties, which may include noise, outliers, and limitedamounts of overlap. Noise may take the form ofperturbations of points, or unwanted points close to a3D surface. Outliers are unwanted points far from thesurface, which can seriously affect results if not dis-carded. Limited overlap arises due to different partsof the object being in view in each scan; typically thenumber of scans is kept low for efficiency, with fewpoints in common between successive scans. Furtherproblems may arise due to self-occlusion when theobject is scanned from certain viewing angles. Whilesuch problems can be mitigated by careful scanning,they are hard to avoid completely. Secondly, varia-tions in initial positions and orientations (and whatis known about them), as well as resolutions of data,can also affect algorithm performance, and must betaken into account when comparing rates of conver-gence, methods of correspondence determination, andapproaches to optimization.

Non-rigid registration is even more difficult, as itnot only faces the above challenges but also needsto account for deformation, so the solution space ismuch larger. Unlike the rigid case, where a few corre-spondences are sufficient to define one candidate rigidtransformation for hypothesis testing, both deforma-tion and alignment in the non-rigid case, withoutstrong prior assumptions, often require a lot morereliable correspondences to define. Establishing mean-ingful and natural correspondences, however, is achallenging problem in its own right. Choice of appro-priate representation for the deformation, and suitable

Page 2: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 2

tools for evaluation of non-rigid registration methodsare two difficult problems. Recent success in rigidsurface registration, coupled with the developmentof scanning devices that can capture time varyingsurfaces, have brought non-rigid surface registrationinto focus.

Over the past two decades, many effective rigidregistration techniques have been developed. Manyof those aforementioned challenges are being ad-dressed [2]. These include [3] which can handle upto 35% noise, and part-in-whole problem, and [4]which can handle up to 40% outliers, and down to40% overlap as demonstrated in their experiments.Comparatively, non-rigid registration techniques arestill in their infancy. Yet many useful techniques havebeen developed. These include techniques that targetarticulation, like [5] that can handle large gaps in4D sequences, reduce high dimensionality of defor-mations through automatic construction of consensusskeletons and [6] that can register 4D surfaces andproduce an urshape template; [7] that supports facialcapture and animation of avatar heads in real-time;and [8], [9] that can handle near- to approximate-isometric deformation. All these have changed thelandscape of digital geometry processing, migratingthe focuses to dynamic scenes and motions.

The goal of this survey is to overview significantwork for new researchers and potential users of reg-istration, to analyse the similarities and differences ofthese methods, and to provide up-to-date referencesto this field. While other surveys exist, our specificcontributions include the following:Rigid and Non-Rigid Registration: Our survey cov-ers registrations over the past two decades. We discuss1) the research development from rigid to non-rigid,2) the recent advance of intrinsic techniques, and 3)some reasons of such research progress.Data-fitting and Registration Components: The con-nection of registration to data fitting and the implica-tions may not be at all obvious to new researchers.We first point out the connection, leading to thediscussion of three core registration components inthe light of the overfitting problem: model selection,correspondences & constraints and optimization.Constraints: We carefully study the constraints usedin various registration techniques. These properties ortechniques limit the search space or improve registra-tion results.Practicalities and Future work: We discuss variouspractical issues for potential users such as the choiceof functional models, initialization, weight settings,evaluations and finally, consider future directions andtrends guided by our analysis of the core componentsand data fitting.

1.1 Comparsion to other surveysAmong all the existing surveys (Table 1), [11]–[15]focus on rigid registration only. Our survey tries

TABLE 1Existing Surveys of Surface Registration

Ref Rigid Non-Rigid Focus

[10] X X surface registration for medical imaging[11] X systematic breakdown of ICP and its variants[12] X registration and fusion of range images[13] X comparison of several Improved ICPs[14] X comparison of quadratic approximants and ICP[15] X coarse vs fine, pairwise vs multi-view alignments[16] X X techniques for shape correspondence

to connect rigid and non-rigid registration, compareand contrast the novel techniques used, and discussthe underlying background to help readers appre-ciate the developments in the field. [10], [16] coverboth rigid and non-rigid registration. [10] focused onapplications in medical imaging. Their classificationis similar to ours, but they discussed optimizationand correspondence together. We particularly sepa-rate constraints from optimization to enable a cleareranalysis of recent work and the research focuses ofthe past decade. [16] discussed registration in themore general context of shape correspondence. Oursurvey, on the other hand, discusses registration fromthe viewpoint of data fitting. This allows us to focuson components that are more specific to registrationtechniques. In particular, we highlight the noveltiesin different components and their uses. We furtherdiscuss the problem of over-fitting and provides up-to-date references beyond those in existing surveys.

1.2 Scope

This study focuses on registration techniques for3D point clouds, meshes (representing surfaces) andsparse 3D point data. To limit our scope, variousrelated techniques that require additional information(e.g. silhouettes extracted from images or video se-quences, or assumptions concerning viewpoints forthe input data) are excluded, as are those whichrequire a specific representation (e.g. range images,2D arrays of depth points). However, we do in-clude techniques originally designed for range imageswhere they can be applied to point clouds withoutmodification or extra assumptions.

The literature contains many techniques for theclosely related problem of 3D volume image registration,particularly for medical data. 2D image registrationis also widely studied. Many kinds of features orlandmarks have been proposed, often tailored to thenature of the images, and have inspired the devel-opment of surface registration methods (e.g. basedon SIFT [17]). Recently, non-parametric image regis-tration techniques have been developed for medicalimages. These take into account physical properties(e.g. elasticity or viscosity) of tissues, and may alsoprove useful in surface registration. Readers are re-ferred to [18]–[20] for surveys of such techniques.

Page 3: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 3

1.3 OrganizationOur survey is organized as follows. Section 2 estab-lishes the connection between data fitting and regis-tration, and its implications, leading to the discussionof the three key components of registration: modelselection, correspondences & constraints and optimization.Section 3 discusses various transformation models.Section 4 classifies various constraints for rigid andnon-rigid registration. Section 5 discusses how regis-tration and correspondence determination can be castin terms of optimization. Section 6 discusses evalua-tions of registration. Finally, Section 7 concludes withdiscussions and future possibilities.

1.4 NotationThe following notation is used in the paper. P =p1 . . .pN1

and Q = q1 . . .qN2 are two point

sets representing overlapping surface portions of ascanned 3D object. Σ ⊂ p1 . . .pN1×q1 . . .qN2 is amatching relation that denotes the set of all correspon-dences. We use aij = pi,qj to denote a correspond-ing pair of points. Sometimes, for illustration purpose,we simplify the notation to ai = pi,qi to denote aone-to-one correspondence, and N is the number ofpoints. pi = χ(a,pi) describes a functional model χwith parameter vector a that maps a point pi to pi.T = (R, t) is a rigid transformation comprising a rigidrotation and translation, while A is an affine trans-formation. u is a deformation vector. E(P,Q,Σ, χ,a)is an energy functional to be optimized. p(·) is aprobability distribution of a random variable. τ istime. DoF means the degree of freedom. χ can befurther classified into extrinsic f and intrinsic g−1hgtransformations, which will be defined and discussedin the next section.

2 REGISTRATION AND DATA FITTING

Registration has much in common with data fitting.Here, we first provide a general overview of both rigidand non-rigid registration techniques as a whole, interms of a generalized version of their objective func-tions. Then we discuss their similarities and differ-ences to data fitting, which leads to our classificationscheme and the structure of this survey.Registration is very often cast as an optimizationproblem with the objective function of the form:

E = Edata + Ereg (1)

The data term Edata measures the alignment error, andis related to the transformation function χ:

Edata =∑i

‖qi − χ(a,pi)‖2 s.t. pi,qi ∈ Σ (2)

where pi belongs to one surface and qi to another.Here, we assume χ transforms pi to pi ∈ P such thatthe residual sum of squares ‖qi − pi‖2 is minimal;pi,qi ∈ Σ is the set of all correspondences. The

P Q Extrinsic domain S

P ′ Q′ Intrinsic domain Ψ

f ∈ F

g ∈ G

h ∈ H

g ∈ G

Fig. 1. Registration can be classified into extrinsic (χ =f ) and intrinsic (χ = g−1 h g) technique based ontheir transformation f, g and mapping function h, seeSection 2.

goal of registration is to obtain the best transfor-mation parameters a. Ereg is a regularization termthat provides additional information to constrain anarbitrary transformation so that the solution a is areasonable one. Registration can be further classifiedinto extrinsic and intrinsic techniques (Figure 1).

Extrinsic techniques consider a surface lying inEuclidean space. They make use of external proper-ties (e.g. transformations: rotation(s) and/or transla-tion(s)) that can be applied onto the surface. Often,these techniques look for a function χ = f ∈ F :S → S such that surface P ∈ S can be transformed toalign and correspond with surface Q ∈ S, where S isthe space of all possible surfaces. We refer to f andS as the extrinsic transformation and domain in ourcontext.

Intrinsic techniques, in contrast, consider a surfaceas free standing. They make use of properties likesurface distances and angles which are internal withina surface. Often, these techniques transform/embedthe two surfaces P and Q into their canonical formsP ′ ∈ Ψ and Q′ ∈ Ψ, where Ψ denotes the space of allpossible canonical forms, with a function g ∈ G : S →Ψ, and look for a mapping function h ∈ H : Ψ → Ψso that the surface point pi = g−1 h g(pi) matchesclosely to qi. We refer to g and Ψ as the intrinsic trans-formation and domain in our context. The objectivefunction may be seen as letting χ = g−1 h g:

Edata =∑i

‖qi−g−1 hg(a,pi)‖2 s.t. pi,qi ∈ Σ (3)

Some techniques may by-pass the embedding processand optimize an objective function which is formu-lated using intrinsic measures for correspondence es-tablishment. The intuition is similar.Data Fitting: Given a set of N 3D data pointsp1, ...,pN , where pi = (xi, yi, zi), various data fittingproblems may be posed. The simplest is to find thevector of parameters a of a functional model z =f(a, x, y) that best describes the data. To measure thefitting error, the usual formulation considers the sumof squared residuals E:

E =

N∑i=1

‖zi − χ(a, xi, yi)‖2 + Ereg. (4)

In general, data fitting involves both selection ofχ, and optimization to find the best a. Ereg is a

Page 4: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 4

regularization term to avoid overfitting.

Connections and Implications: Comparison ofEqns. 1–4 allows us to further analyze registration asthree core components:Model selection (Section 3): The model χ in both casesis often guessed or provided by the users with a prioriknowledge of the data. In data-fitting, specific modelsare given depending on the data, e.g., polynomialequations or splines. In registration, χ is usually basedon an assumed transformation model, and is highlydependent on the data and applications. We willdiscuss the choice of these model χ. These cover rigidtransformation (Section 3.1), and both extrinsic (Sec-tions 3.2-3.4) and intrinsic non-rigid transformations(Section 3.5).

One of the major concerns in data-fitting is toavoid overfitting, when χ fits noise rather than thedata. The same applies to registration where the mod-els f of extrinsic techniques usually have high de-grees of freedom. This potentially leads to over-fittingproblems (in which there are too many parametersto describe the transformation). In these techniques,regularization becomes highly essential, not only toobtain a smooth surface, but also to constrain arbitrarytransformation to obtain a reasonable solution. Inintrinsic techniques, both P and Q are transformedinto a canonical form and the mapping function h isusually low-dimensional. The intrinsic transformationg provides the most important constraint to avoidarbitrary transformation. This is one reason why thesetechniques have become the recent research focus.Correspondences and constraints (Section 4): In datafitting, the correspondences are incorporated withinthe data point (xi, yi, zi), but in registration, corre-spondence information pi,qi ∈ Σ must be de-termined. We will discuss different constraints andtechniques to restrict the solution space (a,Σ). Theymay be induced by the functional model χ (Sec-tions 4.1.1, 4.2.3) or related to the regularizations(Sections 4.1.4, 4.2.6), correspondences Σ (like featuresand saliency, Sections 4.1.2, 4.1.3, 4.2.4, 4.2.5), assump-tions on data (like template or space-time surface,Section 4.2.2, 4.2.7) and optimization (like search con-straint) (Sections 4.1.5, 4.2.8).Optimization (Section 5): Both registration anddata fitting are formulated as optimization problemswhich can be solved in a similar fashion. Wewill discuss techniques to find the best parametera and possibly Σ. When registration is cast asa continuous optimization problem, (a,Σ) aredetermined iteratively and simultaneously. Somemethods focus on correspondence determinationonly, and Σ is estimated iteratively. The implicationhere is that many techniques which were originallydesigned for data fitting (like information-basedcriteria) may be applicable in registration.

TABLE 2Transformation Models

AssumedTransformations DoF Model Formulation Examples

RigidTransformation 6 Euclidean

Transformation all rigid cases

Rigid Alignmentwith Non-RigidCorrection

7-18

Displacement Field +Space Time Surface [21]

Rigid ICP+ Thin Plate Splines [22]

Piecewise RigidTransformation N Predefined Skeleton [23]

Bones and Joints [5], [24]–[26]

GeneralDeformation &Fine Details

3N-12N

Displacement Fields [6], [27]Local RigidTransformation [2], [28]

Local AffineTransformation [29]–[35]

(Nearly) IsometricDeformation N Intrinsic

Transformation [8], [36]–[41]

Our classification is based on these three compo-nents. The focus on constraints, in particular, allowsus to compare the novelty of various approaches.Since each component interacts with the others, effortshave been made to avoid repeating information, butwe do so where necessary for clarity. While not everyregistration technique fits well into this frameworkderived from data fitting, these components and con-straints do exist in most surveyed methods.

3 FUNCTIONAL MODELSWe classify the models χ used in registration fromthe viewpoint of the assumed transformations. Table 2provides a quick look-up for users who need toregister surfaces undergoing a specific type of trans-formations and their DoFs. Four main types are com-mon: rigid transformation, piecewise rigid deformation,general deformation and (nearly) isometric deformation.Rigid alignment with non-rigid correction is a transitionalcase between the rigid and non-rigid models.

3.1 Rigid Transformation (6 DoF)This is the most important assumption of rigid reg-istration: the surfaces are assumed to be aligned bya Euclidean transformation involving a rotation andtranslation (R, t): f(pi) = Rpi + t. This transforma-tion (i) is global—the same for every point pi, (ii)can be uniquely defined by three non-collinear pairsof correspondences, and (iii) has low dimension (6DoF). Using the first two properties, if both point setscontain at least one pair of triplets that match exactlyand provide the correct transformation, it takes O(N6)time to test all possible transformations for sets of Npoints. By considering all three properties, registrationcan be cast as a voting problem [17].

3.2 Rigid Alignment with Non-Rigid CorrectionDisplacement Field of Space-Time Surfaces (7 DoF):To allow non-rigid correction due to subtle move-ments of objects during acquisition, [21] proposes a

Page 5: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 5

space-time displacement model of the form f(pji ) =(Rjp

ji + tj , τ

j). The transformation consists of theunknown (Rj , tj) that aligns surface in consecutiveframe j in the space domain, and a translation τ j

along the time axis. This technique requires densesampling in both the spatial and temporal domain.Rigid ICP + Thin Plate Spline (18 DoF): Calibrationerrors or device non-linearity can lead to imprecisealignment of rigid scans. [22] uses a locally weightedICP to establish reliable correspondences across mul-tiple scans. Then they optimize the locations of thesecorrespondences across multiple scans in a globalmanner. These correspondences define the anchorssuch that those misalignments can be corrected bythin-plate spline (TPS) with 3n + 12 DoF, where nis the number of anchor correspondences, resultingin a sharper final model. Since n is fixed once therigid transformation is computed, it requires overall18 DoF.

3.3 Piecewise Rigid TransformationSome methods, e.g. for human modelling, assumemainly articulated changes, where bones undergolarge rigid transformations and local non-rigid surfacedeformation (bending or stretching) occurs near joints.Predefined Skeleton: [23] uses a skeleton as a basisfor modeling deformation. Each joint is given someDoFs (e.g. joint angles) and is related to other jointsby rigid transformations (e.g. a translation).Bones and Joints: More recent techniques do not usean explicit skeleton. [24] uses predefined bone infor-mation to track bone transformations. [25] searchesin a finite set of plausible clustered rigid transforms.The small deformations of joints are obtained byblending the transformations of two adjacent bones inthe overlap regions [25]: f(pi) =

∑j wj(pi)(Rjpi+tj)

where wj(pi) is the weight for bone j. Recently, [5]constructs a consensus skeleton automatically from4D sequence and [26] recovers a dynamic graph—aposeable skeleton with bones, ball and hinge joints.

The advantage of skeletal representation is the lowDoF ( N ), which depends on the available bones orjoints. It also allow new poses to be created.

3.4 General Deformation and Fine Surface DetailsFurther techniques consider more generic deforma-tions which include articulation, non-rigid deforma-tion and fine surface details.Local Displacement Field (3N DoF): [27] allowspoints to displace freely, and encourages the dis-placement vectors to point in similar directions byregularizing high-frequency components of the dis-placement field. [6] uses meshless finite element defor-mation: f(pi, τ) = pi +

∑j ψj(pi)uj,τ which depends

on certain nodes j scattered near (but not on) thesurface, where uj,τ are displacement vectors; ψj(pi)is the influence of node j on pi. This technique

expects as input a sequence of frames, allows adap-tive sampling, reduces DoF to 3n, where n is thenumber of nodes. [35] uses thin plate spline (TPS)to define a transformation model (DoF 12 + 3N):f(pi) = pi ·a+w ·ω(pi), where a represents the affinetransformation parameters and w is the parameter forthe TPS smoothness kernel ω(pi).Local Rigid Transformation (6N DoF): [2] defines amodel: f(pi) = Ripi + ti using local rigid transfor-mations (Ri, ti) at each point. Nearby points with thesame (Ri, ti) are clustered into patches and are up-dated every few optimization steps. It is adaptive tothe deformation; the DoF depends on the number ofclusters. [28] uses local rigid transformations, an iter-ative as-rigid-as-possible deformation technique [42]and space-time surface constraints to align surfacesfrom densely temporally sampled sequences.Local Affine Transformation (12N DoF): Local affinetransformations are frequently used in non-rigid reg-istration [29], [31]. Higher DoFs allow more free-dom to capture fine surface detail changes (e.g. bodyfat [43], wrinkles [32]). [34] defines a model: f(pi) =Aipi and uses stiffness to ensure adjacent transforma-tions are similar. [33] uses differential coordinates andcan be considered as a local affine transformation witha smoothness constraint. [32] uses an embedded graphnode model: f(pi) =

∑xjw(pi,xj)[Aj(pi,xj)+xj+bj ]

where bj is the translation vector of certain nodes xj ,with weighting function w. These xj lie on the surface.

3.5 (Nearly) Isometric DeformationIntrinsic Transformation: Recent techniques empha-size intrinsic models, which assume (nearly-) isometric(distance-preserving) deformation. Intrinsic geometryconcerns properties like surface distance and angle.Instead of studying pi = f(pi), they study pi =g−1hg(pi) so that pi matches closely to qi (Figure 1).Here g is the assumed intrinsic transformation and his often a low-dimensional ( N ) mapping functionthat establishes the point correspondences (Section 2).

Notable and pioneering works include: [36] seeksa low dimensional embedding that preserves all pair-wise geodesic distances, [37] uses generalized multidimen-sional scaling to embed one mesh in another for partialmatching, [38] uses diffusion distance and Gromov-Hausdorff distance to handle topological noise, and[40] shows that a single correspondence can establishcorrespondences for all points using the heat kernel.[39] pioneers the use of the Mobius transformation(isometry is a subgroup of the Mobius group). Thishas stimulated much subsequent work including [8],[41].

3.6 Observations and DiscussionExtrinsic Models: A general observation made by allextrinsic models of non-rigid techniques is that themore high frequency information on the surface, the

Page 6: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 6

more DoF required. In the extreme case, non-rigidregistration methods that handle large deformationinvolving articulation or fine detail preservation, use12 DoF (an affine transformation) per point or node.

It is well-known that using too many DoF can over-fit noise. In rigid registration, the overfitting problemis generally ignored because there are 6 DoF only.When non-rigid registrations assume high DoFs, over-fitting can become a problem (though not frequentlydiscussed). To alleviate it, [2], [6], [32] reduce theDoF explicitly by computing local transformationswith respect to clusters or nodes, and most non-rigidregistration methods use regularization to further con-strain the solutions.

Although many of these non-rigid deformationmodels are different in terms of formulation, they allhave one common characteristic. Whether they useadjacency similarity, gradient or Laplacian fields [28](first-order), thin-plate splines [35] (second-order),or various regularizers (Section 4.2.6), they all tryto preserve some kind of smoothness. In term ofdata fitting, this ensures that the registered surfacerepresents the surface details but not noise.

Intrinsic Models: Intrinsic models assume (nearly)isometric deformation. Fine surface details are usuallyignored when establishing correspondences, due totheir intrinsic nature. Many methods of this kindrequire mesh data with consistent topology, and maynot work on point cloud data. Large topologicalerrors and holes affect these techniques.

This section has considered many transformations.The appropriate model to use depends on the dataitself. Users should determine the kinds of deforma-tions and other properties of their data to select asuitable model. This point may easily be overlooked.

We exclude the discussion of similarity transfor-mation, which concerns an additional scale factorcompared to a rigid transformation. The major reasonis that, in registration, most of the 3D scanned dataare assumed to be multiple views of the same objectwhich share the same scale. When registration isrequired for establishing correspondences across a setof 3D models from heterogeneous data source, scalebecomes an important factor. One example includesthe registration of sets of faces from different per-sons [44]. Procrustes analysis [45] is one such impor-tant technique.

4 CONSTRAINTS FOR RIGID AND NON-RIGID REGISTRATION

We next classify various constraints, and roughly or-dered them in decreasing strength of priors: see Ta-bles 3 (rigid) and 4 (non-rigid), and subsequent sub-sections. Constraints are the properties that limit thesearch space for transformation and correspondences.

TABLE 3Rigid Registration Constraints. T: Transformation;

SC / DC: Sparse / Dense Correspondence.

Constraints Classifications Type Examples

Transformation-InducedConstraints

Distance and AnglePreserving Properties SC [46], [47]

Lower-Bounding DC [48]Affine Ratio T [4]Principal Axes T [49]Closest Point Criterion DC [14], [50], [51]

Features

Spin Images SC [52]Curvature SC [53], [54]Moments & Spherical SC [54]Integral Descriptors SC [46], [48], [55]DCT-DFT Coefficients SC [17]Cluster Signature SC [46]

Saliency

Differential Properties SC [56]Rareness SC [48]GeometryScale-Space SC [17]

Feature Scale-Space SC [48]Size & Curvature SC [53]Multi-Scale Slippage SC [57]MSER SC [58]

RegularizationEqualizingCorrespondences DC [59], [60]

MaximizingCorrespondences DC [13], [61]

Search ConstraintLocalization DC [62]HierarchicalApproaches DC [56], [62]

In rigid registration, correspondences assist in furtherpruning the transformation search space, whilst inthe non-rigid case, establishing correspondences is theessential step that drives alignment.

4.1 Constraints for Rigid Registration

We consider transformation-induced constraints, fea-tures, saliency, search constraints and regularization.

4.1.1 Transformation-Induced ConstraintsTransformation-induced constraints are properties re-lated to rigid transformation that can be used to prunethe transformation or correspondence search space.Distance and Angle Preserving Properties: Distanceand angle preserving properties of solids can beused for pruning unreliable sparse correspondencesestablished by other constraints. [46] assumes that theEuclidean distance between the centers of a pair offeatures on a surface, and the angles between theirprincipal directions and their connecting line, are pre-served under a rigid transformation. RANSAC-basedDARCES [47] uses distance-preserving constraint toenumerate and test three pairs of correspondences ina sphere to limit the transformation search space.Lower-Bounding: [48] uses (Euclidean) distance-based lower-bounding and the branch-and-bound to re-duce the correspondence search space (Section 5.2.1).Affine Ratio: Given two surfaces P and Q in arbitraryinitial poses, [4] randomly selects a 4-point tuple from

Page 7: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 7

P and another 4-point tuple from Q, and comparesthe affine ratios. If they form a correspondence, theiraffine ratio will match. They further speed up theenumerate-and-test process using a fast range querysearch of points that satisfy the affine ratios.

Principal Axes: Principal axes, computed throughprincipal component analysis (PCA), are three orthogo-nal directions (eigenvectors correspond to the largesteigenvalues) where the greatest variance of the data-points lie. A good review, including their use forcoarse registration, is given in [15]. A recent advance[49] combines PCA with the least median of squares,providing resistance to noise and outliers. PCA meth-ods are most useful for whole object registration.

Closest Point Criterion (CPC): CPC constrains thepotential dense correspondences. Assuming a rigidtransformation, it chooses the closest point qi ∈ Qas the match for pi. It is used in the iterative closestpoint (standard ICP) algorithm [50], which minimizes:

E(R, t) = minR,t

∑i

‖qi − (Rpi + t)‖2 (5)

This iterative algorithm must be carefully initialized.At each step a new set of parameters (CPC corre-spondences, rotation R and translation t) are com-puted and updated. [63] recently provided a formaltreatment of CPC showing that it guarantees the es-tablished point correspondences cannot be too wrongat each step: it is impossible that some correspon-dences are of high quality, while their neighbors areof significantly lower quality, because the relationshipof correspondences are implicitly captured by therigid transformation. Many modifications to ICP havebeen proposed, to improve speed, range and rate ofconvergence, and robustness—see the earlier surveyin [15]. Recent improvements explicitly model inliersand outliers [64], and confidence in correspondencesusing graduated assignment [59], [60], [65].

There are alternatives of CPC. The first of thesewas point-to-plane ICP. [51] minimizes the shortestdistance between a point and the tangent plane of theclosest point on another surface. This allows fastertangential movement of the surfaces. [66] studiesmotion of one surface within the squared distancefield of another, and derives a local quadratic approxi-mant for the squared distance function; standard ICPuses a linear approximation. They reveal that thislocal quadratic approximant gives a hybrid betweenstandard and point-to-plane ICP; [14] derives a reg-istration method with quadratic convergence. Thesemethods give faster and more stable convergencewith accuracy similar to that of standard ICP. In-dependently, [67] proposed a similar method basedon a generalized Gauss-Markov model which uses astatistical framework to model noise.

4.1.2 Features

Features are quantities (e.g. principal curvatures) thatdescribe a point. Multiple features can be concate-nated to form a feature vector. When features of twopoints (or clusters) match, a meaningful correspon-dence may exist. Higher-dimensional features offergreater pruning power because of the low probabilityof matching all components of the feature, but oftentake longer to compute. Low-dimensional features arequick to compute and is usually used in conjunctionwith other constraints like saliency (Section 4.1.3).Features are sometimes referred as point signaturesif they have high descriptive and pruning power.Such features can be defined in a transformationinvariant way, and help to establish sparse corre-spondences. These features include spin images [52],mean curvature [56], Gaussian curvature [53], mo-ments and spherical harmonics [54], integral descrip-tors [46], [48], [55], FFT and DCT coefficients of distri-butions of normals [17], cluster signatures of size andanisotropy [46] and SHOT signature [68].

4.1.3 Saliency

Saliency is a measure of local significance in a surface:salient points/regions are those whose propertiesare unlike most of their neighbours. They are usedfor key point/region detection, often in conjunctionwith features (Section 4.1.2). They reduce the sizeof correspondence space, potential mismatches andobtain more reliable coarse correspondences. Thesesaliency measures include: differential properties [56],rareness [48], scale space analysis on geometry [17]and features [48], size and curvature based on visualsaliency [53], multi-scale slippage [57] and maximallystable extremal regions [58].

4.1.4 Regularization Constraints

Regularization involves adding penalty terms to theobjective function. It incorporates a priori information,improves search, and avoids local minima duringoptimization. Two regularization terms are used inrigid registration; details of the optimization processesinvolved are given in Sections 5.2.3 and 5.2.2.Equalization: [65] casts registration as a continuousoptimization problem that handles both rigid trans-formation and correspondence determination simulta-neously. It uses CPC, and formulates a regularizationterm based on entropy: −

∑i

∑jMij logMij where

M is the matching probability matrix of all corre-spondences. When all point matches are equally likely,the entropy reaches a maximum. This term can beinterpreted as equalizing the importances of pointmatches, forcing them to the same distance and thuscausing the surfaces to run parallel, rather than inter-secting one another [59]. From an optimization pointof view, this helps to convexify the error landscape of

Page 8: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 8

the objective function. [13] uses a similar argument in-volving equalization of confidence weightings basedon mean field variables.Maximizing Correspondences: [65] uses the term−∑i

∑jMij to maximize correspondences and bias

the objective function towards more matches (i.e. itfavors Mij = 1 over Mij = 0). This maximizes theoverlapping area.

4.1.5 Search ConstraintsSearch constraints are for efficiency. These include:Localization: [62] gives a fast ICP algorithm (Sec-tion 4.1.1) based on localization. They restrict the searchfor a new CPC at each iteration to the local neighbor-hood of the correspondence in the previous iteration,avoiding a global exhaustive search.Hierarchical Approaches: [62] further combines theabove with a coarse-to-fine hierarchical search tech-nique. Use of down-sampled data speeds up thefirst few iterations of ICP; the resolution is graduallyincreased to obtain more reliable alignment. [56] usesa hierarchy of surface features in the form of points,curves and finally surfaces. They estimate a Euclideantransformation using a few salient points. The trans-formation is then refined using curves, and surfaces.

4.2 Constraints for Non-Rigid Registration

This section discusses constraints for non-rigid reg-istration, assigning them to categories of markers,templates, deformation-induced constraints, features,saliency, regularization, envelopes of motion andsearch constraints (Table 4).

4.2.1 Markers3D markers provide sparse correspondences. For ex-ample, early commercial systems used simple refer-ence objects (e.g. spheres) clamped to a surface beforedata acquisition, which are then identified by fittingmethods [69]. Other techniques identify markers froman accompanying video sequence using e.g. Laplacianconvolution filters [23] or SIFT [33]. Some approachessimply assume correspondences are provided [34].

4.2.2 TemplatesTemplates simplify the part-to-part alignment prob-lem to part-to-whole alignment, by providing strongstructural information including priors for shape, con-straints on arbitrary deformation, and connectivity.These greatly help to handle topological noise andmissing data. [6], [28], [79] constructs a template,an urshape, by accumulating and filling in missingdata from other scans in the same sequence. Theyhave been used extensively in non-rigid registration toconstrain dense correspondences [23], [29], [32], [34],[80]. This works for objects with well defined forms,such as the human face and body.

TABLE 4Non-Rigid Registration Constraints.

Constraints Classification Type Examples

Markers 3D Objects SC [69]Optical/Manual SC [23], [33], [34]

Template DC, D [6], [23], [28], [34]

Deformation-InducedConstraints

IsometryConsistency SC, D [2], [8], [9], [36]–[40],

[70]–[72]Self-DeformationDistortion SC, D [70]

Features

Spin Images SC [25], [72]Curvature SC [71], [72]IntegralDescriptors SC [46]

Heat KernelSignature SC [73]–[78]

Saliency Extremities SC [70]Slippage SC [57], [71]

Regularization

Orthonormality DC,D [32]Handling Holes DC,D [31]GeometricCoherence DC,D [6], [25], [27], [29],

[30], [32], [35]Articulation DC,D [24]TemporalCoherence DC,D [6], [79]

Envelope of Motion DC [21], [28]SearchConstraints

Closest PointCriteria DC [2], [6], [25], [29], [34],

[46], [79]

T: Transformation; D: Deformation;SC, DC: Sparse / Dense Correspondence

4.2.3 Deformation-Induced ConstraintsThe deformation model itself may induce constraints.Isometry Consistency: Generalising the distance pre-serving constraint in the rigid case, some methodsassume inelastic or isometric deformation. We firstdiscuss the various definitions of surface distance andthen the use of isometry for correspondence.

Definition of surface distances: Surface distancesare intrinsic measures defined solely by a metric ten-sor. Let (M, g) be a Riemannian manifold with a met-ric tensor g. The intrinsic Laplace-Beltrami Operator∆ is determined by g, and induces a family of surfacedistances of the form: d(x, y)2 =

∑k Ω(λk)(Φk(x) −

Φk(y))2 where λk and Φk are the eigenvalues andeigenfunctions of ∆. These include the diffusion dis-tance [38] (Ω(λk) = e−2tλk ) which measures the degreeof connectivity of two points by paths of length t;the commute-time distance [81] (Ω(λk) = λ−1k ) whichis a scale-invariant version of the diffusion distance;and the biharmonic distance [82] (Ω(λk) = λ−2k ) whichbalances the local and global properties of diffusiondistance. The biharmonic distance is more shape-aware and insensitive to topology.

The geodesic distance can be defined via the lengthof curve γ on M : d(x, y) = minγ

∫ 1

0‖γ′(r)‖dr, γ(0) = x

and γ(1) = y where g provides the inner product tomeasure length. Geodesic distance is sensitive to noiseand holes. [83] proposes fuzzy geodesic distanceswhich trades off between precision and stability. [84]approximates geodesic distances by unfolding bound-ary of holes. The geodesic distance is often found by

Page 9: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 9

solving the eikonal equation and wavefront propa-gation [85]. [86] recognizes the connection betweenheat and distance, and develops an efficient scheme tofind geodesic distance, which is less sensitive to noiseand holes: d2(x, y) = limt→0−4t log ht(x, y) whereht(x, y) =

∑e−tλkΦk(x)Φk(y) is the heat kernel.

[75] extends the diffusion distance on 2D manifoldto 3D volume. It targets volume-preserving deforma-tion and is found useful in medical imaging. DifferentRiemannian metrics can be induced by appropriatelyselecting the arc length. [76], for example, developsan affine-invariant version of diffusion distance byconstructing a new Riemannian metric tensor. It isuseful for equi-affine-invariant deformation.

Use of isometry: There are many ways to use theisometry constraint, which indicates that the surfacedistance between two points is the same before andafter non-rigid deformation. The direct use of geodesicdistance can be seen in [70], where they define geodesicdistortion in terms of maximum differences of geodesicdistances, and in [2], where they define isometric con-sistency in terms of ratios of geodesic distances. [5]extends the latter to work on 1D skeletons for space-time surface registration. There are techniques thatformulate correspondences as a minimization of over-all geodesic distortion [9]. Isometry can also be formu-lated as a transformation model (Section 3.5).Self-deformation Distortion: [70] uses a self-deformation distortion measure to avoid unrealisticsparse correspondences arising due to large stretching(e.g. a limb point being mapped to the torso)or repeated structures (e.g. a left leg point beingmapped to the right leg). This approach is based onmesh differential deformation techniques and the useof salient points (Section 4.2.5). This work discussesthe establishment of meaningful correspondencesacross deforming objects of quite different kinds.

4.2.4 Features & SignatureFeatures designed for rigid registration like spin im-ages, mean curvature and integral descriptors (Ta-ble 4) have been used to find correspondences in non-rigid registration.

There are isometry invariant features. [73] proposesthe heat kernel signature (HKS), which is based onheat diffusion at a point. It is intrinsic (invariant toisometric deformation), multi-scale, informative (con-tains all information about the intrinsic geometry of ashape) and stable. It is used in [40] to define isomet-ric mapping. [87] relates and reinterprets HKS withother spectral shape matching techniques through theGromov-Wasserstein distance.

Over the past years, variants of HKS have beenproposed. [74] uses Fourier transform to avoid scaledifferences and defines a scale-invariant version ofHKS. [75] extends HKS to volumetric data, using vol-umetric distance. [76] defines a new first fundamentalform and uses a finite-element technique to define

an affine-invariant version of HKS. [77] proposed theWave Heat Kernel Signature (WKS) which uses aquantum mechanical approach to capture multi-scaledetails. WKS is shown to be more descriptive thanHKS. There is an emerging approach [78] to automat-ically learn the optimal feature descriptor, in terms oflocalization, discriminativity and various invariances.

4.2.5 SaliencySlippage features designed for rigid registration areused in [57] to identify correspondences between non-rigid shapes. [8], [70] use extrema of a deformationinvariant function, the integral geodesic distance G(p) =∫s∈P dg(p, s)dP , to locate salient points. G(p) is large

if p is far away from the rest of the mesh (e.g. at thetip of a finger for a whole body model). This integralis stable under changes of surface detail.

The heat kernel signature (HKS, Section 4.2.4) pro-vides the basis for the definition of salient pointsand important components for non-rigid shapes. [88]defines ”Persistent Heat Signature” for points usingpersistent homology. The idea is to track and identifycritical points (maxima, minima) when the topologyof HKS changes over the surface. [89], [90] applythe concept of maximally stable extremal regions(MSER) from computer vision to detect stable re-gions/components in non-rigid shapes.

4.2.6 RegularizationIn Section 3.6, we pointed out the importance ofregularization for extrinsic non-rigid registration tech-niques. Here we survey and classify these techniques.Orthonormality: [32] uses an affine transformationA = [a1a2a3] for each node to model fine surfacedetails. In order to encourage the overall deformationmodel to respect articulation, they maximize localrigidity:∑

xi

((aT1 a2)2 + (aT1 a3)2 + (aT2 a3)2+

(1− aT1 a1)2 +(1− aT2 a2)2 +(1− aT3 a3)2

)Handling Holes: [31] avoids the effects of holes due toocclusion by minimizing

∑i ω

2i ‖p

ji−z

ji‖22+

∑i(1−ω2

i )2

where zji are depth values. Holes usually have largedepth values, causing a high error. The optimizergives holes small weights ωi in the first term. Thesecond term prevents a trivial solution with all ωi = 0.Geometric Coherence: Various methods encouragegeometric properties to vary as smoothly as possible,or non-rigid transformation fields to be coherent. [28],[33] use differential coordinates (first order) and [35]uses thin plate splines (TPS) (second order) in non-rigid deformation to provide smoothness. [27] en-courages displacement vectors to point in similardirections by analyzing the low-frequency compo-nents of the displacement field. [30] regularizes thelocal affine transformation Ai of each triangle i using∑i

∑j∈N (i)‖Ai − Aj‖2F +

∑i‖Ai − I‖2F where N (i)

Page 10: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 10

is the set of neighbouring triangles to i, and ‖.‖F isthe Frobenius norm. The first term ensures adjacenttransformations are as similar as possible [29], whilstthe second term penalizes over-smoothing whichleads to unwanted changes in shape. [6] encouragesthe normals of adjacent points to vary as smoothlyas possible using

∑i

∑j∈N (pi)

((pi − p′j)

Tni)2

+∑i

∑j∈N (pi)

(n′i − n′j

)2 where N (pi) and n′ are theneighborhood and the normal of p′ on the recon-structed shape. The first term encourages points to liein the tangent plane of their neighbors and the secondterm aligns adjacent normals.Articulation Structure: In articulation, points belong-ing to the same bone have the same transformationbecause of the bone’s rigidity, except near joints. Toensure smooth transition of geometry of a point p atjoint or to fill in missing data between two bones bi, bj ,a regularization term

∑p‖Tbi(p) − Tbj (p)‖2 would

be used. [24] further assumes skeletal information isavailable and [25] finds bone transformations fromclustering and uses geodesic consistency.Temporal Coherence: This is used for regularizingnon-rigid registration when capturing sequences of3D scans. [6], [79] employ conservation of momen-tum to avoid high frequency tangential deformationnoise and remove temporal jittering by minimizing∑τ

∑i(u(pi, τ + 1)− 2u(pi, τ) + u(pi, τ − 1))2 where

u(pi, τ) is the deformation field of a point pi attime τ . They further suggest two regularization terms,∑τ

∑i(det(∇u(pi, τ))−1)2 and

∑τ

∑i(u(pi, τ + 1)−

u(pi, τ))2 to preserve volume and temporal smoothness.A deformation field is locally volume preserving ifdet(∇u) = 1. The temporal term imposes a smallpenalty on velocity in order to avoid fluttering arti-facts in regions that are not well-constrained by datapoints, such as boundaries.

4.2.7 Envelope of MotionCorrespondences can be computed from space-timesurfaces: the envelope of motion is given by trajectoriesof points perpendicular to their normal field. [21] wasthe first to consider space-time surfaces for both rigidregistration and non-rigid correction. Instantaneouskinematics are used to define and minimize the dotproduct of the velocity field and the normal field. Fornon-rigid correction, they sample points, compute lo-cal instantaneous space-time velocities, and propagateresults to neighboring points using regularization.[28] extends the method to larger deformations byextracting an urshape [79], and using a non-linearas-rigid-as-possible model [42] for deformation. Space-time surfaces require very dense spatial and temporalsampling of data.

4.2.8 Search ConstraintsThe Closest Point Criterion (CPC) has been usedextensively in non-rigid registration, often in an opti-mization process similar to that of rigid registration.

CPC is applied to find tentative correspondences ateach step. Practical optimization issues are discussedlater in Sections 5.1.1, 5.5.

4.3 Observations and Discussion

Our classification (Tables 3 and 4) shows that severalsimilar high-level constraints are used in both rigidand non-rigid registration, namely transformation-(deformation-) induced constraints, features, saliency,regularization and search constraints. In particular,many features and saliency have been reused in non-rigid registration. They are usually defined on lo-cal surface geometry and are (at least, assumed tobe) invariant under transformation. Some constraintshave been adopted by non-rigid registration, e.g., [46]defines an isometry constraint based on Euclideandistance, which is replaced by geodesic distance in [2].

A significant difference is that while no rigid regis-tration methods use templates, many non-rigid meth-ods do. The template helps to control the extra DoFin the non-rigid case. This works because non-rigidregistration is typically applied to specific objects witha well-known form, such as human faces and bodies.Rigid registration is often applied to unknown ormuch more general objects and scenes, and templatesare less useful.

Isometry is a very powerful constraint which issufficient to define a class of intrinsic transformationmodels (Section 3.5), and is used in optimization invarious ways. From a high level viewpoint, recentintrinsic techniques enforce isometric consistency. Ex-trinsic non-rigid registration allows fairly arbitrarydeformation (due to the high dimensionality of themodels (an over-fitting problem)) but encourages isom-etry through the heavy use of regularization. In gen-eral, most non-rigid constraints (both extrinsic andintrinsic) focus on modelling the relationship betweencorrespondences, whilst this relationship is implicitlycaptured by the rigid transformation in rigid registra-tion.

For rigid registration, the closest point criterion(CPC) is effective, as the rigidity constraint is imposedby the assumption of rigid transformation. In non-rigid registration, most deformation models allowarbitrary local transformation, and the results maynot adequately respect the material properties of theunderlying surfaces. CPC alone is not likely to bevery effective in the non-rigid case. That is why weclassify CPC as transformation-induced constraint inrigid case but search constraint in non-rigid case.

Choice of features for feature matching is notwell discussed in the literature. Our own experiencesand those reported elsewhere can be summarised asfollows. Curvatures are fast to compute, but easilyaffected by noise and incomplete surfaces. Integralinvariants are robust to noise and give similar in-formation to curvature. Users must define a suitable

Page 11: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 11

ball and grid size for feature matching to get goodperformance. Slippage analysis is unsuited to smoothsurfaces due to its definition. SHOT signatures per-form better than spin images [68]. A comparison ofseveral recent features and saliency measures for non-rigid shapes can be found in [91].Discussion on priors and space-time registration:Here we discuss the constraints of non-rigid regis-trations in terms of the strength of prior. Markersand templates are explicit prior knowledge providedby users. Transformation (deformation)-induced con-straints and reliable features & saliency measures areimportant priors. The transformation-induced con-straints are stronger and can often be used for eval-uating and pruning correspondences established bythe latter. These constraints allow registration to becarried out on highly sparse data, or data from asingle view, and often in a pairwise manner.

The focus of space-time surface registration is theuse of weaker priors, e.g. various regularizations andenvelop of motion. The idea is that more temporaldata helps constrain an arbitrary deformation. Thesetechniques usually target sequences, require densertemporal sampling, assume small deformation andlarge overlap of regions for feature correspondencesbetween consecutive frames. Perhaps the weakestprior is the envelop of motion which requires bothdense spatial and temporal sampling. (Search con-straints are mostly for efficiency.)

Recent space-time registrations focus on reduc-ing the dimension of the deformation (alleviatingthe over-fitting problem) by constructing a low-dimensional representation (e.g. skeletons [5], bonesand joints [26]). These enable registration techniquesto handle sparser data and multiple or all frames ata time, and generate more accurate results. There aremethods that focus on global registration in terms ofthe alignment order of (multiple) sequences to reduceaccumulation errors [92].

5 OPTIMIZATION METHODS

Rigid and non-rigid registration techniques are typ-ically cast as non-linear optimization (see Table 5).Many techniques optimize both transformations andcorrespondences whilst some optimize transforma-tions or find correspondences only.

5.1 Local Deterministic Optimization

Local deterministic methods look for a solution thatmaximizes/minimizes an objective function locally.These techniques are efficient but also depend heavilyon initialization and often converge to local minimum.

5.1.1 Gradient Descent, Newton and related methodsGradient-descent, Newton, (damped) Gauss-Newton,quasi-Newton and Levenberg-Marquardt (L-M) are the

TABLE 5Optimization Methods.

Formulations

RigidRegistration &CorrespondenceExamples

Non-RigidRegistration &CorrespondenceExamples

Loca

lD

eter

min

istic

Gradient Descent &related methods

[11], [50]–[52],[54], [62], [93]

[24], [28], [30],[34]

Newton,Gauss-Newton, L-Mmethod

[14], [46], [66],[94]

[2], [31], [32],[79], [80]

Quasi-Newton [6], [23], [29], [30]ExpectationMaximization [27] [9], [27]

Glo

bal

Det

erm

inis

tic

Branch and Bound &Tree Search [48] [70]

GraduatedAssignment [59], [60] [35]

Mean Field Annealing [13]Gaussian FieldFramework [95]

Game TheoreticalFramework [96]

Spectral Method [2], [5], [97]S

toch

astic

Genetic Algorithm &Simulated Annealing [15]

Particle Filtering [3]HoughTransform [17] [39]

RANSAC [39] [71], [98]Belief Propagation [72]

Con

-st

rain

edS

earc

h Prune and Search Sections 4.1.2, 4.1.3Geometric Hashing [53]EmbeddingTechniques [36]–[38], [99]

most frequently used optimization techniques inrigid and non-rigid registration.Rigid Registration: The optimal transformation a ateach iteration is updated by (a) taking derivatives ofthe objective function, (b) linearizing the rigid motion(rotation and translation) [94] or kinematic (helicalmotion) parameters [66], (c) setting the derivativesto zero and (d) solving the resulting system of equa-tions. [14] points out that standard ICP is similar toa gradient-descent method because it always findsthe best transformation locally at each step. In rigidtransformation estimation, a closed-form least-squaressolution (preferable for efficiency and robustness) ex-ists if three or more distinct non-collinear pairs ofcorrespondences are given [100], which can be solvedby SVD, quaternions, orthonormal matrices or dualquaternions.Non-Rigid Registration: Most methods based on CPCformulate an energy functional with data and reg-ularization terms. We summarize some key imple-mentation issues here. [29] suggests a multi-resolutionapproach, optimizing a low resolution mesh to findcorrespondences, then optimizing again on a high res-olution mesh. Methods handling large deformationsgenerally start the optimization with small emphasison the data term because CPC tends to be invalid.

Page 12: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 12

After a few steps, more emphasis is put on the dataterm because the original shape has been deformedto lie close enough to the aligning shape [29], [30].This is similar to graduated convexity (Section 5.2.2)except that weights are assigned empirically.

5.1.2 Expectation Maximization (EM)Some methods solve for groups of parameters inalternate steps, e.g. first fixing correspondences andoptimizing transformation parameters, then fixingtransformation parameters and optimizing the corre-spondences. These are EM approaches.Rigid Registration: [27] represent the centroids of onepoint set using a Gaussian mixture model (GMM) andalign them with another set of data points. The EMalgorithm alternates between two steps: (1) it refinesthe parameters (the transformations and the covari-ances of GMM), and computes the posterior prob-ability distributions of the GMM centroids throughBayes theorem, (2) computes the rigid transformationparameters that maximize the likelihood.Non-Rigid Registration: [27] extends the abovemethod to non-rigid registration by regularizing thedisplacement fields using coherence.Non-Rigid Correspondence: [9] formulates the prob-lem of shape correspondences as a combinatorialoptimization, solved using EM approach, with thegoal of minimizing the overall isometric distortion ofcorrespondences all over the surface.

5.2 Global Deterministic Optimization

Local optimization can become stuck in local minimaif the initial solution is not close to a global solution.Global optimization tries to find a global solutionand to avoid local minima. Registration, when castas global optimization, does a complete search withbounds (exact global solution), or approximates /relaxes the problem into continuous settings so thata near global solution can be found.

5.2.1 Branch and Bound & Tree SearchA decision tree can be formed where each node is apossible correspondence pi,qj. The root is ∅, ∅;the set of nodes forming a path from root to a leafnode is one set of possible correspondences betweentwo surfaces. Branch-and-bound is a complete searchtechnique, which is based on a lower bound on thecost function. Its efficiency depends on how tight thebound is, and how quickly it can be computed.Rigid Registration: [48] uses distance root meansquared error:

dRMS2(P,Q) =1

n2

∑i

∑j

(‖pi − pj‖ − ‖qi − qj‖)2

as a bound on CPC error (Eqn 5); it is both tight andfast to compute.

Non-Rigid Correspondence: [70] represents potentialcorrespondences in a tree, and uses a self-deformationdistortion measure for a set of correspondences toprune whole branches of a tree search. This methodallows natural correspondences to be found betweentwo non-rigid 3D shapes of quite different kinds.

5.2.2 Graduated AssignmentThe graduated non-convexity, which is a deterministicannealing technique, seeks a convex function E =ED + βES to approximate the non-linear objectivefunction ED by adding a regularization term ES . βcontrols convergence of the system in a similar wayto temperature in simulated annealing.Rigid Registration: [65] initiated the application ofgraduated non-convexity to registration. They cast thecombinatorial search as a constrained continuous non-linear optimization, with cost function:

E(a) =∑j

∑i

Mij‖qj −Rpi − t‖2

+1

β

∑j

∑i

Mij logMij − α∑j

∑i

Mij ,(6)

where α, β are weights, M is the matching proba-bility matrix, and (R, t) is a rigid transformation.The second and third terms are regularization terms(see Section 4.1.4). β is the inverse computationaltemperature of the system. This method can be spedup by k nearest neighbors [59] and CPC [60].Non-Rigid Registration: [35] defines a transformationmodel using thin plate spline, and uses graduated as-signment for non-rigid registration and optimization.

5.2.3 Mean Field AnnealingMean field annealing is a deterministic approximationto the simulated annealing technique.Rigid Registration: [13] minimizes the cost function:

E(M,R, t) =∑i

mi‖qi −Rpi − t‖2

− 1

β

∏i

mi −1

β

γ − 1

γ

∑i

mγi

where mi is the mean field variable and is a vector inM (the matching probability matrix). The first term isthe data term and the others are regularization terms(see Section 4.1.4).

5.2.4 Gaussian FieldsThe Gaussian field framework replaces a non-linear,non-smooth error function by a convex smooth ap-proximation, with the help of mollification: a summa-tion of a large number of Gaussian functions which,by the Central Limit Theorem, leads to a single Gaus-sian. Tracking the global extremum is much easier asthe error is differentiable even for noisy input data.Rigid Registration: [95] uses the objective function:

Eσ(a) = maxR,t

∑i

∑j

exp

(−d

2(pi,qj)

σ2

)

Page 13: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 13

where d(pi,qj) = ‖qi − Rpj − t‖. σ works likethe temperature in annealing technique. This methodshows robustness in the presence of strong noise.

5.2.5 Game Theoretical FrameworkFinding a large set of correspondences that express ahigh level of mutual compatibility can be formulatedas an inlier selection process of a non-cooperativegame [101]. The inlier selection is modelled as twoplayers competing with one another.Rigid Correspondences: [96] uses game theory torecognize rigid objects in a cluttered scene. They usethe consistency of Euclidean distances between pointpairs to model a competitive game. To obtain reliablecorrespondences, they define two steps: 1) to run thecompetitive game to obtain a set of reliable sparsecorrespondences, 2) to propagate the sparse set to adense set using neighbouring information.

5.2.6 Spectral CorrespondenceA spectral method [97] takes into account the rela-tionship between points and correspondences, andformulates it as an eigen-decomposition problem.Non-Rigid Correspondence: [2] extends the conceptto the non-rigid case. Two relationships are consid-ered: (i) the similarities of two points (as differenceof principal curvatures), and (ii) the isometric consis-tency (as difference of geodesic distances) of two pairsof correspondences. Having defined an affinity matrixof correspondences, they compute its principal eigen-vector. The eigenvector entries define the confidencevalues of correspondences that belong to a coherentcluster. This helps to prune unreliable ones.

5.3 Stochastic OptimizationMany techniques model registration and the corre-spondence determination problem with statistics andprobabilistic approaches in order to handle datasetswith noise, outliers and missing data.

5.3.1 Genetic Algorithm and Simulated AnnealingGenetic algorithm adapts the idea of evolution andnatural selection [102]. Simulated annealing is inspiredby thermodynamics. These rigid techniques are dis-cussed in [15].

5.3.2 Particle FilteringParticle filtering is a recursive Bayesian estimationtechnique to estimate the posterior distribution ofa state given past observations. Its strength lies inits ability to handle non-linear functions and non-Gaussian noise. It is often used in object tracking.Rigid Registration: [3] sees the optimization of rigidregistration as a sequence of steps to arrive at aglobal solution. They use particle filtering to track theoptimal state (the rigid transformation). The methodhandles noise, arbitrary initial estimates, missing data,

and both sparse and dense point sets. At each iterativestep, the algorithm draws Ns particles (here, initialtransformations) from the proposed distribution, andfeeds them into a local optimizer (e.g. ICP) to producecandidate solutions. Importance weights are adjusted.New particles are predicted, and the process repeats.Multiple particles help to avoid local minima.

5.3.3 Hough Transform (Voting)The Hough transform was originally developed for linedetection in images, and was later developed to detectother shapes and find transformation parameters. Itoperates on a quantized parameter space using anaccumulator.Rigid Registration: [17] identifies salient points, gen-erates candidate transformations, votes for their pa-rameters in the accumulator space, and picks thepeaks after all votes have been cast. The accuracy ofthe method depends on the accumulator resolution,so ICP is subsequently applied for fine alignment.Non-Rigid Correspondence: [39] uses voting for non-rigid correspondences. The key novelty is to project agenus-zero topology mesh in a mid-edge parametriza-tion so that a global Mobius transformation can beused, as for a rigid transformation. The method re-peatedly samples three points, applies the Mobiustransformation and tests if the transformation is validor not using isometric consistency. This method votesfor correspondences.

5.3.4 RANSACRandom sample consensus (RANSAC) is a robust al-gorithm for fitting models in the presence of manyoutliers. It follows a hypothesis-and-test paradigm.Rigid Registration: [4] considers rigid registrationas a largest common pointset (LCP) problem. Giventwo point sets P and Q, RANSAC finds the largestsubset P ′ ⊂ P such that the error between T(P ′) andQ is less than some predefined threshold, where Tis a rigid transformation. The affine ratio constraintreduces the number of possible correspondences.Non-Rigid Correspondence: [71] introduces impor-tance sampling into RANSAC. Potential correspon-dences are drawn from a probability density function.Isometric consistency is formulated as a likelihoodto adjust the posterior probability such that morereliable correspondences are determined in the nextiteration. Results are improved using a tangent-spaceoptimization technique that moves points along themesh. [98] uses an entropy-based planning strategy toselect important matches and reduce sampling cost.

5.3.5 Message Passing and Belief PropagationRandom variables, which have a Markov property,can be represented as a graph in a Markov random field(MRF), where the undirected edges model the inter-dependencies between random variables.

Page 14: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 14

Non-Rigid Correspondence: [72] uses MRF to modelthe inter-dependency of correspondences, aiming tofind correspondences that maximize a certain jointprobability distribution. Two probability potentialsare considered: (i) discrepancies between the com-pressed spin images of two points, and (ii) isometricconsistency between two pairs of correspondences.The simplifying assumption of MRF is that all prob-abilities are independent of others and can be mul-tiplied together. They apply loopy belief propagation tofind a (local) maximum of probability.

5.4 Constrained SearchConstrained search establishes special constraints tolimit the search space. We summarize them as follows.

5.4.1 Enumerate and PruneEnumerate and prune is the usual strategy to obtainsparse (or initial) correspondences. In general, it is im-possible to enumerate all correspondences. Saliency,features (Sections 4.1.2, 4.1.3) and other constraints areusually employed to prune the huge search space.

5.4.2 Geometric HashingGeometric hashing pre-computes and stores a trans-formation invariant representation of a patch in ahash structure. The hash bins are used to restrict thenumber of potential matches.Patch Correspondences: [53] extracts patches usingsalient geometric features and hashes the transforma-tion invariant representations. The representation hascomplexity

(|Ps|3

)as it takes every point-triple from

the patch Ps containing |Ps| points. Given a new patchQs, a new representation is formed and found in thehash structure in

(|Qs|3

)time. The key is to break the

correspondence problem of complexity(|Ps|

3

)·(|Qs|

3

)into two sub-problems; this allows simultaneous pro-cessing of all hashed patches. The method has hugememory requirements.

5.4.3 Embedding TechniquesNon-rigid Correspondence: A class of techniquesuse embedding to establish correspondences. [36] com-putes and aligns eigenvectors of a geodesic kernel.Correspondence is established through exhaustivelypairing (sign-flip, eigen-mode switch) and non-rigidalignment of the leading 6 eigenvectors. These eigen-vectors capture the low-frequency structure of themeshes. [99] extends the concept further by defin-ing histograms for these eigenvectors for quick pair-ing. [37], [38] embed a mesh directly in another, byusing generalized multi-dimensional scaling (GMDS) andGromov-Hausdorff distance for partial matching [103].In general, this class of work is inspired by [104]which embeds non-rigid models into some canonicalspace for surface matching. Once the intrinsic geom-etry is embedded (through g ∈ G, see Figure 1),

the correspondence problem (h ∈ H) reduces to astandard rigid or low-dimensional non-rigid match-ing and alignment problem (f ∈ F ) [36], [103].

5.5 InitializationGlobal deterministic optimization, stochastic methodsand constrained search generally do not require ini-tialization. Here we focus on local iterative optimizersthat do need an initial estimate.Rigid Registration: [67] interactively selects threecommon points to define an initial rigid transforma-tion. [46], [52] use reliable correspondences to au-tomate this process. [14], [54], [59], [60], [62], [64],[94] test their methods on a set of predefined initialtransformations within a certain range, and evaluatethe range of convergence. In general, the broader andmore stable the convergence funnel, the better themethod [94]. Certain methods are designed to obtaincoarse alignments (e.g. [17], [49]; see [15]).Non-Rigid Registration: [22], [32] use rigid regis-tration to find an initial estimate. [2] tries multipleinitializations if the error of the first is too high. [25]finds the initial transformation of a rigid sub-part bylooking for clusters in transformation space definedby all possible correspondences. For a sequence of3D scans, adjacent frames are roughly aligned, andcorrespondences can be tracked [6], [21], [28], [33],[79]. Other approaches [23], [24], [29], [30], [43] as-sume markers or specific segmentation informationare provided.

5.6 Observations and DiscussionRigid and non-rigid registration are typically cast asoptimization problems. The error landscape depends onthe type of data being registered, outliers, noise, andmissing data. We summarize some observations andexperiences of our own and from the literature.Rigid Registration: If the surfaces are relatively cleanand there is a good initial estimate of alignment,local optimizers (e.g. ICP or gradient descent, Newton’smethod) are the most efficient choice. However, ifthere is significant noise, or the initialization is poor,these methods may not converge [94]. Global deter-ministic optimization (e.g. the Gaussian field frame-work [95]) or stochastic techniques (e.g. particle filter-ing [3]) are more reliable in these situations. Stochastictechniques do not guarantee a globally optimal solu-tion, but the solutions are usually good. In practice,using a fast stochastic technique (e.g. RANSAC [4])to obtain an initial coarse alignment, and a localoptimizer for subsequent fine alignment is a goodapproach. Features and saliency are important for fastpruning and establishing coarse alignment. Hierarchi-cal methods can also find an initial alignment, andoptimize it during fine alignment.Non-Rigid Registration: Extrinsic non-rigid registra-tion usually uses local optimization for efficiency. In

Page 15: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 15

particular, the quasi-Newton method can avoid theneed to invert a large Jacobian or Hessian matrix.However, local optimization typically requires goodinitialization. Three techniques are used to make lo-cal methods work well (i) regularization, (ii) corre-spondences and (iii) pre-registration. Regularizationis useful because it convexifies the error landscapeof the objective function. It is often used with de-creasing weights in successive steps to avoid pre-mature convergence at a local minimum. Reliablecorrespondences drive optimization directly and pre-registration or coarse alignment are used to providegood initialization (Section 5.5).

Recent advances in intrinsic techniques studyisometric consistency and intrinsic geometry. Manyof them use probabilistic techniques, and the solutionsare usually good. Over-fitting is less of an issue inthese techniques, but they mainly target (nearly-)isometric deformations. Computing surface distanceson a large set of points, and handling topologicalnoise and holes are important issues. Sub-samplingof the surfaces may be required.

Quadratic Assignment Problem: Some optimizationtechniques are related to the intractable quadraticassignment problem (QAP). We summarize them be-low. QAP considers allocating a set of facilities toa set of locations, with the cost being a function ofthe distances d and flows f between the facilities,together with the costs b associated with a facilitybeing placed at a certain location. The objective is tofind the mapping φ which assigns each facility i to alocation φ(i) such that the total cost is minimised:

arg minφ

∑i

∑j

fijdφ(i)φ(j) +∑i

biφ(i)

The intuition is to arrange φ so that facilities i, jwith high flow fij are assigned to locations φ(i) andφ(j) which have small distance dφ(i)φ(j). QAP is NP-hard, and all existing techniques approximate or relaxthe solution one way or another. [65] approximatesthe QAP by linearising the cost function via Taylorseries and using two-way constraints to convert theQAP into a continuous search. Inspired by this, grad-uated assignment (e.g., [35], [59], [60]) and mean fieldannealing [13] techniques are further developmentswith additional constraints. These techniques use arigid transformation instead of the quadratic termfijdφ(i)φ(j) to model the pairwise relationship of corre-spondences (facilities & locations in QAP). To explic-itly model such a relationship, spectral techniques [2],[97] and a game theoretical framework [96] considerthe objective function: xTMx where M summarizesthe QAP cost above. Both techniques relax x, the bi-nary assignment, to a continuous one. The differenceis that the former constrains ‖x‖2 = 1 (due to theuse of Rayleigh quotient); whilst the latter ensuresthat ‖x‖1 = 1. Markov random field approaches use

probability potentials to model the inter-dependencesof correspondences using geodesic distance. Inspiredby these pairwise techniques, higher-order matchingtechniques (e.g. [105], [106] which consider ternaryrelationships) are emerging.

6 EVALUATION

Here, we discuss common evaluation approaches andsurvey those datasets that are commonly used.Rigid Registration: The most common way to evalu-ate rigid registration is to compare the deviation of therotation and translation parameters directly becausethe rigid transformation is low-dimensional (6 DOFs).These parameters can be obtained from syntheticdatasets, where additional noise, outliers and differentlevel of overlaps may be added/adjusted. There areseveral publicly available datasets with ground truth(e.g. [107], [108]). The ground truth parameters aredefined during the scanning. These datasets are usu-ally small in size. For large scale datasets (e.g. [109]),ground truth are not available, and visual assessment(e.g. sharpness) is required. The range and rate ofconvergence are also common measures.Non-Rigid Registration: Evaluation of non-rigid reg-istration is a significant issue, since non-rigidity is dif-ficult to formulate and can most directly be assessedin terms of correspondences. Earlier techniques usevisual assessment. [6], [28], [32] used parametrizationto demonstrate low visual distortion. [39] uses colourcoding to show sparse correspondences.

Recent techniques use public datasets for evalua-tion. Several datasets are available: 1) benchmark for3D mesh segmentation [110] (e.g. Ballerina). Thesemeshes have different numbers of vertices, connectiv-ity and severe non-uniformity of sampling. 2) non-rigid world dataset [111] contains various animaland human mesh models with different poses, whereeach object has ≈3K vertices with arbitrary connec-tivity. 3) dataset created by existing techniques, withfixed mesh connectivity, and uniformly sampled athigh resolution (9-30K) [112] (e.g. Jumping Man), [72](e.g. Dancing Man), [30] (e.g. Horse Gallop)—createdby [113]. These ground truth correspondences are ob-tained by fitting a common template on the registereddata. 4) TOSCA high-resolution dataset is constructedwith ground truth correspondences, with fixed num-ber of vertices and connectivities.

When there is no ground truth correspondences,evaluation of techniques using dataset types 1 and2 are usually based on visual assessment by con-necting spare correspondences with lines [9]. Givenground truth correspondences in dataset types 3 and4, additional metrics can be defined. [8] uses sum ofisometric distortion—difference of geodesic distancesbetween the end-points of one correspondence withrespect to all the other correspondences. This can bevisualized per vertex, assuming there is a one-to-one

Page 16: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 16

mapping. [9] sums these isometric distortions all overthe surface to obtain one single number. Note that iso-metric distortion is meaningful only if the datasets areundergoing a (near-)isometric deformation. Note alsothat the evaluation of new techniques on dataset type3 may actually compare their performance against thetechniques that create the dataset.

A benchmark database [91] based on TOSCA fur-ther allows evaluations on isometry, topology, holes,scale, sampling, and noise of features and saliency. [8]also recruited a volunteer to label 10-35 semanticallymeaningful and consistent feature points for eachclass of objects in the watertight meshes dataset [114].

In the computer vision literature, [115] discusseshow to evaluate registration without ground truth.One way is to evaluate the results by generalizingthem to unseen data (a well known method in data-fitting). Given ground truth data, receiver operatingcharacteristic (ROC) curves, which are plots of truepositive rate against false positive rate, may be used.In general, obtaining ground truth data is difficult.One idea is to use image-based techniques [116] andcompare the results with non-rigid registrations.

7 CHALLENGES AND FUTURE DIRECTIONS

In this section, we discuss challenges and possiblefuture directions in registration.Deformation Models: Section 3 surveyed varioustransformation models for registration. Most existingmethods assume a single model for the two surfacesbeing aligned. This is practical, enabling optimizationto simultaneously find the best transformations for allpoints. However, a surface comprises different materi-als and may undergo different kinds of deformation.For example, arms and legs (mainly) undergo artic-ulations, whilst deformation of a skirt leads to foldsand wrinkles, and in such cases multiple deformationmodels might be more useful. The main difficultiesof doing so are the need for surface segmentation,constraints between adjacent models, and approachesto merge registration and deformation results.

In general, non-rigid registration is problem- anddata-dependent. Users and researchers should care-fully consider the nature of the deformation. As wellas blendshape models [117], physical models relatingtissues, bones and muscles may be useful in 3Dface registration. This area is yet to explore, possiblydue to the complexity of facial physiology. Relatedtechniques [118] are emerging.Over-Fitting & Model Selection: In data-fitting, onedoes not want the model to describe the noise (i.e.to over-fit the model). Similarly, non-rigid registrationshould align 3D surfaces rather than noise or holes.Over-fitting often is not a concern in rigid registrationas the rigid transformation is low-dimensional, andthe same at all points. Non-rigid registration, how-ever, requires higher DoF. (e.g. local affine transfor-

mations have 12n DoFs, where n is the number ofpoints or nodes.)

It is well-known theoretically that such high di-mensionality can lead to over-fitting. Ultimately, twogeneral surfaces can always be brought into alignmentwith arbitrarily small error. However, the resultingalignment need not represent physically correct cor-respondences between homologous structures on thesurface. Indeed, simply comparing the residual errorsor the rate of convergence for two non-rigid extrin-sic registration techniques tells us little about whichproduces the more meaningfully correct result.

To alleviate the over-fitting problem, existing ex-trinsic techniques encourage smoothness through (i)formulating a priori knowledge or assumptions asregularization, (ii) introducing additional equationsby some linear constraints on neighborhoods or (iii)uses a reduced representation (e.g. embedded graphnodes, meshless element models). Recent intrinsictechniques [36], [38], [39] use low dimensional em-bedding to by-pass the high-dimensionality problemby assuming a strong inelastic isometry constraint.

A closely related issue is how to automaticallydetermine the most appropriate deformation model:in nearly all existing work, deformation models arepredetermined and thus require a priori knowledge.However, non-rigid registration is similar to datafitting, where there are many well-established tech-niques for model selection: cross-validation, early stop-ping and model comparison (e.g. information-based cri-teria). They could potentially be used to develop afully automatic method. In information theory, theminimum description length criterion is well-known fordetermining the best model from data; it has beenused in image registration [119], face (texture anddepth) registration [120] and medical data registra-tion [121] in computer vision, but is relatively littleused in computer graphics and shape analysis.Constraints and Correspondences: Meaningful cor-respondences should relate homologous sections ofthe shapes, i.e., sections that are semantically related.Although many techniques and constraints have beenproposed to facilitate correspondence search, humanassistance and visual inspection are sometimes re-quired to provide semantic input. As we have noted,there are no precise definitions and methods for evalu-ating quality of correspondences. Such definitions andmeasures are needed that match human perception orare based on physical behavior.

More effective pruning constraints and techniquesare always beneficial. Recent research in intrinsic ge-ometry and isometric consistency has led to importantadvances for isometric deformation, especially forface and body modeling. Here, we try to bring theanalogies with machine learning and provide insightsto why these intrinsic techniques are important.Connection to Machine Learning (ML): There are twokinds of learning methods in ML [122]: inductive and

Page 17: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 17

analytical learning. Inductive learning (e.g. neural net-work) generalizes hypotheses, in terms of a function,from training examples to unseen data. A hypothesisis assumed to be found/fitted empirically from a largeamount of training data. The justification is based onstatistical inference. The advantage is that it requireslittle prior knowledge, but the hypothesis may beincorrect if there is not sufficient training data—over-fitting. Analytical learning uses domain theory (e.g.rules) and deductive inference to derive hypothesis. Itdoes not require lots of training data but is seriouslyaffected if the domain theory is incorrect.

A comparison of registration techniques reveals thatrigid registration and intrinsic techniques are basedon strong prior knowledge (or at least, assumptions).Having a rigid transformation requires all points totransform rigidly. Intrinsic transformations require thetwo surfaces (and all points on them) to satisfy iso-metric consistency. These transformations are similarto the domain theory in analytical learning—a globalrule, one reason why these techniques can be cast asoptimization problems and solved using the Houghtransform [17], [39] or RANSAC technique [4], [71]efficiently. This strong domain theory leads to two ex-amples which demonstrate the successful applicationof typical deductive inference: 1) [40] suggests thatfixing one correspondence, all other correspondencescan be inferred for non-rigid shapes. 2) [98] furtheranswers how many correspondences are required toconstrain deformations of non-rigid shapes.

For general extrinsic non-rigid registrations withhigh DoF, this strong domain theory is lost. Thisis similar to inductive learning with limited priorknowledge. This type of technique assumes the de-formation (hypotheses) can be learnt from sufficienttraining data. Yet, given only two surfaces and ahigh-dimensional deformation model (like [6], [32]),the solution can only be regularized by other priorknowledge—mostly in the form of regularizationsto ensure smoothness. One way of providing moretraining data is to look into the temporal domain. Thisexplains why recent extrinsic non-rigid techniquesfocus heavily on registering space-time sequences.

The pitfall of analytical learning is that it dependson a perfect domain theory. For non-rigid shapes,many deformations are only near/approximate-isometric, which may lead to matching ambigui-ties [8]. In cases with large topological changes, theisometry assumption may fail [38]; handling suchissues is a challenging task. Similarly, the rigid trans-formation is not perfect (i.e. cannot always explainthe optimal solution) under noise, outliers and limitedamount of overlap. Recent efforts of the communityis to model and study these factors (e.g. Section 4.1.4).These approaches may be considered a combinationof inductive and analytical learning techniques [122].Learning from rigid registration in this perspectivemay be a promising direction.

Optimization: Optimization is an important issuein registration as it affects accuracy, robustness andconvergence. Many optimization techniques used inrigid registration have yet to be applied to non-rigidcases. A major obstacle to adapting these techniquesis the high dimensionality, causing most extrinsicapproaches to use quasi-Newton techniques instead ofthe more stable Newton or L-M methods. The former,however, may not converge if the initial alignmentis too far from the optimal solution. Stochastic ap-proaches are promising methods that are not sensitiveto initialization. By transforming non-rigid shapesinto the intrinsic domain, many stochastic methodshave found applications in establishing correspon-dences, and often near globally optimal solutions areobtained. Adapting these techniques to large scaledata, deriving specific objective functions, handlingvarious levels of geometric and topological noise, andextending them to non-isometric and elastic deforma-tion are some of the challenges.Datasets: Recent research has focused on analyzingsequences of 3D scans: (i) hardware for capturingsuch sequences at a high frame rate is now readilyavailable, and (ii) sequences assist registration byproviding intermediate information, and therebyconstraints. However, most techniques can cope withonly limited sequences (15-200 frames), mainly due tomemory limitations. Faster techniques, and especiallyones that can handle more and larger data, are anever-present goal. Groupwise registration techniqueshave been developed [33], [123] which are typicallymore reliable, and reduce the accumulated errors thatarise in pairwise registration. Further challenges liein (i) avoiding the need for markers or templates, and(ii) capturing and processing fine scale details [32].As has been found in medical imaging, registrationutilizing heterogeneous information (e.g. geometryand texture [41], [120]) may give higher accuracy.

Foreseeable Trends:Rigid Registration is becoming more robust and re-liable over the past two decades of researches. Manychallenges posed by rigid registration have been ad-dressed [2]. We foresee that rigid registration is be-coming application-oriented (e.g., [124]). Developingrigid registration techniques that can handle largedatasets with different levels of details and regularitystructures is essential. With the advance of technol-ogy, handheld devices will soon possess 3D scanningfunctionality. Real-time techniques that target theseubiquitous devices are new directions.Non-Rigid Registration, in general, is still at its infantstage, largely because of the large varieties of trans-formations in the real world and the shortage of ourknowledge of these transformations. One directionis to investigate these deformations from a reverseengineering point of view. Many existing non-rigidtechniques are still in the stage of proof-of-concept—

Page 18: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 18

showing what can be done. Developing and capturingmore ground truth data for each type of these trans-formations would help mature these techniques.

Evaluation of correspondences and transformationsare two important areas. Defining the semantic mean-ing of correspondences is a hard problem in itself,and currently no work considers the evaluation oftransformations. Computation efficiency is anotherissue. Perhaps, one can employ machine learning tolearn the nature or pattern of unreliable or incorrectcorrespondences. This can speed up optimization toobtain reliable solution.

We also foresee two directions regarding intrinsictechniques: 1) Existing techniques focus mostly onmodelling near-isometric deformation and obtaining aglobal consistent set of correspondences. Relaxing theconcept to local isometric deformation but ensuringglobal consistency would be one direction becausemost real-life deformations are globally non-isometric(e.g, due to deformation at joints) but locally isomet-ric. 2) Most techniques focus on registering surfaces ofsimilar sources. If one wants to register surfaces fromheterogeneous sources (e.g., two totally different hu-man faces and their expressions), these techniques donot work. This relates to the morphing problem [30],and is useful for medical applications.

Non-isometric deformations need further consider-ation. [8] has started in this direction using blendedconformal maps. [41] combines intrinsic (conformalmap) and extrinsic measures (e.g. texture, Gaussianmaps, curvatures) to handle non-isometric deforma-tion. Consideration of elastic and plastic properties ofmaterials is more commonplace in medical imaging,and needs further work in this community, but oneforeseeable challenge is the lack of ground truth forevaluation. Designing reliable non-rigid registrationfor large dataset and for ubiquitous devices (e.g.Kinects) is important.

ACKNOWLEDGMENT

We thank the associate editor and all the anonymousreviewers for their invaluable comments and insights.

REFERENCES

[1] V. Lempitsky and Y. Boykov, “Global optimization for shapefitting,” in Proc. CVPR, 2007, pp. 1–8.

[2] Q.X. Huang et al., “Non-rigid registration under isometricdeformations,” CGF, vol. 27, pp. 1449–1457, 2008.

[3] R. Sandhu et al., “Particle filtering for registration of 2D and3D point sets with stochastic dynamics,” in Proc. CVPR, 2008,pp. 1–8.

[4] D. Aiger et al., “4-points congruent sets for robust pairwisesurface registration,” ACM TOG, vol. 27, pp. 1–10, 2008.

[5] Q. Zheng et al., “Consensus skeleton for non-rigid space-timeregistration,” CGF, vol. 29, pp. 635–644, 2010.

[6] M. Wand et al., “Efficient reconstruction of nonrigid shapeand motion from real-time 3D scanner data,” ACM TOG,vol. 28, pp. 1–15, 2009.

[7] T. Weise et al., “Realtime performance-based facial anima-tion,” ACM TOG, vol. 30, pp. 77:1–77:10, 2011.

[8] V. Kim et al., “Blended intrinsic maps,” ACM SIGGRAPH,2011.

[9] Y. Sahillioglu and Y. Yemez, “Minimum-distortion isometricshape correspondence using EM algorithm,” IEEE TPAMI,vol. PP, p. 1, 2012.

[10] M.A. Audette et al., “An algorithmic overview of surfaceregistration techniques for medical imaging,” Med. ImageAnaly., vol. 4, pp. 201–217, 2000.

[11] S. Rusinkiewicz and M. Levoy, “Efficient variants of the ICPalgorithm,” in Proc. 3DIM, 2001, pp. 145–152.

[12] M. Rodrigues et al., “Special issue on registration and fusionof range images,” CVIU, vol. 87, pp. 1–7, 2002.

[13] Y. Liu, “A mean field annealing approach to accurate freeform shape matching,” Pat. Recogn., vol. 40, pp. 2418–2436,2007.

[14] H. Pottmann et al., “Geometry and convergence analysis ofalgorithms for registration of 3D shapes,” IJCV, vol. 67, pp.277–296, 2006.

[15] J. Salvi et al., “A review of recent range image registrationmethods with accuracy evaluation,” IVC, vol. 25, pp. 578–596, 2007.

[16] O. van Kaick et al., “A survey on shape correspondence,” inProc. Euro STAR, 2010.

[17] X. Li and I. Guskov, “Multi-scale features for approximatealignment of point-based surfaces,” in Proc. SGP, 2005, p. 217.

[18] D.L.G. Hill, “Non-rigid image registration: theory and prac-tice,” British J. Radiology, vol. 77, pp. S140–S153(1), 2004.

[19] J.P.W. Pluim et al., “Mutual-information-based registration ofmedical images: a survey,” IEEE T-MI, vol. 22, pp. 986–1004,2003.

[20] M. Holden, “A review of geometric transformations fornonrigid body registration,” IEEE T-MI, vol. 27, pp. 111–128,2008.

[21] N.J. Mitra et al., “Dynamic geometry registration,” in Proc.SGP, 2007, pp. 173–182.

[22] B.J. Brown and S. Rusinkiewicz, “Global non-rigid alignmentof 3-D scans,” ACM TOG, vol. 26, p. 21, 2007.

[23] B. Allen et al., “Articulated body deformation from range scandata,” ACM TOG, vol. 21, pp. 612–619, 2002.

[24] Y. Pekelny and C. Gotsman, “Articulated object reconstruc-tion and markerless motion capture from depth video,” CGF,vol. 27, pp. 243–253, 2008.

[25] W. Chang and M. Zwicker, “Automatic registration for artic-ulated shapes,” Proc. SGP, vol. 27, pp. 1459–1468, 2008.

[26] ——, “Global registration of dynamic range scans for artic-ulated model reconstruction,” ACM TOG, vol. 30, pp. 26:1–26:15, 2011.

[27] A. Myronenko and X. Song, “Point-set registration: Coherentpoint drift,” IEEE TPAMI, vol. 32, pp. 2262 –2275, Dec. 2010.

[28] J. Sußmuth et al., “Reconstructing animated meshes fromtime-varying point clouds,” CGF, vol. 27, pp. 1469–1476, 2008.

[29] B. Allen et al., “The space of human body shapes: recon-struction and parameterization from range scans,” in ACMSIGGRAPH, 2003, pp. 587–594.

[30] R.W. Sumner and J. Popovic, “Deformation transfer for tri-angle meshes,” in ACM SIGGRAPH, 2004, pp. 399–405.

[31] H. Li et al., “Global correspondence optimization for non-rigid registration of depth scans,” Proc. SGP, vol. 27, pp.1421–1430, 2008.

[32] ——, “Robust single-view geometry and motion reconstruc-tion,” ACM TOG, vol. 28, pp. 1–10, 2009.

[33] M. Liao et al., “Modeling deformable objects from a singledepth camera,” in Proc. ICCV, 2009.

[34] B. Amberg et al., “Optimal step nonrigid ICP algorithms forsurface registration,” in Proc. CVPR, 2007, pp. 1–8.

[35] H. Chui and A. Rangarajan, “A new point matching algo-rithm for non-rigid registration,” CVIU, vol. 89, pp. 114–141,2003.

[36] V. Jain and H. Zhang, “Robust 3D shape correspondence inthe spectral domain,” in Proc. SMI, 2006, p. 19.

[37] A.M. Bronstein et al., “Generalized multidimensional scaling:A framework for isometry-invariant partial surface match-ing,” PNAS, pp. 1168–1172, 2006.

[38] ——, “A Gromov-Hausdorff framework with diffusion ge-ometry for topologically-robust non-rigid shape matching,”IJCV, vol. 89, pp. 266–286, 2010.

Page 19: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 19

[39] Y. Lipman and T. Funkhouser, “Mobius voting for surfacecorrespondence,” in ACM SIGGRAPH, 2009, pp. 1–12.

[40] M. Ovsjanikov et al., “One point isometric matching with theheat kernel,” CGF, vol. 29, pp. 1555–1564, 2010.

[41] Y. Zeng et al., “Intrinsic dense 3d surface tracking,” in Proc.CVPR, 2011.

[42] O. Sorkine and M. Alexa, “As-rigid-as-possible surface mod-eling,” in Proc. SGP, 2007, pp. 109–116.

[43] D. Anguelov et al., “Scape: shape completion and animationof people,” ACM TOG, vol. 24, pp. 408–416, 2005.

[44] P. Hammond et al., “3D analysis of facial morphology,” Amer.J. Med. Genetics A, pp. 339–348, 2004.

[45] R. Toldo et al., “Global registration of multiple point cloudsembedding the generalized procrustes analysis into an ICPframework,” in Proc. 3DPVT, 2010.

[46] Q.X. Huang et al., “Reassembling fractured objects by geo-metric matching,” ACM TOG, vol. 25, pp. 569–578, 2006.

[47] C.S. Chen et al., “RANSAC-based DARCES: a new approachto fast automatic registration of partially overlapping rangeimages,” IEEE TPAMI, vol. 21, pp. 1229–1234, nov 1999.

[48] N. Gelfand et al., “Robust global registration,” in Proc. SGP,2005, pp. 197–206.

[49] Y.S. Liu and K. Ramani, “Robust principal axes determinationfor point-based shapes using least median of squares,” CAD,vol. 41, pp. 293–305, 2009.

[50] P.J. Besl and N.D. McKay, “A method for registration of 3-Dshapes,” IEEE TPAMI, vol. 14, pp. 239–256, 1992.

[51] Y. Chen and G. Medioni, “Object modeling by registrationof multiple range images,” in Proc. ICRA, vol. 3, 1991, pp.2724–2729.

[52] A.E. Johnson and M. Hebert, “Using spin images for efficientobject recognition in cluttered 3D scenes,” IEEE TPAMI,vol. 21, pp. 433–449, 1999.

[53] R. Gal and D. Cohen-Or, “Salient geometric features forpartial shape matching and similarity,” ACM TOG, vol. 25,pp. 130–150, 2006.

[54] G.C. Sharp et al., “ICP registration using invariant features,”IEEE TPAMI, vol. 24, pp. 90–102, 2002.

[55] H. Pottmann et al., “Integral invariants for robust geometryprocessing,” CAGD, vol. 26, pp. 37–60, 2009.

[56] P. Krsek et al., “Range image registration driven by a hier-archy of surface differential features,” in Proc. OAGM, 1998,pp. 175–183.

[57] M. Bokeloh et al., “Slippage features,” University ofTubingen, Tech. Rep., 2008.

[58] J. Digne et al., “The level set tree on meshes,” in Proc. 3DPVT,2010.

[59] Y. Liu, “Automatic 3D free form shape matching using thegraduated assignment algorithm,” Pat. Recogn., vol. 38, pp.1615–1631, 2005.

[60] ——, “Automatic registration of overlapping 3D point cloudsusing closest points,” IVC, vol. 24, pp. 762–781, 2006.

[61] ——, “Replicator dynamics in the iterative process for accu-rate range image matching,” IJCV, vol. 83, pp. 30–56, 2009.

[62] T. Jost and H. Hugli, “A multi-resolution ICP with heuristicclosest point search for fast and robust 3D registration ofrange images,” in Proc. 3DIM, 2003, p. 427.

[63] Y. Liu, “Constraints for closest point finding,” Pat. Recogn.Letters, vol. 29, pp. 841–851, 2008.

[64] J.M. Phillips et al., “Outlier robust ICP for minimizing frac-tional RMSD,” in Proc. 3DIM, 2007, pp. 427–434.

[65] S. Gold and A. Rangarajan, “A graduated assignment algo-rithm for graph matching,” IEEE TPAMI, vol. 18, pp. 377–388,1996.

[66] H. Pottmann et al., “Registration without ICP,” CVIU, vol. 95,pp. 54–71, 2004.

[67] A. Gruen and D. Akca, “Least squares 3D surface and curvematching,” J. of Photogrammetry and Remote Sensing, vol. 59,pp. 151–174, 2005.

[68] F. Tombari et al., “Unique signatures of histograms for localsurface description,” in Proc. ECCV, 2010, vol. 6313, pp. 356–369.

[69] T. Varady et al., “Reverse engineering of geometric models—an introduction,” CAD, vol. 29, pp. 255–268, 1997.

[70] H. Zhang et al., “Deformation-driven shape correspondence,”CGF, vol. 27, pp. 1431–1439, 2008.

[71] A. Tevs et al., “Isometric registration of ambiguous and partialdata,” in Proc. CVPR, 2009.

[72] D. Anguelov et al., “The correlated correspondence algorithmfor unsupervised registration of nonrigid surfaces,” Proc.NIPS, vol. 17, pp. 33–40, 2004.

[73] J. Sun et al., “A concise and provably informative multi-scalesignature based on heat diffusion,” in Proc. SGP, 2009, pp.1383–1392.

[74] M.M. Bronstein and I. Kokkinos, “Scale-invariant heat kernelsignatures for non-rigid shape recognition,” in Proc. CVPR,2010, pp. 1704–1711.

[75] D. Raviv et al., “Volumetric heat kernel signatures,” in Proc.3DOR, 2010, pp. 39–44.

[76] ——, “Affine-invariant diffusion geometry for the analysis ofdeformable 3d shapes,” in Proc. CVPR, 2011, pp. 2361–2367.

[77] M. Aubry et al., “The wave kernel signature: A quantummechanical approach to shape analysis,” in Proc. 4DMOD,2011.

[78] A.M. Bronstein, “Spectral descriptors for deformable shapes,”Tel Aviv University, Tech. Rep., 2011.

[79] M. Wand et al., “Reconstruction of deforming geometry fromtime-varying point clouds,” in Proc. SGP, 2007, pp. 49–58.

[80] L. Zhang et al., “Spacetime faces: high resolution capturefor modeling and animation,” in ACM SIGGRAPH, 2004, pp.548–558.

[81] F. Fouss et al., “Random-walk computation of similaritiesbetween nodes of a graph with application to collaborativerecommendation,” IEEE TKDE, vol. 19, pp. 355–369, 2007.

[82] Y. Lipman et al., “Biharmonic distance,” ACM TOG, vol. 29,pp. 1–11, 2010.

[83] J. Sun et al., “Fuzzy geodesics and consistent sparse corre-spondences for deformable shapes,” Proc. SGP, vol. 29, pp.1535–1544, 2010.

[84] D.T.P. Quynh et al., “An intrinsic algorithm for computinggeodesic distance fields on triangle meshes with holes,”Graph. Models, vol. 74, pp. 209–220, 2012.

[85] V. Surazhsky et al., “Fast exact and approximate geodesics onmeshes,” ACM TOG, vol. 24, pp. 553–560, 2005.

[86] K. Crane et al., “Geodesics in heat,” CORR, vol.abs/1204.6216, 2012.

[87] F. Memoli, “A spectral notion of Gromov-Wasserstein dis-tance and related methods,” Appl. Comp. Harm. Anal., vol. 30,pp. 363–401, 2011.

[88] T.K. Dey et al., “Persistent heat signature for pose-obliviousmatching of incomplete models,” CGF, vol. 29, pp. 1545–1554,2010.

[89] R. Litman et al., “Diffusion-geometric maximally stable com-ponent detection in deformable shapes,” Comp. & Graph.,vol. 35, pp. 549–560, 2011.

[90] ——, “Stable volumetric features in deformable shapes,”Comp. & Graph., vol. 36, pp. 569–576, 2012.

[91] E. Boyer et al., “SHREC 2011: robust feature detection anddescription benchmark,” in Proc. 3DOR, 2011.

[92] P. Huang et al., “Global temporal registration of multiple non-rigid surface sequences,” in Proc. CVPR, 2011, pp. 3473–3480.

[93] N. Gelfand et al., “Geometrically stable sampling for the ICPalgorithm,” in Proc. 3DIM, 2003, pp. 260–267.

[94] N.J. Mitra et al., “Registration of point cloud data from ageometric optimization perspective,” in Proc. SGP, 2004, pp.22–31.

[95] F. Boughorbel et al., “A new method for the registrationof three-dimensional point-sets: The Gaussian fields frame-work,” IVC, vol. 28, pp. 124–137, 2010.

[96] A. Albarelli et al., “A non-cooperative game for 3D objectrecognition in cluttered scenes,” in Intl. Conf. on 3DIMPVT,2011, pp. 252–259.

[97] M. Leordeanu and M. Hebert, “A spectral technique for cor-respondence problems using pairwise constraints,” in Proc.ICCV, vol. 2, 2005, pp. 1482–1489.

[98] A. Tevs et al., “Intrinsic shape matching by planned landmarksampling,” CGF, vol. 30, pp. 543–552, 2011.

[99] D. Mateus et al., “Articulated shape matching using Laplacianeigenfunctions and unsupervised point registration,” in Proc.CVPR, 2008, pp. 1–8.

[100] A. Lorusso et al., “A comparison of four algorithms forestimating 3-D rigid transformations,” in Proc. BMVC, 1995,pp. 237–246.

Page 20: IEEE TRANSACTIONS ON VISUALIZATION AND ...orca.cf.ac.uk/47333/1/ROSIN registration of 3d point...IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013

IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, VOL. 19, NO. 7, JULY 2013 20

[101] A. Torsello et al., “Grouping with asymmetric affinities: Agame-theoretic perspective,” Proc. CVPR, vol. 1, pp. 292–299,2006.

[102] L. Silva et al., “Precision range image registration using a ro-bust surface interpenetration measure and enhanced geneticalgorithms,” IEEE TPAMI, vol. 27, pp. 762–776, 2005.

[103] A.M. Bronstein et al., Numerical geometry of non-rigid shapes.Springer, 2008.

[104] A. Elad and R. Kimmel, “On bending invariant signaturesfor surfaces,” IEEE TPAMI, vol. 25, pp. 1285–1295, 2003.

[105] O. Duchenne et al., “A tensor-based algorithm for high-ordergraph matching,” in Proc. CVPR, june 2009, pp. 1980 –1987.

[106] Y. Zeng et al., “Dense non-rigid surface registration usinghigh-order graph matching,” in Proc. CVPR, 2010.

[107] “The Stanford 3D Scanning Repository.” [Online] :http://graphics.stanford.edu/data/3Dscanrep/

[108] “Ohio State University/SAMPL Database.” [Online] :http://sampl.eng.ohio-state.edu/sampl/data/

[109] “The Digital Michelangelo Project.” [Online] :http://graphics.stanford.edu/data/mich/

[110] X. Chen et al., “A benchmark for 3D mesh segmentation,”ACM SIGGRAPH, vol. 28, 2009.

[111] A.M. Bronstein et al., “Efficient computation of isometry-invariant distances between surfaces,” SIAM J. Sci. Comput.,vol. 28, pp. 1812–1836, 2006.

[112] P. Sand et al., “Continuous capture of skin deformation,” inACM SIGGRAPH, 2003, pp. 578–586.

[113] D. Vlasic et al., “Multilinear models for face synthesis,” inSIGGRAPH Sketches, 2004, p. 56.

[114] D. Giorgi et al., “SHREC: shape retrieval contest: Watertightmodels track.” [Online] : http://watertight.ge.imati.cnr.it

[115] R.S. Schestowitz et al., “Non-rigid registration assessmentwithout ground truth,” in Med. Image Under. and Analy., vol. 2,2006, pp. 151–155.

[116] D. Bradley et al., “High resolution passive facial performancecapture,” ACM TOG, vol. 29, pp. 1–10, 2010.

[117] H. Li et al., “Example-based facial rigging,” ACM SIGGRAPH,vol. 29, pp. 32:1–32:6, 2010.

[118] P. Buddharaju et al., “Physiology-based face recognition in thethermal infrared spectrum,” IEEE TPAMI, vol. 29, pp. 613–626, 2007.

[119] T.F. Cootes et al., “Computing accurate correspondencesacross groups of images,” IEEE TPAMI, vol. 32, pp. 1994–2005, 2010.

[120] K.A. Sidorov et al., “Efficient groupwise non-rigid registrationof textured surfaces,” in Proc. CVPR, 2011, pp. 2401–2408.

[121] J.h. Chen et al., “3D point correspondence by minimumdescription length in feature space,” in Proc. ECCV, 2010.

[122] T.M. Mitchell, Machine Learning. McGraw-Hill, 1997.[123] F. Wang et al., “Groupwise point pattern registration using

a novel CDF-based Jensen-Shannon Divergence,” in Proc.CVPR, vol. 1, 2006, pp. 1283–1288.

[124] S. Izadi et al., “Kinectfusion: real-time 3d reconstruction andinteraction using a moving depth camera,” in Proc. UIST,2011, pp. 559–568.

Gary K.L. Tam received his BEng (first-classhonors), MPhil and PhD degrees from HongKong University of Science and Technology,City University of Hong Kong, and DurhamUniversity, respectively. He is currently a re-search associate at Cardiff University, UnitedKingdom. His research interests include mul-timedia retrieval & visualization, digital geom-etry processing and interactive techniques.

Zhi-Quan Cheng received a BSc, MSc, andPhD degree from Computer School at Na-tional University of Defense Technology in2000, 2002 and 2008, respectively. He isa lecturer at the PDL Laboratory, ComputerSchool, National University of Defense Tech-nology (NUDT), and the leader of visual com-puting team at NUDT. His research interestsinclude computer graphics, and digital geom-etry processing.

Yu-Kun Lai received his bachelor’s degreeand PhD degree in computer science fromTsinghua University in 2003 and 2008, re-spectively. He is currently a lecturer of visualcomputing in the School of Computer Sci-ence, Cardiff University, Wales, UK. His re-search interests include computer graphics,geometry processing, computer-aided geo-metric design and computer vision.

Frank C. Langbein received a PhD in 2003from Cardiff University on “Beautification ofReverse Engineered Geometric Models” anda Diploma in Mathematics from Stuttgart Uni-versity in 1998. He is currently a lecturer incomputer science at Cardiff University, work-ing on modelling, control and machine learn-ing applied to geometry, quantum technolo-gies, chemical synthesis and perception. Heis a member of the American MathematicalSociety and the IEEE.

Yonghuai Liu is a senior lecturer at Aberys-twyth University since 2011. He is currentlyan associate editor and an editorial boardmember for a number of international jour-nals, including Pattern Recognition Letters.He has published three books and more than140 papers in international conference pro-ceedings and journals. His primary researchinterests lie in 3D computer vision.

Dave Marshall is a Professor in computerscience at Cardiff University. He received hisBSc and PhD ”Three Dimensional Inspec-tion of Manufactured Objects” from Univer-sity College, Cardiff in 1986 and 1990. Heresearches in human facial analysis, highdimensional subspace analysis, audio/videoimage processing, and data/sensor fusion.He has published over 130 papers and onebook in these research areas. He is a mem-ber of the IEEE and BMVA.

Ralph R. Martin obtained his PhD fromCambridge University. He now leads the Vi-sual Computing Research Group at CardiffUniversity, and is Director of Scientific Pro-grammes of the One Wales Research Insti-tute of Visual Computing. His publications in-clude over 200 papers and 12 books. He is aFellow of the Learned Society of Wales, andis on the editorial boards of several leadingjournals.

Xianfang Sun received his PhD degree incontrol theory and its applications from theInstitute of Automation, Chinese Academy ofSciences. He is a lecturer at Cardiff Univer-sity. His research interests include computervision and graphics, pattern recognition andartificial intelligence, and system identifica-tion and control. He is on the Editorial Boardof Acta Aeronautica et Astronautica Sinica.

Paul L. Rosin is a Professor at the School ofComputer Science & Informatics, Cardiff Uni-versity. Previous posts include Brunel Univer-sity, Joint Research Centre, Italy and CurtinUniversity of Technology, Australia. His re-search interests include computer vision, re-mote sensing, mesh processing, and theanalysis of shape in art and architecture.