0% found this document useful (0 votes)

53 views22 pages

Fast and Accurate Point Cloud Registration Using Trees of Gaussian Mixtures

This document presents a new point cloud registration algorithm that uses a hierarchical Gaussian mixture model (GMM) representation to achieve state-of-the-art speed and accuracy. The algorithm constructs a multi-scale representation of point clouds by recursively running small-scale likelihood segmentations in parallel on a GPU. It then leverages this representation using a novel PCA-based optimization criterion to perform adaptive data association between spatial subsets of points at an optimal level of detail. Compared to previous methods like ICP and GMM-based techniques, the tree-based association algorithm performs registration in logarithmic-time while adjusting to local geometry complexity.

Uploaded by

Oliver Jones

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views22 pages

Fast and Accurate Point Cloud Registration Using Trees of Gaussian Mixtures

Uploaded by

Oliver Jones

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Fast and Accurate Point Cloud Registration

using Trees of Gaussian Mixtures

Ben Eckart Kihwan Kim Jan Kautz

NVIDIA Research

Abstract. Point cloud registration sits at the core of many important

and challenging 3D perception problems including autonomous naviga-
tion, SLAM, object/scene recognition, and augmented reality. In this
paper, we present a new registration algorithm that is able to achieve
state-of-the-art speed and accuracy through its use of a hierarchical
arXiv:1807.02587v1 [cs.CV] 6 Jul 2018

Gaussian Mixture Model (GMM) representation. Our method constructs

a top-down multi-scale representation of point cloud data by recursively
running many small-scale data likelihood segmentations in parallel on
a GPU. We leverage the resulting representation using a novel PCA-
based optimization criterion that adaptively finds the best scale to per-
form data association between spatial subsets of point cloud data. Com-
pared to previous Iterative Closest Point and GMM-based techniques,
our tree-based point association algorithm performs data association in
logarithmic-time while dynamically adjusting the level of detail to best
match the complexity and spatial distribution characteristics of local
scene geometry. In addition, unlike other GMM methods that restrict
covariances to be isotropic, our new PCA-based optimization criterion
well-approximates the true MLE solution even when fully anisotropic
Gaussian covariances are used. Efficient data association, multi-scale
adaptability, and a robust MLE approximation produce an algorithm
that is up to an order of magnitude both faster and more accurate than
current state-of-the-art on a wide variety of 3D datasets captured from
LiDAR to structured light.

1 Introduction

Point cloud registration is the task of aligning two or more point clouds by es-
timating the relative transformation between them, and it has been an essential
part of many computer vision algorithms such as 3D object matching [1], local-
ization and mapping [2], dense 3D reconstruction of a scene [3], and object pose
estimation [4].
Recently point set registration methods [5] have been gaining more impor-
tance due to the growing commercial interest of virtual and mixed reality [6],
commercial robotics, and autonomous driving applications [7,8]. In most of these
applications, massive amounts of 3D point cloud data (PCD) are directly cap-
tured from various active sensors (i.e., LiDAR and depth cameras) but at dif-
ferent times under different poses or local coordinate systems. The task of point
2 Ben Eckart Kihwan Kim Jan Kautz

cloud registration is then to try to find a common coordinate system, which is

done by estimating some type of geometric similarity in the point data that can
be recovered through optimization over a set of spatial transformations.
One of the oldest and most widely used registration algorithms, Iterative
Closest Point (ICP) [9,10], is based on an iterative matching process where point
proximity establishes candidate point pair sets. Given a set of point pairs, the
rigid transformation that minimizes the sum of squared point pair distances
can be calculated efficiently in closed form. ICP and its dozens of variants [11]
often fail to produce correct results in many common but challenging scenarios,
where the presence of noise, uneven point density, occlusions, or when large pose
displacements can cause a large proportion of points to be without valid matches.
Compared to traditional ICP-based approaches, much research has been done
on the use of statistical models for registration, which in principle can pro-
vide better estimates for outlier rejection, convergence, and geometric match-
ing [12,13,14]. In particular, many statistical methods have been designed around
the Expectation Maximization (EM) algorithm [15] as it has been shown that
EM generalizes the ICP algorithm under a few basic assumptions [16,17]. Many
statistical registration techniques have explicitly utilized this paradigm to de-
liver better robustness and accuracy [18,19,17,20], but these algorithms tend to
be much slower than ICP and often offer only marginal improvement in all but
a few specific circumstances. As a result, ICP-based methods are still heavily
used in practice for many real-world applications.
Our proposed method falls into the category of GMM-based statistical reg-
istration algorithms. We tackle the typical shortcomings of these methods, slow
speeds and lack of generality, by adopting an efficient hierarchical construction
for the creation of an adaptive multi-scale point matching process. Efficiency:
The search over multiple scales as a recursive tree-based search produces a highly
performant logarithmic-time algorithm that quickly and adaptively finds the
most appropriate level of geometric detail with which to match points. Gener-
ality: By using a data-driven point matching procedure over multiple scales,
our proposed algorithm can automatically adapt to many different types of
scenes, particularly with real-world data where widely varying sampling sparsity
and scene complexity are common. Finally, we introduce a novel Mahalanobis
distance approximation resembling ICP’s point-to-plane distance minimization
metric, which more faithfully approximates the true MLE solution under general
anisotropic covariances than previous methods.

2 Related Work

Our method builds on previous work in GMM-based methods for registration

such as GMM-Reg [31][32], JRMPC [19], and MLMD [28], while also leverag-
ing recent results using hierarchical GMMs for point cloud modeling [33]. By
adopting a GMM-based paradigm, we gain robustness in situations of large pose
displacement, optimal solutions in the form of maximum likelihood estimates,
and an ability to more easily leverage point-level parallelism on GPUs. By aug-
Ben Eckart Kihwan Kim Jan Kautz 3

Mult. Aniso- Multi- Data Assoc. Opt.

Method
Link tropic Scale Trans. Complex. Complex.
ICP [9] – N2 N
SoftAssign [12] X X† – N2 N2
EM-ICP [17] X X kd-tree N log N N2
LM-ICP [21] grid approx. N N
KC [22] X grid approx. N V
N2
TrICP [23] voxels V
N
FICP [24] kd-tree N log N N
G-ICP [16] X kd-tree N log N N
CPD [14] X FGT N N2
ECMPR [20] X X FGT N N
GMMReg [25] X X† FGT N N2
NDT-P2D [26] X X X voxels+kd-tree N log V N
NDT-D2D [26] X X X voxels+kd-tree V log V V
REM-Seg [27] X X X† GMM NJ N
MLMD [28] X X GMM NJ J
SVR [29] X X† GMM‡ N2 ∼ N3 J
JRMPC [30] X GMM NJ J
Proposed X X X GMM-Tree N log J log J ∼ J
† ‡
Implicitly multi-scale via annealing, Conversion to GMM via SVM
Table 1. A Comparison of Registration Methods. Multiply Linked : Many-to-one
or many-to-many correspondences, Anisotropic: General shape alignment using unre-
stricted covariance structures, Multi-Scale: Registration at multiple levels of granular-
ity, Data Transform: Underlying data structure or transform, Association Complexity:
Complexity of data association problem over all N points (E Step in the case of EM-
based methods), Optimization Complexity: Size of the optimization problem (M Step
in the case of EM-based methods). Assuming both point clouds size N , number of
voxels/grid points V , and number of mixture components J.

menting the GMM into a hierarchy, we can efficiently compress empty space,
achieve logarithmic-time matching, and perform robust multi-scale data analy-
sis.
The earliest statistical methods placed an isotropic covariance around every
point in the first set of points and then registered the second set of points to it
under an MLE framework (MPM [18], EM-ICP [17], CPD [14,34]). More mod-
ern statistical approaches utilize a generative model framework, where a GMM
is usually constructed from the points explicitly and registration is solved in an
MLE sense using an EM or ECM [35] algorithm (REM-Seg [27], ECMPR [20],
JRMPC [19], MLMD [28]), though some utilize a max correlation or L2 distance
approach (Kernel Correlation [22], GMM-Reg [31,32], SVR[29], NDT-D2D[36]).
Since a statistical framework for point cloud registration tends to be more heavy-
weight than ICP, techniques such as decimation (EM-ICP [17]), voxelization
(NDT methods [36,26]), or Support Vector Machines (SVR [29]) have been used
to create smaller or more efficient models, while others have relied on compu-
tational tricks such as the Fast Gauss Transform (CPD [14], ECMPR [20]), or
have devised ways to exploit point-level parallelism and GPU-computation for
increased computational tractability and speed (MLMD [28], parallelized EM-
ICP [13]).
4 Ben Eckart Kihwan Kim Jan Kautz

(a) (b) (c) (d)

Fig. 1. Multi-Scale Representation using a Hierarchy of Gaussian Mixtures:

Top-row shows identical geometries (black lines) and associated points (blue circles),
which are represented by different levels of Gaussian models (green contour for 1 σ.)
(a) (Top) Ideal Normals (red arrows) on the surfaces, (b) Too coarse (only two Gaus-
sians in Level 2): poor segmentation leads to incorrect normals, which will degrade
accuracy when registering points to model, (c) Too fine (using finest level of Gaus-
sian models): over-segmentation leads to erroneous normals as sample noise overtakes
real facet geometry (d) Adaptive multi-scale (Mixture of level 3 and level 4 models):
point-to-model association can be much more robust when fidelity adaptively changes
according to data distribution so that facets can be well-modeled given differing spatial
frequencies and sampling densities.

In contrast to these statistical model-based approaches, modern robust vari-

ants of point-to-plane ICP (e.g. Trimmed ICP [37], Fractional ICP [24]) are often
much faster and sometimes perform nearly as well, especially under real-world
conditions [38]. See Table 1 for a detailed comparison of key registration al-
gorithms utilizing the ICP and GMM paradigms. Our proposed method offers
favorable complexity over both classes of algorithms due to its novel use of a
GMM-Tree structure, without needing to resort to discretization strategies like
the NDT-based methods.

3 Registration as Expectation Maximization

The Expectation Maximization (EM) algorithm forms the theoretical foundation

for most modern statistical approaches to registration and also generalizes ICP
under certain basic assumptions. EM is commonly employed for MLE optimiza-
tion in the case where directly maximizing the data likelihood for the sought
after variable is intractable, but maximizing the expected joint data likelihood
conditioned on a set of latent variables is tractable. For the registration case,
the sought after variable is the transformation T between point clouds and the
latent variables are the point-model associations.
The problem is set up as follows: Given point clouds Z1 and Z2 , we would
like to maximize the data probability of Z2 under a set of transformations T
with respect to a probability model Θ Z1 derived from the first point cloud Z1 .
Ben Eckart Kihwan Kim Jan Kautz 5

T̂ = argmax p(T (Z2 )|Θ̂ Z1 ) (1)

That is, the most likely estimate of the transformation T̂ is the estimate that
maximizes the probability that the samples of the transformed point cloud T (Z2 )
came from some probabilistic representation of spatial likelihood (parameterized
by Θ̂) derived from the spatial distribution of the first point cloud Z1 . The
most common form for parametrizing this probability distribution is through a
Gaussian Mixture Model (GMM), whose data probability is defined as a convex
combination of J Gaussians weighted by the J-component vector π,
J
X
p(z|Θ Z1 ) = πj N (z|Θ j ) (2)
j=1

The derivation of the probability model Θ Z1 may be as simple as statically

setting an isotropic covariance around each point in Z1 (e.g. EM-ICP [17]), or
as complicated as framing the search for Θ Z1 as a completely separate opti-
mization problem (e.g. SVR [29], MLMD [28]). Regardless of how the model is
constructed, however, EM provides an iterative procedure to solve for T through
the introduction of a set of latent correspondence variables C = {cij } that dic-
tate how points zi ∈ Z2 probabilistically associate to the J subcomponents Θ j
of the model Θ Z1 . Intuitively, we can view EM as a statistical generalization
of ICP: The E Step estimates data associations, replacing ICP’s matching step,
while the M Step maximizes the expected likelihood conditioned on these data
associations, replacing ICP’s distance minimization step over matched pairs.
In the E Step, we use Bayes’ rule to calculate expectations over the corre-
spondences. For a particular point zi , its expected correspondence to Θ j (E[cij ])
can be calculated as follows,

πj N (zi |Θ j )
E[cij = 1] = PJ (3)
k=1 πk N (zi |Θ k )

Generally speaking, larger model sizes (larger J) produce more accurate reg-
istration results since larger models have more representational fidelity. However,
large models produce very slow registration algorithms: Given N points in Z2 ,
Equation 3 must be calculated N × J times for each subsequent M Step. For
methods that utilize models of size J ≈ O(N ) (e.g. EM-ICP [17], CPD [14],
GMMReg [31]), this causes a data association complexity of O(N 2 ) and thus
these algorithms have problems scaling beyond small point cloud sizes.
To combat this scaling problem, our approach builds from recent advances in
fast statistical point cloud modeling via hierarchical generative models by Eckart
et al. [33]. In this approach, point cloud data is modeled via a GMM-Tree, which
is built in a top-down recursive fashion from small-sized Gaussian Mixtures.
Their efficient GPU-based approach can produce high-fidelity GMM-Trees in
real-time, but given that they were originally designed to optimize reconstructive
fidelity and for dynamic occupancy map generation, it is not obvious how to
6 Ben Eckart Kihwan Kim Jan Kautz

efficiently adapt these models for use in a registration setting. That is, we must
derive a way to associate new data to the model and then use the associations
to drive an optimization over T . As such, we can use their model construction
algorithm in order to construct ΘZ1 from Z1 (see [33] for details), but we must
derive a separate and new EM algorithm to use these GMM-Tree models for
registration.

4 Hierarchical Gaussian Mixture Mahalanobis Estimation

In this section, we review our proposed approach for hierarchical GMM-based

registration under a new EM framework. In Section 4.1 we discuss our new E
Step for probabilistic data association that utilizes the GMM-Tree representation
for point clouds, and in Section 4.2 we introduce a new optimization criterion
to approximate the MLE T for rigid transformations.

4.1 E Step: Adaptive Tree Search

Our proposed E Step uses a recursive search procedure to perform probabilistic

data association in logarithmic time. We also introduce an early stopping heuris-
tic in order to select the most appropriate scale at which to associate data to
the hierarchical model.
The GMM-Tree representation from [33] forms a top-down hierarchy of 8-
component GMM nodes, with each individual Gaussian component in a node
having its own 8-component GMM child. Thus, a particular node in the GMM-
Tree functions in two ways: first, as a probabilistic partition of the data and
second, as a statistical description of the data within a partition. We exploit both
of these properties in our proposed E Step by using the partitioning information
to produce an efficient search algorithm and by using the local data distributions
as a scale selection heuristic.
Logarithmic Search Each level in the GMM-Tree forms a statistical segmen-
tation at finer levels of granularity and detail. Crucially, the expectation of a
point zi to a particular Gaussian component Θ j is exactly the sum of the ex-
pectations of that point to its child GMM. Thus, if we query a parent node’s
point-model expectation and it falls under a threshold, we can effectively prune
away all its children’s expectations, thus avoiding calculating all N × J proba-
bilistic associations. Refer to Algorithm 1 for details. In our implementation, we
only traverse down the maximum likelihood path at each step. By utilizing the
hierarchy in this way, we can recursively search through the tree in logarithmic
time (O(log J)) to calculate a point’s expectation. This is opposed to previous
registration algorithms using traditional GMM’s, where a linear search much be
performed over all mixture components (O(J)) in order to match data to the
model.
Multiscale Adaptivity Real-world point clouds often exhibit large spatial dis-
crepancies in sampling sparsity and geometric complexity, and so different parts
Ben Eckart Kihwan Kim Jan Kautz 7

Algorithm 1 E Step for Registration

1: procedure E step adaptive(Z2 , Θ Z1 )
2: for zi ∈ Z2 in parallel do
3: searchID ← −1, γ ← {0, 0, 0, 0, 0, 0, 0}
4: for l = 0 to L − 1 do // L is max tree level
def
5: G ← Children(searchID) // Children(-1) = {0..7}
6: for j ∈ G do // for each child in subtree
7: γ[j] ∝ πj N (zi |Θ j ) // calculate data-model expectation
8: end for
9: searchID ← argmaxj∈G γ[j] // Update with most likely association
10: if Complexity(Θ[searchID])) ≤ λc then
11: break // early stopping heuristic to prune clusters too simple
12: end if
13: end for
14: // Accumulate 0th , 1st , 2nd moments {Mj0 , Mj1 , Mj2 } for next M Step
15: {Mj0 , Mj1 , Mj2 } ←Accumulate(Mj0 , Mj1 , Mj2 , γ[searchID], zi )
16: end for
17: return {Mj0 , Mj1 , Mj2 }
18: end procedure

of the scene may benefit from being represented at different scales when perform-
ing point-scene association. Refer to Figure 1 for an overview of this concept.
Under a single scale, the point cloud modeling and matching process might suc-
cumb to noise or sampling inadequacies if the given modeling fidelity is not
appropriate to the local data distribution.
To take advantage of the GMM-Tree multiscale representation and prevent
overfitting, we make a check for the current mixture component’s geometric
complexity and stop early if this condition is not met. This complexity check
acts as a heuristic for proper scale selection. We implement our complexity func-
tion (Complexity(·) in Algorithm 1, L10) as λ1 +λλ32 +λ3 for each covariance where
λ1 ≥ λ2 ≥ λ3 are its associated eigenvalues. We experimentally set our adaptive
threshold, λC = 0.01 for all experiments. This means we terminate the search
at a particular scale if the current cluster associated to the point becomes too
planar: when 1% or less of its variance occurs along its normal direction. Exper-
imentally, we have found that if we recurse further, we will likely start to chase
noise.
Figure 2 shows a graphical depiction of what our adaptive threshold looks
like in practice. The Gaussian mixture components break down the point cloud
data at a static tree level of 2 (J = 64) and 3 (J = 512) as compared to
an adaptive model that is split into different recursion levels according to a
complexity threshold λC = 0.01. The points are color coded according to their
expected cluster ownership. Note that the adaptive model has components of
both levels of the GMM hierarchy according how smooth or complex the facet
geometry is. The ability to adapt to changing levels of complexity allows our M
Step to always use a robustly modeled piece of geometry (cf. Figure 1).
8 Ben Eckart Kihwan Kim Jan Kautz

(a) GMM-Tree L2 (64 (c) GMM-Tree L3 (512

(b) Adaptive L4 (λC = 0.01)
components) components)

Fig. 2. Scale Selection using a GMM-Tree To show qualitatively how scale selec-
tion works, we first build a model over a crop (couch, plant, and floor) of the Stanford
Scene Lounge dataset [39]. We then associate random colors to each mixture component
and color each point according to its data-model expectation. (a) shows this coloring
given a static recursion level of 2 in the GMM-Tree, while (c) shows this coloring for
a static recursion level of 3. We contrast this with (b), which shows our adaptively
scale-selected model containing components at varying levels of recursion depending
on the local properties of the mixture components. The scale selection process provides
our Mahalanobis estimator (Sec. 4.2) robust component normals, preventing the use
of over-fitted or under-fitted mixture components and resulting in a more accurate
registration result.

4.2 M Step: Mahalanobis Estimation

In this section, we will derive a new M Step for finding the optimal transforma-
tion T between a point set Z2 and an arbitrary GMM Θ̂ Z1 representing point
set Z1 .
First, given N points zi and J clusters Θ j ∈ Θ̂ Z1 , we introduce a N × J
set of point-cluster correspondences C = {cij }, so that the full joint probability
becomes
N X
X J
ln p(T (Z), C|Θ) = cij {ln πj + ln N (T (zi )|Θ j )} (4)
i=1 j=1

def
We iterate between E and M Steps. On the E Step, we calculate γij = E[cij ]
under the current posterior. On the M Step, we maximize the expected data log
likelihood with respect to T while keeping all γij fixed,

T̂ = argmax Ep(C|T (Z),Θ) [ln p(T (Z), C|Θ)] (5)

T
X
= argmax γij {ln πj + ln N (T (zi )|Θ j )} (6)
T ij
X
= arg min γij (T (zi ) − µj )T Σ j−1 (T (zi ) − µj ) (7)
T ij
Ben Eckart Kihwan Kim Jan Kautz 9

Thus, the most likely transformation T between the point sets is the one that
minimizes the weighted sum of squared Mahalanobis distances between points
of Z2 and individual clusters of Θ Z1 , with weights determined by calculating
expected correspondences given the current best guess for T̂ .
As shown mathematically in previous work [20,19,17,28], if we restrict T
solely to the set of all rigid transformations (T = {R ∈ SO(3), t3×1 }) we can
further reduce the double sum over both points and clusters into a single sum
over clusters. This leaves us with a simplified MLE optimization criterion,
X
T̂ = arg min πj∗ (T (µ∗j ) − µj )T Σ −1 ∗
j (T (µj ) − µj ) (8)
T j

P P
γij i γij zi
where, πj∗= N
i
and µ∗j
= P
i γij
We can further relate Equation 8 to the weighted moments calculated by the
E Step (see Algorithm 1) as follows,

! !T ! !
X Mj1 Mj1
T̂ = arg min Mj0 T − µj Σ j−1 T − µj (9)
T j
Mj0 Mj0

P P
where Mj0 = i γij and Mj1 = i γij zi .
One can interpret the Mahalanobis distance as a generalization of point-to-
point distance where the coordinate system has undergone some affine trans-
formation. In the case of GMM-based registration, each affine transformation is
determined by the covariance, or shape, of the cluster to which points are being
registered. For example, clusters that are mostly planar in shape (two similar
eigenvalues and one near zero) will tend to aggressively pull points toward it
along its normal direction while permitting free movement in the plane. This
observation should match one’s intuition: given that we have chosen a proba-
bilistic model that accurately estimates local geometry, an MLE framework will
utilize this information to pull like geometry together as a type of probabilistic
shape matching.
By using fully anisotropic covariances, arbitrarily oriented point-to-geometry
relations can be modeled. Optimization of Eq. 8 therefore should produce highly
accurate transformation estimate. Previous algorithms in the literature, how-
ever, have yet to fully leverage this general MLE construction. Simplifications
are made either by 1) placing a priori restrictions on the complexity of the Gaus-
sian covariance structure (e.g. isotropic only [19] or a single global bandwidth
term [17]), or by 2) using approximations to the MLE criterion that remove or
degrade this information [28]. The reasons behind both model simplification and
MLE approximation are the same: Eq. 8 has no closed form solution. However,
we will show how simply reinterpreting the Mahalanobis distance calculation can
lead to a highly accurate, novel method for registration.
We first rewrite the inner Mahalanobis distance inside the MLE criterion of
Eq. 8 by decomposing each covariance Σ j into its associated eigenvalues λ and
10 Ben Eckart Kihwan Kim Jan Kautz

eigenvectors n using PCA, thereby producing the following equivalence,

3
X 1 T
||T (µ∗j ) − µj ||2Σ j = (n (T (µ∗j ) − µj ))2 (10)
λl l
l=1

Thus, we can reinterpret each cluster’s Mahalanobis distance term inside the
MLE criterion as a weighted sum of three separate point-to-plane distances.
The weights are inversely determined by the eigenvalues, with their associated
eigenvectors constituting each plane’s normal vector. Going back to the example
of a nearly planar Gaussian, its covariance will have two large eigenvalues and
one near-zero eigenvalue, with the property that the eigenvectors associated with
the larger eigenvalues will lie in the plane and the eigenvector associated with
the smallest eigenvalue will point in the direction of its normal vector. Since the
weights are inversely related to the eigenvalues, we can easily see that the MLE
criterion will mostly disregard any point-to-µj distance inside its plane (that
is, along the two dominant PCA axes) and instead disproportionately focus on
minimizing out-of-plane distances by pulling nearby points along the normal to
the plane.
We can see that by plugging in this equivalence back into Eq. 8, we arrive at
the following MLE criterion,

XJ X3
πj∗ T
T̂ = arg min (njl (T (µ∗j ) − µj ))2 (11)
T j=1
λjl
l=1

where the set of njl , l = 1..3 represent the 3 eigenvectors for the jth Gaussian
(anisotropic) covariance, and λjl the associated eigenvalues.
We have transformed the optimization from the minimization of a weighted
sum of J squared Mahalanobis distances to an equivalent minimization of a
weighted sum of 3J squared point-to-plane distances. In doing so, we arrive at a
form that can be leveraged by any number of minimization techniques previously
developed for point-to-plane ICP [10]. Note that unlike traditional point-to-
plane methods, which usually involve the computationally difficult task of finding
planar approximations over local neighborhoods at every point and sometimes
also for multiple scales [40,41], the normals in Eq. 12 are found through a very
small number of 3x3 eigendecompositions (typically J ≤ 1000 for even complex
geometric models) over the model covariances, with appropriate scales chosen
through our proposed recursive search over the covariances in the GMM-Tree
(Sec 4.1).
We solve Equation 12 using the linear least squares technique described by
Low for point-to-plane ICP optimization [42], which we adapt into a weighted
form (a derivation is provided in our supplementary material). The only approx-
imation required is a linearization of R using the small-angle assumption. In
practice, this is a fair assumption to use since GMM-based registration methods
are local and thus diverge for large pose displacements anyway.
Ben Eckart Kihwan Kim Jan Kautz 11

5 Speed vs Accuracy
For every registration algorithm, there is an inherent trade-off between accuracy
and speed. To explore how different registration algorithms perform under var-
ious accuracy/speed trade-offs, we have designed a synthetic experiment using
the Stanford Bunny. We take 100 random 6DoF transformations of the bunny
and then run each algorithm over the same group of random point subsets of
increasing cardinality. Our method of obtaining a random transformation is to
sample each axis of rotation uniformly from [-15,15] degrees and each translation
uniformly from [-0.05, 0.05] (roughly half the extent of the bunny). We can then
plot speed vs accuracy as a scatter plot in order to see how changing the point
cloud size (a proxy for model complexity) affects the speed vs accuracy tradeoff.
The algorithms and code used in the following experiments were either pro-
vided directly by the authors (JRMPC, ECMPR, NDT-D2D, NDT-P2D, SVR,
GMMReg), taken from popular open source libraries (libpointmatcher for TrICP-
pt2pt, TrICP-pt2pl, FICP), or are open source re-implementations of the original
algorithms with various performance optimizations (EM-ICP-GPU, SoftAssign-
GPU, ICP-OpenMP, CPD-C++). Links to the sources can be found in the sup-
plementary material. Parameters were set for all algorithms according to what
was recommended by the authors and/or by the software. All our experiments
were run on Intel Core i7-5920K and NVIDIA Titan X.
In order to test how each design decision affects the performance of the
proposed algorithm, we test against three variants:
1. Adaptive Ln: The full algorithm proposed in this paper. Adaptive multi-
scale data association using a GMM-Tree that was constructed up to a max
recursion level of n.
2. GMM-Tree Ln: Here we use the same GMM-Tree representation for loga-
rithmic time data association, but without multi-scale adaptivity (λc = 0).
The tree is constructed up to a max recursion level of n. By comparing
GMM-Tree to Adaptive, we can see the benefits of stopping our recursive
search according to data complexity.
3. GMM J=n: This variant forgoes a GMM-Tree representation and uses
a simple, fixed complexity, single-level GMM with n mixture components.
It is therefore similar to other fixed complexity GMM-based registration
approaches to data representation (e.g. [17,31,28,19]). Thus, both recur-
sive (logarithmic) data-association and adaptive complexity cannot be used.
However, this variant is still well-optimized for the GPU and can still use
our new PCA-based MLE optimization. Comparing this approach (GMM ) to
the tree-based representations (GMM-Tree and Adaptive) shows how much
the tree-based data representation affects registration performance over just
using our new MLE optimization technique.
Figure 3(a) shows each algorithm’s speed vs accuracy trade-off by plotting
registration error vs time elapsed. The lower left corner is best (both fast and
accurate). One can quickly see how different classes of algorithms clearly domi-
nate each other on the speed/accuracy continuum. For additional clarity, Figure
12 Ben Eckart Kihwan Kim Jan Kautz

(a) Accuracy vs Speed (b) Speed vs Size

Fig. 3. Each data point represents a particular algorithm’s average speed and accu-
racy when registering together randomly transformed Stanford Bunnies. We produce
multiple points for each algorithm at different speed/accuracy levels by applying the
methods multiple times to different sized point clouds. The lower left corner shows the
fastest and most accurate algorithms for a particular model size. Our proposed algo-
rithms (black, cyan, and red) tend to dominate the bottom left corner, though robust
point-to-plane ICP methods sometimes produce more accurate results, albeit at much
slower speeds (e.g. Trimmed ICP).

3(b) explicitly plots the time scaling of each registration method as a function
of point cloud size. For both timing and accuracy, one can see that, roughly
speaking, our adaptive tree formulation performs the best, followed by our non-
adaptive tree formulation, followed by our non-adaptive non-tree formulation,
then ICP-based variants, and then finally previous GMM-based variants (black
> cyan > red > blue > green).
It should be noted that even though our proposed algorithms (black, cyan,
and red) tend to dominate the lower left corner of Figure 3(a), certain robust
point-to-plane ICP methods sometimes produce more accurate results, albeit at
much slower speeds. See for example in Figure 3 that some point-to-plane ICP re-
sults were less than 10−2◦ angular error and near 1 second convergence time. We
estimate that this timing gap might be decreased given a good GPU-optimized
robust planar ICP implementation, though it is unclear if the neighborhood-
based planar approximation scheme used by these algorithms could benefit from
GPU parallelization as much as our proposed Expectation Maximization ap-
proach, which is designed to be almost completely data parallel at the point
Ben Eckart Kihwan Kim Jan Kautz 13

(a) Urban scene with many (b) Snowy, hilly terrain with
rectilinear structures few features

Fig. 4. Speed vs accuracy tests for two types of real-world LiDAR frames with very
different sampling properties from the Stanford Bunny. In general, similar results are
obtained as in Figure 3.

level. However, if computation time is not a constraint for a given application

(e.g. offline approaches), we would recommend trying both types of algorithms
(our model-based approach vs a robust planar ICP-based approach) to see which
provides the best accuracy.
For completeness, we repeated the test with two frames of real-world Lidar
data, randomly transformed and varyingly subsampled as before in order to
obtain our set of speed/accuracy pairs. The results are shown in Figure 4. As in
Fig. 3(a), the bottom left corner is most desirable (both fast and accurate), our
methods shown in red, teal, and black. Given that the bunny and LiDAR scans
have very different sampling properties, a similar outcome for all three tests
shows that the relative performance of the proposed approach isn’t dependent
on evenly sampled point clouds.

6 Evaluation on Real-World Data

Lounge Dataset In this test, we calculate the frame-to-frame accuracy on the

Stanford Lounge dataset, which consists of range data produced by moving a
handheld Kinect around an indoor environment [39]. We register together every
5th frame for the first 400 frames, each downsampled to 5000 points. To measure
the resulting error, we calculate the average Euler angle deviation from ground
truth. Refer to Table 2(a) for error and timing. All our experiments were run on
Intel Core i7-5920K and NVIDIA Titan X. We chose to focus on rotation error
since this was where the largest discrepancies were found among algorithms. The
best performing algorithm we tested against, Trimmed ICP with point-to-plane
distance error minimization, had an average Euler angle error of 0.54 degrees and
14 Ben Eckart Kihwan Kim Jan Kautz

Ang. Speed Ang. Trans. Speed

Method Error (◦ ) (fps) Method Error (◦ ) Error (cm) (fps)
CPD 2.11 0.18 CPD 0.15 17.2 0.004
GMMReg 3.02 .04 GMMReg 0.73 102.1 0.22
NDT-D2D 14.25 11.76 NDT-D2D 0.17 16.0 0.88
FICP 1.44 4.9 FICP 0.15 35.1 1.01
ICP 7.29 2.58 ICP 0.26 15.0 1.35
IRLS-ICP 2.29 7.1 IRLS-ICP 0.15 14.7 1.28
EMICP 10.47 1.44 EMICP 0.99 103.1 2.05
SVR 2.67 0.35 SVR 0.21 39.1 0.27
ECMPR 2.21 0.059 ECMPR 0.31 24.1 0.21
JRMPC 8.27 0.042 JRMPC 0.60 73.1 0.05
TrICP-pt2pl 0.54 8.4 TrICP-pt2pl 0.15 43.2 1.74
TrICP-pt2pt 1.26 5.5 TrICP-pt2pt 0.21 66.2 1.75
ICP-pt2pl 2.24 7.6 ICP-pt2pl 0.27 7.5 1.48
GMM-Tree L2 0.77 31.3 GMM-Tree L2 0.11 12.5 39.34
GMM-Tree L3 0.48 20.4 GMM-Tree L3 0.18 23.9 21.41
GMM-Tree L4 0.56 14.2 GMM-Tree L4 0.20 29.5 15.00
Adaptive L2 0.76 29.6 Adaptive L2 0.12 10.0 39.20
Adaptive L3 0.46 19.8 Adaptive L3 0.15 8.8 22.82
Adaptive L4 0.37 14.5 Adaptive L4 0.15 9.2 16.91
(a) Lounge Dataset (b) LiDAR Dataset

Table 2. Comparison of Registration Methods for the Lounge and LiDAR

Datasets Timing results for both datasets include the time to build the GMM-Tree.
Errors are frame-to-frame averages. Speed given is in average frames per second that
the data could be processed (note that the sensor outputs data frames at 30 Hz for the
Lounge data and roughly 10 Hz for the LiDAR data).

took on average 119 ms to converge. Our best algorithm, the adaptive algorithm
to a max depth of 3, had an average Euler angle error of 0.46 degrees and took
on average less than half the time (50.5 ms) to converge. Also, note that our
times include the time to build the model (the GMM-Tree), which could have
other benefits for applications that need to utilize such a model for tasks like
loop closure detection, or map building. The accuracy of our proposed methods
on this dataset is comparable with the best ICP variants, but at roughly twice
the speed.
See Figure 7 for the error distribution over these frames as a set of histograms.
We show the error histogram as a few misaligned results can potentially skew
the average error, while the histogram gives a more complete picture of both
the accuracy and robustness of the method. Note that the previous GMM-based
methods (in green) are often prohibitively slow for most applications, while ICP
variants perform better. The perform of our proposed methods on this dataset
is near to the accuracy of the best ICP variants, but at roughly twice the speed.

Velodyne LiDAR Dataset We performed frame-to-frame registration on an

outdoor LiDAR dataset using a Velodyne (VLP-16) LiDAR and overlaid the
results in a common global frame. See Figure 5 for a qualitative depiction of the
result. Table 2(b) summarizes the quantitative results from Figure 5 in an easier
to read table format.
Ben Eckart Kihwan Kim Jan Kautz 15

Fig. 5. Frame-to-Frame Registration with Outdoor LiDAR Dataset: Ground

truth path shown in red, calculated path shown in blue. Each frame of LiDAR data
represents a single sweep. We register successive frames together and concatenate the
transformation matrices in order to plot the results in a single coordinate system.
Note that drift is expected over such long distances as we perform no loop closures.
The top row are related GMM-Based methods, the middle row shop modern ICP
implementations, and the bottom row show three of our proposed adaptive GMM-Tree
methods at three different max recursion levels. For our methods, the timing results
include the time to build the recursive GMM model. GMM-Based methods generally
perform slowly and GMMReg in particular diverged. ICP-based methods fared better
in our testing, though our proposed methods show an order of magnitude improvement
in speed while beating or competing with other state-of-the-art in accuracy.

In Figure 5, the ground truth path is shown in red, and the calculated path is
shown in blue. Since there is no loop closures, the error is expected to compound
and cause drift over time. However, despite the compounding error, the bottom
row of Figure 5 (and correspondingly, the bottom three line items of Table 2(b))
16 Ben Eckart Kihwan Kim Jan Kautz

Fig. 6. Example of lounge depth stream from which we produce input point clouds for
our frame-to-frame registration comparison.

Fig. 7. Lounge Error Distributions Histograms of frame-to-frame error as measured

by average Euler angular deviation from ground truth, for the Lounge dataset. Each row
shows a different type of method: the top row are previous GMM-based methods, the
middle row are robust ICP variants, and the bottom row are our proposed algorithms.
Note that while some ICP variants achieve good accuracy, our methods are roughly
twice as fast. This stands in contrast to previous GMM-based methods, which are
prohibitively slow.

shows that the proposed methods can be used for fairly long distances (city
blocks), without the need for any odometry (e.g. INS or GPS) or loop closures.
This particular LiDAR sensor (Velodyne VLP-16) outputs point cloud sweeps
at roughly 10 Hz with an average point cloud size of 13,878 points. Thus, our
methods achieve faster than real-time speeds (17-39 Hz), while the state-of-the-
Ben Eckart Kihwan Kim Jan Kautz 17

art ICP methods are an order of magnitude slower (≈ 1 fps). Also, note that
our times include the time to build the model (the GMM-Tree), which could be
utilized for other concurrent applications besides registration (e.g. probabilistic
mesh visualization or loop detection).
To reiterate from the description given in the main paper, the results from
the LiDAR test depict single frame-to-frame registration results, where each
new result’s calculated transformation is concatenated with all the previous so
as to put all data into a single global frame. Since there is no loop closures,
the error is expected to be compounding, eventually causing extreme drift over
time. However, even despite the compounding error, the bottom row of Figure
5 shows that the proposed methods can be used for fairly long distances (city
blocks), without the need for any odometry (e.g. INS or GPS) or loop closures.

7 Conclusion
To conclude, we proposed a new model-based point cloud registration algorithm
that uses a tree of Gaussian mixtures as our data representation. Our proposed
technique uses this modeling hierarchy in order to adaptively and efficiently per-
form association between the model and point data. Framing point association
as a recursive tree search results in orders of magnitude speed-up relative to tra-
ditional GMM-based approaches that must perform these associations linearly.
To further take advantage of the chosen multi-scale anisotropic GMM represen-
tation, we introduce a new approximation scheme using PCA that reduces the
MLE optimization to a weighted point-to-plane measure. We tested our proposed
methods along with a large variety of state-of-the-art registration methods and
found that our approach is appropriate for many types of point cloud data and
often an order of magnitude faster than current state-of-the-art while producing
similar or greater accuracy. The implementation of our method can be found at
http://placeholder.
18 Ben Eckart Kihwan Kim Jan Kautz

References
1. Drost, B., Ulrich, M., Navab, N., Ilic, S.: Model globally, match locally: Efficient
and robust 3d object recognition. In: CVPR, 2010 IEEE Conference on. (2010)
998–1005 1
2. Nüchter, A., Lingemann, K., Hertzberg, J., Surmann, H.: 6d slam—3d map-
ping outdoor environments: Research articles. J. Field Robot. 24(8-9) (2007) 699–
722 1
3. Newcombe, R.A., Davison, A.J., Izadi, S., Kohli, P., Hilliges, O., Shotton, J.,
Molyneaux, D., Hodges, S., Kim, D., Fitzgibbon, A.: Kinectfusion: Real-time dense
surface mapping and tracking. In: IEEE ISMAR, IEEE (2011) 127–136 1
4. Park, I.K., Germann, M., Breitenstein, M.D., Pfister, H.: Fast and automatic object
pose estimation for range images on the gpu. Machine Vision and Applications 21
(08/2010 2010) 749–766 1
5. Tam, G.K., Cheng, Z.Q., Lai, Y.K., Langbein, F., Liu, Y., Marshall, A.D., Martin,
R., Sun, X., Rosin, P.: Registration of 3d point clouds and meshes: A survey from
rigid to nonrigid. IEEE Transactions on Visualization and Computer Graphics
19(7) (2013) 1199–1217 1
6. Mehta, S.U., Kim, K., Pajak, D., Pulli, K., Kautz, J., Ramamoorthi, R.: Filtering
Environment Illumination for Interactive Physically-Based Rendering in Mixed
Reality. In: Eurographics Symposium on Rendering. (2015) 1
7. Hahnel, D., Thrun, S., Burgard, W.: An extension of the icp algorithm for modeling
nonrigid objects with mobile robots. In: Proceedings of the 18th International Joint
Conference on Artificial Intelligence. IJCAI’03 (2003) 915–920 1
8. Levinson, J., Askeland, J., Becker, J., Dolson, J., Held, D., Kammel, S., Kolter,
J.Z., Langer, D., Pink, O., Pratt, V., Sokolsky, M., Stanek, G., Stavens, D.M.,
Teichman, A., Werling, M., Thrun, S.: Towards fully autonomous driving: Systems
and algorithms. In: Intelligent Vehicles Symposium, IEEE (2011) 163–168 1
9. Besl, P., McKay, H.: A method for registration of 3-D shapes. IEEE Transactions
on Pattern Analysis and Machine Intelligence 14(2) (1992) 239–256 2, 3
10. Chen, Y., Medioni, G.: Object modelling by registration of multiple range images.
Image and Vision Computing 10(3) (1992) 145 – 155 Range Image Understanding.
2, 10
11. Rusinkiewicz, S., Levoy, M.: Efficient variants of the ICP algorithm. In: Interna-
tional Conference on 3-D Digital Imaging and Modeling. (2001) 145–152 2
12. Gold, S., Rangarajan, A., Lu, C., Pappu, S., Mjolsness, E.: New algorithms for 2d
and 3d point matching:: pose estimation and correspondence. Pattern Recognition
31(8) (1998) 1019–1031 2, 3
13. Tamaki, T., Abe, M., Raytchev, B., Kaneda, K.: Softassign and EM-ICP on GPU.
In: IEEE International Conference on Networking and Computing. (2010) 179–183
2, 3
14. Myronenko, A., Song, X.: Point set registration: Coherent point drift. IEEE
Transactions on Pattern Analysis and Machine Intelligence 32(12) (2010) 2262–
2275 2, 3, 5
15. Dempster, A., Laird, N., Rubin, D.: Maximum likelihood from incomplete data
via the em algorithm. Journal of the Royal Statistical Society. (1977) 1–38 2
16. Segal, A., Haehnel, D., Thrun, S.: Generalized ICP. Robotics: Science and Systems
2(4) (2009) 2, 3
17. Granger, S., Pennec, X.: Multi-scale EM-ICP: A fast and robust approach for
surface registration. ECCV 2002 (2002) 69–73 2, 3, 5, 9, 11
Ben Eckart Kihwan Kim Jan Kautz 19

18. Chui, H., Rangarajan, A.: A feature registration framework using mixture mod-
els. In: IEEE Workshop on Mathematical Methods in Biomedical Image Analysis.
(2000) 190–197 2, 3
19. Evangelidis, G.D., Kounades-Bastian, D., Horaud, R., Psarakis, E.Z.: A generative
model for the joint registration of multiple point sets. In: ECCV 2014. (2014) 109–
122 2, 3, 9, 11
20. Horaud, R., Forbes, F., Yguel, M., Dewaele, G., Zhang, J.: Rigid and articulated
point registration with expectation conditional maximization. IEEE Trans. on
Pattern Analysis and Machine Intelligence 33(3) (2011) 587–602 2, 3, 9
21. Fitzgibbon, A.W.: Robust registration of 2d and 3d point sets. Image and Vision
Computing 21(13) (2003) 1145–1153 3
22. Tsin, Y., Kanade, T.: A correlation-based approach to robust point set registration.
ECCV 2004 (2004) 558–569 3
23. Chetverikov, D., Stepanov, D., Krsek, P.: Robust euclidean alignment of 3d point
sets: the trimmed iterative closest point algorithm. Image and Vision Computing
23(3) (2005) 299–309 3
24. Phillips, J.M., Liu, R., Tomasi, C.: Outlier robust icp for minimizing fractional
rmsd. In: 3-D Digital Imaging and Modeling, 2007. 3DIM’07. Sixth International
Conference on, IEEE (2007) 427–434 3, 4
25. Jian, B., Vemuri, B.C.: Robust point set registration using gaussian mixture mod-
els. Pattern Analysis and Machine Intelligence, IEEE Transactions on 33(8) (2011)
1633–1645 3
26. Stoyanov, T.D., Magnusson, M., Andreasson, H., Lilienthal, A.: Fast and accurate
scan registration through minimization of the distance between compact 3D NDT
representations. International Journal of Robotics Research (2012) 3
27. Eckart, B., Kelly, A.: REM-seg: A robust EM algorithm for parallel segmentation
and registration of point clouds. In: IEEE Conf. on Intelligent Robots and Systems.
(2013) 4355–4362 3
28. Eckart, B., Kim, K., Troccoli, A., Kelly, A., Kautz, J.: Mlmd: Maximum likeli-
hood mixture decoupling for fast and accurate point cloud registration. In: IEEE
International Conference on 3D Vision, IEEE (2015) 2, 3, 5, 9, 11
29. Campbell, D., Petersson, L.: An adaptive data representation for robust point-set
registration and merging. In: Proceedings of the IEEE International Conference
on Computer Vision. (2015) 4292–4300 3, 5
30. Evangelidis, G.D., Horaud, R.: Joint alignment of multiple point sets with batch
and incremental expectation-maximization. IEEE Transactions on Pattern Anal-
ysis and Machine Intelligence (2017) 3
31. Jian, B., Vemuri, B.C.: Robust point set registration using Gaussian mixture
models. IEEE Trans. Pattern Anal. Mach. Intell. 33(8) (2011) 1633–1645 2, 3, 5,
11
32. Jian, B., Vemuri, B.C.: A robust algorithm for point set registration using mixture
of Gaussians. In: IEEE Intern. Conf. on Computer Vision. (2005) 1246–1251 2, 3
33. Eckart, B., Kim, K., Troccoli, A., Kelly, A., Kautz, J.: Accelerated generative
models for 3d point cloud data. In: CVPR, IEEE (2016) 2, 5, 6
34. Myronenko, A., Song, X., Carreira-Perpinán, M.A.: Non-rigid point set registra-
tion: Coherent point drift. In: Advances in Neural Information Processing Systems.
(2006) 1009–1016 3
35. Meng, X.L., Rubin, D.B.: Maximum likelihood estimation via the ECM algorithm:
A general framework. Biometrika 80(2) (1993) 267–278 3
20 Ben Eckart Kihwan Kim Jan Kautz

36. Stoyanov, T., Magnusson, M., Lilienthal, A.J.: Point set registration through min-
imization of the L2 distance between 3D-NDT models. In: IEEE International
Conference on Robotics and Automation. (2012) 5196–5201 3
37. Chetverikov, D., Svirko, D., Stepanov, D., Krsek, P.: The trimmed iterative closest
point algorithm. In: Pattern Recognition, 2002. Proceedings. 16th International
Conference on. Volume 3., IEEE (2002) 545–548 4
38. Pomerleau, F., Colas, F., Siegwart, R., Magnenat, S.: Comparing ICP Variants on
Real-World Data Sets. Autonomous Robots 34(3) (February 2013) 133–148 4
39. Zhou, Q.Y., Koltun, V.: Dense scene reconstruction with points of interest. ACM
Transactions on Graphics 32(4) (2013) 112 8, 13
40. Lalonde, J., Unnikrishnan, R., Vandapel, N., Hebert, M.: Scale selection for classi-
fication of Point-Sampled 3-D surfaces. In: Fifth International Conference on 3-D
Digital Imaging and Modeling (3DIM’05), Ottawa, ON, Canada (2005) 285–292
10
41. Unnikrishnan, R., Lalonde, J., Vandapel, N., Hebert, M.: Scale selection for the
analysis of Point-Sampled curves. In: 3D Data Processing Visualization and Trans-
mission, International Symposium on. Volume 0., Los Alamitos, CA, USA, IEEE
Computer Society (2006) 1026–1033 10
42. Low, K.L.: Linear least-squares optimization for point-to-plane icp surface regis-
tration. Chapel Hill, University of North Carolina 4 (2004) 10

Appendices

Related Work: Sources

In this section, we supply all the sources used for the related work we tested
against in the main paper. We used the author’s suggested parameters for their
software unless otherwise noted.

SVR https://sites.google.com/view/djcampbell/research-software
Note: Annealing disabled (resulted in better speeds without much loss in accu-
racy)

GPU-Accelerated SoftAssign, EM-ICP and ICP https://github.com/

tttamaki/cuda_emicp_softassign

ECMPR https://team.inria.fr/perception/research/ecmpr/

CPD (Python): https://github.com/siavashk/pycpd

(C++): https://github.com/gadomski/cpd

JRMPC https://team.inria.fr/perception/research/jrmpc/
Note: JRMPC is designed for multi-frame batch registration, though we tested
it as a frame-to-frame registration method in the paper. We decided to include
it despite this due to its similarity to our method.
Ben Eckart Kihwan Kim Jan Kautz 21

Trimmed Point-to-Plane ICP, Trimmed Point-to-Point ICP, Fractional

ICP, ICP with Iteratively Reweighted Least Squares, Point-to-Point
ICP https://github.com/ethz-asl/libpointmatcher
Note: These algorithms were each implemented using the libpointmatcher library.
In every case, IdentityDataPointsFilter was used as the sole data points filter.
That is, no subsampling was performed for any of these methods. We based
our point-to-point and point-to-plane configurations off of the sample yaml files
found in the evaluations/official solutions directory (Besl92 pt2point.yaml and
Chen91 pt2plane.yaml).

1. Trimmed Point-to-Plane ICP: Trimmed distance ratio = 0.7, 7-NN for nor-
mal calculation
2. Trimmed Point-to-Point ICP: Trimmed distance ratio = 0.75
3. Point-to-Plane ICP: 10-NN for normal calculation
4. Fractional ICP: Trimmed distance filter minRatio = 0.05
5. IRLS-ICP: Robust Welsch outlier filter scale = 5.0

GMMReg https://github.com/bing-jian/gmmreg

NDT-D2D, NDT-P2D http://wiki.ros.org/ndt_registration

Note: We utilized slightly larger voxel sizes on the LiDAR dataset than was
default so as to avoid out of memory errors.

1. NDT-P2D: 4-level multiscale (voxel sizes of 0.5, 1, 2, and 4) (to avoid out of
memory errors on LiDAR dataset)
2. NDT-D2D: 4-level multiscale (voxel sizes of 0.5, 1, 2, and 4) (to avoid out of
memory errors on LiDAR dataset)

Expanded Derivation: Mahalanobis Estimation (Sec 4.2)

In this final section, we derive the full weighted least squares solution for the
Mahalanobis approximator given in Section 4.2.

Recall that with a GMM of size J, the MLE optimization is as follows,

XJ X3
πj∗ T
T̂ = arg min (njl (T (µ∗j ) − µj ))2 (12)
T j=1
λjl
l=1

where the set of njl , l = 1..3 represent the 3 eigenvectors for the jth Gaussian
(anisotropic) covariance, and λjl the associated eigenvalues.
To set up the weighted linear least squares solution, we will first define three
vectors of weights,
22 Ben Eckart Kihwan Kim Jan Kautz
q  q  q 
π1∗ π1∗ π1∗
λ11 λ12 λ13
q ∗  q ∗  q ∗ 
 π2   π2   π2 
 λ2   λ2   λ2 
q ∗1  q ∗2  q ∗3 
 π3   π3   π3 
w1 =  λ3  , w2 = λ3  , w3 = λ3  (13)
 1   2   3 
 .   .   . 
 ..   ..   .. 
q ∗
 q ∗
 q ∗

πJ πJ πJ
λJ1 λJ2 λJ3

Next we define the following three vectors,

 T ∗   T ∗   T ∗ 
n11 (µ1 − µ1 ) n12 (µ1 − µ1 ) n13 (µ1 − µ1 )
 nT2 (µ∗2 − µ2 )   nT2 (µ∗2 − µ2 )   nT2 (µ∗2 − µ2 ) 
 T1 ∗   T2 ∗   T3 ∗ 
 n3 (µ3 − µ3 )   n3 (µ3 − µ3 )   n3 (µ3 − µ3 ) 
b1 =  1  , b2 =  2  , b3 =  3  (14)
 ..   ..   .. 
 .   .   . 
T ∗ T ∗ T ∗
nJ1 (µJ − µJ ) nJ2 (µJ − µJ ) nJ3 (µJ − µJ )

We denote the set of J eigenvectors associated with each covariance’s lth

eigenvalue as nλl . Note that is the elementwise product. We weight the nor-
mals and vertically stack µ∗j as follows,
 
w1 w1 w1 nλ1  ∗
  µj
  ∗
n= 
 w2 w2 w2 nλ2  , µ̂ = µj∗
 ∗ (15)
  µj

w3 w3 w3 nλ3
If we use the small angle assumption to linearize rotation, we are left with the
following overconstrained linear system, where {α, β, γ} are the rotation angles
and {tx , ty , tz } are the optimal translation parameters.
 
α
β   
  w b
∗ γ  1 1
µ̂ × n n   
tx  = w2 b2
 (16)
  w3 b3
ty 
tz
Equation 16 can be solved with any standard linear least squares algorithm.

Monaco 5.11.02 Training Guide
100% (1)
Monaco 5.11.02 Training Guide
1,294 pages
ds7201 Advanced Digital Image Processing PDF
80% (5)
ds7201 Advanced Digital Image Processing PDF
6 pages
Shan Englot IROS 2018 Preprint
No ratings yet
Shan Englot IROS 2018 Preprint
8 pages
Use of Image Registration and Fusion Algorithms and Techniques in Radiotherapy - Report of The AAPM Radiation Therapy Committee Task Group No. 132
No ratings yet
Use of Image Registration and Fusion Algorithms and Techniques in Radiotherapy - Report of The AAPM Radiation Therapy Committee Task Group No. 132
35 pages
Icp SVD
No ratings yet
Icp SVD
37 pages
Introduction To Mobile Robotics: Iterative Closest Point Algorithm
No ratings yet
Introduction To Mobile Robotics: Iterative Closest Point Algorithm
39 pages
1992 - A Method For Registration of 3-D Shapes - ICP - OK
No ratings yet
1992 - A Method For Registration of 3-D Shapes - ICP - OK
18 pages
Besl 1992 Method For Registration of 3D Shapes
No ratings yet
Besl 1992 Method For Registration of 3D Shapes
21 pages
3D Reconstruction Based On Microsoft Kinect Sensor Using Iterative Closest Point Algorithm
No ratings yet
3D Reconstruction Based On Microsoft Kinect Sensor Using Iterative Closest Point Algorithm
6 pages
S 4Pcs Fast Global Pointcloud Registration Via Smart Indexing
No ratings yet
S 4Pcs Fast Global Pointcloud Registration Via Smart Indexing
11 pages
2020conf DeepFit
No ratings yet
2020conf DeepFit
16 pages
Dynamic 2D/3D Registration: Sofien Bouaziz Andrea Tagliasacchi Mark Pauly Ecole Polytechnique F Ed Erale de Lausanne
No ratings yet
Dynamic 2D/3D Registration: Sofien Bouaziz Andrea Tagliasacchi Mark Pauly Ecole Polytechnique F Ed Erale de Lausanne
17 pages
Multi Iteration Registration
No ratings yet
Multi Iteration Registration
6 pages
Yang Go-ICP Solving 3D 2013 ICCV Paper PDF
No ratings yet
Yang Go-ICP Solving 3D 2013 ICCV Paper PDF
8 pages
GEHealthcare Brochure - Discovery CT590 RT PDF
No ratings yet
GEHealthcare Brochure - Discovery CT590 RT PDF
12 pages
2962 Non Rigid Point Set Registration Coherent Point Drift
No ratings yet
2962 Non Rigid Point Set Registration Coherent Point Drift
8 pages
Lu DeepVCP An End-to-End Deep Neural Network For Point Cloud Registration ICCV 2019 Paper
No ratings yet
Lu DeepVCP An End-to-End Deep Neural Network For Point Cloud Registration ICCV 2019 Paper
10 pages
Higaki Et Al 2018 Introduction To The Technical Aspects of Computed Diffusion Weighted Imaging For Radiologists
No ratings yet
Higaki Et Al 2018 Introduction To The Technical Aspects of Computed Diffusion Weighted Imaging For Radiologists
14 pages
1 s2.0 S0924271622003380 Main
No ratings yet
1 s2.0 S0924271622003380 Main
15 pages
Kiss Icp
No ratings yet
Kiss Icp
8 pages
Sensors 23 06841
No ratings yet
Sensors 23 06841
23 pages
Ma Et Al. - 2022 - RETHINKING NETWORK DESIGN AND LOCAL GEOM - ETRY IN POINT CLOUD: A SIMPLE RESIDUAL MLP FRAMEWORK
No ratings yet
Ma Et Al. - 2022 - RETHINKING NETWORK DESIGN AND LOCAL GEOM - ETRY IN POINT CLOUD: A SIMPLE RESIDUAL MLP FRAMEWORK
14 pages
Generalized-ICP: Aleksandr V. Segal Dirk Haehnel Sebastian Thrun
No ratings yet
Generalized-ICP: Aleksandr V. Segal Dirk Haehnel Sebastian Thrun
8 pages
A Review of Research On Point Cloud Registration Methods: IOP Conference Series: Materials Science and Engineering
No ratings yet
A Review of Research On Point Cloud Registration Methods: IOP Conference Series: Materials Science and Engineering
10 pages
Fast Registration Based On Noisy Planes With Unknown Correspondences For 3-D Mapping
No ratings yet
Fast Registration Based On Noisy Planes With Unknown Correspondences For 3-D Mapping
18 pages
Lyu Learning To Segment 3D Point Clouds in 2D Image Space CVPR 2020 Paper
No ratings yet
Lyu Learning To Segment 3D Point Clouds in 2D Image Space CVPR 2020 Paper
10 pages
Root
No ratings yet
Root
9 pages
NeuralGF NuerIPS2023
No ratings yet
NeuralGF NuerIPS2023
14 pages
Fast 3D Mapping by Matching Planes Extracted From Range Sensor Point-Clouds
No ratings yet
Fast 3D Mapping by Matching Planes Extracted From Range Sensor Point-Clouds
6 pages
Two-Stage Point Cloud Registration For 3D Measurement of Large Workpieces
No ratings yet
Two-Stage Point Cloud Registration For 3D Measurement of Large Workpieces
6 pages
Image Registration For Agricultur
No ratings yet
Image Registration For Agricultur
7 pages
CorrI2P Deep Image-to-Point Cloud Registration Via Dense Correspondence
No ratings yet
CorrI2P Deep Image-to-Point Cloud Registration Via Dense Correspondence
11 pages
Trimble Scanning Overview TS BRO USL 0517 LR
No ratings yet
Trimble Scanning Overview TS BRO USL 0517 LR
16 pages
s9623 Gpu Accelerated 3d Point Cloud Processing With Hierarchical Gaussian Mixtures
No ratings yet
s9623 Gpu Accelerated 3d Point Cloud Processing With Hierarchical Gaussian Mixtures
51 pages
Hashimoto Normal Estimation For Accurate 3D Mesh Reconstruction With Point Cloud CVPRW 2019 Paper
No ratings yet
Hashimoto Normal Estimation For Accurate 3D Mesh Reconstruction With Point Cloud CVPRW 2019 Paper
10 pages
Medical Image Processing DG
100% (1)
Medical Image Processing DG
66 pages
Deep Geometric Prior For Surface Reconstruction
No ratings yet
Deep Geometric Prior For Surface Reconstruction
13 pages
Point Transformer
No ratings yet
Point Transformer
10 pages
UM Yao - Point Cloud Stitching
No ratings yet
UM Yao - Point Cloud Stitching
16 pages
Aligning Point Cloud Views Using Persistent Feature Histograms
No ratings yet
Aligning Point Cloud Views Using Persistent Feature Histograms
8 pages
AMultimodal Image Registration Method
No ratings yet
AMultimodal Image Registration Method
20 pages
1 s2.0 S0167865515002287 Main
No ratings yet
1 s2.0 S0167865515002287 Main
7 pages
cs685 Icp
No ratings yet
cs685 Icp
32 pages
Peerj Cs 2189
No ratings yet
Peerj Cs 2189
20 pages
Combination of Feature-Based and Area-Based Image Registration Technique For High Resolution Remote Sensing Image
No ratings yet
Combination of Feature-Based and Area-Based Image Registration Technique For High Resolution Remote Sensing Image
4 pages
WiCRF Weighted Bimodal Constrained LiDAR Odometry and Mapping With Robust Features
No ratings yet
WiCRF Weighted Bimodal Constrained LiDAR Odometry and Mapping With Robust Features
8 pages
2015 Pomerleau FnTRo Review
No ratings yet
2015 Pomerleau FnTRo Review
108 pages
Point Transformer
No ratings yet
Point Transformer
11 pages
Advanced Feature Extraction and Outlier Detection For 3D Biological Biomedical Image Registration
No ratings yet
Advanced Feature Extraction and Outlier Detection For 3D Biological Biomedical Image Registration
12 pages
PfAS 5 Point - Cloud - Processing
No ratings yet
PfAS 5 Point - Cloud - Processing
37 pages
Generalized-ICP: Aleksandr V. Segal Dirk Haehnel Sebastian Thrun
No ratings yet
Generalized-ICP: Aleksandr V. Segal Dirk Haehnel Sebastian Thrun
8 pages
Sse2 03 Icp
No ratings yet
Sse2 03 Icp
12 pages
Lego-Loam: Lightweight and Ground-Optimized Lidar Odometry and Mapping On Variable Terrain
No ratings yet
Lego-Loam: Lightweight and Ground-Optimized Lidar Odometry and Mapping On Variable Terrain
8 pages
Deep Learning On Point Clouds and Its Application - A Survey
No ratings yet
Deep Learning On Point Clouds and Its Application - A Survey
22 pages
Dual Transformer For Point Cloud Analysis: Xian-Feng Han Yi-Fei Jin
No ratings yet
Dual Transformer For Point Cloud Analysis: Xian-Feng Han Yi-Fei Jin
8 pages
Introduction To Registration: Registration Problem in Data Alignment
No ratings yet
Introduction To Registration: Registration Problem in Data Alignment
13 pages
Development of An Inspection System For Defect Detection in Pressed Parts Using Laser Scanned Data
No ratings yet
Development of An Inspection System For Defect Detection in Pressed Parts Using Laser Scanned Data
6 pages
ICP Full Presentation
No ratings yet
ICP Full Presentation
11 pages
R N D L G - P C: A S R MLP F: Ethinking Etwork Esign and Ocal EOM Etry in Oint Loud Imple Esidual Ramework
No ratings yet
R N D L G - P C: A S R MLP F: Ethinking Etwork Esign and Ocal EOM Etry in Oint Loud Imple Esidual Ramework
14 pages
3D Registration of Human Face Using Evolutionary Computation and Kriging Interpolation
No ratings yet
3D Registration of Human Face Using Evolutionary Computation and Kriging Interpolation
4 pages
Isprs Archives XLI B3 309 2016
No ratings yet
Isprs Archives XLI B3 309 2016
6 pages
Lepard: Learning Partial Point Cloud Matching in Rigid and Deformable Scenes
No ratings yet
Lepard: Learning Partial Point Cloud Matching in Rigid and Deformable Scenes
17 pages
Automatic Registration Between Lidar and Digital Images
No ratings yet
Automatic Registration Between Lidar and Digital Images
4 pages
Dalleyg RI Regn 3DIM
No ratings yet
Dalleyg RI Regn 3DIM
8 pages
Different Image Registration Methods An Overview
No ratings yet
Different Image Registration Methods An Overview
6 pages
Pers 2010 763
No ratings yet
Pers 2010 763
2 pages
Improved Iterative Closest Point (ICP) 3D Point Cloud Registration Algorithm Based On Point Cloud Filtering and Adaptive Fireworks For Coarse Registration
No ratings yet
Improved Iterative Closest Point (ICP) 3D Point Cloud Registration Algorithm Based On Point Cloud Filtering and Adaptive Fireworks For Coarse Registration
25 pages
Image Processing Toolbox User Guide Matlab 2023
No ratings yet
Image Processing Toolbox User Guide Matlab 2023
1,800 pages
Handout Image Registration Techniques B.H.
No ratings yet
Handout Image Registration Techniques B.H.
30 pages
Bumps and Pothole Detection Report Final
No ratings yet
Bumps and Pothole Detection Report Final
64 pages
Digital Image Processing: Fourth Edition
No ratings yet
Digital Image Processing: Fourth Edition
64 pages
EPQ Dissertation
No ratings yet
EPQ Dissertation
3 pages
Error Analysis in Circle Fitting
No ratings yet
Error Analysis in Circle Fitting
26 pages
Aorta Segmentation in 3D CT Images by Combining Image Processing and Machine Learning Techniques
No ratings yet
Aorta Segmentation in 3D CT Images by Combining Image Processing and Machine Learning Techniques
15 pages
FINAL Script
No ratings yet
FINAL Script
41 pages
Thesis Registration System
100% (2)
Thesis Registration System
7 pages
The Application of Augmented Reality and Unity 3D in Interaction With Intangible Cultural Heritage
No ratings yet
The Application of Augmented Reality and Unity 3D in Interaction With Intangible Cultural Heritage
9 pages
Applications of Robotics in Medicine: Paduri Veerabhadram
No ratings yet
Applications of Robotics in Medicine: Paduri Veerabhadram
5 pages
Scene 2: A Nurse Named Violet
No ratings yet
Scene 2: A Nurse Named Violet
13 pages
Dartel
No ratings yet
Dartel
68 pages
Introduction: Types of Phantom
No ratings yet
Introduction: Types of Phantom
4 pages
An Analysis and Impementation of A Parallel Ball Pivoting Algorithm
No ratings yet
An Analysis and Impementation of A Parallel Ball Pivoting Algorithm
20 pages
1 - A Probabilistic Method For Fractured Cultural Relics
No ratings yet
1 - A Probabilistic Method For Fractured Cultural Relics
25 pages
Concealed Weapon Detection Using Image Processing: Bingi Yogi Gopinath, Vasa Suresh Krishna, G.Srilatha
No ratings yet
Concealed Weapon Detection Using Image Processing: Bingi Yogi Gopinath, Vasa Suresh Krishna, G.Srilatha
5 pages
Tutorials
No ratings yet
Tutorials
21 pages
A Perspective On Medical Robotics
No ratings yet
A Perspective On Medical Robotics
13 pages
Cruz Et Al. - 2023 - Low-Rank Motion Correction For Accelerated Free-Br
No ratings yet
Cruz Et Al. - 2023 - Low-Rank Motion Correction For Accelerated Free-Br
15 pages
3D Whole Brain Segmentation Using Spatially Localized Atlas Network Tiles
No ratings yet
3D Whole Brain Segmentation Using Spatially Localized Atlas Network Tiles
18 pages
Staging Act 2 Scene 2 of Othello
No ratings yet
Staging Act 2 Scene 2 of Othello
2 pages
GE 4D CT Perfusion
No ratings yet
GE 4D CT Perfusion
2 pages
Rtos Image Building For Different Target Platforms Lecture Five
No ratings yet
Rtos Image Building For Different Target Platforms Lecture Five
10 pages
IPA Course Content
No ratings yet
IPA Course Content
2 pages
Assignment - I
No ratings yet
Assignment - I
2 pages
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
From Everand
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
Fouad Sabry
No ratings yet

Fast and Accurate Point Cloud Registration Using Trees of Gaussian Mixtures

Uploaded by

Fast and Accurate Point Cloud Registration Using Trees of Gaussian Mixtures

Uploaded by

Fast and Accurate Point Cloud Registration

using Trees of Gaussian Mixtures

Ben Eckart Kihwan Kim Jan Kautz

Abstract. Point cloud registration sits at the core of many important

Gaussian Mixture Model (GMM) representation. Our method constructs

cloud registration is then to try to find a common coordinate system, which is

Our method builds on previous work in GMM-based methods for registration

Mult. Aniso- Multi- Data Assoc. Opt.

(a) (b) (c) (d)

Fig. 1. Multi-Scale Representation using a Hierarchy of Gaussian Mixtures:

In contrast to these statistical model-based approaches, modern robust vari-

3 Registration as Expectation Maximization

The Expectation Maximization (EM) algorithm forms the theoretical foundation

T̂ = argmax p(T (Z2 )|Θ̂ Z1 ) (1)

The derivation of the probability model Θ Z1 may be as simple as statically

4 Hierarchical Gaussian Mixture Mahalanobis Estimation

In this section, we review our proposed approach for hierarchical GMM-based

4.1 E Step: Adaptive Tree Search

Our proposed E Step uses a recursive search procedure to perform probabilistic

Algorithm 1 E Step for Registration

(a) GMM-Tree L2 (64 (c) GMM-Tree L3 (512

4.2 M Step: Mahalanobis Estimation

T̂ = argmax Ep(C|T (Z),Θ) [ln p(T (Z), C|Θ)] (5)

eigenvectors n using PCA, thereby producing the following equivalence,

(a) Accuracy vs Speed (b) Speed vs Size

level. However, if computation time is not a constraint for a given application

6 Evaluation on Real-World Data

Lounge Dataset In this test, we calculate the frame-to-frame accuracy on the

Ang. Speed Ang. Trans. Speed

Table 2. Comparison of Registration Methods for the Lounge and LiDAR

Velodyne LiDAR Dataset We performed frame-to-frame registration on an

Fig. 5. Frame-to-Frame Registration with Outdoor LiDAR Dataset: Ground

Fig. 7. Lounge Error Distributions Histograms of frame-to-frame error as measured

Related Work: Sources

GPU-Accelerated SoftAssign, EM-ICP and ICP https://github.com/

CPD (Python): https://github.com/siavashk/pycpd

Trimmed Point-to-Plane ICP, Trimmed Point-to-Point ICP, Fractional

NDT-D2D, NDT-P2D http://wiki.ros.org/ndt_registration

Expanded Derivation: Mahalanobis Estimation (Sec 4.2)

Recall that with a GMM of size J, the MLE optimization is as follows,

Next we define the following three vectors,

We denote the set of J eigenvectors associated with each covariance’s lth

You might also like