0% found this document useful (0 votes)

23 views13 pages

Lec 19

This lecture covered absolute orientation in closed form, dealing with outliers through robust methods like RANSAC, and incorporating scale into transformations between coordinate systems. Key points: 1) Absolute orientation finds optimal translation and rotation between two coordinate systems. Quaternions can represent rotations efficiently. 2) Outliers can affect closed-form absolute orientation. RANSAC is an algorithm to improve robustness by detecting outliers. 3) Transformations between systems can include not just rotation and translation but also scaling, which may better describe some applications like satellite imaging.

Uploaded by

stathiss11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views13 pages

Lec 19

Uploaded by

stathiss11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

6.801/6.

866: Machine Vision, Lecture 19

Professor Berthold Horn, Ryan Sander, Tadayuki Yoshitake

MIT Department of Electrical Engineering and Computer Science
Fall 2020

These lecture summaries are designed to be a review of the lecture. Though I do my best to include all main topics from the
lecture, the lectures will have more elaborated explanations than these notes.

1 Lecture 19: Absolute Orientation in Closed Form, Outliers and Robustness,

RANSAC
This lecture will continue our discussion of photogrammetry topics - specifically, covering more details with the problem of
absolute orientation. We will also look at the effects of outliers on the robustness of closed-form absolute orientation, and how
algorithms such as RANSAC can be leveraged as part of an absolute orientation, or, more generally, photogrammetry pipeline
to improve the robustness of these systems.

1.1 Review: Absolute Orientation

Recall our four main problems of photogrammetry:
• Absolute Orientation 3D ←→ 3D
• Relative Orientation 2D ←→ 2D
• Exterior Orientation 2D ←→ 3D
• Intrinsic Orientation 3D ←→ 2D
In the last lecture, we saw that when solving absolute orientation problems, we are mostly interested in finding transfor-
mations (translation + rotation) between two coordinate systems, where these coordinate systems can correspond to objects
or sensors moving in time (recall this is where we saw duality between objects and sensors).

Last time, we saw that one way we can find an optimal transformation between two coordinate systems in 3D is to de-
compose the optimal transformation into an optimal translation and an optimal rotation. We saw that we could solve for
optimal translation in terms of rotation, and that we can mitigate the constraint issues with solving for an orthonormal rotation
matrix by using quaternions to carry out rotation operations.

1.1.1 Rotation Operations

Relevant to our discussion of quaternions is identifying the critical operations that we will use for them (and for orthonormal
rotation matrices). Most notably, these are:
oo
1. Composition of rotations: pq = (p, q)(q, q) = (pq − q · q, pq + qq + q × q)
o0 o o o∗
2. Rotating vectors: r = q rq = (q 2 − q · q)r + 2(q · r)q + 2q(q × r)
Recall from the previous lecture that operation (1) was faster than using orthonormal rotation matrices, and operation (2)
was slower.

1
1.1.2 Quaternion Representations: Axis-Angle Representation and Orthonormal Rotation Matrices
From our previous discussion, we saw that another way we can represent quaternions is through the axis-angle notation (known
as the Rodrigues formula):

r0 = (cos θ)r + (1 − cos θ)(ω̂ · r)ω̂ + sin θ(ω̂ × r)

Combining these equations from above, we have the following axis-angle representation:
!
o θ θ o θ θ
q ⇐⇒ ω̂, θ, q = cos , q = ω̂ sin =⇒ q = cos , ω̂ sin
2 2 2 2

We also saw that we can convert these quaternions to orthonormal rotation matrices. Recall that we can write our vector rotation
operation as:
o o o∗ o
q rq = (Q̄T Q)r, where

 
o o
q·q 0 0 0
 0 q02 + qx2 − qy2 − qz2 2(qx qy − q0 qz ) 2(qx qz + q0 qy ) 
 
Q̄T Q = 
 0 2(qy qx + q0 qz ) q02 − qx2 + qy2 − qz2 2(qy qz − q0 qx ) 

0 2(qz qx − q0 qy ) 2(qz qy + q0 qz ) q02 − qx2 − qy2 + qz2

The matrix Q̄T Q has skew-symmetric components and symmetric components. This is useful for conversions. Given a
quaternion, we can compute orthonormal rotations more easily. For instance, if we want an axis and angle representation, we
can look at the lower right 3 × 3 submatrix, specifically its trace:

Let R = [Q̄T Q]3×3,lower , then :

tr(R) = 3q02 − (qx2 + qy2 + qz2 )

θ θ
= 3 cos2 − (sin2 (Substituting our axis-angle representation)
2 2

θ θ
− cos2 + sin2 − 1 (Subtracting Zero)
2 2

θ θ
→ 2 cos2 − sin2 −1
2 2
= 2 cos θ − 1
1
=⇒ cos θ = (tr(R) − 1)
2
While this is one way to get the angle (we can solve for θ through arccos of the expression above), it is not the best
way
to do
θ
so: we will encounter problems near θ ≈ 0, π. Instead, we can use the off-diagonal elements, which depend on sin 2 instead.

Note that this works because at angles θ where cos θ2 is “bad” (is extremely sensitive), sin θ2 is “good” (not as sensitive),
and vice versa.

1.2 Quaternion Transformations/Conversions

Next, let us focus on how we can convert between quaternions and orthonormal rotation matrices. Given a 3 × 3 orthonormal
rotation matrix r, we can compute sums and obtain the following system of equations:

1 + r11 + r22 + r33 = 4q02

1 + r11 − r22 − r33 = 4qx2
1 − r11 + r22 − r33 = 4qy2
1 − r11 − r22 + r33 = 4qz2

This equation can be solved by taking square roots, but due to the number of solutions (8 by Bezout’s theorem, allowing for the
flipped signs of quaternions, we should not use this set of equations alone to find the solution).

2
Instead, we can compute these equations, evaluate them, take the largest for numerical accuracy, arbitrarily select to use
the positive version (since there is sign ambiguity with the signs of the quaternions), and solve for this. We will call this selected
righthand side qi .

For off-digaonals, which have symmetric and non-symmetric components, we derive the following equations:

r32 − r23 = 4q0 qx

r13 − r31 = 4q0 qy
r21 − r12 = 4q0 qx
r21 + r12 = 4qx qy
r32 + r23 = 4qy qz
r13 + r31 = 4qz qz

Adding/subtracting off-diagonals give us 6 relations, of which we only need 3 (since we have 1 relation from the diagonals). For
instance, if we have qi = qy , then we pick off-diagonal relations involving qy , and we solve the four equations given by:

1 − r11 + r22 − r33 = 4qy2

r13 − r31 = 4q0 qy
r32 + r23 = 4qy qz
r13 + r31 = 4qz qz

This system of four equations gives us a direct way of going from quaternions to an orthonormal rotation matrix. Note that this
could be 9 numbers that could be noisy, and we want to make sure we have best fits.

1.3 Transformations: Incorporating Scale

Thus far, for our problem of absolute orientation, we have considered transformations between two coordinate systems of being
composed of translation and rotation. This is often sufficient, but in some applications and domains, such as satellite imaging
fr topographic reconstruction, we may be able to better describe these transformations taking account not only translation and
rotation, but also scaling.

Taking scaling into account, we can write the relationship between two point clouds corresponding to two different coordi-
nate systems as:

r0r = sR(r0l )

∆
Where rotation is again given by R ∈ SO(3), and the scaling factor is given by s ∈ R+ (where R+ = {x : x ∈ R , x > 0}). Recall
that r0r and r0l are the centroid-subtracted variants of the point clouds in both frames of reference.

1.3.1 Solving for Scaling Using Least Squares: Asymmetric Case

As we did before, we can write this as a least-squares problem over the scaling parameter s:
n
X
min ||r0r,i − sR(r0l,i )||22
s
i=1

As we did for translation and rotation, we can solve for an optimal scaling parameter:
n
X
∗
s = arg min ||r0r,i − sR(r0l,i )||22
s
i=1
n
X n
X n
X
= arg min ||r0r,i ||22 − 2s r0r,i R(r0l,i ) + s2 ||R(r0l,i )||22
s
i=1 i=1 i=1
n
X n
X n
X
= arg min ||r0r,i ||22 − 2s r0r,i R(r0l,i ) + s2 ||r0l,i ||22 (Rotation preserves vector lengths)
s
i=1 i=1 i=1

Next, let us define the following terms:

3

∆ Pn
1. sr = i=1 ||r0r,i ||22

∆ Pn
2. D = i=1 r0r,i R(r0l,i )

∆ Pn
3. sl = i=1 ||r0l,i ||22
Then we can write this objective for the optimal scaling factor s∗ as:
∆
s∗ = arg min{J(s) = sr − 2sD + s2 sl }
s

Since this is an unconstrained optimization problem, we can solve this by taking the derivative w.r.t. s and setting it equal to 0:

dJ(s) d
= sr − 2sD + s2 sl = 0
ds ds
D
= −2D + s2 sl = 0 =⇒ s =
sl
As we also saw with rotation, this does not give us an exact answer without finding the orthonormal matrix R, but now we are
able to remove scale factor and back-solve for it later using our optimal rotation.

1.3.2 Issues with Symmetry

Symmetry question: What if instead of going from the left coordinate system to the right one, we decided to go from right
to left? In theory, this should be possible: we should be able to do this simply by negating translation and inverting our
rotation and scaling terms. But in general, doing this in practice with our OLS approach above does not lead to sinverse = 1s
- i.e. inverting the optimal scale factor does not give us the scale factor for the reverse problem.

Intuitively, this is the case because the version of OLS we used above “cheats” and tries to minimize error by shriking the
scale by more than it should be shrunk. This occurs because it brings the points closer together, thereby minimizing, on average,
the error term. Let us look at an alternative formulation for our error term that accounts for this optimization phenomenon.

1.3.3 Solving for Scaling Using Least Squares: Symmetric Case

Let us instead write our objective as:
1 √
ei = √ r0r,i = sR(r0l,i )
s

Then we can write our objective and optimization problem over scale as:
n
1
X √
s∗ = arg min || √ r0r,i − sR(r0l,i )||22
s
i=1
s
n
X 1 Xn Xn
= arg min ||r0r,i ||22 − 2 r0r,i R(r0l,i ) + s ||R(r0l,i )||22
s
i=1
s i=1 i=1
n n n
1 X 0 2 X X
= arg min ||rr,i ||2 − 2 r0r,i R(r0l,i ) + s ||r0l,i ||22 (Rotation preserves vector lengths)
s s i=1 i=1 i=1

We then take the same definitions for these terms that we did above:

∆ Pn
1. sr = i=1 ||r0r,i ||22

∆ Pn
2. D = i=1 r0r,i R(r0l,i )

∆ Pn
3. sl = i=1 ||r0l,i ||22
Then, as we did for the asymmetric OLS case, we can write this objective for the optimal scaling factor s∗ as:

∆ 1
s∗ = arg min{J(s) = sr − 2D + ssl }
s s

4
Since this is an unconstrained optimization problem, we can solve this by taking the derivative w.r.t. s and setting it equal to 0:

dJ(s) d 1
= sr − 2D + ssl = 0
ds ds s
1 sl
= − 2 sr + sl = 0 =⇒ s2 =
s sr
Therefore, we can see that going in the reverse direction preserves this inverse (you can verify this mathematically and intu-
itively by simply setting r0r,i ↔ r0l,i ∀ i ∈ {1, ..., n} and noting that you will get s2inverse = ssrl ). Since this method better preserves
symmetry, it is preferred.

Intuition: Since s no longer depends on correspondences (matches between points in the left and right point clouds), then
the scale simply becomes the ratio of the point cloud sizes in both coordinate systems (note that sl and sr correspond to the
summed vector lengths of the centroid-subtracted point clouds, which means they reflect the variance/spread/size of the point
cloud in their respective coordinate systems.

We can deal with translation and rotation in a correspondence-free way, while also allowing for us to decouple rotation. Let us
also look at solving rotation, which is covered in the next section.

1.4 Solving for Optimal Rotation in Absolute Orientation

Recall for rotation (see lecture 18 for details) that we switched from optimizing over orthonormal rotation matrices to opti-
mizing over quaternions due to the lessened number of optimization constraints that we must adhere to. With our quaternion
optimization formulation, our problem becomes:
oT o oT o
max
o
q N q, subject to q q = 1
q

If this were an unconstrained optimization problem, we could solve by taking the derivative of this objective w.r.t. our quaternion
o
q and setting it equal to zero. Note the following helpful identities with matrix and vector calculus:
d
1. da (a · b) = b
d T
2. da (a M b) = 2M b
However, since we are working with quaternions, we must take this constraint into account. We saw in lecture 18 that we did
this with using Lagrange Multiplier - in this lecture it is also possible to take this specific kind of vector length constraint
into account using Rayleigh Quotients.

What are Rayliegh Quotients? The intuitive idea behind them: How do I prevent my parameters from becoming too
large) positive or negative) or too small (zero)? We can accomplish this by dividing our objective by our parameters, in this
case our constraint. In this case, with the Rayleigh Quotient taken into account, our objective becomes:
 
oT o n
q Nq  ∆
X
T
Recall that N = Rl,i Rr,i 
oT o
q q i=1

How do we solve this? Since this is now an unconstrained optimization problem, we can solve this simply using the rules of
calculus:
oT o
o ∆ q Nq
J(q) ==
oT o
q q
o oT o
dJ(q) d q Nq
o = o T
=0
dq dq qo qo
T o oT o oT o d oT o
d o
o (q N q)q q − q N q o (q q)
dq dq
= =0
oT o 2
(q q)
o o
2N q 2q oT o
= − (q N q) = 0
oT o oT o
q q (q q)2

5
From here, we can write this first order condition result as:
oT o
o q Nq o
Nq = T q
o o
q q
oT o
q Nq
Note that oT o ∈ R (this is our objective). Therefore, we are searching for a vector of quaternion coefficients such applying the
q q
T
∆ qo N qo
rotation matrix to this vector simply produces a scalar multiple of it - i.e. an eigenvector of the matrix N . Letting λ = oT o ,
q q
o o
then this simply becomes N q = λq. Since this optimization problem is a maximization problem, this means that we can pick
the eigenvector of N that corresponds to the largest eigenvalue (which in turn maximizes the objective consisting of the
oT o
q Nq
Rayleigh quotient oT o , which is the eigenvalue.
q q

Even though this quaternion-based optimization approach requires taking this Rayleigh Quotient into account, it is much easier
to do this optimization than to solve for orthonormal matrices, which either require a complex Lagrangian (if we solve with
Lagrange multipliers) or an SVD decomposition from Euclidean space to the SO(3) group (which also happens to be a manifold).

This approach raises a few questions:

• How many correspondences are needed to solve these optimization problems? Recall a correspondence is when we say
two 3D points in different coordinate systems belong to the same point in 3D space, i.e. the same point observed in two
separate frames of reference.
• When do these approaches fail?
We cover these two questions in more detail below.

1.4.1 How Many Correspondences Do We Need?

Recall that we are looking for 6 parameters (for translation and rotation) or 7 parameters (for translation, rotation, and scaling).
Since each correspondence provides three constraints (since we equate the 3-dimensional coordinates of two 3D points in space),
assuming non-redundancy, then we can solve this with two correspondences.

Let us start with two correspondences: if we have two objects corresponding to the correspondences of points in the 3D world,
then if we rotate one object about axis, we find this does not work, i.e. we have an additional degree of freedom. Note that the
distance between correspondences is fixed.

Figure 1: Using two correspondences leads to only satisfying 5 of the 6 needed constraints to solve for translation and rotation
between two point clouds.

Because we have one more degree of freedom, this accounts for only 5 of the 6 needed constraints to solve for translation and
rotation, so we need to have at least 3 correspondences.

With 3 correspondences, we get 9 constraints, which leads to some redundancies. We can add more constraints by incor-
porating scaling and generalizing the allowable transformations between the two coordinate systems to be the generalized
linear transformation - this corresponds to allowing non-orthonormal rotation transformations. This approach gives us 9
unknowns!
    
a11 a12 a13 x a14
a21 a22 a23  y  + a24 
a31 a32 a33 z a34

6
But we also have to account for translation, which gives us another 3 unknowns, giving us 12 in total and therefore requiring at
least 4 non-redundant correspondences in order to compute the full general linear transformation. Note that this doesn’t have
any constraints as well!

On a practical note, this is often not needed, especially for finding the absolute orientation between two cameras, because
oftentimes the only transformations that need to be considered due to the design constraints of the system (e.g. an autonomous
car with two lidar systems, one on each side) are translation and rotation.

1.4.2 When do These Approaches Fail?

These approaches can fail when we do not have enough correspondences. In this case, the matrix N will become singular, and
will produce eigenvalues of zero. A more interesting failure case occurs when the points of one or both of the point clouds are
coplanar. Recall that we solve for the eigenvalues of a matrix (N in this case) using the characteristic equation given by:

Characteristic Equation : det |N − λI| = 0

Leads to 4th-order polynomial : λ4 + c3 λ3 + c2 λ2 + c1 λ + c0 = 0

Recall that our matrix N composed of the data has some special properties:
1. c3 = tr(N ) = 0 (This is actually a great feature, since usually the first step in solving 4th-order polynomial systems is
eliminating the third-order term).
2. c2 = 2tr(M T M ), where M is defined as the sum of dyadic products between the points in the point clouds:
 
n
∆
X
M = r0l,i r0r,i T  ∈ R3×3
i=1

3. c1 = 8 det |M |
4. c0 = det |N |
What happens if det |M | = 0, i.e. the matrix M is singular? Then using the formulas above we must have that the coefficient
c1 = 0. Then this problem reduces to:

λ4 + c2 λ2 + c0 = 0

This case corresponds to a special geometric case/configuration of the point clouds - specifically, when points are coplanar.

1.4.3 What Happens When Points are Coplanar?

When points are coplanar, we have that the matrix N , composed of the sum of dyadic products between the correspondences in
the two point clouds, will be singular.

To describe this plane in space, we need only find a normal vector n̂ that is orthogonal to all points in the point cloud -
i.e. the component of each point in the point cloud in the n̂ direction is 0. Therefore, we can describe the plane by the equation:

r0r,i · n̂ = 0 ∀ i ∈ {1, ..., n}

Figure 2: A coplanar point cloud can be described entirely by a surface normal of the plane n̂.

Note: In the absence of measurement noise, if one point cloud is coplanar, the the other point cloud must be as well (assuming
that the transformation between the point clouds is a linear transformation). This does not necessarily hold when measurement

7
noise is introduced.

Recall that our matrix M , which we used above to compute the coefficients of the characteristic polynomial describing this
system, is given by:
n
∆
X
M= r0r,i r0l,i T
i=1

Then, rewriting our equation for the plane above, we have:

 
n
X
r0r,i · n̂ = 0 =⇒ M n̂ =  r0r,i r0l,i T  n̂
i=1
n
X
= r0r,i r0l,i T n̂
i=1
n
X
= r0r,i 0
i=1
=0

Therefore, when a point cloud is coplanar, the null space of M is non-trivial (it is given by at least Span({n̂}), and therefore
M is singular. Recall that a matrix M ∈ Rn×d is singular if ∃ x ∈ Rd , x 6= 0 such that M x = 0, i.e. the matrix has a non-trivial
null space.

1.4.4 What Happens When Both Coordinate Systems Are Coplanar

Visually, when two point clouds are coplanar, we have:

Figure 3: Two coplanar point clouds. This particular configuration allows us to estimate rotation in two simpler steps.

In this case, we can actually decompose finding the right rotation into two simpler steps!
1. Rotate one plane so it lies on top of the other plane. We can read off the axis and angle from the unit normal vectors of
these two planes describing the coplanarity of these point clouds, given respectively by n̂1 and n̂2 :
• Axis: We can find the axis by noting that the axis vector will be parallel to the cross product of n̂1 and n̂2 , simply
scaled to a unit vector:
n̂1 × n̂2
ω̂ =
||n̂1 × n̂||2

• Angle: We can also solve for the angle using the two unit vectors n̂1 and n̂2 :

cos θ = n̂1 · n̂2

sin θ = n̂1 × n̂2

sin θ
θ = arctan 2
cos θ

8
We now have an axis angle representation for rotation between these two planes, and since the points describe each of the
respective point clouds, therefore, a rotation between the two point clouds! We can convert this axis-angle representation
into a quaternion with the formula we have seen before:

o θ θ
q = cos , sin ω̂
2 2

2. Perform an in-plane rotation. Now that we have the quaternion representing the rotation between these two planes, we can
orient two planes on top of each other, and then just solve a 2D least-squares problem to solve for our in-place rotation.
With these steps, we have a rotation between the two point clouds!

1.5 Robustness
In many methods in this course, we have looked at the use of Least Squares methods to solve for estimates in the presence
of noise and many data points. Least squares produces an unbiased, minimum-variance estimate if (along with a few other
assumptions) the dataset/measurement noise is Gaussian (Gauss-Markov Theorem) [1]. But what if the measurement noise is
non-Gaussian? How do we deal with outliers in this case?

It turns out that Least Squares methods are not robust to outliers. One alternative approach is to use absolute error in-
stead. Unfortunately, however, using absolute error does not have a closed-form solution. What are our other options for dealing
with outliers? One particularly useful alternative is RANSAC.

RANSAC, or Random Sample Consensus, is an algorithm for robust estimation with least squares in the presence
of outliers in the measurements. The goal is to find a least squares estimate that includes, within a certain threshold band, a
set of inliers corresponding to the inliers of the dataset, and all other points outside of this threshold bands as outliers. The
high-level steps of RANSAC are as follows:
1. Random Sample: Sample the minimum number of points needed to fix the transformation (e.g. 3 for absolute orientation;
some recommend taking more).
2. Fit random sample of points: Usually this involves running least squares on the sample selected. This fits a line (or
hyperplane, in higher dimensions), to the randomly-sampled points.
3. Check Fit: Evaluate the line fitted on the randomly-selected subsample on the rest of the data, and determine if the fit
produces an estimate that is consistent with the “inliers” of your dataset. If the fit is good enough accept it, and if it is
not, run another sample. Note that this step has different variations - rather than just immediately terminating once you
have a good fit, you can run this many times, and then take the best fit from that.

Furthermore, for step 3, we threshold the band from the fitted line/hyperplane to determine which points of the dataset are
inliers, and which are outliers (see figure below). This band is usually given by a 2 band around the fitted line/hyperplane.
Typically, this parameter is determined by knowing some intrinsic structure about the dataset.

Figure 4: To evaluate the goodness of fit of our sampled points, as well as to determine inliers and outliers from our dataset, we
have a 2 thick band centered around the fitted line.

Another interpretation of RANSAC: counting the “maximimally-occupied” cell in Hough transform parameter space! Another
way to find the best fitting line that is robust to outliers:

9
1. Repeatedly sample subsets from the dataset/set of measurements, and fit these subsets of points using least squares
estimates.

2. For each fit, map the points to a discretized Hough transform parameter space, and have an accumulator array that keeps
track of how often a set of parameters falls into a discretized cell. Each time a set of parameters falls into a discretized
cell, increment it by one.
3. After N sets of random samples/least squares fits, pick the parameters corresponding to the cell that is “maximally-
occupied”, aka has been incremented the most number of times! Take this as your outlier-robust estimate.

Figure 5: Another way to perform RANSAC using Hough Transforms: map each fit from the subsamples of measurements to a
discretized Hough Transform (parameter) space, and look for the most common discretized cell in parameter space to use for an
outlier-robust least-squares estimate.

1.6 Sampling Space of Rotations

Next, we will shift gears to discuss the sampling space of rotations.

Why are we interested in this space? Many orientation problems we have studied so far do not have a closed-form
solution and may require sampling. How do we sample from the space of rotations?

1.6.1 Initial Procedure: Sampling from a Sphere

Let us start by sampling from a unit sphere (we will start in 3D, aiming eventually for 4D, but our framework will gener-
alize easily from 3D to 4D). Why a sphere? Recall that we are interested in sampling for the coefficients of a unit quaternion
o o
q = (q0 , qx , qy , qz ), ||q||22 = 1.

One way to sample from a sphere is with latitude and longitude, given by (θi , φi ), respectively. The problem with this ap-
proach, however, is that we sample points that are close together at the poles. Alternatively, we can generate random longitude
θi and φi , where:
• − π2 ≤ θi ≤ π
2 ∀i
• −π ≤ φi ≤ π ∀ i
But this approach suffers from the same problem - it samples too strongly from the poles. Can we do better?

1.6.2 Improved Approach: Sampling from a Cube

To achieve more uniform sampling from a sphere, what if we sampled from a unit cube (where the origin is given the center of
the cube), and map the sampled points to an enscribed unit sphere within the cube?

Idea: Map all points (both inside the sphere and outside the sphere/inside the cube) onto the sphere by connecting a line
from the origin to the sampled point, and finding the point where this line intersects the sphere.

10
Figure 6: Sampling from a sphere by sampling from a cube and projecting it back to the sphere.

Problem with this approach: This approach disproportionately samples more highly on/in the direction of the cube’s
edges. We could use sampling weights to mitigate this effect, but better yet, we can simply discard any samples that fall outside
the sphere. To avoid numerical issues, it is also best to discard points very close to the sphere.

Generalization to 4D: As we mentioned above, our goal is to generalize this from 3D to 4D. Cubes and spheres simply
become 4-dimensional - enabling us to sample quaternions.

1.6.3 Sampling From Spheres Using Regular and Semi-Regular Polyhedra

We saw the approach above requires discarding samples, which is computationally-undesirable because it means we will proba-
bilistically have to generate more samples than if we were able to sample from the sphere alone. To make this more efficient, let
us consider shapes that form a “tighter fit” around the sphere - for instance: polyhedra! Some polyhedra we can use:
• Tetrahedra (4 faces)
• Hexahedra (6 faces)
• Octahedra (8 faces)
• Dodecahedra (12 faces)
• Icosahedra (20 faces)
These polyhedra are also known as the regular solids.

As we did for the cube, we can do the same for polyhedra: to sample from the sphere, we can sample from the polyhedra,
and then project onto the point on the sphere that intersects the line from the origin to the sampled point on the polyhedra.
From this, we get great circles from the edges of these polyhedra on the sphere when we project.

Fun fact: Soccer balls have 32 faces! More related to geometry: soccer balls are part of a group of semi-regular solids,
specifically an icosadodecahedron.

1.6.4 Sampling in 4D: Rotation Quaternions and Products of Quaternions

Now we are ready to apply these shapes for sampling quaternions in 4D. Recall that our goal with this sampling task is to
find the rotation between two point clouds, e.g. two objects. We need a uniform way of sampling this spae. We can start
with thehexahedron.
Below are 10 elementary rotations we use (recall that a quaternion is given in axis-angle notation by
o θ π

q = (cos 2 , sin 2 ω̂)):
o
1. Identity rotation: q = (1, 0)
o
2. π about x̂: q = (cos π2 , sin π2 x̂) = (0, x̂)

o
3. π about ŷ: q = (cos π2 , sin π2 ŷ) = (0, ŷ)

o
4. π about ẑ: q = (cos π2 , sin π2 ẑ) = (0, ẑ)

o
5. π2 about x̂: q = (cos π4 , sin π4 x̂) = √12 (1, x̂)

π o π π √1 (1, ŷ)

6. 2 about ŷ: q = (cos 4 , sin 4 ŷ) = 2

11
π o π π √1 (1, ẑ)

7. 2 about ẑ: q = (cos 4 , sin 4 ẑ) = 2
o
8. − π2 about x̂: q = (cos − π4 , sin − π4 x̂) = √1 (1, −x̂)

2
o
9. − π2 about ŷ: q = (cos − π4 , sin − π4 ŷ) = √1 (1, −ŷ)

2
o
10. − π2 about ẑ: q = (cos − π4 , sin − π4 ẑ) = √1 (1, −ẑ)

2

These 10 rotations by themselves give us 10 ways to sample the rotation space. How can we construct more samples? We can
do so by taking quaternion products, specifically, products of these 10 quaternions above. Let us look at just a couple of
these products:
1. (0, x̂)(0, ŷ):

(0, x̂)(0, ŷ) = (0 − x̂ · ŷ, 0x̂ + 0ŷ + x̂ × ŷ)

= (−x̂ · ŷ, x̂ × ŷ)
= (0, ẑ)

We see that this simply produces the third axis, as we would expect. This does not give us a new rotation to sample from.
Next, let us look at one that does.
2. √1 (1, x̂) √1 (1, ŷ):
2 2

1 1 1
√ (1, x̂) √ (1, ŷ) = (1 − x̂ · ŷ, ŷ + x̂ + x̂ × ŷ)
2 2 2
1
= (1, x̂ + ŷ + x̂ × ŷ)
2
This yields the following axis-angle representation:
• Axis: √1 (1
3
1 1)

• Angle: cos θ
2 = 1
2 =⇒ θ
2 = π
3 =⇒ θ = 2π
3

Therefore, we have produced a new rotation that we can sample from!

These are just a few of the pairwise quaternion products we can compute. It turns out that these pairwise quaternion products
produce a total of 24 new rotations from the original 10 rotations. These are helpful for achieving greater sampling granularity
when sampling the rotation space.

1.7 References
1. Gauss-Markov Theorem, https://en.wikipedia.org/wiki/Gauss%E2%80%93Markov theorem

12
MIT OpenCourseWare
https://ocw.mit.edu

6.801 / 6.866 Machine Vision

Fall 2020

For information about citing these materials or our Terms of Use, visit: https://ocw.mit.edu/terms

Position and Orientation of Rigid Bodies: Robotics 1
No ratings yet
Position and Orientation of Rigid Bodies: Robotics 1
62 pages
(Bajalinov) Linear-Fractional Programming 1st Edition
100% (5)
(Bajalinov) Linear-Fractional Programming 1st Edition
442 pages
Maths
40% (10)
Maths
66 pages
MPC
50% (2)
MPC
94 pages
Math IMU CET Sample Questions 03
No ratings yet
Math IMU CET Sample Questions 03
14 pages
3D Transformations & Orientation Parameterization: Advanced Computer Animation Techniques Aug-Dec 2011 Cesteves@cimat - MX
100% (1)
3D Transformations & Orientation Parameterization: Advanced Computer Animation Techniques Aug-Dec 2011 Cesteves@cimat - MX
33 pages
ARKaD WS2122 L02 Orientation
No ratings yet
ARKaD WS2122 L02 Orientation
37 pages
Position and Orientation of Rigid Bodies: Robotics 1
No ratings yet
Position and Orientation of Rigid Bodies: Robotics 1
20 pages
Chp3 - Transformation
No ratings yet
Chp3 - Transformation
41 pages
La 241
No ratings yet
La 241
29 pages
Chương 03
No ratings yet
Chương 03
42 pages
Group13 Project9 Report
No ratings yet
Group13 Project9 Report
45 pages
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
From Everand
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
Shubhankar Paul
No ratings yet
Robotic Homework - 1 - Sol
No ratings yet
Robotic Homework - 1 - Sol
7 pages
0580 - Classified Paper 4 Questions
No ratings yet
0580 - Classified Paper 4 Questions
956 pages
Rotation in The Space: 1 Euler Angles
100% (1)
Rotation in The Space: 1 Euler Angles
14 pages
Cse169 04
100% (1)
Cse169 04
57 pages
ĐSTT
No ratings yet
ĐSTT
30 pages
Ch2 Robotics
No ratings yet
Ch2 Robotics
41 pages
Coordtransforms PDF
No ratings yet
Coordtransforms PDF
20 pages
Computational Methods For Electric Power Systems 2nd Edition Mariesa L. Crow Instant Download
No ratings yet
Computational Methods For Electric Power Systems 2nd Edition Mariesa L. Crow Instant Download
62 pages
Lec 03
No ratings yet
Lec 03
22 pages
KOM Lect37 Robotics Rigid Motions and Homogeneous Transformations
No ratings yet
KOM Lect37 Robotics Rigid Motions and Homogeneous Transformations
27 pages
Skogestad Simple Pid Tuning Rules
No ratings yet
Skogestad Simple Pid Tuning Rules
27 pages
ABAQUS Umat
No ratings yet
ABAQUS Umat
15 pages
LEC2F Robotics
No ratings yet
LEC2F Robotics
30 pages
4 Rotations
No ratings yet
4 Rotations
6 pages
Robot Kinematics
No ratings yet
Robot Kinematics
37 pages
Indian Forest Service (IFoS-IFS) Mains Exam Maths Optional Linear Algebra Previous Year Questions (PYQs) From 2020 To 2009
No ratings yet
Indian Forest Service (IFoS-IFS) Mains Exam Maths Optional Linear Algebra Previous Year Questions (PYQs) From 2020 To 2009
9 pages
Lec4 Imaging Geometry
No ratings yet
Lec4 Imaging Geometry
35 pages
Assignment 5 Navigation Systems
No ratings yet
Assignment 5 Navigation Systems
17 pages
RD2 Quaternions
No ratings yet
RD2 Quaternions
19 pages
Chapter 2 - Robotics Arm Kinematics
No ratings yet
Chapter 2 - Robotics Arm Kinematics
37 pages
Cramer's Rule: Applying Determinants To Solve Systems of Equations 2x2 & 3x3
No ratings yet
Cramer's Rule: Applying Determinants To Solve Systems of Equations 2x2 & 3x3
19 pages
ME604: Introduction To Robotics Assignment 2: Spring 2025
No ratings yet
ME604: Introduction To Robotics Assignment 2: Spring 2025
2 pages
Spatial Descriptions II
No ratings yet
Spatial Descriptions II
20 pages
GUJCET - D08 Mar 2025
No ratings yet
GUJCET - D08 Mar 2025
5 pages
IR - Chapter 2 - For ST
No ratings yet
IR - Chapter 2 - For ST
17 pages
02-Camera Calibration
No ratings yet
02-Camera Calibration
6 pages
Robot Modeling-1 - 4
No ratings yet
Robot Modeling-1 - 4
13 pages
Group F: Information Engineering: 4F1 - Control System Design
No ratings yet
Group F: Information Engineering: 4F1 - Control System Design
4 pages
Lec 18
No ratings yet
Lec 18
15 pages
Computer Animation: Lecture 3: Orientation
No ratings yet
Computer Animation: Lecture 3: Orientation
55 pages
Xlec 02
No ratings yet
Xlec 02
16 pages
Arbitrary Axis Rotation
No ratings yet
Arbitrary Axis Rotation
6 pages
11 Gain Scheduling
No ratings yet
11 Gain Scheduling
8 pages
Labview LID and Fuzzy Logic Control
No ratings yet
Labview LID and Fuzzy Logic Control
147 pages
Lect07 PDF
No ratings yet
Lect07 PDF
18 pages
An Intuitive Guide To Linear Algebra
No ratings yet
An Intuitive Guide To Linear Algebra
16 pages
ME 482/582 Chapter 2 HW Solution: U A B C T T T U
No ratings yet
ME 482/582 Chapter 2 HW Solution: U A B C T T T U
6 pages
Robot Kinematics: A Large Part of Robot Kinematics Is Concerned With The
No ratings yet
Robot Kinematics: A Large Part of Robot Kinematics Is Concerned With The
32 pages
History of Geometry
No ratings yet
History of Geometry
14 pages
Computer Graphics Using Open GL, 3 Edition F. S. Hill, Jr. and S. Kelley
No ratings yet
Computer Graphics Using Open GL, 3 Edition F. S. Hill, Jr. and S. Kelley
49 pages
10 Vectors
No ratings yet
10 Vectors
17 pages
Points and Planes in 3D
No ratings yet
Points and Planes in 3D
30 pages
Anti-Windup Techniques: Automatic Control 2
No ratings yet
Anti-Windup Techniques: Automatic Control 2
17 pages
01 Transformations
No ratings yet
01 Transformations
16 pages
Lecture 9: Introduction To Kinematics and Rotation Matrices: Vaibhav Srivastava
No ratings yet
Lecture 9: Introduction To Kinematics and Rotation Matrices: Vaibhav Srivastava
19 pages
Lec13 PDF
No ratings yet
Lec13 PDF
15 pages
What Is The Goal of Camera Calibration?
No ratings yet
What Is The Goal of Camera Calibration?
13 pages
Absolute Orientation
No ratings yet
Absolute Orientation
14 pages
Position and Orientation of Rigid Bodies: Robotics 1
No ratings yet
Position and Orientation of Rigid Bodies: Robotics 1
20 pages
Lecture3 PDF
No ratings yet
Lecture3 PDF
19 pages
Coordinate Changes
No ratings yet
Coordinate Changes
14 pages
Lecture3 PDF
No ratings yet
Lecture3 PDF
19 pages
Coordinate Systems and Transformations: Topics
No ratings yet
Coordinate Systems and Transformations: Topics
19 pages
AE420 Chpt8 Fall 2021
No ratings yet
AE420 Chpt8 Fall 2021
20 pages
Initial Approximations
No ratings yet
Initial Approximations
5 pages
Matrices Rotations
No ratings yet
Matrices Rotations
7 pages
What Is The Goal of Camera Calibration?
No ratings yet
What Is The Goal of Camera Calibration?
13 pages
Lec 9
No ratings yet
Lec 9
14 pages
Robotics and Automation by Thomas R. Kurfess 02
No ratings yet
Robotics and Automation by Thomas R. Kurfess 02
4 pages
GATE Mathematics 2025
No ratings yet
GATE Mathematics 2025
50 pages
Lec 13
No ratings yet
Lec 13
7 pages
Euler PDF
No ratings yet
Euler PDF
5 pages
2 Screw Rotations: 2.1 Rotation About An Arbitrary Axis
No ratings yet
2 Screw Rotations: 2.1 Rotation About An Arbitrary Axis
11 pages
Transformation Part 1
No ratings yet
Transformation Part 1
93 pages
From ICE
No ratings yet
From ICE
15 pages
Dynamic Linear Models, Recursive Least Squares and Steepest-Descent Learning
No ratings yet
Dynamic Linear Models, Recursive Least Squares and Steepest-Descent Learning
11 pages
Robotics1 Test 09.11.10
No ratings yet
Robotics1 Test 09.11.10
5 pages
Arbitrary Axis Rotation
No ratings yet
Arbitrary Axis Rotation
6 pages
Booklist IIA
No ratings yet
Booklist IIA
40 pages
Torque Vectoring (Burgess, WWW - Vehicledynamicsinternational.com)
No ratings yet
Torque Vectoring (Burgess, WWW - Vehicledynamicsinternational.com)
4 pages
Lec 5
No ratings yet
Lec 5
8 pages
Nearest Orthonormal Matrix
No ratings yet
Nearest Orthonormal Matrix
2 pages
LarLinAlg8 LecturePPTs 04 05
No ratings yet
LarLinAlg8 LecturePPTs 04 05
24 pages
Rotations and The Euler Angles 1 Rotations
No ratings yet
Rotations and The Euler Angles 1 Rotations
5 pages
Statistics Manual
No ratings yet
Statistics Manual
43 pages
Lê Xuân Đ I Linear - System - Handout
No ratings yet
Lê Xuân Đ I Linear - System - Handout
90 pages
Gradient
No ratings yet
Gradient
14 pages
Lec 12
No ratings yet
Lec 12
10 pages
Unit Normal Vector Is Denoted by
No ratings yet
Unit Normal Vector Is Denoted by
20 pages
SMART Attribute
No ratings yet
SMART Attribute
8 pages
Distance-Preserving Dimensionality Reduction (Wiley Interdisciplinary Reviews - Data Mining and Knowledge Discovery, Vol. 1, Issue 5) (2011)
No ratings yet
Distance-Preserving Dimensionality Reduction (Wiley Interdisciplinary Reviews - Data Mining and Knowledge Discovery, Vol. 1, Issue 5) (2011)
12 pages
Inbound 8720238417846405249
No ratings yet
Inbound 8720238417846405249
8 pages
Classical Kalb-Ramond Field Theory in Curved Spacetimes
No ratings yet
Classical Kalb-Ramond Field Theory in Curved Spacetimes
10 pages
Mem 591 HW3
No ratings yet
Mem 591 HW3
2 pages
B.eda Course Structure With Electives 2022
No ratings yet
B.eda Course Structure With Electives 2022
21 pages
NGB GB DGBNFG
No ratings yet
NGB GB DGBNFG
5 pages
Algebraic Equations
From Everand
Algebraic Equations
Demetrios P. Kanoussis
No ratings yet
Vector 5
No ratings yet
Vector 5
7 pages
25 08 2024
No ratings yet
25 08 2024
7 pages
AMA2111 Past Test
No ratings yet
AMA2111 Past Test
8 pages
Lecture Planner - Maths - Lakshya MHTCET Crash Course 2025
No ratings yet
Lecture Planner - Maths - Lakshya MHTCET Crash Course 2025
1 page
Ometric Design of Highway PDF
75% (4)
Ometric Design of Highway PDF
48 pages

Lec 19

Uploaded by

Lec 19

Uploaded by

6.801/6.

866: Machine Vision, Lecture 19

Professor Berthold Horn, Ryan Sander, Tadayuki Yoshitake

1 Lecture 19: Absolute Orientation in Closed Form, Outliers and Robustness,

1.1 Review: Absolute Orientation

1.1.1 Rotation Operations

r0 = (cos θ)r + (1 − cos θ)(ω̂ · r)ω̂ + sin θ(ω̂ × r)

Let R = [Q̄T Q]3×3,lower , then :

1.2 Quaternion Transformations/Conversions

1 + r11 + r22 + r33 = 4q02

r32 − r23 = 4q0 qx

1 − r11 + r22 − r33 = 4qy2

1.3 Transformations: Incorporating Scale

1.3.1 Solving for Scaling Using Least Squares: Asymmetric Case

Next, let us define the following terms:

1.3.2 Issues with Symmetry

1.3.3 Solving for Scaling Using Least Squares: Symmetric Case

1.4 Solving for Optimal Rotation in Absolute Orientation

This approach raises a few questions:

1.4.1 How Many Correspondences Do We Need?

1.4.2 When do These Approaches Fail?

Characteristic Equation : det |N − λI| = 0

1.4.3 What Happens When Points are Coplanar?

r0r,i · n̂ = 0 ∀ i ∈ {1, ..., n}

Then, rewriting our equation for the plane above, we have:

1.4.4 What Happens When Both Coordinate Systems Are Coplanar

cos θ = n̂1 · n̂2

1.6 Sampling Space of Rotations

1.6.1 Initial Procedure: Sampling from a Sphere

1.6.2 Improved Approach: Sampling from a Cube

1.6.3 Sampling From Spheres Using Regular and Semi-Regular Polyhedra

1.6.4 Sampling in 4D: Rotation Quaternions and Products of Quaternions

(0, x̂)(0, ŷ) = (0 − x̂ · ŷ, 0x̂ + 0ŷ + x̂ × ŷ)

Therefore, we have produced a new rotation that we can sample from!

6.801 / 6.866 Machine Vision

You might also like