Unit 5 Int345

The document discusses range image processing, detailing various imaging techniques such as passive imaging, active range sensing, and structured light sensors. It covers range data segmentation methods, including region-based and edge-based segmentation, as well as range image registration and model acquisition processes. Additionally, it explores object recognition techniques, emphasizing the use of machine learning and deep learning approaches for identifying objects in images.

Uploaded by

Shreyansh Agarwal 49

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views28 pages

Unit 5 Int345

Uploaded by

Shreyansh Agarwal 49

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

UNIT 5 (Part 2)- Range Image Processing

Range data
❑ Range data is a 2-1/2 D or 3-D representation of the
scene.
❑ An image d(i, j), which records the distance d to the
corresponding scene point (X, Y, Z) for each image pixel
(i, j).
❑ It could be provided as a set of 3-D scene points (point
cloud).

2
Imaging Techniques
■ Passive imaging.
■ Stereo imaging
■ Active range sensing
o Time-of-flight sensors
o Triangulation-based sensors
o Structured Light
Active range sensors
• Active range sensors are devices that use emitted energy, such
as light or sound, to measure the distance between the sensor
and an object.
• These sensors are commonly used in various applications,
including robotics, industrial automation, automotive systems,
and more.
• They work by emitting a signal, measuring the time it takes for
the signal to bounce off an object and return, and then using
this information to calculate the distance.
Time-of-Flight (ToF) Sensors
• ToF sensors use light, typically infrared, to measure the time it
takes for a light pulse to bounce off an object and return.
• They provide accurate distance measurements and are used in
applications like gesture recognition, augmented reality, and
indoor navigation.
Time-of-Flight Range Sensors
Source and
t: Time taken to travel
detector
the forward and return
path. v: speed of light in collocated.
Use a moving
the
given medium. mirror to scan
the beam. Pulsed
Distance: laser

d: (v x t)/2
Laser-based Limitation:
time-of-flight range o the minimum observation time
sensors: light detection thus limited by the minimum
and ranging (LIDAR) or
distance observable.
laser radar (LADAR)
Triangulation-based Sensors
• Triangulation-based active range sensors are a category of
active sensors that determine the distance to an object by
measuring the angle and position of the reflected signal.
• Triangulation is a geometric principle used to calculate
distances by forming a triangle between the sensor, the object,
and the point of reflection.
• These sensors are widely used in various applications, including
industrial automation, 3D scanning, robotics, and more.
Triangulation based Sensors
Known for a predetermined
The camera scanning path of the beam.
and light
source are Apply triangulation
(X,Y,Z
calibrated. ) to get the 3D point.
(i,j)
(u,v)

Observed
in camera. A light
Camera source.
Yi-Chih Hsieh, Decoding structured light patterns for three-dimensional imaging systems, Pattern Recognition 34 (2001) 343-349
.
Imaging principle
Encoding the 3-D
position of Scan the ray in a
projected predetermined
ray. (X,Y,Z
)
calibrated path.

(i,j)
(u,v)

A light
Camera
source.
Yi-Chih Hsieh, Decoding structured light patterns for three-dimensional imaging systems, Pattern Recognition 34 (2001) 343-349
.
Structured Light Sensors
• These sensors project a structured pattern (such as grids or
stripes) of light onto an object and use the deformation of the
pattern on the object's surface to calculate distance.
• They are commonly used in 3D scanning and industrial
metrology.
Structured Light
Encoding the 3-D Project a strip or
position of pattern.
projected ray.
(X,Y,Z
Get 3-D positions
) of all the scene
(i,j)
points lying on the
(u,v)
projected strip.

Structured
Camera
light
Yi-Chih Hsieh, Decoding structured light patterns for three-dimensional imaging systems, Pattern Recognition 34 (2001) 343-349
.
Range Data Segmentation
• Range data segmentation is the task of segmenting (dividing) a range
image, an image containing depth information for each pixel, into
segments (regions), so that all the points of the same surface belong
to the same region, there is no overlap between different regions and
the union of these regions generates the entire image.
• The goal of range data segmentation is to identify and isolate
objects, obstacles, or distinct regions within the sensor's field of
view for further analysis and decision-making.
There have been two main approaches to the range segmentation problem:
region-based range segmentation and edge-based range segmentation.
1. Region-based range segmentation
• Region-based range segmentation algorithms can be further categorized into two major groups:
parametric model-based range segmentation algorithms and region-growing algorithms.

• Algorithms of the first group are based on assuming a parametric surface model and grouping data
points so that all of them can be considered as points of a surface from the assumed parametric
model (an instance of that model).

• Region-growing algorithms start by segmenting an image into initial regions. These regions are
then merged or extended by employing a region growing strategy. The initial regions can be
obtained using different methods, including iterative or random methods. A drawback of
algorithms of this group is that in general they produce distorted boundaries because the
segmentation usually is carried out at region level instead of pixel level.
Edge-based range segmentation
• Edge-based range segmentation algorithms are based on edge
detection and labeling edges using the jump boundaries
(discontinuities).
• They apply an edge detector to extract edges from a range image.
Once boundaries are extracted, edges with common properties are
clustered together.
• The segmentation procedure starts by detecting discontinuities using
zero-crossing and curvature values.
• The image is segmented at discontinuities to obtain an initial
segmentation.
• At the next step, the initial segmentation is refined by fitting quadratics
whose coefficients are calculated based on the Least squares method.
• In general, a drawback of edge-based range segmentation algorithms is
that although they produce clean and well defined boundaries between
different regions, they tend to produce gaps between boundaries.
• In addition, for curved surfaces, discontinuities are smooth and hard to
locate and therefore these algorithms tend to under-segment the range
image. Although the range image segmentation problem has been studied
for a number of years, the task of segmenting range images of curved
surfaces is yet to be satisfactorily resolved.
Range Image Registration
Range image registration is the process of aligning and combining multiple range images,
which may be captured from different viewpoints or at different times, to create a single,
coherent 3D model. The key steps involved in range image registration include:
1. Feature Extraction: Extract features or keypoints from the range images. These
features serve as distinctive points that can be matched across different images.
2. Feature Matching: Match corresponding features in pairs of range images to establish
their relative pose (position and orientation) with respect to each other.
3. Pose Estimation: Calculate the transformation (usually translation and rotation) that
aligns the range images accurately. Common algorithms for this purpose include
Iterative Closest Point (ICP) and its variants.
4. Global Registration: Combine the relative transformations to register all range images
into a common global coordinate system.
5. Refinement: Fine-tune the alignment to minimize any residual errors or misalignments.
6. Data Fusion: Merge the registered range images into a single, unified 3D model.
Model Acquisition:
Model acquisition is the process of creating a 3D model from the registered range images. This representation
can take the form of a point cloud, a mesh, or other 3D data structures. The typical steps in model acquisition
include:
• Point Cloud Generation: Convert the registered range images into a 3D point cloud, where each point
corresponds to a 3D coordinate in space and includes depth information from the range data.
• Mesh Generation: In some applications, a 3D mesh is generated from the point cloud, often consisting of
interconnected triangles. This mesh can provide a more detailed and structured representation of the 3D
model.
• Texture Mapping: Apply color or texture information to the 3D model, typically using images or textures
captured in conjunction with the range data.
• Mesh Simplification: For real-time rendering and storage efficiency, reduce the complexity of the 3D
model by simplifying the mesh while preserving important geometric features.
• Texture Projection: Project textures from the original images onto the 3D model to enhance its appearance
and realism.
• Post-Processing: Perform various data cleaning, noise reduction, hole filling, and optimization tasks to
ensure that the 3D model is suitable for its intended application.
Object Recognition
• Object recognition is a computer vision technique for identifying
objects in images or videos. Object recognition is a key output
of deep learning and machine learning algorithms.
• When humans look at a photograph or watch a video, we can
readily spot people, objects, scenes, and visual details.
• The goal is to teach a computer to do what comes naturally to
humans: to gain a level of understanding of what an image
contains.
Object recognition is a key technology behind driverless cars, enabling them to recognize a stop sign or
to distinguish a pedestrian from a lamppost. It is also useful in a variety of applications such as disease
identification in bioimaging, industrial inspection, and robotic vision.
Object recognition typically consists of several key steps:
1.Object Detection:
• Localization: Determine the location and extent of objects within an image. This is often
done by drawing bounding boxes around the objects.
• Class Labeling: Assign a label or category to each detected object (e.g., "car," "person,"
"dog").
2. Feature Extraction: Extract distinctive features from the detected objects. Common features
include color, texture, shape, and key points.
3. Feature Representation: Transform the extracted features into a suitable format for further
analysis. This often involves creating feature vectors or descriptors that capture the essential
characteristics of the object.
4. Classification: Use machine learning algorithms, such as deep neural networks (e.g.,
CNNs), support vector machines (SVMs), or decision trees, to classify objects based on their
feature representations.
5. Recognition and Decision-Making: Determine the identity of recognized objects based on
the classification results. This may involve associating objects with known object categories or
labels.
6. Post-Processing: Enhance the recognition results by applying additional techniques, such
as non-maximum suppression to remove redundant bounding boxes or smoothing techniques
to improve object tracking.
How Object Recognition Works
• You can use a variety of approaches for object recognition.
Recently, techniques in machine learning and deep
learning have become popular approaches to object recognition
problems. Both techniques learn to identify objects in images,
but they differ in their execution.
Object Recognition Using Deep Learning
• Deep learning techniques have become a popular method for
doing object recognition. Deep learning models such as
convolutional neural networks, or CNNs, are used to
automatically learn an object’s inherent features in order to
identify that object.
• For example, a CNN can learn to identify differences between
cats and dogs by analyzing thousands of training images and
learning the features that make cats and dogs different.
There are two approaches to performing object recognition using deep
learning:
• Training a model from scratch: To train a deep network from scratch, you gather a very large
labeled dataset and design a network architecture that will learn the features and build the model.
The results can be impressive, but this approach requires a large amount of training data, and you
need to set up the layers and weights in the CNN.
• Using a pretrained deep learning model: Most deep learning applications use the transfer
learning approach, a process that involves fine-tuning a pretrained model. You start with an existing
network, such as AlexNet or GoogLeNet, and feed in new data containing previously unknown
classes. This method is less time-consuming and can provide a faster outcome because the model
has already been trained on thousands or millions of images.
Deep learning offers a high level of accuracy but requires a large amount of data to make accurate
predictions.
Machine learning techniques for object
recognition
• Machine learning techniques are also popular for object
recognition and offer different approaches than deep learning.
Common examples of machine learning techniques are:
• HOG feature extraction with an SVM machine learning model
• Bag-of-words models with features such as SURF and MSER
(Maximally Stable External Regions)
• The Viola-Jones algorithm, which can be used to recognize a
variety of objects, including faces and upper bodies
Machine Learning vs. Deep Learning for
Object Recognition
• The main consideration to keep in mind when choosing
between machine learning and deep learning is whether you
have a powerful GPU and lots of labeled training images.
• If the answer to either of these questions is No, a machine
learning approach might be the best choice.
• Deep learning techniques tend to work better with more images,
and a GPU helps to decrease the time needed to train the
model.
Thank you!

Chapter 3: The Electronic Wallet 3.1. Introduction To E-Wallet
100% (2)
Chapter 3: The Electronic Wallet 3.1. Introduction To E-Wallet
11 pages
UNIT-5 (PART-2, Final)
No ratings yet
UNIT-5 (PART-2, Final)
28 pages
Range Image Segmentation For 3-D Object Recognition
No ratings yet
Range Image Segmentation For 3-D Object Recognition
157 pages
3d Scanning
100% (1)
3d Scanning
15 pages
Dalleyg RI Regn 3DIM
No ratings yet
Dalleyg RI Regn 3DIM
8 pages
Robotics Autonomous Systems: V Tor Sequeira A,,, Joao G.M. Gon Alves A, M. Isabel Ribeiro B
No ratings yet
Robotics Autonomous Systems: V Tor Sequeira A,,, Joao G.M. Gon Alves A, M. Isabel Ribeiro B
11 pages
From Images To 3D Models
No ratings yet
From Images To 3D Models
7 pages
Fast Object Localization With Real Time 3D Laser R
No ratings yet
Fast Object Localization With Real Time 3D Laser R
11 pages
3 Dmodelpipeline
No ratings yet
3 Dmodelpipeline
24 pages
Active, Optical Range Imaging Sensors Paul J. Besl Computer Science
No ratings yet
Active, Optical Range Imaging Sensors Paul J. Besl Computer Science
26 pages
CVP 3D Vision System Development Mattias Johannesson
No ratings yet
CVP 3D Vision System Development Mattias Johannesson
91 pages
Build Your Own 3D Scanner: 3D Photography For Beginners: SIGGRAPH 2009 Course Notes Wednesday, August 5, 2009
No ratings yet
Build Your Own 3D Scanner: 3D Photography For Beginners: SIGGRAPH 2009 Course Notes Wednesday, August 5, 2009
94 pages
OIE 751 ROBOTICS Unit 3 Class 7 (29-9-2020)
No ratings yet
OIE 751 ROBOTICS Unit 3 Class 7 (29-9-2020)
13 pages
Lec 41
No ratings yet
Lec 41
14 pages
Account File
No ratings yet
Account File
22 pages
Lecture1 PDF
No ratings yet
Lecture1 PDF
95 pages
Augmented Reality - Unit 5
No ratings yet
Augmented Reality - Unit 5
36 pages
Obstacle Detection For Autonomus Vehicles Using 3D LiDAR Point Cloud Data
No ratings yet
Obstacle Detection For Autonomus Vehicles Using 3D LiDAR Point Cloud Data
14 pages
Structured-Light 3D Surface Imaging - Aop-3!2!128
0% (1)
Structured-Light 3D Surface Imaging - Aop-3!2!128
33 pages
Close Range Photogrammetry - Part1
No ratings yet
Close Range Photogrammetry - Part1
70 pages
Image
No ratings yet
Image
14 pages
Unit 5 Shapes
No ratings yet
Unit 5 Shapes
13 pages
3 D Models and Match
No ratings yet
3 D Models and Match
35 pages
Camera Notes For Photogrammetry
No ratings yet
Camera Notes For Photogrammetry
61 pages
Realtime Scanmatching and Localization Using A Spinning Laser Rangefinder Tudor Achim
No ratings yet
Realtime Scanmatching and Localization Using A Spinning Laser Rangefinder Tudor Achim
5 pages
Sensors On 3d Digitization
No ratings yet
Sensors On 3d Digitization
24 pages
SG14 Byod3d
No ratings yet
SG14 Byod3d
66 pages
3D Scanning: Acknowledgement: Some Content and Figures by Brian Curless
No ratings yet
3D Scanning: Acknowledgement: Some Content and Figures by Brian Curless
35 pages
Real-Time Camera Tracking and 3D Reconstruction Using Signed Distance Functions
No ratings yet
Real-Time Camera Tracking and 3D Reconstruction Using Signed Distance Functions
8 pages
Sensor On 3D Digitization
No ratings yet
Sensor On 3D Digitization
22 pages
Image Projection Onto Flat LiDAR Point Cloud Surfaces To Create Dense and Smooth 3D Color Maps
No ratings yet
Image Projection Onto Flat LiDAR Point Cloud Surfaces To Create Dense and Smooth 3D Color Maps
5 pages
Visual Based Navigation of Mobile Robots
No ratings yet
Visual Based Navigation of Mobile Robots
42 pages
Settergren
No ratings yet
Settergren
8 pages
Lec 17
No ratings yet
Lec 17
10 pages
Quality of 3d Models Generated by SFM Te
No ratings yet
Quality of 3d Models Generated by SFM Te
12 pages
Sensors On 3D Digitization
No ratings yet
Sensors On 3D Digitization
23 pages
3D Range Map Using Structured Light and Three-Point Epipolar Constraints
No ratings yet
3D Range Map Using Structured Light and Three-Point Epipolar Constraints
4 pages
Object Recognition
No ratings yet
Object Recognition
49 pages
6 - Image Registration
No ratings yet
6 - Image Registration
35 pages
Unit 4 Part 1
No ratings yet
Unit 4 Part 1
98 pages
Unit 4
No ratings yet
Unit 4
13 pages
A Multicamera Active 3D Reconstruction Approach
No ratings yet
A Multicamera Active 3D Reconstruction Approach
10 pages
P L S - Lecture-2
No ratings yet
P L S - Lecture-2
31 pages
Sensors 24 04177
No ratings yet
Sensors 24 04177
31 pages
E3sconf Gi2021 04001
No ratings yet
E3sconf Gi2021 04001
9 pages
Overview On 3 D Reconstruction From Images
No ratings yet
Overview On 3 D Reconstruction From Images
7 pages
Sensors On 3D Digitization
No ratings yet
Sensors On 3D Digitization
19 pages
Unit 4 Dip
No ratings yet
Unit 4 Dip
115 pages
Machines 10 00124
No ratings yet
Machines 10 00124
26 pages
Sensors On 3D Digitization
No ratings yet
Sensors On 3D Digitization
2 pages
Summary of Computer Vision Cyril Stanissh
No ratings yet
Summary of Computer Vision Cyril Stanissh
13 pages
Remotesensing 11 01499
No ratings yet
Remotesensing 11 01499
29 pages
3D With Consumer Depth Sensors s1 Introduction 23T
No ratings yet
3D With Consumer Depth Sensors s1 Introduction 23T
38 pages
Fast 3D Mapping by Matching Planes Extracted From Range Sensor Point-Clouds
No ratings yet
Fast 3D Mapping by Matching Planes Extracted From Range Sensor Point-Clouds
6 pages
An Invitation To 3-D Vision From Images To Models
No ratings yet
An Invitation To 3-D Vision From Images To Models
339 pages
Sensors 23 06456 v2
No ratings yet
Sensors 23 06456 v2
25 pages
Uw Cse 11 02 02 PDF
No ratings yet
Uw Cse 11 02 02 PDF
8 pages
Advancing 3D Point Cloud Understanding Through Deep Transfer Learning: A Comprehensive Survey
No ratings yet
Advancing 3D Point Cloud Understanding Through Deep Transfer Learning: A Comprehensive Survey
38 pages
Procedural Surface: Exploring Texture Generation and Analysis in Computer Vision
From Everand
Procedural Surface: Exploring Texture Generation and Analysis in Computer Vision
Fouad Sabry
No ratings yet
Ray Tracing Graphics: Exploring Photorealistic Rendering in Computer Vision
From Everand
Ray Tracing Graphics: Exploring Photorealistic Rendering in Computer Vision
Fouad Sabry
No ratings yet
Computer Stereo Vision: Exploring Depth Perception in Computer Vision
From Everand
Computer Stereo Vision: Exploring Depth Perception in Computer Vision
Fouad Sabry
No ratings yet
Spin Current
No ratings yet
Spin Current
9 pages
Discrete Probability Flashcard - Units IV & V
No ratings yet
Discrete Probability Flashcard - Units IV & V
7 pages
Discrete Probability-I - University Level MCQ Questions
No ratings yet
Discrete Probability-I - University Level MCQ Questions
4 pages
Relations - University Level MCQ Questions
No ratings yet
Relations - University Level MCQ Questions
4 pages
8B10B Encoder/Decoder Megacore Function (Ed8B10B) : November 2001 Ver. 1.02 Data Sheet
No ratings yet
8B10B Encoder/Decoder Megacore Function (Ed8B10B) : November 2001 Ver. 1.02 Data Sheet
11 pages
Skill Java Script 2nd Sem 2024
No ratings yet
Skill Java Script 2nd Sem 2024
25 pages
Paper - 1 - Group Data Sharing Agreement Using Block Design Based Key in Cloud Computing
No ratings yet
Paper - 1 - Group Data Sharing Agreement Using Block Design Based Key in Cloud Computing
5 pages
Bus Com
No ratings yet
Bus Com
3 pages
Lab 02 - First Assembly Language Program
No ratings yet
Lab 02 - First Assembly Language Program
7 pages
KS18 Data Centres - An Introduction To Concepts and Design (2012)
No ratings yet
KS18 Data Centres - An Introduction To Concepts and Design (2012)
85 pages
Game Requirements For Venge Io (Clone)
No ratings yet
Game Requirements For Venge Io (Clone)
3 pages
Program Directory For ENOVIA® Digital Enterprise Solutions ENOVIA Portfolio Enovia Version 1 Release 5 For Use With AIX, HP-UX, IRIX, Solaris
No ratings yet
Program Directory For ENOVIA® Digital Enterprise Solutions ENOVIA Portfolio Enovia Version 1 Release 5 For Use With AIX, HP-UX, IRIX, Solaris
238 pages
Zoom Phone - Bring Your Own Carrier
No ratings yet
Zoom Phone - Bring Your Own Carrier
3 pages
WBS-2-Operations Analytics-W2S1-How-to-Build-and-Optimization-Model
No ratings yet
WBS-2-Operations Analytics-W2S1-How-to-Build-and-Optimization-Model
18 pages
ETECH WorkSheet 1
No ratings yet
ETECH WorkSheet 1
10 pages
Part B Unit 1 PYQ (MCQ)
No ratings yet
Part B Unit 1 PYQ (MCQ)
5 pages
Onboarding Guide (Integration With SAP Sales Cloud Version 2)
No ratings yet
Onboarding Guide (Integration With SAP Sales Cloud Version 2)
86 pages
Neo Analytics
No ratings yet
Neo Analytics
3 pages
A6K Product Manual
No ratings yet
A6K Product Manual
16 pages
DC-30 - System Recovery Guide - V2.0 - EN
No ratings yet
DC-30 - System Recovery Guide - V2.0 - EN
12 pages
CN Practical
No ratings yet
CN Practical
9 pages
Notes - 5 Unit
No ratings yet
Notes - 5 Unit
54 pages
1 - 6 Years Experience 2nd
No ratings yet
1 - 6 Years Experience 2nd
2 pages
Matrices Basic Concepts
No ratings yet
Matrices Basic Concepts
14 pages
Computer Science Distilled
No ratings yet
Computer Science Distilled
110 pages
Presentation Impress Shortcut Keys
No ratings yet
Presentation Impress Shortcut Keys
3 pages
Multimedia-Unit 3
No ratings yet
Multimedia-Unit 3
23 pages
NBIMS-US V3 4.7 Eie-405-415
No ratings yet
NBIMS-US V3 4.7 Eie-405-415
11 pages
Keshav Com Seminar
No ratings yet
Keshav Com Seminar
5 pages
ARRI Reference Tool GUI 1 3 0 Release Notes
No ratings yet
ARRI Reference Tool GUI 1 3 0 Release Notes
18 pages
Tally ERP History
No ratings yet
Tally ERP History
9 pages
Cutman Course Hosting Guidelines
No ratings yet
Cutman Course Hosting Guidelines
4 pages
iSMA Control Kit: User Manual
No ratings yet
iSMA Control Kit: User Manual
59 pages

Unit 5 Int345

Uploaded by

Unit 5 Int345

Uploaded by

UNIT 5 (Part 2)- Range Image Processing

You might also like