NUS MA4268 Ch1
NUS MA4268 Ch1
NUS MA4268 Ch1
by
Hui Ji
Chapter 1
Background
1.1
Overview
Figure 1.1: The human visual system can interpret the subtle variation in transparency, shading in this image, foreground flower and blurred background
CHAPTER 1. BACKGROUND
1. low-level vision are essentially digital image processing. Its input and
output are both digital images.
2. middle-level vision has the digital images as the input and low-level
symbolic representation of image content as the output, e.g., representation of object contour.
3. high-level vision take either digital images or low-level symbolic content
as the input and high-level symbolic representation as the output,
which tries to simulate the high level of human visual perception, e.g.,
3D structure of the world, object recognition.
1.2
CHAPTER 1. BACKGROUND
1.3
The first step in digital image analysis is often the detection of image features, e.g., edges, lines, feature points. These low-level image features should
be robust to image noise and also they should be robust to various environmental changes, dependent on the subsequent image analysis tasks. One
fundamental problem in image analysis is the segmentation. The goal of
image segmentation algorithms is to partition an image into multiple segmentations such that the image can be represented in a more meaningful
way. Image segmentation typically is used to locate objects and their boundaries. The result of image segmentation is a set of segments and pixels in
each segment are similar with respect to some characteristic property, e.g.,
color, texture.
Object recognition is another important topic in digital image analysis.
The goal of object recognition is to find a given object in an image or
video sequence. Human visual system can recognize objects in image with
ease, despite the fact that the look of the same object in different images
could vary somewhat in different views, in different sizes or even partially
obstructed from the view. The main challenges in object recognition is to
develop representation scheme for images to match the reliability of human
perception.
(a)
(b)
1.4
Computer vision
human visual system perceive the 3D structure of the world around us from
2D imagery. In parallel, computer vision focus on how to recover the 3D
shape and appearance of objects from 2D images. Computer vision has
been used in a wide range of applications, which includes: machine inspection, 3D model building, medical imaging, motion capture in animation,
surveillance, etc. While all these applications are extremely important, they
mostly pertains to fairly specified imagery and narrow domain. The current
vision techniques are far away from the dream of having a machinery which
interpret an image at the same level as human beings.
The main reason why why vision is so difficult is because it is an inverse
CHAPTER 1. BACKGROUND
(a)
(b)
Figure 1.7: Some optical illusions and what they might tell us about the
human visual system.
problem which is to recover some unknowns given in-sufficient information to
fully specify the solution. The solver to this inverse problem therefore need
accurate model to disambiguate between solutions. However, the model of
the visual world is overwhelmingly difficult considering the rich complexity
of the real world.
1.5
Image processing has traditionally been an area in applied mathematics research as image transforms play an essential role in most image processing
tasks. Most image transforms used in the theoretical and implementational
tools are liner ones, notably in image filtering, restoration, encoding and
analysis. The basic transforms are Fourier transform with a long history
and wavelet transform which have been popular in last decades. Also, Variational principle and PDE-based approaches see their widely applications
in-developed to solve many image processing tasks, e.g., image segmentation. Many algorithms in image processing and computer vision are tackling
inverse problems to revert image formation processes, which are often closely
related to the optimization techniques.
This note is to give a rigorous treatment of various important mathematical topics which have wide applications. The main math topics will be
covered in this note include:
1. Spectral analysis and digital filtering
2. Gabor transform and wavelets