L1 CVIP Introduction En-2020
L1 CVIP Introduction En-2020
Learning outcome:
outcome:
Knowledge:
• current state and development trends of computer vision, basic principles and
concepts of image processing and video streams, including color models and
hardware solutions;
• main stages, approaches, algorithms for processing and converting digital
images, applications, main methods and operations of digital image processing.
Skills:
• implement in software form the methods and algorithms for digital image
processing, including preprocessing, background removal, binarization,
smoothing;
• apply in software form the methods of filtering, segmentation and recognition of
objects in digital images.
Competences:
• perform a comparative analysis and classification of methods and algorithms
for computer vision, use algorithmic tools for solving the problems of
processing and analysis of digital images.
3
COMPUTER VISION and IMAGE PROCESSING
Main parameters of study course:
Course volume: 4 c.p. (6 ECTS)
160 academic hours (48 contact hours),
lectures – 24 hours,
laboratory and practical works – 24 hours,
independent student’s work – 112 hours.
Assessment: Laboratory works in the team – (20%),
Home work – (20%),
Final exam – (60%).
Requirements: Discipline, attendance, attentiveness,
fulfillment of tasks on time, independent creative work.
4
COURSE CONTENT
Part-Time,
No Themes of classroom lessons (volume, hours) Distant
Introduction. General characteristics of the current state, trends and the main
1 stages of digital image processing. 4
Methods and means of obtaining and displaying digital images. Basics of
2 digital representation of images. Discretization and quantization of images, 4
color models.
Pre-processing, background, noise, binarization. Luminance operations,
3 spatial filtering. 6
Filtering in the frequency domain. Compression, restoration and
4 reconstruction of digital images. Codecs. Watermarks. 6
Morphological image processing and cellular automata. The “deep learning”
5 approach and artificial neural networks (ANN). 4
Algorithms of segmentation. Representation and description of images.
6 Signatures of the objects. 6
Identification of objects in the image. Aerial photography and satellite
7 imagery. Fractal representation. Stereo data processing and 3D 4
reconstruction.
OCR problem. Recognition of printed and handwritten texts. Recognition of
8 car ID numbers (state plates) and bar codes. 4
9 Human biometry. Identification of the face, fingerprints, hands, iris, veins. 6
10 Recognition of moving objects in video surveillance systems. 4
Total: 48
5
Basic literature/sources for studies:
1. Gonzalez, R.C. and Woods, R.E. Digital Image
Processing, 3rd edition, Pearson Prentice Hall, 2008,
976 p.
2. Davies, E. Machine Vision: Theory, Algorithms,
Practicalities. 3nd Edition, Elsevier, 2004, 934 p.
3. Gonzalez, R. C., Woods, R. E. and Eddins, S. L. Digital
Image Processing Using MATLAB ®, 2nd edition,
Gatesmark Publishing, 2009, 827 p.
4. Szeliski, R. Computer Vision. Algorithms and
Applications. Springer-Verlag, London, 2011, 812 p.
5. Haralick, R., Shapiro, L. Computer and Robot Vision.
Addison-Wesley Publishing Company, 1992, 673 p.
6
Theme 1. INTRODUCTION TO IMAGE PROCESSING
An important role in the development of modern technology is played by the
methods of processing graphic information. They provide improved images
for their best perception, compression of video data for storage and
transmission through the communication channels, as well as analysis,
recognition and interpretation of visual images for decision making and
control the behavior of autonomous systems.
In contrast to the vision of people who are able to perceive electromagnetic radiation
only in the visible range, machine image processing covers almost the entire
electromagnetic spectrum from gamma radiation to radio waves. Image processing
and machine vision are related fields.
Image processing
is defined as a
discipline where
images are
present at the
input and output
of the process.
Machine vision is a computer simulation of human vision, including learning, the ability
to inference and act on the basis of observable information (imitation of human
intellectual activity).
9
Key Terms and Definitions (2)
Image recovery is also an increase in the visual quality of the image. In contrast to improvement
(criteria are subjective), image restoration is objective based on mathematical or probabilistic
models of image distortion (resampling, distortion correction, noise filtering and removal,
contrast, smoothing, sharpening).
11
Main stages of digital image processing (2)
Image compression is the reducing the volume of memory needed to store an image, or
narrowing the channel bandwidth required for its transmission. Compression is reflected in the
names of graphic files, for example, * .jpg (Joint Photographic Experts Group - JPEG), * .gif, * .avi,
etc.
Image segmentation splits an image into its component parts or objects. Automatic segmentation
is one of the most difficult tasks of digital image processing. Overly detailed segmentation
requires a long solution path, requiring the identification of objects separately. Insufficiently
detailed or incorrect segmentation will inevitably lead to errors at the recognition stage. The more
accurate the segmentation, the greater the chances of success at recognition.
Representation and description immediately follow the segmentation stage, the output of which is
usually only data on the pixels that either form the boundary of the region (i.e., a lot of pixels are
given that separate one area of the image from another) or represent all points of the regions
themselves.
Forming of description is the selection of features (signatures), it is associated with the selection
of attributes that would express the information of interest in a quantitative form or could serve as
the basis for distinguishing classes of objects (classification) and further for identification.
12