0% found this document useful (0 votes)

32 views5 pages

Basic Concepts

This document summarizes key concepts in traditional computer vision and deep learning. In traditional vision, it discusses image processing tasks like classification and segmentation. It also covers color spaces, image correction through intensity transforms and filters, and edge detection methods. In deep learning, it defines the difference between GOFAI and machine learning, the types of machine learning, and fundamentals of algorithms like k-NN, perceptron, SVM, and regularization. It also provides overviews of loss functions, the gradient method, and techniques to improve efficiency.

Uploaded by

Josue Daniel Ortega Ortega

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views5 pages

Basic Concepts

Uploaded by

Josue Daniel Ortega Ortega

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

TRADITIONAL VISION

1. Basic Concepts: Computer Vision, Image Processing. Fundamental tasks: classification,

detection, segmentation (types). Difficulties

Computer vision: Computer vision, as a scientific domain, aims to extract, analyze, process and
acquire relevant information from images or image, in order to produce numerical or symbolic
information. We can do this with the help of algorithmic tools.

Computer vison task:

• Localization
• Segmentation
• Classification
• Detection

Image Processing: Image processing aims to produce an image that is more advantageous for our
purposes. Image processing is often used to prepare images for further analysis or to help human
users to recognize crucial details more easily.

Traditional vision steps:

• Image Acquisition
• Image correction
• Feature detection
• Decision

2. Imaging, how photo-diodes work, CCD and its variations, CMOS, (dis)advantages. Image
structure, properties, and errors. Camera type

A photodiode is a semiconductor device that converts light into an electrical current. The current is
generated when photons are absorbed in the photodiode.

CCD: CCD are analog devices, which can store charge in the case a photon is absorbed.
CMOS: is an image sensor where each pixel sensor has a photodetector, also they have smaller
dimensions, less energy consumption, and cheap.
Cameras: There are three types of cameras
• Stereo cameras
• depth cameras
• LIDAR

3. Producing color images. Three important color spaces, interpretation of color

components, basic advantages of certain color spaces and their use. Variations of
grayscale conversion.

Three important color spaces: red, green and blue, abbreviated as RGB).
basic advantages of certain color spaces and their use:
• YCbCr: Y represents the lightness of the given color. The Cr and Cb channels describe the
hue of the color.
• HSV/HSI/HSL- The common in these color representations is that the chrominance is coded
with a hue and saturation value.

Image Correction

4. Intensity transforms (what does each do), histogram, its use, histogram operations.

The histogram describes the frequency intensity values in an image. The histogram can help us to
detect and correct the defects of image acquisition

5. Image noise types, convolutional filters. Smoothing, sharpening, edge detection filters,
recognize them from the weight matrix. Linear vs rank filters, which is good for what,
advantages.
Types of noise:
• Gaussian-noise, which is the consequence of the noisy nature of the imaging sensor and the
surrounding electronics.
• salt and pepper noise, which occurs sparsely, but changes the value of the pixels in a
significant manner.
convolutional filters: a small filtering window is slid through the image, and the value of each pixel
is set to the result of the convolution with the pixel itself and its neighborhood

Smoothing: each element of the kernel is non-negative, and they sum up to one. If the sum differs,
then a brightening/darkening step also occurs beside smoothing

sharpening, edge detection: There are such filters, which are similar to edge detector filters
regarding the structure of their elements (i.e., negative and positive elements on different sides), but
sum up to 1
Edge detection filter: In each position, two differences are calculated (x- and y-direction), one
between the pixel and its right and one between the pixel and its below neighbor. The squared sum
of both gives a metric that characterizes how much is the pixel edge-like

Linear vs rank filters, which is good for what, advantages:

6. Edge detection, using first and second order derivatives, determine derivative order from
the filter matrix. Sobel and Prewitt operators, directionality. Idea of the Canny algorithm
and its steps.

Sobel and Prewitt operators: are direction-dependent edge detectors, otherwise their working
principle is similar to that of the Gaussian.

Canny algorithm: the first of which is the calculation - with simple derivation filters - of vertical and
horizontal derivatives the norm and the direction of the image gradient are calculated.

7. Basic operations of image math and their goal. Principle and algorithms of interpolation,
properties, (dis)advantages.
DEEP LEARNING
What is the difference good old-fashioned artificial intelligence (GOFAI) and machine learning?
What is the structure of machine learning algorithms, what types does it have, and what important
things do we always have to keep in mind when we use them?
the input is denoted by x, the output by y, and the parameters by #.
^y = f (x; #)
Types of machine learning:
• Supervised learning, we need to make labels databases, this mean time a money. Second
the quality of the labels determines the quality of the result
• unsupervised learning the goal of the algorithm is to explain the input with the help of a
compact model
• reinforcement learning, the algorithm makes a sequence of decisions, but there is generally
no feedback after each decision. On the other hand, the feedback only describes the quality
of the steps taken

How does the kNN algorithm work? What problems does it have? How does the Perceptron model
work and how can we interpret its outputs? What does the decision function of the Perceptron
look like?

kNN: is a non-parametric method used for classification and regression. In both cases, the input
consists of the k closest training examples in the feature space. The output depends on whether k-
NN is used for classification or regression
Problem The neighbors decide the label of the image. The image distance is, in the case of kNN,
mostly defined as the absolute
The Perceptron model the working principle is that the pixels are ordered in a vector, which is
multiplied with a weight matrix, the result having as many elements as the number of classes. Each
element can be interpreted as an indicator of the degree of belonging to that class
s=Wx
Where x is the input, s the degree of belonging to a class, W the matrix of parameters

SVM fundamentals, operation. Explanation of the method of determining the output for a new input.
What is the kernel function and how does it affect the SVM decision function?

SVM The SVM algorithm is quite popular in computer vision, which determines the hyperplane,
which has the highest margin. The decision function of the SVM is stated below
kernel function: kernel function is a similarity measure between the input to be classified and the
training samples this means that the output of each entry influences the decision proportional to
how similar two data points are.
What types of loss functions can we use to train the Perceptron? What is the idea behind the hinge
loss and what does it look like? Fundamentals of the cross-entropy loss, how can we get probabilities
on the output of the Perceptron, and how can we define the “real” class probabilities? What are the
(dis)advantages of the two?

What types of loss functions can we use to train the Perceptron?

• the cross-entropy loss
• the hinge loss
What is the idea behind the hinge loss and what does it look like?: the main principle of which is to
define a measure, which is called the gap, if the score of the correct class is bigger at least with than
the other scores, then the error value is 0. Otherwise, the error increases linearly

What is regularization and what types does it have? Why do we need to do it, and how is it
connected to overfitting?
Regularization: are used because when the weight matrices are multiplied with a constant then the
value of them will decrease; thus, the limit of the value of the weights will tend towards infinity,
causing numeric problems and resulting in a confident model
Types of regularization:
• L2
• L1
• Elastic Net (L1+L2)
• Dropout, Batch Normalization

How does the gradient method work? Why do we use mini-batches, how does the size of the batch
affect the learning? How can we improve the efficiency of the gradient method (momentum,
scaling) and how do these help

What Is Computer Vision?
No ratings yet
What Is Computer Vision?
125 pages
What Is Computer Vision?
No ratings yet
What Is Computer Vision?
120 pages
Week 9 Lecture Notes
No ratings yet
Week 9 Lecture Notes
27 pages
Filtering Basics
No ratings yet
Filtering Basics
83 pages
Introcduction To Image Processing With Python Nour Eddine ALAA and Ismail Zine El Abidne March 5, 2021
No ratings yet
Introcduction To Image Processing With Python Nour Eddine ALAA and Ismail Zine El Abidne March 5, 2021
77 pages
Lecture 1 AI Summary
No ratings yet
Lecture 1 AI Summary
31 pages
Computer Vision Part 2
No ratings yet
Computer Vision Part 2
5 pages
Revisionback
No ratings yet
Revisionback
13 pages
Computer Vision
No ratings yet
Computer Vision
33 pages
Deep Learning Based Computer Vision
No ratings yet
Deep Learning Based Computer Vision
98 pages
Robotics
No ratings yet
Robotics
35 pages
Chapter 5 DIP
No ratings yet
Chapter 5 DIP
23 pages
Computer Vision
No ratings yet
Computer Vision
21 pages
CS4442 - CS9542 - Part 2 - Lecture 1 - Intro - Filtering
No ratings yet
CS4442 - CS9542 - Part 2 - Lecture 1 - Intro - Filtering
40 pages
Lect02 ImageProcessingReview
No ratings yet
Lect02 ImageProcessingReview
53 pages
CV Questions
No ratings yet
CV Questions
15 pages
Unit 3 - 1 - 1709014556934
No ratings yet
Unit 3 - 1 - 1709014556934
49 pages
Opencv2refman CPP
No ratings yet
Opencv2refman CPP
409 pages
Computer Vision 2
No ratings yet
Computer Vision 2
62 pages
Unit-5 Computer Vision
No ratings yet
Unit-5 Computer Vision
3 pages
CS4442 - CS9542 - Part 2 - Lecture 1 - Intro - Filtering
No ratings yet
CS4442 - CS9542 - Part 2 - Lecture 1 - Intro - Filtering
52 pages
Pattern Recognition 14
No ratings yet
Pattern Recognition 14
46 pages
Module 5
No ratings yet
Module 5
72 pages
Computer Vision
No ratings yet
Computer Vision
8 pages
3-D Computer Vision: Yu-Jin Zhang
No ratings yet
3-D Computer Vision: Yu-Jin Zhang
453 pages
1 Concepts: Computer Vision: Midterm Study Guide
No ratings yet
1 Concepts: Computer Vision: Midterm Study Guide
3 pages
Computer Vision
No ratings yet
Computer Vision
4 pages
Introduction To Machine Learning For Computer Graphics: Peter M. Hall University of Bath
No ratings yet
Introduction To Machine Learning For Computer Graphics: Peter M. Hall University of Bath
33 pages
CO Machine Vision
No ratings yet
CO Machine Vision
3 pages
Opencv 2 Refman
No ratings yet
Opencv 2 Refman
553 pages
Computer Vision Class 10 Notes
No ratings yet
Computer Vision Class 10 Notes
5 pages
Lec01 Filter For Web
No ratings yet
Lec01 Filter For Web
47 pages
Opencv 2 Refman
No ratings yet
Opencv 2 Refman
551 pages
Opencv 2 Refman
No ratings yet
Opencv 2 Refman
553 pages
Lecture 2 Handout
No ratings yet
Lecture 2 Handout
154 pages
Filtering: Most Slides From Steve Seitz
No ratings yet
Filtering: Most Slides From Steve Seitz
25 pages
Machine Learning Interview Questions
No ratings yet
Machine Learning Interview Questions
8 pages
Image Filtering: Davide Scaramuzza
No ratings yet
Image Filtering: Davide Scaramuzza
63 pages
Ece3099 Ipt PPT Template 18becxxxx
No ratings yet
Ece3099 Ipt PPT Template 18becxxxx
27 pages
SWE622 Lecture 3 Classification
No ratings yet
SWE622 Lecture 3 Classification
57 pages
Fundamentals of Digital Image Processing: Roger L. Easton, Jr. 22 November 2010
No ratings yet
Fundamentals of Digital Image Processing: Roger L. Easton, Jr. 22 November 2010
216 pages
Lec01 Filter For Web
No ratings yet
Lec01 Filter For Web
47 pages
HODL Lec 3 DNNs For Vision 1
No ratings yet
HODL Lec 3 DNNs For Vision 1
36 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
155 pages
Opencv 2 Refman
No ratings yet
Opencv 2 Refman
643 pages
CV2019
No ratings yet
CV2019
152 pages
Opencv Reference Manual PDF
No ratings yet
Opencv Reference Manual PDF
817 pages
1.neural Networks and Convolutional Processing
No ratings yet
1.neural Networks and Convolutional Processing
94 pages
Ml@ok Questions
No ratings yet
Ml@ok Questions
16 pages
Opencv 2 Refman
No ratings yet
Opencv 2 Refman
819 pages
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet
Canny Edge Detector: Unveiling the Art of Visual Perception
From Everand
Canny Edge Detector: Unveiling the Art of Visual Perception
Fouad Sabry
No ratings yet
Edge Detection: Exploring Boundaries in Computer Vision
From Everand
Edge Detection: Exploring Boundaries in Computer Vision
Fouad Sabry
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Computer Vision Fundamental Matrix: Please, suggest a subtitle for a book with title 'Computer Vision Fundamental Matrix' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
From Everand
Computer Vision Fundamental Matrix: Please, suggest a subtitle for a book with title 'Computer Vision Fundamental Matrix' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
Fouad Sabry
No ratings yet
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
From Everand
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
Fouad Sabry
No ratings yet
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
From Everand
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
Fouad Sabry
No ratings yet
Contextual Image Classification: Understanding Visual Data for Effective Classification
From Everand
Contextual Image Classification: Understanding Visual Data for Effective Classification
Fouad Sabry
No ratings yet
Harris Corner Detector: Unveiling the Magic of Image Feature Detection
From Everand
Harris Corner Detector: Unveiling the Magic of Image Feature Detection
Fouad Sabry
No ratings yet
Computer Stereo Vision: Exploring Depth Perception in Computer Vision
From Everand
Computer Stereo Vision: Exploring Depth Perception in Computer Vision
Fouad Sabry
No ratings yet
Chapter # 2 Solution of Algebraic and Transcendental Equations
100% (2)
Chapter # 2 Solution of Algebraic and Transcendental Equations
31 pages
1 - Four Fundamental Operations
No ratings yet
1 - Four Fundamental Operations
6 pages
Karatuner: Towards End-To-End Natural Pitch Correction For Singing Voice in Karaoke
No ratings yet
Karatuner: Towards End-To-End Natural Pitch Correction For Singing Voice in Karaoke
5 pages
Normal Forms and Parsing: CSC 3130: Automata Theory and Formal Languages
No ratings yet
Normal Forms and Parsing: CSC 3130: Automata Theory and Formal Languages
22 pages
FE590 Introduction To Knowledge Engineering
No ratings yet
FE590 Introduction To Knowledge Engineering
3 pages
DSA Unit-5
No ratings yet
DSA Unit-5
227 pages
02 - 2022 - Ensemble Learning Techniques For Object Detection in High-Resolution Satellite Images
No ratings yet
02 - 2022 - Ensemble Learning Techniques For Object Detection in High-Resolution Satellite Images
12 pages
301 Assignment 2
No ratings yet
301 Assignment 2
4 pages
ML CS3035 Question Bank Part I
No ratings yet
ML CS3035 Question Bank Part I
2 pages
Unit 3
No ratings yet
Unit 3
23 pages
AST20105 Data Structure and Algorithms: Chapter 9 - Hash Table
No ratings yet
AST20105 Data Structure and Algorithms: Chapter 9 - Hash Table
39 pages
CP ENum Projectkuno
No ratings yet
CP ENum Projectkuno
5 pages
Fibonicci: Example
No ratings yet
Fibonicci: Example
7 pages
Comparison of Uninformed Search Techniques
No ratings yet
Comparison of Uninformed Search Techniques
1 page
Ordinary Differential Equation
No ratings yet
Ordinary Differential Equation
17 pages
Question Bank
No ratings yet
Question Bank
4 pages
Lec 02
No ratings yet
Lec 02
63 pages
K-Means Clustering Algorithm
No ratings yet
K-Means Clustering Algorithm
17 pages
WienerKhin PDF
No ratings yet
WienerKhin PDF
4 pages
Transshipment Problem
No ratings yet
Transshipment Problem
21 pages
DAA Question Bank
No ratings yet
DAA Question Bank
3 pages
DSP-U5 - Musical Sound Processing
No ratings yet
DSP-U5 - Musical Sound Processing
13 pages
Intro To Ai ML
No ratings yet
Intro To Ai ML
21 pages
Unit 5 Cp7004 Question Bank
No ratings yet
Unit 5 Cp7004 Question Bank
4 pages
Deadlock Lab Program
No ratings yet
Deadlock Lab Program
7 pages
Dfa Minimization
No ratings yet
Dfa Minimization
9 pages
UTF 8indo French Workshop Brochure
No ratings yet
UTF 8indo French Workshop Brochure
2 pages
Real Statistics Examples Regression 1
No ratings yet
Real Statistics Examples Regression 1
440 pages
Sample Paper For The Machine Learning Course Ajay Sharma
No ratings yet
Sample Paper For The Machine Learning Course Ajay Sharma
19 pages
Numerical Methods: 0 N N N N n+1 N N N 2 N N N 2
No ratings yet
Numerical Methods: 0 N N N N n+1 N N N 2 N N N 2
1 page

Basic Concepts

Uploaded by

Basic Concepts

Uploaded by

TRADITIONAL VISION

1. Basic Concepts: Computer Vision, Image Processing. Fundamental tasks: classification,

Computer vison task:

Traditional vision steps:

3. Producing color images. Three important color spaces, interpretation of color

Linear vs rank filters, which is good for what, advantages:

What types of loss functions can we use to train the Perceptron?

You might also like