0% found this document useful (0 votes)

8 views21 pages

Computer Vision and Pattern Topics

Uploaded by

242210020

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views21 pages

Computer Vision and Pattern Topics

Uploaded by

242210020

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

1.

Gabor filters
Gabor filters are linear filters used for texture analysis, edge detection, and feature extraction
in image processing, particularly in computer vision. They are named after Dennis Gabor, who
introduced them in 1946. Gabor filters are particularly effective for capturing spatial
frequency, orientation, and phase information from an image, which are essential for
detecting textures and patterns.

Key Concepts:

1. Spatial Frequency: Gabor filters are tuned to specific spatial frequencies, which
means they are good at detecting textures with specific repetitive patterns, like
stripes or grids.

2. Orientation: Gabor filters can be oriented at different angles, making them useful for
detecting edges and patterns in a specific direction, such as horizontal, vertical, or
diagonal.

3. Localization in Space and Frequency: The Gabor filter offers a good balance between
localization in both the spatial and frequency domains. It helps capture local image
features, making it highly useful in tasks like edge detection

Mathematical Definition:

A Gabor filter is essentially a sinusoidal wave modulated by a Gaussian envelope. The 2D

Gabor filter is defined by the following equation:
Applications in Computer Vision:

1. Texture Analysis: Gabor filters are often used to analyze the texture of an image by
decomposing it into different frequency and orientation components.
2. Edge Detection: They are highly effective in detecting edges and line features because
they are sensitive to both orientation and frequency.
3. Face Recognition: Gabor filters are widely used in face recognition to extract features
such as eyes, nose, and mouth by focusing on local patterns.
4. Object Detection: By convolving an image with Gabor filters at different scales and
orientations, key features of objects in the image can be detected.
5. Fingerprint Recognition: In biometric systems, Gabor filters are used to extract local
ridge features from fingerprints.

Example of Gabor Filter Response:

When an image is convolved with a Gabor filter, the output emphasizes regions of the image
that match the frequency and orientation of the filter, suppressing other features. For
example, if the filter is tuned to detect vertical lines, it will highlight vertical edges in the
image.

Why Gabor Filters Are Useful:

• Biological Relevance: Gabor filters mimic the receptive fields of simple cells in the
human visual cortex, making them biologically inspired and efficient for visual
processing tasks.
• Feature Extraction: They can capture both low-level features (e.g., edges) and mid-
level features (e.g., textures) in an image.
• Robustness: Gabor features are robust against illumination changes and slight
geometric distortions, which is important in tasks like face and texture recognition.
In summary, Gabor filters are a powerful tool in computer vision for extracting local features
from an image, particularly when orientation and frequency are important for analyzing
texture, edges, and patterns.

Q: Explain the concept of Gabor filters. How do they help in texture and edge detection in
computer vision?

Q: Describe the role of spatial frequency, orientation, and phase in Gabor filters. Why is Gabor
filtering considered biologically inspired?

Q: What is the significance of the Gaussian envelope in Gabor filters? How does it affect the
filter's response in the spatial domain?

Q: Derive the 2D Gabor filter equation and explain each term in detail. How does each
parameter (wavelength, orientation, phase, etc.) affect the filter’s behavior?

Q: Given a Gabor filter with specific parameters (e.g., wavelength, orientation), show
mathematically how it detects certain frequency components and suppresses others.

Q: Consider a sinusoidal input image pattern. Analyze how a Gabor filter will respond to this
pattern, providing a step-by-step mathematical explanation.

Q: Explain how Gabor filters are used in face recognition systems. What features do they help
extract, and why are these features important for recognition?

Q: Design a set of Gabor filters for an image processing task, such as fingerprint recognition
or texture classification. Justify your choice of parameters (e.g., scale, orientation).

Q: How can Gabor filters be utilized in real-time object detection tasks? Propose a method for
extracting relevant features using Gabor filtering.

Q: Compare and contrast Gabor filters with wavelet transforms for image feature extraction.
Which method is better suited for multi-resolution analysis, and why?

Q: Discuss the limitations of Gabor filters in computer vision. How do modern deep learning
methods address these limitations?

Q: How can Gabor filters be integrated into convolutional neural networks (CNNs)? What
benefits and challenges arise from such integration?

Q: Explain how Gabor filter banks are constructed and how they can be used to extract
features from an image at different orientations and scales.
Q: Implement a Gabor filter in MATLAB (or Python) for edge detection in a given image. Explain
the steps and provide the code.

Q: Given a set of images, explain how Gabor filters can be applied to perform texture
classification. How would you evaluate the performance of this approach?

Q: What are the different methods to visualize the response of a Gabor filter on an image?
Explain with examples of images.

Q: Design an experiment to analyze the performance of Gabor filters for detecting specific
edges (e.g., horizontal, vertical) in a noisy image. What metrics would you use to measure
performance?

Q: Given a noisy image dataset, how would you modify the Gabor filter parameters (e.g.,
bandwidth, orientation) to achieve robust texture recognition?

Gabor Implementation Code:

import cv2

import numpy as np

# Load the input image

image = cv2.imread('input_image.jpg')

grayscale_image = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)

# Parameters for the Gabor filter

wavelength = 4.0

sigma = 1.0

aspect_ratio = 0.5

phase = 0

# Apply Gabor filters at multiple orientations

orientations = [0, np.pi/4, np.pi/2, 3*np.pi/4]

gabor_images = []

for theta in orientations:

gabor_kernel = cv2.getGaborKernel((21, 21), sigma, theta, wavelength, aspect_ratio,
phase, ktype=cv2.CV_32F)

filtered_image = cv2.filter2D(grayscale_image, cv2.CV_8UC3, gabor_kernel)

gabor_images.append(filtered_image)

# Combine filtered images to get the final edge map

final_edge_map = np.max(gabor_images, axis=0)

# Apply thresholding to highlight edges

_, binary_edges = cv2.threshold(final_edge_map, 128, 255, cv2.THRESH_BINARY)

# Display the edge-detected image

cv2.imshow('Edges', binary_edges)

cv2.waitKey(0)

cv2.destroyAllWindows()

# Optionally, save the result

cv2.imwrite('edge_detected_image.jpg', binary_edges)
2. Support Vector Machine (SVM)
The Support Vector Machine (SVM) is a supervised machine learning algorithm commonly
used for classification tasks in computer vision, such as object detection, face recognition, and
image classification. The SVM algorithm finds a hyperplane that best separates data points
into different classes. For computer vision tasks, SVMs often operate on features extracted
from images, such as those generated by methods like HOG (Histogram of Oriented
Gradients), SIFT, or Bag of Features (BoF).

Algorithm Overview:

1. Input:
A set of labelled training data: (x1, y1),(x2, y2),…………(xn,yn), where
xi is the feature vector corresponding to an image (or a patch of an image).
yi is the label for the feature vector, where yi belongs to {+1,-1} for binary classification
problem.

2. Objective:
Find the optimal hyperplane that maximally separates the data points of different
classes in the feature space.

✓ Steps of the SVM Algorithm in Computer Vision:

Step 1: Feature Extraction

Before applying SVM, image data is transformed into feature vectors using one of the feature
extraction methods, such as:

✓ HOG: Computes gradient orientations over local regions of the image.

✓ SIFT: Extracts scale-invariant features.
✓ BoF: Represents an image as a histogram of visual words.

Let each image I be represented by a feature vector xi belongs to Rd, where d is the dimension
of the feature space (e.g., number of HOG bins or visual words).

Step 2: SVM Training

The core of SVM is to find a linear decision boundary (hyperplane) that maximizes the margin
between two classes.

The decision function is:

Where, w ∈ Rd is the weight vector normal to the hyperplane.

And b ∈ R is the bias (intercept) term.

The optimal hyperplane maximizes the margin between the two classes. The margin is the
distance between the hyperplane and the closest data points from either class, known as
support vectors.

Step 3: Define the Optimization Problem

The goal of SVM is to find w and b that maximize the margin, which is equivalent to minimizing
the following objective function:

Subject to the following constraints for all training data points i:

where:

yi=+1 for positive examples and yi=−1 for negative examples.

The constraint ensures that data points are correctly classified and fall on the correct side of
the hyperplane.

Step 4: Introducing Soft Margin for Non-Separable Data

In many real-world applications, the data may not be perfectly linearly separable. To handle
this, we introduce slack variables ξi ≥0 to allow some misclassifications.

The optimization problem becomes:

Subject to:

Where:

ξi are slack variables that measure the degree of misclassification.

C is a regularization parameter that controls the trade-off between maximizing the margin and
minimizing the classification error.

Step 5: Solving the Optimization Problem

SVM is typically solved in its dual form using Lagrange multipliers. The dual form is given by:

Subject to:

Where αi (alpha) are the Lagrange multipliers.

After solving for the αi (alpha), the weight vector w is computed as:

Step 6: Kernel Trick for Non-Linear Data

For non-linearly separable data, SVM can be extended using the kernel trick, which maps the
input data to a higher-dimensional feature space where a linear separation is possible.
Step 7: Classification of New Images

For a new image (or image patch), feature extraction is first performed to generate a feature
vector xtest. The SVM classifier computes:
1. Explain the concept of SVM in machine learning.
2. What is the objective of SVM in classification tasks?
3. How does the margin affect the performance of SVM?
4. What is a support vector in the context of SVM?
5. How does SVM handle linearly separable and non-linearly separable data?
6. What is the kernel trick, and how is it used in SVM?
7. Explain different types of kernel functions used in SVM.
8. Which kernel is commonly used in computer vision tasks, and why?
9. How do you choose the kernel function for SVM in a computer vision application?
10. What is the optimization objective of the SVM algorithm?
11. Explain the dual form of SVM and its significance.
12. How do slack variables work in soft-margin SVM?
13. Write down the mathematical formulation for SVM with soft margin.

14. How does regularization parameter C affect the decision boundary in SVM?
15. How is the dual form of SVM solved?

16. Describe the role of SVM in image classification tasks.

17. What are the advantages and disadvantages of using SVM for object detection in
images?
18. How can SVM be used for face recognition?

19. How does SVM perform compared to other classifiers in image processing tasks?

20. What are some challenges when using SVM for real-time object detection?

21. What is the role of SVM in the Bag of Features (BoF) model?

22. Explain the process of using SVM for multi-class image classification.

23. What are the advantages of using SVM over deep learning methods in some
computer vision tasks?
24. How is SVM used in conjunction with feature extraction methods like HOG and
SIFT for image classification?
25. How can SVM be modified to handle large-scale image datasets in computer vision?

26. Practical and Implementation-Based Questions

3. Bag of Features:

Bag of Features (BoF) is a widely used technique in computer vision for representing and
classifying images. It's an adaptation of the Bag of Words (BoW) model from text processing,
where an image is described as a collection of local features without considering their spatial
arrangement. This method is popular for tasks like object recognition, image classification, and
scene understanding.

1. Feature Extraction
In the context of BoF, the first step is to extract key features from an image. These
features represent significant patterns or points in the image, such as corners, edges, or
textures. Popular algorithms for extracting these features include:

• SIFT (Scale-Invariant Feature Transform): Extracts key points that are

invariant to scaling, rotation, and small distortions.
• SURF (Speeded Up Robust Features): A faster alternative to SIFT, useful for
detecting blob-like features.
• ORB (Oriented FAST and Rotated BRIEF): A more efficient feature extractor
for real-time processing.

Each key point or region is represented by a feature descriptor—a numerical vector

that encodes information about that region of the image.

2. Building the Vocabulary (Codebook)

• After extracting features from multiple images, the next step is to create a
vocabulary or codebook. This is done using clustering algorithms like k-means
to group similar feature descriptors together.
• The resulting cluster centroids represent the "visual words" or "codewords."
These are analogous to words in the Bag of Words model for text.
• For example, if we want to create a codebook with 100 visual words, we would
cluster the feature descriptors into 100 groups. Each centroid represents one
visual word.

3. Histogram Representation (Bag of Features)

• For each image, after constructing the vocabulary, the feature descriptors of the
image are mapped to the nearest visual words from the codebook.
• A histogram is created where each bin represents a visual word, and the value
in the bin represents how many times a feature corresponding to that visual word
appears in the image. This histogram is the Bag of Features representation of
the image.
• This representation is orderless, meaning it does not consider the spatial
arrangement of features within the image, just their frequency.

4. Classification

• Once the Bag of Features representation is created, machine learning classifiers (like
Support Vector Machines (SVMs), Random Forests, or k-Nearest Neighbors (k-
NN)) can be used to train a model for image classification.
• For training, labeled data (images with known categories) is used, and the classifier
learns to associate patterns in the histogram representations with specific image
categories.

5. Limitations

• Loss of Spatial Information: BoF ignores the spatial relationships between features,
which can be critical in recognizing objects in structured scenes.
• Computationally Intensive: Extracting features and building a vocabulary can be
time-consuming, especially with large datasets.
• Sensitivity to Feature Choice: The performance of the BoF model depends heavily on
the choice of feature extraction algorithm and the size of the codebook.

6. Extensions

• Spatial Pyramid Matching (SPM): A common extension that adds a degree of spatial
information to the BoF model. It divides an image into regions and computes
histograms at multiple levels of resolution, providing some spatial context.
• Deep Learning Alternatives: In modern computer vision tasks, convolutional neural
networks (CNNs) have largely replaced traditional methods like BoF, as CNNs can
learn hierarchical features and capture spatial relationships more effectively.

Workflow of Bag of Features in Image Classification:

✓ Feature Extraction: Extract key points and descriptors using SIFT or ORB.
✓ Vocabulary Construction: Use k-means clustering to create a set of visual
words.
✓ Histogram Representation: For each image, compute the histogram based
on the frequency of visual words.
✓ Classification: Train a classifier using the histograms of training images
and apply it to classify new images.

In summary, the Bag of Features approach allows images to be represented as

distributions of local visual elements, abstracting them into a format that can be
easily processed by standard machine learning algorithms. However, with the
advent of deep learning, the BoF model has seen decreased usage for tasks where
more advanced methods can achieve better performance.

1. Algorithm for Bag of Features

Step 1: Feature Extraction

The first step is to extract local features (keypoints and descriptors) from the image.
Popular feature extraction methods include SIFT (Scale-Invariant Feature
Transform), SURF (Speeded Up Robust Features), and ORB (Oriented FAST and
Rotated BRIEF)

Step 2: Build the Vocabulary (Codebook)

Next, a visual vocabulary is constructed by clustering the feature descriptors from all
the training images into a predefined number of clusters. Each cluster is treated as a
visual word, and the set of cluster centers forms the codebook.

Step 3: Represent Images as Histograms

For each image, we now map its descriptors to the closest visual words in the
vocabulary. Each image is then represented as a histogram of visual word occurrences.
Step 4: Classification

Once each image is represented as a histogram of visual words, it can be fed into a
classifier for the final recognition or classification task. Common classifiers include:

Support Vector Machines (SVMs): Used to classify the histograms into different
object categories.

Step 5: Testing and Image Classification

For an unseen test image, we:

1. Extract the feature descriptors.
2. Map the descriptors to the nearest visual words from the precomputed
vocabulary.
3. Represent the image as a histogram of visual words.
4. Use the trained classifier to predict the class of the image based on its histogram.

4. What is Matching and recognition in computer vision

In computer vision, matching and recognition are two fundamental tasks that are often
intertwined, particularly in object detection, image retrieval, and facial recognition.

1. Matching in Computer Vision

Matching refers to the process of finding correspondences between two sets of visual
elements, such as key points, features, or objects, in different images. It is typically used
in scenarios where we want to align, compare, or find similarities between visual data.

Key Concepts in Matching:

Feature Matching:

➢ This is the most common type of matching in computer vision. The goal is to
match feature descriptors from one image to another. For example, in SIFT or
SURF algorithms, keypoints are detected and described in terms of their visual
features, and matching involves finding pairs of keypoints from two different
images that have similar descriptors.
➢ The process usually involves computing the distance (e.g., Euclidean distance)
between the feature descriptors of two points, and the points with the smallest
distance are considered matches.
Image Matching:

Image matching goes beyond individual features and tries to find correspondences
between entire images or significant parts of images. This is often used in template
matching or image stitching. For example, in panorama stitching, images of the same
scene from different viewpoints are matched by detecting and aligning common
features.

Keypoint Matching:

➢ Keypoint matching is the process of finding corresponding keypoints in two

images. For example, in stereo vision or structure from motion (SfM), keypoint
matching is used to estimate the depth of objects or to build a 3D model of the
scene.
➢ Algorithms like RANSAC (Random Sample Consensus) are often used to
eliminate incorrect or outlier matches.

Applications of Matching:

➢ Image stitching (e.g., panorama creation).

➢ Object tracking across frames in video.
➢ 3D reconstruction using images from multiple angles.
➢ Image registration, where two images of the same scene taken under different
conditions are aligned.
➢ Stereo vision, for estimating depth from two images

2. Recognition in Computer Vision

Recognition refers to the task of identifying objects, patterns, or features in images or

videos. It involves determining the identity or category of the detected objects based on
prior knowledge (e.g., through trained models or templates). Recognition is a more
general and often more complex task compared to matching, as it involves
understanding the semantic content of the image.

Key Concepts in Recognition

• Object Recognition:

➢ The task of identifying and classifying objects in an image or video. This can include
recognizing a car, person, animal, or any other object from a known set of categories.
➢ Modern object recognition algorithms rely heavily on deep learning methods,

particularly Convolutional Neural Networks (CNNs). These networks learn from

large amounts of labeled data to recognize specific objects in unseen images.

• Facial Recognition:

➢ A specific type of recognition where the goal is to identify or verify a person’s identity
by analyzing their face. This is widely used in security systems, social media, and
mobile authentication.
➢ Traditional methods for face recognition involve techniques like Eigenfaces or
Fisherfaces, while modern systems use deep learning models like FaceNet or
DeepFace to represent faces as vectors in high-dimensional spaces.

• Scene Recognition:
• Recognizing entire scenes (e.g., beach, forest, street) rather than individual
objects. This requires understanding both the context and the relationships
between objects in the image
• Pattern Recognition:

Recognizing regularities or patterns in data, such as textures, shapes, or repeated

elements in images. Pattern recognition is often used in tasks like handwriting
recognition or medical image analysis.

Recognition Techniques

Template Matching:
➢ A simple form of recognition where a pre-defined template (a small part of an image)
is compared to regions of a new image to find a match. This method works well for
controlled environments but is not robust to changes in scale, rotation, or occlusion.

Machine Learning and Deep Learning Models:

➢ Support Vector Machines (SVMs): Classical machine learning models for

recognition tasks, particularly effective when combined with hand-crafted features
(e.g., SIFT, HOG).
➢ Neural Networks (CNNs): Deep learning models that automatically learn features
from data and are highly effective for recognition tasks, especially when trained on
large datasets like ImageNet.
➢ Transfer Learning: Using pre-trained models on large datasets and fine-tuning them
for specific recognition tasks, which has become a popular approach for tasks with
limited data.

Hacking With Google
100% (2)
Hacking With Google
10 pages
ITEC 625 9042 Computer Systems Architecture
No ratings yet
ITEC 625 9042 Computer Systems Architecture
11 pages
Gabor Filter
No ratings yet
Gabor Filter
7 pages
Segmentacion 2
No ratings yet
Segmentacion 2
6 pages
Features Extraction Using A Gabor Filter Family
No ratings yet
Features Extraction Using A Gabor Filter Family
6 pages
Biologically Moivated Computer Vision
No ratings yet
Biologically Moivated Computer Vision
29 pages
2-2 Gabor Filters
No ratings yet
2-2 Gabor Filters
21 pages
Chisti 2d Gabor Filter
No ratings yet
Chisti 2d Gabor Filter
17 pages
Report PDF
No ratings yet
Report PDF
9 pages
computer_vision_2_feature_extraction_4_students
No ratings yet
computer_vision_2_feature_extraction_4_students
79 pages
Blob Detection: Unveiling Patterns in Visual Data
From Everand
Blob Detection: Unveiling Patterns in Visual Data
Fouad Sabry
No ratings yet
Vision Review: Image Processing: Course Web Page
No ratings yet
Vision Review: Image Processing: Course Web Page
51 pages
Feature Extraction Gabor Filters
No ratings yet
Feature Extraction Gabor Filters
2 pages
Prcv 7th Sem Aiml Ite Notes Complete Long
No ratings yet
Prcv 7th Sem Aiml Ite Notes Complete Long
95 pages
Harris Corner Detector: Unveiling the Magic of Image Feature Detection
From Everand
Harris Corner Detector: Unveiling the Magic of Image Feature Detection
Fouad Sabry
No ratings yet
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
From Everand
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
Fouad Sabry
No ratings yet
CV-U1-U2
No ratings yet
CV-U1-U2
13 pages
Face Detection and Recognition Using Back Propagation Neural Network and Fourier Gabor Filters
No ratings yet
Face Detection and Recognition Using Back Propagation Neural Network and Fourier Gabor Filters
7 pages
Factor Graphs For Robot Perception
100% (1)
Factor Graphs For Robot Perception
144 pages
Unit 2 CV
No ratings yet
Unit 2 CV
17 pages
Texture Analysis Using Gabor Filters
No ratings yet
Texture Analysis Using Gabor Filters
38 pages
Lecture 3 AI Summary
No ratings yet
Lecture 3 AI Summary
34 pages
Feature Selection Using Adaboost For Face Expression Recognition
No ratings yet
Feature Selection Using Adaboost For Face Expression Recognition
6 pages
Area/Mask Processing Methods: - What Is A Mask?
No ratings yet
Area/Mask Processing Methods: - What Is A Mask?
10 pages
Pyramid Image Processing: Exploring the Depths of Visual Analysis
From Everand
Pyramid Image Processing: Exploring the Depths of Visual Analysis
Fouad Sabry
No ratings yet
Lab5
No ratings yet
Lab5
13 pages
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
From Everand
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
Fouad Sabry
No ratings yet
tmpFCBE TMP
No ratings yet
tmpFCBE TMP
5 pages
Recognizing Face Using Gabor Filter With Perceptual Features
No ratings yet
Recognizing Face Using Gabor Filter With Perceptual Features
7 pages
target-classification-based-on-microdoppler-signatures
No ratings yet
target-classification-based-on-microdoppler-signatures
5 pages
Linear Processing: Spring 2023 16-725 (Cmu Ri) : Bioe 2630 (Pitt) Dr. John Galeotti
No ratings yet
Linear Processing: Spring 2023 16-725 (Cmu Ri) : Bioe 2630 (Pitt) Dr. John Galeotti
20 pages
CV_UNIT_4
No ratings yet
CV_UNIT_4
9 pages
Machine Vision: Chapter Index
No ratings yet
Machine Vision: Chapter Index
1 page
Tutorial 7 Developing A Simple Image Classifier
No ratings yet
Tutorial 7 Developing A Simple Image Classifier
11 pages
basic of computer vision UNIT II
No ratings yet
basic of computer vision UNIT II
29 pages
Biometric Image Restoration
No ratings yet
Biometric Image Restoration
33 pages
بنك الاسئلة د محمود ابوالفتوح PDF
No ratings yet
بنك الاسئلة د محمود ابوالفتوح PDF
4 pages
Chap3 Image Processing (2018)
No ratings yet
Chap3 Image Processing (2018)
35 pages
Computer Vision New
No ratings yet
Computer Vision New
28 pages
Ipmv Mod 5&6 (Theory Questions)
No ratings yet
Ipmv Mod 5&6 (Theory Questions)
11 pages
Unit 1 CV
No ratings yet
Unit 1 CV
13 pages
Features
No ratings yet
Features
2 pages
Haar - Cascades - 1 Ref
No ratings yet
Haar - Cascades - 1 Ref
5 pages
Face Recognition Using Gabor Filters: Muhammad SHARIF, Adeel KHALID, Mudassar RAZA, Sajjad MOHSIN
No ratings yet
Face Recognition Using Gabor Filters: Muhammad SHARIF, Adeel KHALID, Mudassar RAZA, Sajjad MOHSIN
5 pages
Texture Features For Browsing and Retrieval of Image Data: Manjunathi and W.Y
No ratings yet
Texture Features For Browsing and Retrieval of Image Data: Manjunathi and W.Y
6 pages
Lab2_
No ratings yet
Lab2_
3 pages
Lecture 2: Image Processing Review, Neighbors, Connected Components, and Distance
No ratings yet
Lecture 2: Image Processing Review, Neighbors, Connected Components, and Distance
7 pages
b20cs065 Assignment 1 Report
No ratings yet
b20cs065 Assignment 1 Report
13 pages
Deep Learning lab manual
No ratings yet
Deep Learning lab manual
69 pages
DLCV Day2
No ratings yet
DLCV Day2
5 pages
11_17_22_merged
No ratings yet
11_17_22_merged
16 pages
Lab4 Classification 2024kdjclkan
No ratings yet
Lab4 Classification 2024kdjclkan
36 pages
Global Feature?: Local Feature Detection and Extraction
No ratings yet
Global Feature?: Local Feature Detection and Extraction
6 pages
1 Edge
No ratings yet
1 Edge
140 pages
CV2019
No ratings yet
CV2019
152 pages
Gabor Wave Let Transform Based Facial Expression
No ratings yet
Gabor Wave Let Transform Based Facial Expression
4 pages
LECTURE 1.1
No ratings yet
LECTURE 1.1
26 pages
Image Filtering
No ratings yet
Image Filtering
11 pages
Robotics
No ratings yet
Robotics
35 pages
Image Filtering
No ratings yet
Image Filtering
11 pages
Lecture 3: Filtering I Image Enhancement by Neighbourhood Processing
No ratings yet
Lecture 3: Filtering I Image Enhancement by Neighbourhood Processing
7 pages
Computer Vision I
No ratings yet
Computer Vision I
61 pages
Cisco Commands Cheat Sheet
No ratings yet
Cisco Commands Cheat Sheet
12 pages
ADC Exam2 PDF
0% (1)
ADC Exam2 PDF
3 pages
Interview Questions and HTML5 Interview Questions in 2023
No ratings yet
Interview Questions and HTML5 Interview Questions in 2023
58 pages
IPR & Cyber Laws Manual
73% (22)
IPR & Cyber Laws Manual
28 pages
Individual Daily Log and Accomplishment Report: Date: October 1, 2021
No ratings yet
Individual Daily Log and Accomplishment Report: Date: October 1, 2021
9 pages
PHY212: Physics Lab IV: Study of P-N Junction
No ratings yet
PHY212: Physics Lab IV: Study of P-N Junction
16 pages
Admission Erp Instruction
No ratings yet
Admission Erp Instruction
7 pages
BTTC 200 X
No ratings yet
BTTC 200 X
4 pages
Computation at The Edge of Chaos Langton PDF
No ratings yet
Computation at The Edge of Chaos Langton PDF
26 pages
q2
No ratings yet
q2
12 pages
Basisanschlussplan MS4 Sport HPI DDU7 Software Release 36
No ratings yet
Basisanschlussplan MS4 Sport HPI DDU7 Software Release 36
1 page
PH3256 Physics For Information Science 1
No ratings yet
PH3256 Physics For Information Science 1
7 pages
Bryman5Ce SSG Ch05
No ratings yet
Bryman5Ce SSG Ch05
13 pages
Air India Training Manual Passage ESS
No ratings yet
Air India Training Manual Passage ESS
16 pages
S15033.r2 - ACORN - Rev4, STEPPERONLINE CL86Y Drive
No ratings yet
S15033.r2 - ACORN - Rev4, STEPPERONLINE CL86Y Drive
1 page
2015 Boysnamesfinal
No ratings yet
2015 Boysnamesfinal
110 pages
Engineering Thesis Format
100% (3)
Engineering Thesis Format
5 pages
Ospf Conformance Test Suite: Agilent Routertester
No ratings yet
Ospf Conformance Test Suite: Agilent Routertester
8 pages
Java Thread Priority in Multithreading
No ratings yet
Java Thread Priority in Multithreading
3 pages
(Approved by AICTE, Govt. of India, F. No Northern/2015/1-2474317051) NH-2, Jogiya More, Ratanpura, Aurangabad, Bihar - 824 121
No ratings yet
(Approved by AICTE, Govt. of India, F. No Northern/2015/1-2474317051) NH-2, Jogiya More, Ratanpura, Aurangabad, Bihar - 824 121
86 pages
What Is A Web Browser PDF
No ratings yet
What Is A Web Browser PDF
3 pages
Sda Shona Hymn Book Download
No ratings yet
Sda Shona Hymn Book Download
4 pages
Key Components of The IBM Lotus Notes and Lotus Domino Environment
No ratings yet
Key Components of The IBM Lotus Notes and Lotus Domino Environment
72 pages
The Append Hint - Direct Path Load
No ratings yet
The Append Hint - Direct Path Load
2 pages
QuantitativeMethods238 PDF
No ratings yet
QuantitativeMethods238 PDF
145 pages
Nokia 6230 UG en
No ratings yet
Nokia 6230 UG en
94 pages
ISSUE DATE: NOC. 25, 2004: Read The Manual Before Operating This Machine. Keep This Manual For Your Reference
No ratings yet
ISSUE DATE: NOC. 25, 2004: Read The Manual Before Operating This Machine. Keep This Manual For Your Reference
33 pages
C C CCCCCCCCCCCC C C
No ratings yet
C C CCCCCCCCCCCC C C
62 pages

Computer Vision and Pattern Topics

Uploaded by

Computer Vision and Pattern Topics

Uploaded by

1.

A Gabor filter is essentially a sinusoidal wave modulated by a Gaussian envelope. The 2D

Example of Gabor Filter Response:

Why Gabor Filters Are Useful:

Gabor Implementation Code:

# Load the input image

grayscale_image = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)

# Parameters for the Gabor filter

# Apply Gabor filters at multiple orientations

orientations = [0, np.pi/4, np.pi/2, 3*np.pi/4]

for theta in orientations:

filtered_image = cv2.filter2D(grayscale_image, cv2.CV_8UC3, gabor_kernel)

# Combine filtered images to get the final edge map

final_edge_map = np.max(gabor_images, axis=0)

# Apply thresholding to highlight edges

_, binary_edges = cv2.threshold(final_edge_map, 128, 255, cv2.THRESH_BINARY)

# Display the edge-detected image

# Optionally, save the result

✓ Steps of the SVM Algorithm in Computer Vision:

Step 1: Feature Extraction

✓ HOG: Computes gradient orientations over local regions of the image.

Step 2: SVM Training

The decision function is:

Where, w ∈ Rd is the weight vector normal to the hyperplane.

Step 3: Define the Optimization Problem

Subject to the following constraints for all training data points i:

yi=+1 for positive examples and yi=−1 for negative examples.

Step 4: Introducing Soft Margin for Non-Separable Data

The optimization problem becomes:

ξi are slack variables that measure the degree of misclassification.

Step 5: Solving the Optimization Problem

Where αi (alpha) are the Lagrange multipliers.

Step 6: Kernel Trick for Non-Linear Data

16. Describe the role of SVM in image classification tasks.

26. Practical and Implementation-Based Questions

• SIFT (Scale-Invariant Feature Transform): Extracts key points that are

Each key point or region is represented by a feature descriptor—a numerical vector

2. Building the Vocabulary (Codebook)

3. Histogram Representation (Bag of Features)

Workflow of Bag of Features in Image Classification:

In summary, the Bag of Features approach allows images to be represented as

1. Algorithm for Bag of Features

Step 1: Feature Extraction

Step 2: Build the Vocabulary (Codebook)

Step 3: Represent Images as Histograms

Step 5: Testing and Image Classification

For an unseen test image, we:

4. What is Matching and recognition in computer vision

1. Matching in Computer Vision

Key Concepts in Matching:

➢ Keypoint matching is the process of finding corresponding keypoints in two

➢ Image stitching (e.g., panorama creation).

2. Recognition in Computer Vision

Recognition refers to the task of identifying objects, patterns, or features in images or

Key Concepts in Recognition

particularly Convolutional Neural Networks (CNNs). These networks learn from

Recognizing regularities or patterns in data, such as textures, shapes, or repeated

Machine Learning and Deep Learning Models:

➢ Support Vector Machines (SVMs): Classical machine learning models for

You might also like