End Sem
End Sem
At each location, the sub-region is passed to a classifier (e.g., SVM, CNN) to determine
whether it contains the object.
Drawbacks: High computational cost, slow, and doesn’t handle scale changes well.
Instead of exhaustive sliding, these methods propose regions likely to contain objects.
These proposals are then passed to CNNs for classification and bounding box regression.
MTCNN, RetinaFace, and others use CNNs to detect faces at different scales.
Provide higher accuracy and robustness to pose, illumination, and occlusion variations.
2. Comparison of YOLO, SSD, and Faster R-CNN; Harris and Shi-Tomasi Corner Detection
Shi-Tomasi:
Hough Transform:
Transforms each point to parameter space (e.g., lines in polar form: ρ=xcosθ+ysinθ\rho = x\
cos\theta + y\sin\thetaρ=xcosθ+ysinθ).
Morphological Operations:
Erosion:
Dilation:
Thresholding:
Global Thresholding:
Adaptive Thresholding:
Region-Based Segmentation:
Opening:
Closing:
Viola-Jones:
YOLO:
Predicts bounding boxes and class probabilities directly from image in one pass.
Faster R-CNN:
Region Proposal Network (RPN) suggests regions.
Computer Vision:
Pixels:
Resolution:
Image Representation:
Stored as 2D or 3D arrays.
Image Formation:
Pinhole model, perspective projection, and lens distortions affect the image.
➤ Definition:
Filtering is the process of modifying or enhancing an image by emphasizing or removing certain
features like noise, edges, or textures.
➤ Types of Filters:
A. Linear Filters:
Mean Filter (Averaging): Reduces noise by replacing each pixel with the average of its
neighbors.
Gaussian Filter: Applies a weighted average using a Gaussian kernel. Smoothens image while
preserving edges better than mean filtering.
B. Non-linear Filters:
Median Filter: Replaces pixel value with the median of its neighborhood. Very effective in
removing salt-and-pepper noise.
➤ Example:
An image with salt-and-pepper noise can be cleaned using a median filter, which removes outliers
(black or white dots) while preserving edges.
➤ Definition:
➤ How It Works:
At each location, the sum of the element-wise product of the kernel and the overlapping
image region is computed.
➤ Mathematical Expression:
Where:
➤ Example:
CopyEdit
[ 0 -1 0
-1 5 -1
0 -1 0 ]
✅ 3. Edge Detection
➤ How it works:
Applies two 3×3 kernels: one for horizontal (GxG_xGx), one for vertical (GyG_yGy) gradients.
➤ Kernels:
text
CopyEdit
Gx = [ -1 0 1 Gy = [ -1 -2 -1
-2 0 2 0 0 0
-1 0 1 ] 1 2 1]
➤ Example:
Apply Sobel to detect roads in satellite images by emphasizing edges in horizontal and vertical
directions.
Used in medical imaging (e.g., MRI, X-rays) to detect boundaries of tissues or bones accurately.
✅ 4. Image Transformations
➤ Definition:
Transforms an image from spatial domain to frequency domain. Useful to analyze frequency
content.
➤ How it works:
➤ Use Cases:
Pattern recognition
➤ Example:
A fingerprint image with periodic noise can be denoised by applying Fourier Transform, masking
high-frequency components, and applying Inverse Fourier Transform.
➤ B. Hough Transform
➤ Definition:
➤ For Lines:
Each edge point votes in the accumulator space for possible lines passing through it.
➤ For Circles:
➤ Example:
Used in license plate detection or lane detection in autonomous driving by detecting straight lines
on the road.