CVR Unit 2
CVR Unit 2
Linear Filters: Linear filters, commonly used in digital signal processing and
image processing, operate by applying a linear transformation to an input signal
or image to produce an output.
They are designed to emphasize or reduce certain aspects of the signal or image,
such as smoothing, sharpening, edge detection, or noise reduction.
A linear filter processes each point in a signal (or pixel in an image) as a weighted
sum of its neighbours.
The output of a linear filter is directly proportional to the input: for any inputs
x1 and x2 with scalars a and b H(a.x1 + b.x2) = a.H(x1) + b.H(x2), where H is the
filter operation.
Types of Linear Filters:
1. Low-Pass Filters: These filters allow low-frequency components to pass
while attenuating high-frequency components. They are used for
smoothing or blurring an image, reducing noise or minor detail.
2. High-Pass Filters: These filters pass high-frequency components while
attenuating low-frequency components, emphasizing edges and fine
details.
3. Band-Pass Filters: These filters allow a specific range of frequencies to
pass while blocking frequencies outside that range, which is useful for
applications that need to isolate certain frequency bands.
Convolution Operation
𝑦[n] = ∑ 𝑥 [𝑛 − 𝑘 ]. ℎ[𝑘 ]
𝑘=−𝑀
where x is the input signal, h is the filter kernel, and y is the output.
In a 2D image, convolution for a pixel located at (i, j) is:
𝑀 𝑁
Limitations:
Advantages:
Noise Reduction: Smoothing filters, like the Gaussian filter, reduce noise
by averaging pixel values.
Edge Detection: High-pass and edge-detecting filters, such as Sobel and
Laplacian, highlight transitions, helping in object detection and boundary
extraction.
Image Sharpening: High-frequency components are emphasized,
enhancing fine details.
Data Smoothing in Signal Processing: Low-pass filters are applied to
reduce random noise in time-series data, useful in speech and audio
processing.
Applications of Convolution:
These systems are defined by two key properties: linearity and time (or
shift) invariance.
They are mathematically predictable, stable under certain conditions, and
have straightforward responses to inputs, making them essential in
system analysis and design.
Advantages of Shift-Invariant Linear Systems:
Signal Processing: LTI systems are used to design filters that remove
noise, enhance certain frequencies, or separate signal components.
Control Systems: LTI models allow engineers to predict system behaviour,
design stable control systems, and ensure desired performance.
Communications: In communication systems, LTI filters manage
frequency bands, extract signals from noisy environments, and equalize
channels.
Image Processing: LTI properties enable convolution-based filters to
smooth, sharpen, or detect edges in images consistently.
Sampling Rate (Frequency): This is the rate at which samples are taken
from the continuous signal. Measured in samples per second, or Hertz
(Hz).
Sampling Interval (Period): The reciprocal of the sampling rate, it’s the
time between consecutive samples, denoted as Ts = 1/fs
Quantization: After sampling, the analog values are often quantized,
meaning they’re rounded to the nearest level in a discrete set of values
(usually due to limited resolution in digital systems).
Types of Sampling:
Applications of Sampling:
Audio Processing: Analog audio signals are sampled to create digital audio
files, which are then stored and processed. Common sampling rates for
audio are 44.1 kHz (CD quality) and 48 kHz (professional audio).
Image Processing: An image is sampled both horizontally and vertically.
Higher sampling rates (i.e., more pixels) result in better image quality.
Telecommunications: Analog signals are sampled to create digital signals,
allowing them to be transmitted and processed more effectively in digital
communication systems.
Medical Imaging: In devices like MRI or CT scanners, signals are sampled
to create digital images of the body.
This typically occurs when the sampling rate is too low to accurately
capture the high-frequency content of a continuous signal, causing higher
frequencies to "fold" into the lower frequencies and create misleading
representations.
When a continuous signal is sampled, the discrete samples capture
information about the signal’s frequency content.
However, if the sampling rate is insufficient (below twice the highest
frequency of the signal), certain high-frequency components will overlap
or "fold back" into lower frequencies, distorting the signal.
This effect makes it challenging to distinguish the original frequencies
from the alias frequencies, causing errors in signal analysis and
reconstruction.
Types of Aliasing:
Types of Template Filters: Filters used as templates can be broadly divided into
categories based on the feature they’re designed to match:
Edge Detection:
Sources of Noise: Noise can enter a system through various stages in signal
processing, with primary sources being:
Mean and Variance: Mean measures the average noise level, while
variance measures the spread (or power) of noise. In Gaussian noise,
mean is zero, and variance determines the noise intensity.
Probability Distribution: Defines the likelihood of various noise
amplitudes occurring. Common distributions include Gaussian, Poisson,
and uniform.
Power Spectral Density (PSD): Describes how noise power is distributed
across different frequencies. White noise has a constant PSD across all
frequencies, while coloured noise has frequency-dependent
characteristics.
Temporal and Spatial Properties: Noise can vary over time (temporal
noise) or across space (spatial noise), such as in a video or image sequence
where noise fluctuates frame by frame.
Edge detection simplifies the analysis of images by reducing data and preserving
structural properties, helping in tasks like:
Accuracy: The ability of the detector to identify true edges and avoid false
detections.
Localization: The accuracy of edge placement, ensuring edges are
detected close to their true positions.
Sensitivity to Noise: How well the method performs in the presence of
noise, which can produce false edges.
Computational Efficiency: How quickly the method processes an image,
essential in real-time applications.
Analysis and Synthesis Using Oriented Pyramids: Analysis and Synthesis Using
Oriented Pyramids is a technique in image processing used to analyse and
reconstruct images by decomposing them into multiple levels, each capturing
different orientations and spatial scales.
1. Multi-Resolution Analysis:
2. Multi-Orientation Analysis:
1. Decomposition:
The image is filtered and down sampled to create multiple levels, similar
to a standard pyramid.
At each level, the image is also decomposed into different orientations
using directional filters (e.g., Gabor filters or steerable filters), resulting in
orientation subbands.
These subbands capture information at different directions, such as edges
and textures aligned with each specific orientation.
This structure results in a pyramid where each level has multiple
subbands, each corresponding to a particular orientation.
Common filters for oriented pyramids include steerable filters and Gabor
filters, both of which are effective in isolating specific orientations.
Steerable Filters: These can be "steered" or rotated to match different
orientations, making them versatile in capturing specific directional
information.
Gabor Filters: These filters are also tuned to particular frequencies and
orientations and are commonly used in texture analysis due to their
resemblance to human visual receptive fields.
1. Feature Extraction:
2. Pattern Recognition:
3. Multi-Resolution Analysis:
1. Reconstruction:
2. Texture Synthesis:
3. Image Compression:
1. Local Models:
Start with an initial patch or sample, which can be a single pixel or a small
patch taken from the source image.
This patch serves as the foundation, from which the texture is
progressively synthesized.
For each new patch, find a region in the source texture with the highest
similarity to the neighbourhood of the region being synthesized.
The selection process involves comparing neighbourhoods using a
similarity metric (like mean squared error) to find the best match.
Repeat patch selection and placement until the entire target texture or
image is filled.
If required, hierarchical structures can be synthesized by applying this
process from larger to smaller patches, or by progressively filling in
smaller details.
Types of Textures and Models for Sampling
1. Stochastic Textures:
2. Structured Textures:
3. Procedural Models:
Shape from Texture: Shape from Texture is a technique in computer vision and
image processing used to infer the 3D shape or surface geometry of an object
from variations in texture observed in a 2D image.
1. Texture Gradients:
2. Perspective Distortion: