Digital Image Processing -2
Digital Image Processing -2
Intensity transformations are techniques used to enhance images by adjusting pixel intensity values.
These transformations are commonly applied to improve contrast, highlight specific details, or
correct brightness variations. The three fundamental intensity transformation techniques are Image
Negatives, Log Transform, and Gamma Transformation.
1. Image Negatives
Definition:
The negative transformation inverts the intensity values of an image, making bright regions dark and
dark regions bright. This is particularly useful for enhancing details in dark regions of an image.
Mathematical Representation:
s=L−1−rs = L - 1 - rs=L−1−r
Where:
• LLL = maximum intensity value (e.g., 255 for an 8-bit grayscale image)
Effect on Image:
2. Log Transformation
Definition:
Log transformation is used to enhance low-intensity pixels while compressing high-intensity pixel
values. This helps in displaying images with a wide range of intensity variations.
Mathematical Representation:
Where:
Effect on Image:
Example:
• If an image has many dark details, applying a log transformation brightens them, making
them visible.
Definition:
Gamma correction is used to adjust the brightness of an image based on a non-linear power
function. It is particularly useful for display calibration and image enhancement.
Mathematical Representation:
s=crγs = c r^\gammas=crγ
Where:
Effect on Image:
• If γ<1\gamma < 1γ<1 → Brightens the image (useful for dark images).
• If γ>1\gamma > 1γ>1 → Darkens the image (useful for overexposed images).
Example:
• Medical imaging applications use gamma transformation for enhancing X-ray and MRI
scans.
Conclusion
These intensity transformation techniques modify pixel values to improve image visibility and extract
details. Each method has specific applications based on whether the image needs contrast
enhancement, brightness correction, or detail enhancement in dark areas.
You said:
Make it descriptive for 5 marker question what is homomorphic filtering? Explain with mathematical
analysis. How does it enhance images?
ChatGPT said:
Introduction:
Where:
The illumination component mainly contributes to low frequencies, while the reflectance
component contains high-frequency details. Homomorphic filtering separates and enhances these
components by suppressing low frequencies (illumination) and amplifying high frequencies
(reflectance).
1. Log Transformation
To separate the multiplicative components into additive components, we take the logarithm:
Now, the illumination and reflectance components are additive, allowing frequency-domain filtering.
2. Fourier Transform
The log-transformed image is then converted into the frequency domain using the Fourier Transform
(FT):
After filtering, the inverse Fourier transform is applied to get the modified log-transformed image:
5. Exponential Transformation
f′(x,y)=exp[g′(x,y)]f'(x,y) = \exp[g'(x,y)]f′(x,y)=exp[g′(x,y)]
This results in an enhanced image where the effects of uneven illumination are minimized, and fine
details are more visible.
Conclusion:
Homomorphic filtering is an advanced image enhancement technique that operates in the frequency
domain. By reducing illumination variations and highlighting finer details, it significantly improves
image clarity and contrast, making it useful in medical imaging, forensics, and industrial quality
control.
You said:
Make it descriptive for 5 marker question Compare Ideal, Butterworth and gaussian filters in the
frequency domain. Also their mathematical analysis.
ChatGPT said:
1. Ideal Filter
Definition:
The Ideal Filter allows all frequency components within a cutoff radius and completely removes all
others. It is a sharp-cutoff filter but suffers from artifacts due to sudden transitions in the frequency
domain.
Mathematical Expression:
H(u,v)={1,D(u,v)≤D00,D(u,v)>D0H(u,v) = \begin{cases} 1, & D(u,v) \leq D_0 \\ 0, & D(u,v) > D_0
\end{cases}H(u,v)={1,0,D(u,v)≤D0D(u,v)>D0
H(u,v)={0,D(u,v)≤D01,D(u,v)>D0H(u,v) = \begin{cases} 0, & D(u,v) \leq D_0 \\ 1, & D(u,v) > D_0
\end{cases}H(u,v)={0,1,D(u,v)≤D0D(u,v)>D0
Characteristics:
2. Butterworth Filter
Definition:
The Butterworth Filter provides a smooth transition between low and high frequencies, avoiding the
sharp cutoffs of the Ideal Filter. The transition is controlled by the order nnn of the filter, where a
higher order makes the cutoff sharper.
Mathematical Expression:
For a low-pass Butterworth filter (BLPF) of order nnn and cutoff D0D_0D0:
3. Gaussian Filter
Definition:
The Gaussian Filter provides the smoothest transition between frequencies, eliminating artifacts
caused by sudden changes. It is widely used due to its natural smoothing behavior.
Mathematical Expression:
H(u,v)=e−D2(u,v)2D02H(u,v) = e^{-\frac{D^2(u,v)}{2D_0^2}}H(u,v)=e−2D02D2(u,v)
H(u,v)=1−e−D2(u,v)2D02H(u,v) = 1 - e^{-\frac{D^2(u,v)}{2D_0^2}}H(u,v)=1−e−2D02D2(u,v)
Characteristics:
Comparison Table
Mathematical
Simple Moderate Complex (exponential)
Complexity
High (Gibbs
Artifacts (Ringing) Less None
phenomenon)
Conclusion:
• Ideal filters are simple but introduce ringing artifacts due to sharp transitions.
• Butterworth filters offer a balance between sharpness and smoothness, reducing artifacts.
• Gaussian filters provide the smoothest transition, making them the best choice for blurring,
noise reduction, and gradual enhancement.
Thus, Gaussian filters are preferred for practical image processing applications where a natural,
smooth transition is required.
You said:
ChatGPT said:
Introduction:
Gray Level Slicing is a point processing technique used in digital image processing to enhance
specific ranges of gray levels within an image while suppressing others. This technique is particularly
useful for highlighting important features such as tumors in medical images, roads in satellite
images, or specific textures in industrial applications.
An image is composed of various gray levels (intensities), ranging from 0 (black) to 255 (white) in an
8-bit grayscale image. In some cases, only certain intensity values are of interest, while others are not
relevant. Gray level slicing helps to emphasize or suppress specific intensity values in an image.
• In this approach, pixels within the specified gray-level range are enhanced (set to a high
intensity), while all other pixels are completely suppressed (set to black).
• This method is useful when the background is not important and only the targeted region
needs to be highlighted.
Mathematical Representation:
where:
• This ensures that the image retains some of its original details, while the important features
are still highlighted.
Mathematical Representation:
where the background pixels remain unchanged, and only the required intensities are enhanced.
✔ Medical Imaging: Highlighting tumors or specific tissue regions in X-ray or MRI scans.
✔ Remote Sensing: Extracting roads, water bodies, or vegetation from satellite images.
✔ Industrial Inspection: Identifying defects or cracks in manufactured products.
✔ Biometric Recognition: Enhancing specific facial features for face detection and recognition.
Conclusion:
Gray level slicing is a powerful enhancement technique that helps in emphasizing important details
while suppressing irrelevant information. It is widely used in medical, satellite, and industrial
imaging to extract meaningful information from grayscale images efficiently.
You said:
Make it descriptive for 5 marker question What is significance of Fourier Transform in image
processing? Discuss Bit Level slicing?
2/2
ChatGPT said:
Introduction
The Fourier Transform (FT) is a fundamental mathematical tool used in image processing to analyze
and manipulate images in the frequency domain. It helps in breaking down an image into its
sinusoidal components, making it useful for filtering, compression, and enhancement.
• High-pass filters enhance edges and fine details by removing low-frequency components.
3. Image Compression
• Used in applications like medical imaging, astronomy, and satellite image enhancement.
• The Fast Fourier Transform (FFT) is an efficient algorithm that speeds up the Fourier
Transform computation.
For a continuous image function f(x,y)f(x,y)f(x,y), the 2D Fourier Transform is given by:
The Inverse Fourier Transform is used to convert back to the spatial domain:
Where:
Definition
Bit Plane Slicing is a technique used to analyze and enhance images by separating them into
different binary planes. Each pixel in an 8-bit grayscale image is represented by 8 bits (0 to 255), and
these bits can be extracted as separate images.
Concept
• The Least Significant Bit (LSB) contains finer, often noisy details.
Consider a grayscale image with pixel intensity 135 (binary: 10000111). The bit planes are:
7 (MSB) 1
6 0
5 0
4 0
3 0
2 1
1 1
0 (LSB) 1
By selecting only MSB or a combination of significant bits, we can enhance the main features of an
image while ignoring less important details.
Conclusion
• Fourier Transform plays a key role in image filtering, enhancement, and compression by
converting images into the frequency domain.
• Bit Plane Slicing helps in feature extraction and noise reduction by decomposing an image
into individual binary planes.
• Both techniques are widely used in medical imaging, satellite image processing, and
security applications.