0% found this document useful (0 votes)

47 views100 pages

8 Image Processing Fundamentals Full

The document discusses an introductory course on digital image processing fundamentals and convolutional neural networks. The course will cover preprocessing techniques like shading correction, de-blurring, de-noising and contrast enhancement that are applied to images before feeding them to CNN models. It will also introduce basic concepts in digital image processing like image representation, color models, reading and displaying images using Python libraries.

Uploaded by

ckcheun43

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views100 pages

8 Image Processing Fundamentals Full

Uploaded by

ckcheun43

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 100

COMP 2211 Exploring Artificial Intelligence

Digital Image Processing Fundamentals

Dr. Desmond Tsoi, Dr. Cecia Chan
Department of Computer Science & Engineering
The Hong Kong University of Science and Technology, Hong Kong SAR, China
Convolutional Neural Network
Convolutional Neural Network (CNN or ConvNet) is a class of Artificial Neural Networks
applied to analyze visual imagery.
Research has demonstrated that preprocessing images before feeding them to CNN would
significantly improve the classification/recognition accuracy.
To learn how to preprocess images and use this powerful tool to tackle Computer Vision
tasks, you will first be introduced to digital image processing fundamentals.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 2 / 100

Image Preprocessing

Image preprocessing refers to processing an image so that the

resulting image is more suitable than the original for a
specific application.
A preprocessing method that works well for one application
may not be the best method for another application.
Some preprocessing tasks:
Shading correction
De-blurring
De-noising
Contrast enhancement

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 3 / 100

Image Preprocessing

Shading Correction De-blurring De-noising Contrast Enhancement

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 4 / 100
Digital Image Processing
Digital image processing (DIP) is the method to manipulate a digital image to either
enhance the quality or extract relevant information.
These methods provide the foundation to preprocess images that are more suitable for
specific applications.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 5 / 100

Part I

Digital Image Fundamentals

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 6 / 100

Digital Image
A digital image is a two-dimensional grid of intensity values, represented by I(x,y), where
x and y are coordinates, and the value of I at coordinates (x,y) is called intensity.
Pixels: Short for Picture Element. A pixel is a single point (dot) in an image.
Dimensions: Specified by the width and height of the image.
Image width is the number of columns in the image.
Image height is the number of rows in the image.

An image of dimensions 32×21 (i.e., image width = 32 pixels, image height = 21 pixels)
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 7 / 100
Image Coordinate System
A specific pixel is specified by its coordinates (x,y) where x is increasing from left to right,
and y is increasing from top to bottom.
The origin (0,0) is in the top-left corner.
The following shows the coordinate system of digital images:
(0,0) (1,0) (2,0) (3,0) ··· (width-1,0)
(0,1) (1,1) (2,1) (3,1) ··· (width-1,1)
..
.
(0, height-1) (1, height-1) (2, height-1) (3, height-1) ... (width-1, height-1)
where width and height are the image width and image height, respectively.

The legal range of x-coordinate is between 0 to width-1

The legal range of y-coordinate is between 0 to height-1

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 8 / 100

Grayscale Images
Grayscale is a range of gray shades from black to white.
Grayscale images are most commonly used in image processing because the data are
smaller and allow us to process the images in a short time.
A grayscale digital image is an image in which the value of each pixel carries only
intensity information.
A grayscale image contains 8-bit/pixel data, which has 28 = 256 different gray levels (0
for black, 127 for gray, and 255 for white).

Black Gray-level = 0
Dark gray Gray-level = 64
Medium gray Gray-level = 127
Light gray Gray-level = 190
White Gray-level = 255

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 9 / 100

Color Images
Color images have intensity from the darkest and lightest of 3 different colors, Red,
Green, and Blue (RGB).
The mixtures of these color intensities produce a color image.
Since RGB images contain 3×8-bit intensities, they are also referred to as 24-bit color
images.
An 8-bit intensity range has 256 possible values, 0 to 255.
Common colors represented in RGB:

Black RGB = (0, 0, 0)

White RGB = (255, 255, 255)
Red RGB = (255, 0, 0) 24-bit grayscale images are subset of RGB
Green RGB = (0, 255, 0) images where the RGB intensity are all
Blue RGB = (0, 0, 255) equal (e.g., Gray and Dark Gray shown
Yellow RGB = (255, 255, 0)
above).
Gray RGB = (128, 128, 128)
Dark Gray RGB = (50, 50, 50)

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 10 / 100

Access Images in Google Drive
To access files/images in Google drive, you need to mount your Google Drive to Colab
using the following code:
# Import drive from google.colab package
from google.colab import drive
# Import os and sys modules
import os, sys

# Mount Google Drive

drive.mount('/content/drive')
# Assume a folder "images" has been created, go to the folder "images"
os.chdir('/content/drive/My Drive/images')
# Add the path for interpreter to search
sys.path.append('/content/drive/My Drive/images')

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 11 / 100

Read Images in Colab
To read an image, you need to first import matplotlib.image:
import matplotlib.image as mpimg
Then use imread() method of the matplotlib.image module.
Syntax
mpimg.imread(fname, format=none)

Parameters:
fname: The image file to read: a filename or a file-like object opened in read-binary mode.
format: The image file format assumed for reading the data. If format is not given, the format is deduced from the
filename. If nothing can be deduced, PNG is tried.
Return value: numpy.array.
(M,N) for grayscale images
(M,N,3) for RGB images
(M,N,4) for RGBA images
PNG images are returned as float arrays (0-1). All other formats are returned as int arrays, with a bit depth
determined by the file’s contents.

URL: https://matplotlib.org/stable/api/_as_gen/matplotlib.pyplot.imread.html
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 12 / 100
Show Images in Colab

To show an image, you need to first import matplotlib.pyplot:

import matplotlib.pyplot as plt
Then use imshow() method of the matplotlib.pyplot module.
Syntax
plt.imshow(X, cmap, vmin, vmax)
Parameters:
X: The image data. Supported array shapes are
(M,N): an image with scalar data. The values are mapped to colors using normalization and a
colormap.
(M,N,3): an image with RGB values (0-1 float or 0-255 int)
(M,N,4): an image with RGBA values (0-1 float or 0-255 int), i.e., including transparency.
The first two dimensions (M,N) define the rows and columns of the image.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 13 / 100

Syntax
Parameters:
cmap: str (e.g., ’gray’) or Colormap. The Colormap instance or registered colormap name used to map scalar data
to colors. This parameter is ignored for RGB(A) data.
vmin, vmax:
By default, imshow scales elements of the numpy array so that the smallest element becomes 0, the largest
becomes 1, and intermediate values are mapped to the interval [0,1] by a linear function.
Optionally, imshow can be called with arguments, vmin and vmax. In such case all elements of the array
smaller or equal to vmin are mapped to 0, all elements greater or equal to vmax are sent to 1, and the
elements between vmin and vmax are mapped in a linear fashion into the interval [0,1].
Returns AxesImage
AxesImage is an image attached to an Axes.

URL: https://matplotlib.org/stable/api/_as_gen/matplotlib.pyplot.imshow.html

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 14 / 100

Save Images in Colab

To save an image, you need to first import matplotlib.pyplot:

import matplotlib.pyplot as plt
Then use imsave() method of the matplotlib.pyplot module.
Syntax
plt.imsave(fname, arr)

Parameters:
fname: a path or a file-like object to store the image in.
arr: The image data. The shape can be one of M×N (luminance), M × N × 3 (RGB) or M × N × 4 (RGBA).
The first two dimensions (M,N) define the rows and columns of the image.
Returns AxesImage
AxesImage is an image attached to an Axes.

URL: https://matplotlib.org/stable/api/_as_gen/matplotlib.pyplot.imsave.html

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 15 / 100

# Assume Google Drive has been mounted & the path has been added for interpreter to search

# Import all the required libraries

import matplotlib.image as mpimg
import matplotlib.pyplot as plt
import numpy as np

# Read and show the image

img = mpimg.imread('snorlax.png')
plt.imshow(img)

# Find the shape of input image

[height, width, layers] = np.shape(img)

# Print all the required information

print('Dimensions: {}x{}x{}'.format(height, width, layers))
print('Total number of pixels:', width*height)

plt.imsave('snorlax2.png', img) # Save the image

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 16 / 100

Part II

Image Processing

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 17 / 100

Image Processing using OpenCV

OpenCV (Open Source Computer Vision Library) is an open source computer vision and
machine learning software library.
OpenCV was built to provide a common infrastructure for computer vision applications
and to accelerate the use of machine perception in the commercial products.
The library has more than 2500 optimized algorithms, which includes a comprehensive set
of both classic and state-of-the-art computer vision and machine learning algorithms.
OpenCV supports a wide variety of programming languages such as Python, C++, Java,
etc.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 18 / 100

Convert Color Images to Grayscale
In certain problem, you will find it useful to lose unnecessary information from your
images to reduce space and computational complexity.
Converting colored images to grayscale images is an example. This is done, as color is not
necessary to recognize and interpret an image.
Grayscale can be good enough for recognizing certain objects, because color images
contain more information than black and white images, they can add unnecessary
complexity and take up more space in memory.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 19 / 100

Convert Color Images to Grayscale
One way to convert color images to grayscale is to apply the following formula:

V = 0.299 × R + 0.587 × G + 0.114 × B

To perform the above using OpenCV, you need to first import cv2
import cv2
Then use cvtColor() method of the cv2 module.
Syntax
cv2.cvtColor(image, code)
Parameters:
image: Image to be processed in n-dimensional array
code: Conversion code for colorspace. For converting RGB to grayscale, we use cv2.COLOR RGB2GRAY
Return value: Converted image.

URL: https://docs.opencv.org/3.4/d8/d01/group__imgproc__color__conversions.html#
ga397ae87e1288a81d2363b61574eb8cab
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 20 / 100
# Assume Google Drive has been mounted & the path has been added for interpreter to search

# Import all the required libraries

import cv2
import matplotlib.image as mpimg
import matplotlib.pyplot as plt

# Read and show the image

img = mpimg.imread('snorlax.png')
plt.figure()
plt.imshow(img)

# Convert color image to gray

grayImg = cv2.cvtColor(img, cv2.COLOR_RGB2GRAY)

# Show the image

plt.figure()
plt.imshow(grayImg, cmap='gray', vmin=0, vmax=1)

# Save the image

plt.imsave('snorlax-gray.png', grayImg)
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 21 / 100
Image Affine Transformation
An affine transformation is any transformation that preserves collinearity, parallelism as
well as the ratio of distances between the points (e.g., midpoint of a line remains the
midpoint after transformation).
It does not necessarily preserve distances and angles.
Geometric transformations, such as translation, rotation, scaling, shearing, etc., are all
affine transformations.
In general, the affine transformation can be expressed in the form of a linear
transformation followed by a vector addition as follows:
 
′
x
x a00 a01 x b00 a00 x + a01 y + b00 a00 a01 b00  y 
= + = =
y′ a10 a11 y b10 a10 x + a11 y + b10 a10 a11 b10
1

a00 a01 b00
M= is a transformation matrix. To define what transformation you
a10 a11 b10
want to do, you need to define M.
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 22 / 100
Image Translation
To perform the image affine transformation using OpenCV, you need to first import cv2
import cv2
Then use warpAffine() method of the cv2 module.
Syntax
cv2.warpAffine(src, M, dsize, flags, borderMode, borderValue)

Parameters:
src: input image
M: 2 × 3 transformation matrix
dsize: size of the output image
flags: combination of interpolation methods
borderMode: pixel extrapolation method
borderValue: value used in case of a constant border; by default, it is 0
Return value: output image that has the size dsize and the same type as src

URL:
https://docs.opencv.org/3.4/da/d54/group__imgproc__transform.html#ga0203d9ee5fcd28d40dbc4a1ea4451983

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 23 / 100

Image Translation

Translation is simply the shifting of object location.

Suppose we have a point P(x, y ) which is translated by (tx , ty ), then the coordinates
after translation denoted by P ′ (x ′ , y ′ ) are given by

x ′ = x + tx
y ′ = y + ty

In matrix form
 
x
x′

1 0 tx  y 
=
y′ 0 1 ty
1

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 24 / 100

# Assume Google Drive has been mounted & the path has been added for interpreter to search
# Import all the requird libraries
import cv2
import matplotlib.image as mpimg
import matplotlib.pyplot as plt
import numpy as np
# Read the image
img = mpimg.imread('snorlax.png')
# Convert the color image to gray and show it
grayImg = cv2.cvtColor(img, cv2.COLOR_RGB2GRAY)
plt.figure();
plt.imshow(grayImg, cmap='gray', vmin=0, vmax=1)
# Find the shape of the gray image
rows, cols = grayImg.shape
Move to the +x direction by 100, +y direction by 50
# Form the transformation matrix of translation
M = np.float32([[1, 0, 100],[0, 1, 50]])
# Perform the transformation
translatedImg = cv2.warpAffine(grayImg, M, (cols,rows))
# Show the image
plt.figure(); plt.imshow(translatedImg, cmap='gray', vmin=0, vmax=1)
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 25 / 100
Image Reflection
Reflection refers to mirroring (or flipping) images along x-axis or y-axis.
To make sure the resulting image fits the image coordinate, after flipping image along
x-axis, we need to translate it by the amount of number of rows in y-axis. Similarly, after
flipping image along y-axis, we need to translate it by the amount of columns in x-axis.
Suppose we have a point P(x, y ) which is reflected along the x-axis, then the coordinates
after reflection denoted by P ′ (x ′ , y ′ ) are given by
In matrix
form
 
x
′

x′ = x x 1 0 0  y 
′ =
′ y 0 −1 (rows − 1)
y = −y + (rows − 1) 1
height
Similarly, a point P(x, y ) which is reflected along the y-axis, the coordinates after
reflection denoted by P ′ (x ′ , y ′ ) are given by
In matrix
form width  
x
′

x ′ = −x + (cols − 1) x −1 0 (cols − 1)  
′ = y
y 0 1 0
y′ = y 1
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 26 / 100
# Assume Google Drive has been mounted & the path has been added for interpreter to search

# Import all the required libraries

import cv2; import numpy as np
import matplotlib.image as mpimg
import matplotlib.pyplot as plt

img = mpimg.imread('snorlax.png') # Read the image

# Convert the color image to gray and show it
grayImg = cv2.cvtColor(img, cv2.COLOR_RGB2GRAY)
plt.figure(); plt.imshow(grayImg, cmap='gray', vmin=0, vmax=1)

rows, cols = grayImg.shape # Find the shape of the gray image

# Form the transformation matrix of x-axis reflection
M = np.float32([[1, 0, 0], [0, -1, rows-1]])
# Perform the transformation
xaxisreflection = cv2.warpAffine(grayImg, M, (cols,rows))
plt.figure(); plt.imshow(xaxisreflection, cmap='gray', vmin=0, vmax=1)

# Form the transformation matrix of y-axis reflection

M = np.float32([[-1, 0, cols-1], [0, 1, 0]])
# Perform the transformation
yaxisreflection = cv2.warpAffine(grayImg, M, (cols,rows))
plt.figure(); plt.imshow(yaxisreflection, cmap='gray', vmin=0, vmax=1)

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 27 / 100

Image Rotation
Image rotation refers to rotating an image θ degree along (x0 , y0 ).
Suppose we have a point P(x, y ) which is rotated along the center of the image, then the
coordinates after rotation denoted P ′ (x ′ , y ′ ) are given by

x ′ = (x − x0 )cosθ + (y − y0 )sinθ + x0
y ′ = −(x − x0 )sinθ + (y − y0 )cosθ + y0

In matrix form
If theta > 0, anti-clockwise, theta < 0 -> clockwise
 
x
x′

cosθ sinθ −x0 cosθ − y0 sinθ + x0  y 
=
y′ −sinθ cosθ x0 sinθ − y0 cosθ + y0
1
x0, y0 is the point to rotate the image about
normally the centre of the image
URL:
https://docs.opencv.org/3.4/da/d54/group__imgproc__transform.html#gafbbc470ce83812914a70abfb604f4326

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 28 / 100

# Assume Google Drive has been mounted & the path has been added for interpreter to search
import cv2, math; import numpy as np # Import all the required libraries
import matplotlib.image as mpimg; import matplotlib.pyplot as plt
img = mpimg.imread('snorlax.png') # Read the image
# Convert the color image to gray and show it
grayImg = cv2.cvtColor(img, cv2.COLOR_RGB2GRAY)
plt.figure(); plt.imshow(grayImg, cmap='gray', vmin=0, vmax=1)
rows, cols = grayImg.shape # Find the shape of image
# Form the transformation matrix of rotation
# The angle for math.sin and math.cos should be in radian.
# 45 degree = pi/4 radian
angle = math.pi/4
M = np.float32([[math.cos(angle), math.sin(angle),
-(cols//2)*math.cos(angle)-(rows//2)*math.sin(angle) + (cols//2)],
[-math.sin(angle), math.cos(angle),
(cols//2)*math.sin(angle)-(rows//2)*math.cos(angle) + (rows//2)]])
# Another way to generate the required transformation matrix
# The angle for getRotationMatrix2D should be in degree
# M = cv2.getRotationMatrix2D((cols//2,rows//2), 45, 1.0)
# Perform the transformation
rotate45 = cv2.warpAffine(grayImg, M, (cols,rows))
plt.figure(); plt.imshow(rotate45, cmap='gray', vmin=0, vmax=1)
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 29 / 100
Image Resizing/Scaling
Resizing/Scaling images is a critical preprocessing step in computer vision.
Machine learning models train faster on smaller images. Moreover, many deep learning
model architectures require that our images are the same size, and our raw collected
images may vary in size.
Resizing is a common approach to make the input images the same size, and it works well
unless you have a very different aspect ratio from the expected input shape.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 30 / 100

Image Resizing/Scaling
To perform the image resizing/scaling using OpenCV, you need to first import cv2
import cv2
Then use resize() method of the cv2 module.
Syntax
cv2.resize(src, dsize, dst, fx = 0, fy = 0, interpolation = INTER_LINEAR)
Parameters:
src: Input image
dsize: The size for the output image
dst (optional): The output image with size dsize
fx (optional): The scale factor along the horizontal axis
fy (optional): The scale factor along the vertical axis
interpolation: The algorithm used to reconstruct the new pixels
cv2.INTER NEARNEST (nearest neighbor interpolation)
cv2.INTER LINEAR (bilinear interpoolation) interpolation
cv2.INTER CUBIC (bicubic interpolation)
Returns AxesImage

URL:
https://docs.opencv.org/3.4/da/d54/group__imgproc__transform.html#ga47a974309e9102f5f08231edc7e7529d
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 31 / 100
# Assume Google Drive has been mounted & the path
# has been added for interpreter to search

# Import all the required libraries

import cv2
import matplotlib.image as mpimg
import matplotlib.pyplot as plt
img is a 4D array since it is RGBA with transparency
img = mpimg.imread('snorlax.png') # Read the image

# Convert the color image to gray and show it

grayImg = cv2.cvtColor(img, cv2.COLOR_RGB2GRAY)
plt.figure()
plt.imshow(grayImg, cmap='gray', vmin=0, vmax=1)

# Perform the transformation

resizedImg = cv2.resize(grayImg, (300, 300),
interpolation=cv2.INTER_LINEAR)

# Show the image

plt.figure()
plt.imshow(resizedImg, cmap='gray', vmin=0, vmax=1)

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 32 / 100

Characteristics of Image Operations
There are many ways to classify image operations.
One way for doing so is to be based on the “region” used to process the pixels.
1. Point: The output value at a specific coordinate is dependent only on the input value at the
same coordinate.
2. Local: The output value at a specific coordinate is dependent on the input values in the
neighborhood of that same coordinate.
3. Global: The output value at a specific coordinate is dependent on all the values in the input
image.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 33 / 100

Question

What type of image operation is color to grayscale conversion? :D

Answer:
Point operation! Since the output value at a specific coordinate of the grayscale image is
dependent only on the input value at the same coordinate of the color image.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 34 / 100

Point-based Operations (Examples)
Brightness adjustment: Make images brighter or dimmer (Optional)
Contrast stretching: Adjust the contrast of images
Gamma correction: Grayscale non-linear transformation (Optional)
Grayscale threshold: Convert a grayscale image into a black and white binary image
Histogram equalization: Transformation where an output image has approximately the
same number of pixels at each gray level (Optional)

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 35 / 100

Contrast Stretching
Contrast stretching is an image enhancement method which attempts to improve an image by
stretching the range of intensity values.
One way to perform contrast stretching is to stretch minimum and maximum intensity values
present to the possible minimum and maximum intensity values.
Example:
Assume 0-255 taken as standard minimum and maximum intensity for 8-bit images.
If the minimum intensity value (Imin ) present in the image is 100, then it is stretched to the
possible minimum value 0.
Likewise, if the maximum intensity value (Imax ) is less than the possible maximum intensity
value 255, then it is stretched out to 255.
General formula for contrast stretching:
I − Imin
Inew = × 255
Imax − Imin
where
I is the current pixel intensity value
Imin is the minimum intensity value present in the whole image
Imax is the maximum intensity value present in the whole image
Inew is the output intensity value rounded up to the nearest integer value
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 36 / 100
# Assume Google Drive has been mounted & the path has been added for interpreter to search

# Import all the required libraries how to reverse the process? DO REVISION
import cv2; import numpy as np Method: Change of subject for the equation
import matplotlib.image as mpimg
import matplotlib.pyplot as plt

# Read the image

img = mpimg.imread('snorlax-low-contrast.png')

# Convert the color image to gray and show it

grayImg = cv2.cvtColor(img, cv2.COLOR_RGB2GRAY)
plt.figure(); plt.imshow(grayImg, cmap='gray', vmin=0, vmax=1)

# Convert pixel values from [0,1] to [0,255]

grayImgUint = grayImg*255
grayImgUint = grayImgUint.astype(np.uint8)

# Find min and max pixel values and perform normalization

min = np.min(grayImgUint)
max = np.max(grayImgUint)
imageContrastEnhance = ((grayImgUint-min)/(max-min))*255

# Show the image cannot show float type image

plt.figure(); plt.imshow(imageContrastEnhance.astype(np.uint8), cmap='gray', vmin=0, vmax=255)
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 37 / 100
Grayscale Thresholding

Grayscale thresholding is a simple form of image segmentation.

It is a way to create a binary image from a grayscale image or full-color image.
This is typically done in order to separate “object” or foreground pixels from background
pixels to aid in image processing.
The formula of grayscale thresholding with threshold T is defined as
(
0 I <T
Inew =
255 otherwise

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 38 / 100

# Assume Google Drive has been mounted & the path has been added for interpreter to search

# Import all the required libraries

import cv2
import matplotlib.image as mpimg
import matplotlib.pyplot as plt

# Read the image

img = mpimg.imread('snorlax.png')

# Convert the color image to gray and show it

grayImg = cv2.cvtColor(img, cv2.COLOR_RGB2GRAY)
plt.figure(); plt.imshow(grayImg, cmap='gray', vmin=0, vmax=1)

# Convert pixel values from [0,1] to [0,255]

grayImgUint = grayImg*255
grayImgUint = grayImgUint.astype(np.uint8)

# Perform thresholding
processedImg = grayImgUint > 128

# Show the image

plt.figure(); plt.imshow(processedImg, cmap='gray')
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 39 / 100
Grayscale Thresholding (Otsu’s Method)

We need a way to automatically determine the threshold value T so that the result of
thresholding is reproductible.
A well-known approach is Otsu’s method
1. Select an initial estimate of the threshold T. A good initial value is the average intensity of
the image.
2. Calculate the mean gray values µ1 and µ2 of the partitions, R1 , R2 .
3. Partition the image into two groups, R1 , R2 , using the threshold T.
4. Compute a new threshold
1
T = (µ1 + µ2 )
2
5. Repeat steps 2-4 until the mean values µ1 and µ2 in successive iterations do not change.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 40 / 100

Grayscale Thresholding (Otsu’s Method)
To perform Otsu’s thresholding, you need to first import cv2
import cv2
Then use threshold() method of the cv2 module.
Syntax
cv2.threshold(source, thresholdValue, maxVal, thresholdingTechnique)

Parameters:
source: input image array (must be grayscale)
thresholdValue: value of threshold below and above which pixel values will change accordingly
maxVal: Maximum value that can be assigned to a pixel
thresholdingTechnique: The type of thresholding to be applied
(For Otsu’s, we put cv2.THRESH BINARY + cv2.THRESH OTSU)
Return values:

The first is the threshold that was used.

The second is the thresholded image (i.e., the binary image).

URL: https://docs.opencv.org/4.x/d7/d4d/tutorial_py_thresholding.html
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 41 / 100
# Assume Google Drive has been mounted & the path has been added for interpreter to search

# Import all the required libraries

import cv2
import matplotlib.image as mpimg
import matplotlib.pyplot as plt

# Read the image

img = mpimg.imread('snorlax.png')

# Convert the color image to gray and show it

grayImg = cv2.cvtColor(img, cv2.COLOR_RGB2GRAY)
plt.figure(); plt.imshow(grayImg, cmap='gray', vmin=0, vmax=1)

# Convert pixel values from [0,1] to [0,255]

grayImgUint = grayImg*255
grayImgUint = grayImgUint.astype(np.uint8)
processed image
# Perform thresholding using Otsu's method
thresh, processedImg = cv2.threshold(grayImgUint, 120, 255,
Optimal threshold cv2.THRESH_BINARY + cv2.THRESH_OTSU)
print('Optimal threshold:', thresh)

# Show the image

plt.figure(); plt.imshow(processedImg, cmap='gray')
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 42 / 100
Local Operations

Recall, local operations refer to those the output value at a specific coordinate is
dependent on the input values in the neighborhood of that same coordinate.
Some of the most common neighborhoods are 4-connected neighborhood and the
8-connected neighborhood.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 43 / 100

Examples

Image smoothing: It removes noise and softens edges and corners of the image. It is also
called blurring.
Image edge detection: It detects the boundaries (edges) of objects, or regions within an
image.
Image sharpening: It removes blur, enhances details, and dehazes.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 44 / 100

Image Convolution
Image convolution is defined as
∞
X ∞
X
O(x, y ) = K (m, n)I (x − m, y − n)
m=−∞ n=−∞

where I is the input image, K is the image kernel.

Assume the origin (i.e., (0,0)) of I is top-left corner, while
the origin (i.e., (0,0)) of K is the center of the kernel.
If the image kernel is 3×3, then
1
X 1
X
O(x, y ) = K (m, n)I (x − m, y − n)
m=−1 n=−1

Image kernel is also called image filter or image mask.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 45 / 100
Example Input image I Image kernel K
10 1 3 2 6 -1 0 1
4 3 5 8 0 -1 0 1
8 7 9 6 5 -1 0 1
1
X 1
X Need to flip the Kernel left-right and then up-side down
O(x, y ) = K (m, n)I (x − m, y − n) 1 0 -1
m=−1 n=−1 1 0 -1
1 0 -1
1
X
O(1, 1) = (K (m, −1)I (1 − m, 1 − (−1)) + K (m, 0)I (1 − m, 1 − 0) + K (m, 1)I (1 − m, 1 − 1))
m=−1

=K (−1, −1)I (2, 2) + K (−1, 0)I (2, 1) + K (−1, 1)I (2, 0)+
K (0, −1)I (1, 2) + K (0, 0)I (1, 1) + K (0, 1)I (1, 0)+
K (1, −1)I (0, 2) + K (1, 0)I (0, 1) + K (1, 1)I (0, 0)
=(−1)(9) + (−1)(5) + (−1)(3) + (0)(7) + (0)(3) + (0)(1) + (1)(8) + (1)(4) + (1)(10)
= − 9 − 5 − 3 + 8 + 4 + 10 = 5

Can you calculate all the remaining output image values?

Answer:
5 -5 6

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 46 / 100

Very Tedious :( Any Intuitive Way? YES!!!

Steps
1. Inverse the kernel, i.e., flipping the kernel in both
horizontal and vertical directions about the center
of kernel.
-1 0 1 1 0 -1 1 0 -1
-1 0 1 1 0 -1 1 0 -1
-1 0 1 1 0 -1 1 0 -1
(Left) Original kernel, (Middle) Flipped
horizontally, (Right) Flipped vertically
2. Slide over the inversed kernel centered at interested
point.
3. Multiply inversed kernel data with the overlapped
area.
4. Sum and accumulate the output.

Step 2 to Step 4
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 47 / 100
Image Convolution Again

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 48 / 100

Problem
When computing an output pixel at the boundary of an image, a portion of the
convolution is usually off the edge of the image. How to deal with this?
Solutions:
1. Just ignore those boundary pixels. :P
2. Do zero padding (i.e., add a border of pixels all with value zero around the edges of the input
images.)
0 0 0 0 0 0 0
0 10 1 3 2 6 0
0 4 3 5 8 0 0
0 8 7 9 6 5 0
0 0 0 0 0 0 0
3. Replicating boundary pixels
10 10 1 3 2 6 6
10 10 1 3 2 6 6
4 4 3 5 8 0 0
8 8 7 9 6 5 5
8 8 7 9 6 5 5
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 49 / 100
Solutions:
4. Reflecting boundary pixels
10 10 1 3 2 6 6
10 10 1 3 2 6 6
4 4 3 5 8 0 0
8 8 7 9 6 5 5
8 8 7 9 6 5 5
5. Mirroring boundary pixels
3 4 3 5 8 0 8
1 10 1 3 2 6 2
3 4 3 5 8 0 8
7 8 7 9 6 5 6
3 4 3 5 8 0 8

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 50 / 100

Image Convolution
To do image convolution, you need to first import cv2
import cv2
Then use filter2D() method of the cv2 module.
Syntax
cv2.filter2D(src, ddepth, kernel, dst, anchor, delta, borderType=cv2.BORDER_DEFAULT)
Parameters:
src: input image that you want to convolve RGB depth is 3, RGBA depth is 4
ddepth: desired depth of the destination image. If ddepth=-1, the output image will have the same depth as the src
kernel: convolution kernel, a single-channel floating point matrix; if you want to apply different kernels to different channels, split
the image into separate color planes using split and process them individually.
dst: output image of the same size and the same number of channels as src.
anchor: anchor of the kernel that indicates the relative position of a filtered point within the kernel; the anchor should lie within the
kernel; default value (-1,-1) means that the anchor is at the kernel center.
delta: optional value added to the filtered pixels before storing them in dst.
borderType: pixel extrapolation method
cv2.BORDER CONSTANT: iiiiii|abcdefgh|iiiiii with some specified i
cv2.BORDER REPLICATE: aaaaaa|abcdefgh|hhhhhh
cv2.BORDER REFLECT: fedcba|abcdefgh|hgfedc
cv2.BORDER REFLECT 101: gfedcb|abcdefgh|gfedcba
cv2.BORDER DEFAULT: Same as BORDER REFLECT 101
Return value: filtered image.

URL: https://docs.opencv.org/4.x/d4/d86/group__imgproc__filter.html#ga27c049795ce870216ddfb366086b5a04
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 51 / 100
Image Convolution
Note
filter2D does not mirror the kernel for you. You will need to flip the kernel before applying
cv2.filter2D.

# Assume Google Drive has been mounted & the path has been added for interpreter to search

# Import all the required libraries

import cv2; import numpy as np # Prepare a kernel (a sharpening kernel here)
import matplotlib.image as mpimg kernel_3x3 = np.array([ [0,-1,0],
import matplotlib.pyplot as plt [-1,5,-1],
[0,-1,0] ])
# Read the image
img = mpimg.imread('snorlax-sleep.png') for i in range(5): # Perform filtering 5 times
grayImg = cv2.filter2D(grayImg, -1,
# Convert the color image to gray and show it kernel_3x3)
grayImg = cv2.cvtColor(img, cv2.COLOR_RGB2GRAY)
plt.figure(); plt.imshow(grayImg, cmap='gray', # Show the resulting image
vmin=0, vmax=1) plt.figure(); plt.imshow(grayImg, cmap="gray",
vmin=0, vmax=1)

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 52 / 100

Before and After

Input image Output image

Text
Text

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 53 / 100

Smoothing (Averaging/Blurring) Kernel
1/9 1/9 1/9
1/9 1/9 1/9
1/9 1/9 1/9

Using a small kernel and performing many times is better than using big kernels
Original image
Effects of kernel in different size. (What do you observe?)

3×3 mask 5×5 mask 15×15 mask 25×25 mask

Blurring an image can be done by averaging pixels

Analogous to integration, related to sum of pixel intensity values
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 54 / 100
Sharpening Kernel
0 0 0 -1 -1 -1 -1 -1 -1
0 1 0 -1 8 -1 -1 9 -1
0 0 0 -1 -1 -1 -1 -1 -1

+ =
Original image Detail (Edge) Sharpened image
[Color flipped for clarity]

Sharpening has the opposite effect of blurring

Analogous to differentiation, related to the difference of pixel intensity values

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 55 / 100

Edge Kernel - Prewitt
-1 0 1 -1 -1 -1
-1 0 1 0 0 0
-1 0 1 1 1 1

Kernel for detecting Kernel for detecting

vertical edges horizontal edges

Edge image (vertical edges) Edge image (horizontal edges) Edge image
q (magnitude)
|Gx | |Gy | Gx2 + Gy2

Pixels of the processed images are inverted (i.e. black to white, white to black) for making them more visible.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 56 / 100

Edge Kernel - Sobel
-1 0 1 -1 -2 -1
-2 0 2 0 0 0
-1 0 1 1 2 1

Kernel for detecting Kernel for detecting

vertical edges horizontal edges
For detecting edges, the total sum of kernel is 0

Edge image (vertical edges) Edge image (horizontal edges) Edge image
q (magnitude)
|Gx | |Gy | Gx2 + Gy2

Pixels of the processed images are inverted (i.e. black to white, white to black) for making them more visible.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 57 / 100

Image Convolutions
Clearly, image convolution is powerful in finding the features of an image if we already
know the right kernel to use.
Kernel design is an art and has been refined over the last few decades to do some pretty
amazing things with images. But the important question is, what if we don’t know the
features we are looking for? Or what if we do know, but we don’t know what kernel
should look like?

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 58 / 100

Practice Problem
A 3-bit/pixel (i.e. pixel value is in range 0 and 23 =8) of size 3 × 3 is given below.

3 7 6
2 4 6
4 7 2

(a) Find the output of a 3 × 3 averaging kernel at (1,1).

(b) Find the edge magnitude at (1,1) using the Sobel masks shown below.
-1 -2 -1 -1 0 1
0 0 0 -2 0 2
1 2 1 -1 0 1
Sobel kernel for extracting Sobel kernel for extracting
horizontal edges
p vertical edges
Note: Edge magnitude = horizontal edge value 2 + vertical edge value 2 .

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 59 / 100

Practice Problem
(a) The output value of a 3 × 3 averaging kernel at (1,1) is

3+7+6+2+4+6+4+7+2 41
=
9 9

(b) Horizontal edge value

=1 × 3 + 2 × 7 + 1 × 6 + 0 × 2 + 0 × 4 + 0 × 6 + (−1) × 4 + (−2) × 7 + (−1) × 2

=3 + 14 + 6 − 4 − 14 − 2 = 3

Vertical edge value

=1 × 3 + 0 × 7 + (−1) × 6 + 2 × 2 + 0 × 4 + (−2) × 6 + 1 × 4 + 0 × 7 + (−1) × 2)

=3 − 6 + 4 − 12 + 4 − 2 = −9
p √
Edge magnitude = (3)2 + (−9)2 = 90 = 9.49

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 60 / 100

Part III

Reference Materials

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 61 / 100

Image Cropping

Sometimes, you may want to crop the region of internet (ROI) for further processing.
For instance, in a face detection application, you may want to drop the face from an
image.
To crop an image, you can use the same method as numpy array slicing.
To slice an array, you need to specify the start and end index of the first as well as the
second dimension.
Syntax
croppedImg = sourceImg[start_row:end_row, start_col:end_col]

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 62 / 100

# Assume Google Drive has been mounted, & the path
# has been added for interpreter to search

# Import all the required libraries

import cv2
import matplotlib.image as mpimg
import matplotlib.pyplot as plt

img = mpimg.imread('snorlax.png') # Read the image

# Convert the color image to gray and show it

grayImg = cv2.cvtColor(img, cv2.COLOR_RGB2GRAY)

# Show the image

plt.figure()
plt.imshow(grayImg, cmap='gray', vmin=0, vmax=1)

# Obtain part of the image

croppedImg = grayImg[200:733, 300:1100]

# Show the image

plt.figure()
plt.imshow(croppedImg, cmap='gray', vmin=0, vmax=1)
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 63 / 100
Image Padding

Image padding introduces new pixels around the edges of an image.

Types of padding
Constant padding
Reflection padding
Replication padding

0 0 0 0 0 0 0 5 4 4 5 6 6 5 1 1 1 2 3 3 3
0 0 0 0 0 0 0 2 1 1 2 3 3 2 1 1 1 2 3 3 3
0 0 1 2 3 0 0 2 1 1 2 3 3 2 1 1 1 2 3 3 3
0 0 4 5 6 0 0 5 4 4 5 6 6 5 4 4 4 5 6 6 6
0 0 7 8 9 0 0 8 7 7 8 9 9 8 7 7 7 8 9 9 9
0 0 0 0 0 0 0 8 7 7 8 9 9 8 7 7 7 8 9 9 9
0 0 0 0 0 0 0 5 4 4 5 6 6 5 7 7 7 8 9 9 9

BORDER CONSTANT BORDER REFLECT BORDER REPLICATE

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 64 / 100

Image Padding
To pad an image, you need to first import cv2
import cv2
Then use copyMakeBorder() method of the cv2 module.
Syntax
cv2.copyMakeBorder(src, top, bottom, left, right, borderType, value)

Parameters:
src: Source image
top: The border width in number of pixels in top direction
bottom: The border width in the number of pixels in bottom direction
left: The border width in the number of pixels in left direction
right: The border width in the number of pixels in the right direction
borderType: The kind of border to be added
cv2.BORDER CONST
cv2.BORDER REFLECT
cv2.BORDER REPLICATE
value (optional): The color of border if border type is cv2.BORDER CONSTANT
Returns the resulting image

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 65 / 100

padImgConst = cv2.copyMakeBorder(grayImg,
50, 50, 50, 50,
# Assume Google Drive has been mounted cv2.BORDER_CONSTANT, 128)
# & the path has been added for # Add the image to the first row, second col
# interpreter to search ax2 = fig.add_subplot(2, 2, 2)
ax2.title.set_text('BORDER_CONSTANT')
# Import all the required libraries plt.imshow(padImgConst, cmap='gray',
import cv2 vmin=0, vmax=1)
import matplotlib.image as mpimg
import matplotlib.pyplot as plt padImgRef = cv2.copyMakeBorder(grayImg,
300, 300, 300, 300,
# Read the image cv2.BORDER_REFLECT)
img = mpimg.imread('snorlax.png') # Add the image to the second row, first col
ax3 = fig.add_subplot(2, 2, 3)
# Create subplots ax3.title.set_text('BORDER_REFLECT')
fig = plt.figure(figsize=(12,9)) plt.imshow(padImgRef, cmap='gray',
fig.tight_layout(); vmin=0, vmax=1)
fig.subplots_adjust(wspace=0.2, hspace=0.2)
padImgRep = cv2.copyMakeBorder(grayImg,
# Add the image to the first row, first col 300, 300, 300, 300,
ax1= fig.add_subplot(2, 2, 1) cv2.BORDER_REPLICATE)
ax1.title.set_text('Original') # Add the image to the second row, second col
grayImg = cv2.cvtColor(img, cv2.COLOR_RGB2GRAY) ax4 = fig.add_subplot(2, 2, 4)
plt.imshow(grayImg, cmap='gray', vmin=0, vmax=1) ax4.title.set_text('BORDER_REPLICATE')
plt.imshow(padImgRep, cmap='gray',
vmin=0, vmax=1)
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 66 / 100
Example

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 67 / 100

Image Histogram
An image histogram is a graphical representation of the number of pixels in an image as a
function of their intensity.
To find the histogram of an image, you need to first import cv2
import cv2
Then use calcHist() method of the cv2 module.
Syntax
cv2.calcHist(images, channels, mask, histSize, ranges[, hist[, accumulate]])

Parameters:
image: Image of type uint8 or float32 represented as “[img]”
channels: It is the index of channel for which we calculate histogram. For grayscale image, its value is [0] and color
image, you can pass [0], [1], or [2] to calculate histogram of each channel respectively.
mask: mask image. To find histogram of full image, it is given as ‘None”.
histSize: This represents the number of bins. For full scale, we pass [256]
ranges: This is the range of intensities. Normally, it is [0,256].
Return value: Histogram of the image.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 68 / 100

# Assume Google Drive has been mounted & the path has been added for interpreter to search

# Import all the required libraries

import cv2
import matplotlib.image as mpimg
import matplotlib.pyplot as plt

img = mpimg.imread('snorlax.png') # Read the image

# Convert the color image to gray and show it

grayImg = cv2.cvtColor(img, cv2.COLOR_RGB2GRAY)
plt.figure()
plt.imshow(grayImg, cmap='gray', vmin=0, vmax=1)

# Convert pixel values from [0,1] to [0,255]

grayImgUint = grayImg*255
grayImgUint = grayImgUint.astype(np.uint8)

# Calculate histogram
hist = cv2.calcHist([grayImgUint], [0], None, [256], [0,255])
plt.figure()
plt.plot(hist) # Plot and show the histogram
plt.show()
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 69 / 100
Brightness Adjustment
To adjust the brightness of an image using OpenCV, you need to first import cv2
import cv2
Then use convertScaleAbs() method of the cv2 module.
Syntax
cv2.convertScaleAbs(image, alpha = 1, beta = 0)

convertScaleAbs does the following:

Inew(x,y) = min(alpha * I(x,y) + beta, 255)
Inew(x,y) = max(Inew(x,y), 0)

Parameters:
image: Image to be processed in n-dimensional array
alpha: The scale factor. It is 1 by default
beta: The delta added to the scaled values. It is 0 by default.
Return value: Converted image.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 70 / 100

# Assume Google Drive has been mounted & the path has been added for interpreter to search

# Import all the required libraries

import cv2
import matplotlib.image as mpimg
import matplotlib.pyplot as plt

img = mpimg.imread('snorlax.png') # Read the image

# Convert the color image to gray and show it

grayImg = cv2.cvtColor(img, cv2.COLOR_RGB2GRAY)
plt.figure(); plt.imshow(grayImg, cmap='gray', vmin=0, vmax=1)

# Convert pixel values from [0,1] to [0,255]

grayImgUint = grayImg*255
grayImgUint = grayImgUint.astype(np.uint8)

# Produce an image that is brighter than the original

brighterImg = cv2.convertScaleAbs(grayImgUint, alpha = 1.0, beta = 64)
plt.figure(); plt.imshow(brighterImg, cmap='gray', vmin=0, vmax=255)

# Produce an image that is dimmer than the original

dimmerImg = cv2.convertScaleAbs(grayImgUint, alpha = 1.0, beta = -64)
plt.figure(); plt.imshow(dimmerImg, cmap='gray', vmin=0, vmax=255)

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 71 / 100

Gamma Correction
Gamma defines the relationship between a pixel’s numerical value and its actual
luminance.

Without gamma, shades captured by digital cameras would not appear as they did to our
eyes (on a standard monitor).
Gamma is also referred to as gamma correction, gamma encoding or gamma compression,
but these all refer to a similar concept.
A gamma encoded image has to have “gamma correction” applied when it is viewed –
which effectively converts it back into light from the original scene.
Gamma correction can be performed by adjusting gamma value (γ).
γ < 1 will make the image appear darker
γ > 1 will make the image appear lighter
γ = 1 will have no effect on the input image
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 72 / 100
# Assume Google Drive has been mounted & the path has been added for interpreter to search
# Import all the required libraries
import cv2; import numpy as np
import matplotlib.image as mpimg; import matplotlib.pyplot as plt
img = mpimg.imread('snorlax.png') # Read the image
# Convert the color image to gray and show it
grayImg = cv2.cvtColor(img, cv2.COLOR_RGB2GRAY)
plt.figure(); plt.imshow(grayImg, cmap='gray', vmin=0, vmax=1)
# Convert pixel values from [0,1] to [0,255]
grayImgUint = grayImg*255; grayImgUint = grayImgUint.astype(np.uint8)
# Prepare look-up-table and perform gamma correction
gamma = 0.5; invGamma = 1/gamma
table = [((i / 255) ** invGamma) * 255 for i in range(256)]
table = np.array(table, np.uint8)
processedImg1 = cv2.LUT(grayImgUint, table)
plt.figure(); plt.imshow(processedImg1, cmap='gray', vmin=0, vmax=255)
# Prepare look-up-table and perform gamma correction
gamma = 2.2; invGamma = 1/gamma
table = [((i / 255) ** invGamma) * 255 for i in range(256)]
table = np.array(table, np.uint8)
processedImg2 = cv2.LUT(grayImgUint, table)
plt.figure(); plt.imshow(processedImg2, cmap='gray', vmin=0, vmax=255)
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 73 / 100
Histogram Equalization
Histogram equalization is another technique used to improve contrast of images.
The idea is to spread out the most frequent intensity values.
Algorithm
1. Compute the histogram, H, of the image
2. Compute the cumulative histogram, C, of
the image
i
X
C (i) = H(j)
j=0

3. Map old intensity value to new intensity

value as follows:

Inew = C (I )

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 74 / 100

# Assume Google Drive has been mounted & the path has been added for interpreter to search
# Add the image to the first row, second col
# Import all the required libraries ax2 = fig.add_subplot(2, 2, 2)
import cv2 ax2.title.set_text('Hist. of original image')
import matplotlib.image as mpimg plt.plot(hist1);
import matplotlib.pyplot as plt
# Produce cumulative histogram
# Read the image cumHist = np.cumsum(hist1)
img = mpimg.imread('snorlax-low-contrast.png') cumMax = np.max(cumHist)
table = np.array((cumHist/np.max(cumHist))*255,
# Prepare subplots np.uint8)
fig = plt.figure(figsize=(12,9)) equalizedImg = cv2.LUT(gImgUint, table)
fig.tight_layout(); # Add the image to the second row, first col
fig.subplots_adjust(wspace=0.2, hspace=0.2) ax3 = fig.add_subplot(2, 2, 3)
ax3.title.set_text("Equalized")
# Add the image to the first row, first col plt.imshow(equalizedImg, cmap='gray',
gImg = cv2.cvtColor(img, cv2.COLOR_RGB2GRAY) vmin=0, vmax=255)
ax1 = fig.add_subplot(2, 2, 1)
ax1.title.set_text("Original") # Produce histogram
plt.imshow(gImg, cmap='gray', vmin=0, vmax=1) hist2 = cv2.calcHist([equalizedImg],
[0], None, [256], [0,255])
# Convert pixel values from [0,1] to [0,255] # Add the image to the second row, second col
gImgUint = gImg*255 ax4 = fig.add_subplot(2, 2, 4)
gImgUint = gImgUint.astype(np.uint8) ax4.title.set_text('Hist. of equalized image')
# Produce histogram plt.plot(hist2);
hist1 = cv2.calcHist([gImgUint],
[0], None, [256], [0,255])
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 75 / 100
Example

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 76 / 100

Histogram Equalization

In fact, histogram equalization can be performed using OpenCV function. To do so, you
need to first import cv2
import cv2
Then use equalizeHist() method of the cv2 module.
Syntax
equalizeHist(source)
Parameters:
source: input image array
Return value: Equalized image

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 77 / 100

# Assume Google Drive has been mounted & the path has been added for interpreter to search
# Import all the required libraries
import cv2; import numpy as np
import matplotlib.image as mpimg; import matplotlib.pyplot as plt
img = mpimg.imread('snorlax-low-contrast.png') # Read the image
# Prepare subplots
fig = plt.figure(figsize=(12,9))
fig.set_figwidth(6); fig.tight_layout()
fig.subplots_adjust(wspace=0.2, hspace=0.2)
# Convert pixel values from [0,1] to [0,255]
grayImg = cv2.cvtColor(img, cv2.COLOR_RGB2GRAY)
grayImgUint = grayImg*255
grayImgUint = grayImgUint.astype(np.uint8)
# Perform histogram equalization
equalizedImg = cv2.equalizeHist(grayImgUint)
ax1 = fig.add_subplot(2, 1, 1); ax1.title.set_text('Equalized')
plt.imshow(equalizedImg, cmap='gray', vmin=0, vmax=255)
# Produce histogram
hist = cv2.calcHist([equalizedImg], [0], None, [256], [0,255])
ax2 = fig.add_subplot(2, 1, 2)
ax2.title.set_text('Histogram of equalized image'); plt.plot(hist)
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 78 / 100
Interesting Kernels
Convolve the
0 0 0 original image with
0 1 0 the kernel 5 times,
0 0 0 we still get back
the identical
Identity kernel image.
Original image Resulting image

0 0 0 Convolve the
1 0 0 original image with
the kernel 5 times,
0 0 0
we get back an
image shifted by 5
Shifted identity pixels.
kernel
Original image Resulting image
{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 79 / 100
Non-linear Filtering

Non-linear filters are typically more powerful than linear filters

Suppression of spikes
Edge preserving properties
Examples:
Median filter
Morphological filters

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 80 / 100

Median Filter
The median filter is often used to remove noise from an image.
It preserves edges while removing noise. Also, no new gray value is introduced.
Idea:
Consider each pixel in the image in turn and looks at its nearby neighbors to decide whether
or not it is representative of its surroundings.
It replaces the pixel with the median of those values.
The median is calculated by first sorting all the pixel values from the surrounding
neighborhood into numerical order and then replacing the pixel being considered with the
middle pixel value.
Example:

123 125 126 130 140

Neighborhood values: 124, 126, 127, 120, 150, 125, 115, 119, 123
122 124 126 127 135
118 120 150 125 134 Sort the values: 115, 119, 120, 123, 124, 125, 126, 127, 150
119 115 119 123 133
111 116 110 120 130 Pick the median value: 124

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 81 / 100

Median Filter
To perform median filtering, you need to first import cv2
import cv2
Then use medianBlur() method of the cv2 module.
Syntax
cv2.medianBlur(src, kernelSize)

Parameters:
src: input image that you want to process
kernelSize: The size of the kernel
Return value: filtered image.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 82 / 100

# Assume Google Drive has been mounted & the path has been added for interpreter to search

# Import all the required libraries

import cv2; import numpy as np
import matplotlib.image as mpimg
import matplotlib.pyplot as plt

# Read the image

grayImg = mpimg.imread('snorlax-noisy-result.png')
# Convert the pixel values from [0,1] to [0,255]
grayImgUint = grayImg*255
grayImgUint = grayImgUint.astype(np.uint8)
plt.figure(); plt.imshow(grayImgUint, cmap='gray',
vmin=0, vmax=255)

# Perform median filtering

resultImg = cv2.medianBlur(grayImgUint, 5)

# Show the resulting image

plt.figure();
plt.imshow(resultImg, cmap="gray", vmin=0, vmax=255)

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 83 / 100

Morphological Filter
Morphological filters are a set of image processing operations where the shapes of the
image’s object are manipulated.
Similar to convolutional kernels, morphological operations utilize a structuring element to
transform each pixel of an image to a value based on its neighbors’ value.
An example of structuring element:

import numpy as np

structuring_element = np.array([ [0,1,0],

[1,1,1],
[0,1,0] ], np.uint8)
plt.imshow(structuring_element, cmap='gray');

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 84 / 100

Morphological Filter

To demonstrate how morphological filter work, let us create two adjacent circles with
random noise on its background.
from skimage.draw import disk
import numpy as np

circle_image = np.zeros((25, 40))

circle_image[disk((12, 12), 8)] = 1
circle_image[disk((12, 28), 8)] = 1
for x in range(20):
circle_image[np.random.randint(25),
np.random.randint(40)] = 1
plt.imshow(circle_image, cmap='gray');

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 85 / 100

Examples

Erosion
Dilation
Opening
Closing

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 86 / 100

Erosion Filter

Erosion is used for shrinking of element in input image by using the structuring element.
The pixel values are retained only when the structuring element is completely contained
inside input image. Otherwise, it gets deleted or eroded.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 87 / 100

Erosion Filter
To perform erosion, you need to first import cv2
import cv2
Then use erode() method of the cv2 module.
Syntax
cv2.erode(src, kernel, dst, anchor, iterations, borderType, borderValue)

Parameters:
src: input image that you want to erode
kernel: A structuring element used for erosion
dst: Output image
anchor: Integer representing anchor point and it’s default value Point is (-1,-1) which means that the anchor is at
the kernel center.
borderType: cv2.BORDER CONSTANT, cv2.BORDER REFLECT, etc.
iterations: Number of times erosion is applied.
borderValue: It is border value in case of a constant border.
Return value: filtered image.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 88 / 100

import cv2
import matplotlib.pyplot as plt

# Show circle image

plt.figure(); plt.imshow(circle_image, cmap='gray')
# Show structuring element
plt.figure(); plt.imshow(structuring_element, cmap='gray');
# Perform erosion filter
eroded_img = cv2.erode(circle_image, structuring_element)
# Show the resulting image
plt.figure(); plt.imshow(eroded_img, cmap='gray')

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 89 / 100

Dilation Filter

Dilation is used for expanding of element in input image by using the structuring element.
The pixel values are “on” only when the structuring element has overlapped with the
input image. Otherwise, the pixel values are “off”.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 90 / 100

Dilation Filter
To perform dilation, you need to first import cv2
import cv2
Then use dilate() method of the cv2 module.
Syntax
cv2.dilate(src, kernel, dst, anchor, iterations, borderType, borderValue)

Parameters:
src: input image that you want to dilate
kernel: A structuring element used for dilation
dst: Output image
anchor: Integer representing anchor point and it’s default value Point is (-1,-1) which means that the anchor is at
the kernel center.
borderType: cv2.BORDER CONSTANT, cv2.BORDER REFLECT, etc.
iterations: Number of times dilation is applied.
borderValue: It is border value in case of a constant border.
Return value: filtered image.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 91 / 100

import cv2
import matplotlib.pyplot as plt

# Show circle image

plt.figure(); plt.imshow(circle_image, cmap='gray')
# Show structuring element
plt.figure(); plt.imshow(structuring_element, cmap='gray');
# Perform erosion filter
dilated_img = cv2.dilate(circle_image, structuring_element)
# Show the resulting image
plt.figure(); plt.imshow(dilated_img, cmap='gray')

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 92 / 100

Opening Filter
Opening filter removed small objects while also maintaining the original shape of the
object.
Opening is done by applying the erosion first, and then applying dilation.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 93 / 100

Opening Filter
To perform opening, you need to first import cv2
import cv2
Then use morphologyEx() method of the cv2 module.
Syntax
cv2.morphologyEx(src, op, kernel, dst=None, anchor=None,
iterations=None, borderType=None, borderValue=None)

Parameters:
src: input image that you want to process
op: Operations (cv2.MORPH ERODE, cv2.MORPH DILATE, cv2.MORPH OPEN, cv2.MORPH CLOSE)
kernel: A structuring element used for dilation
dst: Output image
anchor: Integer representing anchor point and it’s default value Point is (-1,-1) which means that the anchor is at
the kernel center.
iterations: Number of times dilation is applied (e.g. iterations = 2, erode×2, dilate×2).
borderType: cv2.BORDER CONSTANT, cv2.BORDER REFLECT, etc.
borderValue: It is border value in case of a constant border.
Return value: filtered image.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 94 / 100

import cv2
import matplotlib.pyplot as plt

img = plt.imread('input-open.png')

# Show input image

plt.figure(); plt.imshow(img, cmap='gray')
# Show structuring element
plt.figure();
plt.imshow(structuring_element, cmap='gray');

# Perform opening filter

opened_img = cv2.morphologyEx(img, cv2.MORPH_OPEN,
structuring_element,
iterations=4)
# Show the resulting image
plt.figure();
plt.imshow(opened_img, cmap='gray')

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 95 / 100

Closing Filter

Closing filter removes small holes while also maintaining the original shape of the object.
Closing is done by applying the dilation first, and then applying erosion.

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 96 / 100

Closing Filter
To perform closing, you need to first import cv2
import cv2
Then use morphologyEx() method of the cv2 module.
Syntax
cv2.morphologyEx(src, op, kernel, dst=None, anchor=None,
iterations=None, borderType=None, borderValue=None)

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 97 / 100

import cv2
import matplotlib.pyplot as plt

img = plt.imread('input-close.png')

# Show input image

plt.figure(); plt.imshow(img, cmap='gray')

# Show structuring element

plt.figure();
plt.imshow(structuring_element, cmap='gray',
vmin=0, vmax=1);

# Perform closing filter

closed_img = cv2.morphologyEx(img, cv2.MORPH_CLOSE,
structuring_element,
iterations=4)
# Show the resulting image
plt.figure();
plt.imshow(closed_img, cmap='gray')

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 98 / 100

Useful Links

imread(): https://matplotlib.org/stable/api/_as_gen/matplotlib.pyplot.imread.html
imshow(): https://matplotlib.org/stable/api/_as_gen/matplotlib.pyplot.imshow.html
imsave(): https://matplotlib.org/stable/api/_as_gen/matplotlib.pyplot.imsave.html
cvtColor(): https://docs.opencv.org/3.4/df/d9d/tutorial_py_colorspaces.html
warpAffine(), getRotationMatrix2D(), resize():
https://docs.opencv.org/3.4/da/d6e/tutorial_py_geometric_transformations.html
copyMakeBorder(): https://docs.opencv.org/3.4/dc/da3/tutorial_copyMakeBorder.html
calcHist(): https://docs.opencv.org/3.4/dd/d0d/tutorial_py_2d_histogram.html
convertScaleAbs():
https://docs.opencv.org/3.4/d2/de8/group__core__array.html#ga3460e9c9f37b563ab9dd550c4d8c4e7d
threshold(): https://docs.opencv.org/3.4/d7/d4d/tutorial_py_thresholding.html
equalizeHist(): https://docs.opencv.org/3.4/d5/daf/tutorial_py_histogram_equalization.html
filter2D(), medianBlur(): https://docs.opencv.org/3.4/d4/d13/tutorial_py_filtering.html
erode(), dilate(), morphologyEx(),
https://docs.opencv.org/3.4/d9/d61/tutorial_py_morphological_ops.html

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 99 / 100

That’s all!
Any questions?

{desmond,kccecia}@ust.hk COMP 2211 (Fall 2022) 100 / 100

Summer Training Report On Python Cum Computer Vision
100% (2)
Summer Training Report On Python Cum Computer Vision
28 pages
Lab 2
No ratings yet
Lab 2
10 pages
Plotting Images Using Matplotlib Library in Python
No ratings yet
Plotting Images Using Matplotlib Library in Python
8 pages
Course Notes Solutions Answers Image Processing in Python
No ratings yet
Course Notes Solutions Answers Image Processing in Python
99 pages
Scan 01
No ratings yet
Scan 01
8 pages
02 - Digital Image Processing
No ratings yet
02 - Digital Image Processing
38 pages
DIP - 2025 - Matlab-123
No ratings yet
DIP - 2025 - Matlab-123
15 pages
Digital Images Processing: 4 Class Dr. Majid Dhirar
No ratings yet
Digital Images Processing: 4 Class Dr. Majid Dhirar
17 pages
Class 10:artificial Intelligence: Computer Vision
No ratings yet
Class 10:artificial Intelligence: Computer Vision
36 pages
Image Processing With MATLAB: What Is Digital Image Processing? Transforming Digital Information Motivating Problems
No ratings yet
Image Processing With MATLAB: What Is Digital Image Processing? Transforming Digital Information Motivating Problems
7 pages
Image - Handling - Session - 1 - PDF
No ratings yet
Image - Handling - Session - 1 - PDF
25 pages
Unit1 PDF
No ratings yet
Unit1 PDF
15 pages
Digital Image Processing Using MATLAB: J.Nageswara Rao, M.Veerraju, Mca 2 Year
No ratings yet
Digital Image Processing Using MATLAB: J.Nageswara Rao, M.Veerraju, Mca 2 Year
24 pages
Lab Pro
No ratings yet
Lab Pro
34 pages
Fisrt Steps in Image Processing
No ratings yet
Fisrt Steps in Image Processing
34 pages
Digital Image Processing
100% (1)
Digital Image Processing
46 pages
Dip 04 Updated
No ratings yet
Dip 04 Updated
12 pages
Lab 1
No ratings yet
Lab 1
39 pages
Intensity Images: Double Uint8 Double Uint8 Uint8
No ratings yet
Intensity Images: Double Uint8 Double Uint8 Uint8
54 pages
Introduction To Digital Image Processing by Using Matlab: Objectives
No ratings yet
Introduction To Digital Image Processing by Using Matlab: Objectives
7 pages
Lab - Digital Image Processing
No ratings yet
Lab - Digital Image Processing
38 pages
DIP Lab.1 For Level 5th
No ratings yet
DIP Lab.1 For Level 5th
6 pages
Week 4 Digital Image Processing
No ratings yet
Week 4 Digital Image Processing
16 pages
ECE280F24 Lab5
No ratings yet
ECE280F24 Lab5
27 pages
Introduction To Dig Ital Image Processi NG (DIP) : Prepared By: Laily Azyan Binti Ramlan
No ratings yet
Introduction To Dig Ital Image Processi NG (DIP) : Prepared By: Laily Azyan Binti Ramlan
27 pages
EContent 11 2025 01 28 09 44 51 Unit1IPpdf 2025 01 28 09 10 35
No ratings yet
EContent 11 2025 01 28 09 44 51 Unit1IPpdf 2025 01 28 09 10 35
39 pages
Practical Image-1
No ratings yet
Practical Image-1
22 pages
Fundamentals of Computer Vision With QA
No ratings yet
Fundamentals of Computer Vision With QA
25 pages
Aifc CV
No ratings yet
Aifc CV
13 pages
DSP Lab6
No ratings yet
DSP Lab6
10 pages
Lab 12 16082022 125032pm 30082022 102624pm
No ratings yet
Lab 12 16082022 125032pm 30082022 102624pm
9 pages
BM2406 Digital Image Processing Lab Manual
No ratings yet
BM2406 Digital Image Processing Lab Manual
107 pages
Fundamental Image Processing Steps: sensors/cameras/CCD Cameras Etc.. in Digital Form
No ratings yet
Fundamental Image Processing Steps: sensors/cameras/CCD Cameras Etc.. in Digital Form
44 pages
Image Processing With MATLAB: Quick Reference
No ratings yet
Image Processing With MATLAB: Quick Reference
30 pages
Digital Image Processing: 1 Objectives
No ratings yet
Digital Image Processing: 1 Objectives
8 pages
Digital Image Processing-Lab (15-EC 4110L)
No ratings yet
Digital Image Processing-Lab (15-EC 4110L)
13 pages
3.1 - Image Fundamentals
No ratings yet
3.1 - Image Fundamentals
32 pages
Lab Manual
No ratings yet
Lab Manual
40 pages
Mayank Nec
No ratings yet
Mayank Nec
8 pages
Dip 6
No ratings yet
Dip 6
35 pages
Image Processing - Group 3 (1) (1) - Updated Final
No ratings yet
Image Processing - Group 3 (1) (1) - Updated Final
11 pages
Computer Vision 2
No ratings yet
Computer Vision 2
62 pages
Image Processin G: Landsat 7 Image of The Retreating Malaspina Glacier, Alaska
No ratings yet
Image Processin G: Landsat 7 Image of The Retreating Malaspina Glacier, Alaska
22 pages
Image Processing: Objective
No ratings yet
Image Processing: Objective
6 pages
Lecture Notes
No ratings yet
Lecture Notes
33 pages
Unit 4 (10 Marks
No ratings yet
Unit 4 (10 Marks
16 pages
Video 2 - Digital Images in PIL and NumPy
No ratings yet
Video 2 - Digital Images in PIL and NumPy
15 pages
CV Lab 3
No ratings yet
CV Lab 3
3 pages
Lec5 Image Enhancement
No ratings yet
Lec5 Image Enhancement
104 pages
Laboratory 1: DIP Spring 2015: Introduction To The MATLAB Image Processing Toolbox
No ratings yet
Laboratory 1: DIP Spring 2015: Introduction To The MATLAB Image Processing Toolbox
7 pages
Image File in Octave: Image. Data Class. Data Class Conversion
No ratings yet
Image File in Octave: Image. Data Class. Data Class Conversion
8 pages
Dip Lab
No ratings yet
Dip Lab
55 pages
DIP Lab Manual
No ratings yet
DIP Lab Manual
42 pages
Dip - 01 Introduction
No ratings yet
Dip - 01 Introduction
81 pages
Images: Chris Piech and Mehran Sahami CS106A, Stanford University
No ratings yet
Images: Chris Piech and Mehran Sahami CS106A, Stanford University
33 pages
Histogram Equalization: Enhancing Image Contrast for Enhanced Visual Perception
From Everand
Histogram Equalization: Enhancing Image Contrast for Enhanced Visual Perception
Fouad Sabry
No ratings yet
Alpha Compositing: Mastering the Art of Image Composition in Computer Vision
From Everand
Alpha Compositing: Mastering the Art of Image Composition in Computer Vision
Fouad Sabry
No ratings yet
Digital Image Processing: Fundamentals and Applications
From Everand
Digital Image Processing: Fundamentals and Applications
Fouad Sabry
No ratings yet
Vector Graphics Editor: Empowering Visual Creation with Advanced Algorithms
From Everand
Vector Graphics Editor: Empowering Visual Creation with Advanced Algorithms
Fouad Sabry
No ratings yet
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
From Everand
Line Drawing Algorithm: Mastering Techniques for Precision Image Rendering
Fouad Sabry
No ratings yet
Color Mapping: Exploring Visual Perception and Analysis in Computer Vision
From Everand
Color Mapping: Exploring Visual Perception and Analysis in Computer Vision
Fouad Sabry
No ratings yet
ISOM2700 Practice Set4 Sol
100% (1)
ISOM2700 Practice Set4 Sol
8 pages
After Class Quiz #1 Sol (Updated)
No ratings yet
After Class Quiz #1 Sol (Updated)
4 pages
ISOM2700 Practice Set5 Sol
No ratings yet
ISOM2700 Practice Set5 Sol
16 pages
After Class Quiz #5 - Sol
No ratings yet
After Class Quiz #5 - Sol
8 pages
15-Mobile and Local Marketing
No ratings yet
15-Mobile and Local Marketing
29 pages
12 Netflix
No ratings yet
12 Netflix
28 pages
17-Privacy and Information Rights
No ratings yet
17-Privacy and Information Rights
30 pages
0 Logistics Full
No ratings yet
0 Logistics Full
31 pages
5-Porter's Five Forces
No ratings yet
5-Porter's Five Forces
18 pages
3-Internet Services and Mobile Apps
No ratings yet
3-Internet Services and Mobile Apps
28 pages
1 Intro
No ratings yet
1 Intro
27 pages
10 Minimax Full
No ratings yet
10 Minimax Full
55 pages
2 Python Fundamentals Full
No ratings yet
2 Python Fundamentals Full
103 pages
Segmentation - Ipynb - Colaboratory
No ratings yet
Segmentation - Ipynb - Colaboratory
8 pages
NB2121 Practical 4 Exercises
No ratings yet
NB2121 Practical 4 Exercises
8 pages
03 - Image Segmentation
No ratings yet
03 - Image Segmentation
45 pages
Object Size Measurement and Camera Distance Evaluation For Electronic Components Using Fixed-Position Camera
No ratings yet
Object Size Measurement and Camera Distance Evaluation For Electronic Components Using Fixed-Position Camera
4 pages
ASIP Practical
No ratings yet
ASIP Practical
30 pages
Crack Detection Using Image Processing - Review
No ratings yet
Crack Detection Using Image Processing - Review
3 pages
Opencv Python Tutorial
100% (1)
Opencv Python Tutorial
88 pages
Basic Adaptive Thresholding: Bill Ames CIS 467 Spring '05
No ratings yet
Basic Adaptive Thresholding: Bill Ames CIS 467 Spring '05
17 pages
Chapter (2) Literature Review
No ratings yet
Chapter (2) Literature Review
8 pages
Sauvola
No ratings yet
Sauvola
5 pages
Binary Image
No ratings yet
Binary Image
91 pages
Shafait Efficient Binarization SPIE08
No ratings yet
Shafait Efficient Binarization SPIE08
6 pages
Raspberry Pi Based Wearable Reader For Visually Impaired People With Haptic Feedback
No ratings yet
Raspberry Pi Based Wearable Reader For Visually Impaired People With Haptic Feedback
4 pages
Module - 3 Image Enhancement in Spatial and Frequency Domain
No ratings yet
Module - 3 Image Enhancement in Spatial and Frequency Domain
8 pages
Forest Fire Detection Using Image Processing
100% (1)
Forest Fire Detection Using Image Processing
16 pages
Quantum Inspired Meta-Heuristics For Image Anlys PDF
No ratings yet
Quantum Inspired Meta-Heuristics For Image Anlys PDF
367 pages
Image Processing and Computer Vision (Notes)
No ratings yet
Image Processing and Computer Vision (Notes)
64 pages
Different Color Detection in An RGB Image: ISSN:2230-9926
No ratings yet
Different Color Detection in An RGB Image: ISSN:2230-9926
4 pages
Fabric Defect Detection Using Local Homogeneity and Morphological Image Processing
No ratings yet
Fabric Defect Detection Using Local Homogeneity and Morphological Image Processing
5 pages
Actividad 3 Images Computer Vision - Ipynb - Colaboratory
No ratings yet
Actividad 3 Images Computer Vision - Ipynb - Colaboratory
8 pages
Lecture 3
No ratings yet
Lecture 3
13 pages
T11 - Image-Processing-Fundamentals
No ratings yet
T11 - Image-Processing-Fundamentals
90 pages
Automatic Change Detection On Satellite Images Using Principal Component Analysis, ISODATA and Fuzzy C-Means Methods
No ratings yet
Automatic Change Detection On Satellite Images Using Principal Component Analysis, ISODATA and Fuzzy C-Means Methods
8 pages
Automatic Thresholding Using Modified Valley Empha PDF
No ratings yet
Automatic Thresholding Using Modified Valley Empha PDF
9 pages
Thresholding 1
No ratings yet
Thresholding 1
7 pages
CSC566-Tutorial Thresholding
No ratings yet
CSC566-Tutorial Thresholding
6 pages
A Hybrid Ostu Based Niblack Binarization For Degraded Image Documents
No ratings yet
A Hybrid Ostu Based Niblack Binarization For Degraded Image Documents
7 pages
Tut 11
No ratings yet
Tut 11
3 pages
Image Segmentation
No ratings yet
Image Segmentation
9 pages