Basic Computer Vision
Basic Computer Vision
0:00 / 6:24
2. Defect Detection - The manufacturing industry often struggles to get 100% accuracy in detect defects in their
manufactured products, as it demands systems to monitor defects on a micro-scale (like monitoring the wrong
threading). Detecting these defects at the end of the production process or after the delivery to the client can result
https://splms.polite.edu.sg/d2l/le/enhancedSequenceViewer/545684?url=https%3A%2F%2Ffanyv88.com%3A443%2Fhttps%2Fb988e89b-eb2c-401a-999a-94b943da0008.sequenc… 1/12
1/14/25, 10:54 AM Basic Computer Vision
in increased production costs and leads to customer dissatisfaction. These losses are comparatively far higher
than the cost of adopting an AI-powered computer vision defect detection system.
The manufacturing industry often struggles to get 100% accuracy in detect defects in their manufactured products,
as it demands systems to monitor defects on a micro-scale (like monitoring the wrong threading). Detecting these
defects at the end of the production process or after the delivery to the client can result in increased production
costs and leads to customer dissatisfaction. These losses are comparatively far higher than the cost of adopting an
AI-powered computer vision defect detection system.
0:00 / 1:40
1. 3D Vision System - A computer vision inspection system is used in a production line to carry out responsibilities
that humans struggle with. In this use case, the system uses high-resolution images to build a full 3D model of
components and their connector pins.
As components run through the manufacturing plant, the computer vision system captures various images from
different angles to generate a 3D model. These images, when combined and fed to AI algorithms, identify any
faulty threading or minor deviation from the design. This technology is very credible in manufacturing industries like
automotive, electronic circuits, oil and gas, energy, and so on.
2. Computer Vision-Guided Die Cutting - Rotary and Laser Die Cutting are the most adopted technologies in
performing die-cutting in the manufacturing process. Rotary uses hard tooling and steel blades while laser uses
high-speed laser light. Even though laser die cutting is more precise, cutting tough materials is challenging and
rotary cutting can be used to cut any material.
The manufacturing industry can deploy computer vision systems to perform rotary die cutting to be as precise as
laser cutting to cut any sort of design. Once the design pattern is fed to the computer vision system, the system will
guide the die cutting machine, be it laser or rotary cutting, to perform cutting accurately.
3. Predictive Maintenance - Some manufacturing processes happen at critical temperatures and environmental
conditions, so material degradation or corrosion is common. This results in equipment deformation. If not handled
early, this can result in severe losses and halt the manufacturing process. For this reason, manufacturers employ
corrosion engineers to ensure the health of machines and avoid corrosion as a part of predictive maintenance.
Manufacturers conduct constant monitoring of equipment manually. Computer vision systems, however, can
https://splms.polite.edu.sg/d2l/le/enhancedSequenceViewer/545684?url=https%3A%2F%2Ffanyv88.com%3A443%2Fhttps%2Fb988e89b-eb2c-401a-999a-94b943da0008.sequenc… 2/12
1/14/25, 10:54 AM Basic Computer Vision
constantly monitor the equipment based on various metrics. If any deviation from metrics suggests corrosion, the
computer vision systems can alert the respective managers to carry out maintenance activities proactively.
4. Safety and Security Standards - Employees in the manufacturing industry work in exceedingly dangerous
conditions, so the risk of injury is much higher. Not adhering to safety and security standards can cause serious
injury or even death. The manufacturing plants are required to abide by safety standards enforced by governing
authorities and companies that fail to adhere to these standards must face penalties.
Even though manufacturing companies have cameras installed to monitor employee movement in the plant to
ensure safety standards, it is largely a manual monitoring process where an employee must sit and constantly
monitor the video stream. The manual processes are error-prone and this error could result in serious
consequences. An AI-powered computer vision can be an appropriate solution. This application constantly monitors
the manufacturing site right from the entry point, into the site, and exit point. Even if there is a minor violation in
compliance, the system reports to a respective manager and alerts the employees too. This way, manufacturing
companies can ensure their employees adhere to safety and security standards.
In case of an active accident, the computer vision system can alert managers and staff about where the accident
happened and the intensity of the accident so the production process can be halted in that specific area and
proactively ensure the safety of employees.
1. Packaging Standards - In some manufacturing companies, it is important to count the number of manufactured
pieces before packaging them in a container. Performing this task manually can incur a lot of errors. This problem
is more prevalent in pharmaceutical and retail products. Deploying a computer vision system in the packaging
process to count the number of pieces, verifies if packaging standards are followed.
Once the items are properly packed, another use case for computer vision is inspecting any damage on the
packaging itself. It's important that products get to customers safely and in one piece. Damaged packaging risks
damage to the product inside. Computer vision systems can proactively divert any damaged packaging before
leaving the plant.
2. Barcode Analysis - Another important aspect is barcode verification. Most products have barcodes on them. The
packaging department should cross-check if the printed barcodes are accurate and readable. Cross verifying
barcodes of thousands of products manually demands a lot of human hours and is error-prone and costly.
Computer vision systems can verify barcodes easily and divert any products with faulty barcodes.
3. Inventory Management - Computer vision systems can help count stock, maintain inventory status in
warehouses, and automate and alert managers if any material required for manufacturing is below demand. The
computer vision systems can avoid human errors in counting stock.
https://splms.polite.edu.sg/d2l/le/enhancedSequenceViewer/545684?url=https%3A%2F%2Ffanyv88.com%3A443%2Fhttps%2Fb988e89b-eb2c-401a-999a-94b943da0008.sequenc… 3/12
1/14/25, 10:54 AM Basic Computer Vision
In massive warehouses, locating stock is difficult. With a computer vision system, based on barcode data, these
systems can help inventory managers locate products in the warehouse.
Greater Productivity
Today’s customers expect higher-quality products without defects, all at a low cost. In order to meet this demand, most
companies are looking for ways to boost employee productivity.
That often means monitoring employees to cut down on unscheduled downtime. This is where CV can make a huge
contribution. Half a century ago, the foreman was responsible for walking the factory floor to make sure that employees
were hard at work instead of idling. Today, a network of computing systems can be used to monitor workers in real-time
and check to see that they are all making the best use of their work time.
Some businesses have reported increases in productivity of between 10 and 12%. It’s worth noting that employees may
also experience greater job satisfaction when they are challenged at work, which may lead to a virtuous cycle of increased
productivity.
https://splms.polite.edu.sg/d2l/le/enhancedSequenceViewer/545684?url=https%3A%2F%2Ffanyv88.com%3A443%2Fhttps%2Fb988e89b-eb2c-401a-999a-94b943da0008.sequenc… 4/12
1/14/25, 10:54 AM Basic Computer Vision
The term Computer Vision (CV) is used and heard very often in artificial intelligence (AI) and deep learning (DL)
applications. The term essentially means giving a computer the ability to see the world as we humans do.
Computer Vision is a field of study which enables computers to replicate the human visual system. As already mentioned
above, It’s a subset of artificial intelligence which collects information from digital images or videos and processes them to
define the attributes. The entire process involves image acquiring, screening, analysing, identifying and extracting
information. This extensive processing helps computers to understand any visual content and act on it accordingly.
Computer vision projects translate digital visual content into explicit descriptions to gather multi-dimensional data. This
data is then turned into a computer-readable language to aid the decision-making process. The main objective of this
branch of artificial intelligence is to teach machines to collect information from pixels.
Here are a few common tasks that computer vision systems can be used for:
Object classification - The system parses visual content and classifies the object on a photo/video to the
defined category. For example, the system can find a dog among all objects in the image.
Object identification - The system parses visual content and identifies a particular object on a photo/video. For
example, the system can find a specific dog among the dogs in the image.
Object tracking - The system processes video finds the object (or objects) that match search criteria and track
its movement.
https://splms.polite.edu.sg/d2l/le/enhancedSequenceViewer/545684?url=https%3A%2F%2Ffanyv88.com%3A443%2Fhttps%2Fb988e89b-eb2c-401a-999a-94b943da0008.sequenc… 5/12
1/14/25, 10:54 AM Basic Computer Vision
A digital image is an image composed of picture elements, also known as pixels, each with finite, discrete quantities of
numeric representation for its intensity or grey level. So the computer sees an image as numerical values of these pixels
and in order to recognise a certain image, it has to recognise the patterns and regularities in this numerical data.
But usually, you will find that for any colour image, there are 3 primary channels – Red, green and blue and the value of
each channel varies from 0-255. In more simpler terms we can say that a digital image is actually formed by the
combination of three basic colour channels Red, green, and blue whereas for a grayscale image we have only one
channel whose values also vary from 0-255.
In short, machines interpret images as a series of pixels, each with their own set of color values. For example, below is a
picture of Abraham Lincoln. Each pixel’s brightness in this image is represented by a single 8-bit number, ranging from 0
(black) to 255 (white). Here is a hypothetical example of how pixels form an image. The darker pixels are represented by
a number closer to the zero and lighter pixels are represented by numbers approaching one. All other colours are
represented by the numbers between 0 and 1. These numbers are what software sees when you input an image. This
data is provided as an input to the computer vision algorithm that will be responsible for further analysis and decision
making.
Color values of individual pixels are converted into a simple array of numbers used as input for a computer vision
algorithm
Pixels are accessed by their (x, y)-coordinates. The origin, (0, 0), is located at the top-left of the image. The x-
values go left-to-right (column number) and y-values go top-to-bottom (row number)
When we take a digital image, it is stored as a combination of pixels. Each pixel contains a different number of channels.
If it a grayscale image, it has only one pixel, whereas if it is a coloured image, it contains three channels: red, green and
blue.
https://splms.polite.edu.sg/d2l/le/enhancedSequenceViewer/545684?url=https%3A%2F%2Ffanyv88.com%3A443%2Fhttps%2Fb988e89b-eb2c-401a-999a-94b943da0008.sequenc… 6/12
1/14/25, 10:54 AM Basic Computer Vision
As shown in the above representation of a digital coloured image, each channel of each pixel has a value between 0 and
255. Each of these values represented in binary before a computer can understand the image.
https://splms.polite.edu.sg/d2l/le/enhancedSequenceViewer/545684?url=https%3A%2F%2Ffanyv88.com%3A443%2Fhttps%2Fb988e89b-eb2c-401a-999a-94b943da0008.sequenc… 7/12
1/14/25, 10:54 AM Basic Computer Vision
Deep learning represents a more effective way to do computer vision—it uses a specific algorithm called a neural
network. The neural networks are used to extract patterns from provided data samples. The algorithms are inspired by the
human understanding of how brains function, in particular, the interconnections between the neurons in the cerebral
cortex.
At the core level of a neural network is the perceptron, the mathematical representation of a biological neuron. Similar to
biologic neurons in the cerebral cortex, it’s possible to have several layers of interconnected perceptrons. Input values
(raw data) get passed through the network created by perceptrons and end up in the output layer, which is a prediction, or
a highly educated guess about a certain object. For example, at the end of the analysis, the machine can classify an
object with X% confidence.
Machine learning uses algorithms to parse data while deep learning relies on layers of artificial neural networks
(ANN)
https://splms.polite.edu.sg/d2l/le/enhancedSequenceViewer/545684?url=https%3A%2F%2Ffanyv88.com%3A443%2Fhttps%2Fb988e89b-eb2c-401a-999a-94b943da0008.sequenc… 8/12
1/14/25, 10:54 AM Basic Computer Vision
Facial recognition
Facial recognition technology is used to match photos of people’s faces to their identities. This technology is integrated
into major products that we use every day. For example, Facebook is using computer vision to identify people in photos.
Facial recognition is a crucial technology for biometric authentication. Many mobile devices available on the market today
allow users to unlock devices by showing their faces. A front face camera is used for facial recognition; mobile devices
process this image and, based on analysis, can tell whether the person who is holding a device is authorized on this
device. The beauty of this technology is that it works really fast.
Self-driving cars
Computer vision enables cars to make sense of their surroundings. A smart vehicle has a few cameras that capture
videos from different angles and send videos as an input signal to the computer vision software. The system processes
the video in real-time and detects objects like road marking, objects near the car (such as pedestrians or other cars),
traffic lights, etc. One of the most notable examples of applications of this technology is autopilot in Tesla cars.
https://splms.polite.edu.sg/d2l/le/enhancedSequenceViewer/545684?url=https%3A%2F%2Ffanyv88.com%3A443%2Fhttps%2Fb988e89b-eb2c-401a-999a-94b943da0008.sequenc… 9/12
1/14/25, 10:54 AM Basic Computer Vision
0:00 / 1:56
Agriculture
Many agricultural organizations employ computer vision to monitor the harvest and solve the common agricultural
problems such as weeds emergence or nutrient deficiency. Computer vision systems process images from satellites,
drones, or planes, and attempt to detect the problems in the early phase, which helps to avoid unnecessary financial
losses.
Healthcare
Computer Vision has tremendous uses in healthcare.
A patient’s data can have up to 250GB of data in images and records. Being able to identify, evaluate, and interpret
visuals and information like a human eye, computer vision helps doctors process an overwhelming amount of patient’s
data in a short amount of time.
https://splms.polite.edu.sg/d2l/le/enhancedSequenceViewer/545684?url=https%3A%2F%2Ffanyv88.com%3A443%2Fhttps%2Fb988e89b-eb2c-401a-999a-94b943da0008.sequen… 10/12
1/14/25, 10:54 AM Basic Computer Vision
Retail
In December 2016, Amazon opened its first partially automated convenience store, AmazonGO, for its employees. In two
years, AmazonGO opened its doors to the public.
Today, AmazonGO equips other retailers to use its technology, Just Walk Out (a combination of technologies that includes
computer vision, sensor fusion, and deep learning), in their stores.
Just Walk Out, true to its name, allows the shopper to just walk out of a store, taking whatever, they want with them. The
shoppers neither have to stand in line nor have to pay at the store. It may sound like theft, except it is not. AmazonGO’s
Just Walk Out, is better at preventing theft than a conventional store.
While Just Walk Out is built for the customer’s ultimate shipping experience and convenience, it is also built to ensure the
shop owner/retailer’s ultimate peace of mind.
0:00 / 1:49
However, the no-checkout convenience store powered by super surveillance systems does not in any way offer a 100%
guaranteed effectiveness against shoplifting, at least not the unintentional shoplifting.
Despite the weight sensors installed in the shelves, AmazonGO stores might, from time to time, accidentally miss adding
the item to the shopper’s cart if the shopper happens to take multiple yogurts at the same time. That said, every
convenience story is designed to expect and accept a certain amount of lossage.
https://splms.polite.edu.sg/d2l/le/enhancedSequenceViewer/545684?url=https%3A%2F%2Ffanyv88.com%3A443%2Fhttps%2Fb988e89b-eb2c-401a-999a-94b943da0008.sequen… 11/12
1/14/25, 10:54 AM Basic Computer Vision
https://splms.polite.edu.sg/d2l/le/enhancedSequenceViewer/545684?url=https%3A%2F%2Ffanyv88.com%3A443%2Fhttps%2Fb988e89b-eb2c-401a-999a-94b943da0008.sequen… 12/12