0% found this document useful (0 votes)

34 views25 pages

Fundamentals of Computer Vision With QA

The document provides an overview of computer vision, emphasizing its role in artificial intelligence and the ability of computers to process images. It discusses fundamental concepts such as pixel arrays, image processing techniques, and the use of convolutional neural networks (CNNs) for feature extraction and image classification. Additionally, it introduces Microsoft Azure AI Vision as a service that offers prebuilt and customizable computer vision models, enabling developers to create sophisticated solutions efficiently.

Uploaded by

Krishna Rajes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views25 pages

Fundamentals of Computer Vision With QA

Uploaded by

Krishna Rajes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 25

Fundamentals of Computer

Vision
Introduction
Completed100 XP
 2 minutes

Computer vision is one of the core areas of artificial intelligence (AI), and
focuses on creating solutions that enable AI applications to "see" the
world and make sense of it.

Of course, computers don't have biological eyes that work the way ours
do, but they're capable of processing images; either from a live camera
feed or from digital photographs or videos. This ability to process images
is the key to creating software that can emulate human visual perception.

In this module, we'll examine some of the fundamental principles and

techniques that underly computer vision. We'll also introduce Microsoft
Azure AI Vision, a cloud service that developers can use to create a wide
range of computer vision solutions.

Images and image processing

Completed100 XP

 8 minutes

Before we can explore image processing and other computer vision

capabilities, it's useful to consider what an image actually is in the context
of data for a computer program.

Images as pixel arrays

To a computer, an image is an array of numeric pixel values. For example,

consider the following array:

Copy
0 0 0 0 0 0 0
0 0 0 0 0 0 0
0 0 255 255 255 0 0
0 0 255 255 255 0 0
0 0 255 255 255 0 0
0 0 0 0 0 0 0
0 0 0 0 0 0 0

The array consists of seven rows and seven columns, representing the
pixel values for a 7x7 pixel image (which is known as the
image's resolution). Each pixel has a value between 0 (black) and 255
(white); with values between these bounds representing shades of gray.
The image represented by this array looks similar to the following
(magnified) image:

The array of pixel values for this image is two-dimensional (representing

rows and columns, or x and y coordinates) and defines a single rectangle
of pixel values. A single layer of pixel values like this represents a
grayscale image. In reality, most digital images are multidimensional and
consist of three layers (known as channels) that represent red, green, and
blue (RGB) color hues. For example, we could represent a color image by
defining three channels of pixel values that create the same square shape
as the previous grayscale example:

Copy
Red:
150 150 150 150 150 150 150
150 150 150 150 150 150 150
150 150 255 255 255 150 150
150 150 255 255 255 150 150
150 150 255 255 255 150 150
150 150 150 150 150 150 150
150 150 150 150 150 150 150

Green:
0 0 0 0 0 0 0
0 0 0 0 0 0 0
0 0 255 255 255 0 0
0 0 255 255 255 0 0
0 0 255 255 255 0 0
0 0 0 0 0 0 0
0 0 0 0 0 0 0

Blue:
255 255 255 255 255 255 255
255 255 255 255 255 255 255
255 255 0 0 0 255 255
255 255 0 0 0 255 255
255 255 0 0 0 255 255
255 255 255 255 255 255 255
255 255 255 255 255 255 255
Here's the resulting image:

The purple squares are represented by the combination:

Copy
Red: 150
Green: 0
Blue: 255

The yellow squares in the center are represented by the combination:

Copy
Red: 255
Green: 255
Blue: 0

Using filters to process images

A common way to perform image processing tasks is to apply filters that

modify the pixel values of the image to create a visual effect. A filter is
defined by one or more arrays of pixel values, called filter kernels. For
example, you could define filter with a 3x3 kernel as shown in this
example:

Copy
-1 -1 -1
-1 8 -1
-1 -1 -1

The kernel is then convolved across the image, calculating a weighted

sum for each 3x3 patch of pixels and assigning the result to a new image.
It's easier to understand how the filtering works by exploring a step-by-
step example.

Let's start with the grayscale image we explored previously:

Copy
0 0 0 0 0 0 0
0 0 0 0 0 0 0
0 0 255 255 255 0 0
0 0 255 255 255 0 0
0 0 255 255 255 0 0
0 0 0 0 0 0 0
0 0 0 0 0 0 0

First, we apply the filter kernel to the top left patch of the image,
multiplying each pixel value by the corresponding weight value in the
kernel and adding the results:

Copy
(0 x -1) + (0 x -1) + (0 x -1) +
(0 x -1) + (0 x 8) + (0 x -1) +
(0 x -1) + (0 x -1) + (255 x -1) = -255

The result (-255) becomes the first value in a new array. Then we move
the filter kernel along one pixel to the right and repeat the operation:

Copy
(0 x -1) + (0 x -1) + (0 x -1) +
(0 x -1) + (0 x 8) + (0 x -1) +
(0 x -1) + (255 x -1) + (255 x -1) = -510

Again, the result is added to the new array, which now contains two
values:

Copy
-255 -510

The process is repeated until the filter has been convolved across the
entire image, as shown in this animation:
The filter is convolved across the image, calculating a new array of values.
Some of the values might be outside of the 0 to 255 pixel value range, so
the values are adjusted to fit into that range. Because of the shape of the
filter, the outside edge of pixels isn't calculated, so a padding value
(usually 0) is applied. The resulting array represents a new image in which
the filter has transformed the original image. In this case, the filter has
had the effect of highlighting the edges of shapes in the image.

To see the effect of the filter more clearly, here's an example of the same
filter applied to a real image:

Expand table

Original Image Filtered Image

Because the filter is convolved across the image, this kind of image
manipulation is often referred to as convolutional filtering. The filter used
in this example is a particular type of filter (called a laplace filter) that
highlights the edges on objects in an image. There are many other kinds
of filter that you can use to create blurring, sharpening, color inversion,
and other effects.

Machine learning for computer

vision
Completed100 XP

 10 minutes

The ability to use filters to apply effects to images is useful in image

processing tasks, such as you might perform with image editing software.
However, the goal of computer vision is often to extract meaning, or at
least actionable insights, from images; which requires the creation of
machine learning models that are trained to recognize features based on
large volumes of existing images.

Tip

This unit assumes you are familiar with the fundamental principles of
machine learning, and that you have conceptual knowledge of deep
learning with neural networks. If you are new to machine learning,
consider completing the Fundamentals of machine learning module
on Microsoft Learn.

Convolutional neural networks (CNNs)

One of the most common machine learning model architectures for

computer vision is a convolutional neural network (CNN). CNNs use filters
to extract numeric feature maps from images, and then feed the feature
values into a deep learning model to generate a label prediction. For
example, in an image classification scenario, the label represents the
main subject of the image (in other words, what is this an image of?). You
might train a CNN model with images of different kinds of fruit (such as
apple, banana, and orange) so that the label that is predicted is the type
of fruit in a given image.

During the training process for a CNN, filter kernels are initially defined
using randomly generated weight values. Then, as the training process
progresses, the models predictions are evaluated against known label
values, and the filter weights are adjusted to improve accuracy.
Eventually, the trained fruit image classification model uses the filter
weights that best extract features that help identify different kinds of fruit.

The following diagram illustrates how a CNN for an image classification

model works:
1. Images with known labels (for example, 0: apple, 1: banana, or 2:
orange) are fed into the network to train the model.
2. One or more layers of filters is used to extract features from each
image as it is fed through the network. The filter kernels start with
randomly assigned weights and generate arrays of numeric values
called feature maps.
3. The feature maps are flattened into a single dimensional array of
feature values.
4. The feature values are fed into a fully connected neural network.
5. The output layer of the neural network uses a softmax or similar
function to produce a result that contains a probability value for each
possible class, for example [0.2, 0.5, 0.3].

During training the output probabilities are compared to the actual class
label - for example, an image of a banana (class 1) should have the value
[0.0, 1.0, 0.0]. The difference between the predicted and actual class
scores is used to calculate the loss in the model, and the weights in the
fully connected neural network and the filter kernels in the feature
extraction layers are modified to reduce the loss.

The training process repeats over multiple epochs until an optimal set of
weights has been learned. Then, the weights are saved and the model can
be used to predict labels for new images for which the label is unknown.

Note

CNN architectures usually include multiple convolutional filter layers and

additional layers to reduce the size of feature maps, constrain the
extracted values, and otherwise manipulate the feature values. These
layers have been omitted in this simplified example to focus on the key
concept, which is that filters are used to extract numeric features from
images, which are then used in a neural network to predict image labels.
Transformers and multi-modal models

CNNs have been at the core of computer vision solutions for many years.
While they're commonly used to solve image classification problems as
described previously, they're also the basis for more complex computer
vision models. For example, object detection models combine CNN feature
extraction layers with the identification of regions of interest in images to
locate multiple classes of object in the same image.

Transformers

Most advances in computer vision over the decades have been driven by
improvements in CNN-based models. However, in another AI discipline
- natural language processing (NLP), another type of neural network
architecture, called a transformer has enabled the development of
sophisticated models for language. Transformers work by processing huge
volumes of data, and encoding language tokens (representing individual
words or phrases) as vector-based embeddings (arrays of numeric
values). You can think of an embedding as representing a set of
dimensions that each represent some semantic attribute of the token. The
embeddings are created such that tokens that are commonly used in the
same context are closer together dimensionally than unrelated words.

As a simple example, the following diagram shows some words encoded

as three-dimensional vectors, and plotted in a 3D space:

Tokens that are semantically similar are encoded in similar positions,

creating a semantic language model that makes it possible to build
sophisticated NLP solutions for text analysis, translation, language
generation, and other tasks.
Note

We've used only three dimensions, because that's easy to visualize. In

reality, encoders in transformer networks create vectors with many more
dimensions, defining complex semantic relationships between tokens
based on linear algebraic calculations. The math involved is complex, as is
the architecture of a transformer model. Our goal here is just to provide
a conceptual understanding of how encoding creates a model that
encapsulates relationships between entities.

Multi-modal models

The success of transformers as a way to build language models has led AI

researchers to consider whether the same approach would be effective for
image data. The result is the development of multi-modal models, in
which the model is trained using a large volume of captioned images, with
no fixed labels. An image encoder extracts features from images based on
pixel values and combines them with text embeddings created by a
language encoder. The overall model encapsulates relationships between
natural language token embeddings and image features, as shown here:

The Microsoft Florence model is just such a model. Trained with huge
volumes of captioned images from the Internet, it includes both a
language encoder and an image encoder. Florence is an example of
a foundation model. In other words, a pre-trained general model on which
you can build multiple adaptive models for specialist tasks. For example,
you can use Florence as a foundation model for adaptive models that
perform:

 Image classification: Identifying to which category an image belongs.

 Object detection: Locating individual objects within an image.
 Captioning: Generating appropriate descriptions of images.
 Tagging: Compiling a list of relevant text tags for an image.
Multi-modal models like Florence are at the cutting edge of computer
vision and AI in general, and are expected to drive advances in the kinds
of solution that AI makes possible.

Azure AI Vision
Completed100 XP

 3 minutes

While you can train your own machine learning models for computer
vision, the architecture for computer vision models can be complex; and
you require significant volumes of training images and compute power to
perform the training process.

Microsoft's Azure AI Vision service provides prebuilt and customizable

computer vision models that are based on the Florence foundation model
and provide various powerful capabilities. With Azure AI Vision, you can
create sophisticated computer vision solutions quickly and easily; taking
advantage of "off-the-shelf" functionality for many common computer
vision scenarios, while retaining the ability to create custom models using
your own images.

Azure resources for Azure AI Vision service

To use Azure AI Vision, you need to create a resource for it in your Azure
subscription. You can use either of the following resource types:

 Azure AI Vision: A specific resource for the Azure AI Vision service.

Use this resource type if you don't intend to use any other Azure AI
services, or if you want to track utilization and costs for your Azure AI
Vision resource separately.
 Azure AI services: A general resource that includes Azure AI Vision
along with many other Azure AI services; such as Azure AI Language,
Azure AI Custom Vision, Azure AI Translator, and others. Use this
resource type if you plan to use multiple AI services and want to
simplify administration and development.
Analyzing images with the Azure AI Vision service

After you've created a suitable resource in your subscription, you can

submit images to the Azure AI Vision service to perform a wide range of
analytical tasks.

Azure AI Vision supports multiple image analysis capabilities, including:

 Optical character recognition (OCR) - extracting text from images.

 Generating captions and descriptions of images.
 Detection of thousands of common objects in images.
 Tagging visual features in images

These tasks, and more, can be performed in Azure AI Vision Studio.

Optical character recognition

Azure AI Vision service can use optical character recognition (OCR)

capabilities to detect text in images. For example, consider the following
image of a nutrition label on a product in a grocery store:
The Azure AI Vision service can analyze this image and extract the
following text:

Copy
Nutrition Facts Amount Per Serving
Serving size:1 bar (40g)
Serving Per Package: 4
Total Fat 13g
Saturated Fat 1.5g
Amount Per Serving
Trans Fat 0g
calories 190
Cholesterol 0mg
ories from Fat 110
Sodium 20mg
ntDaily Values are based on
Vitamin A 50
calorie diet
Tip

You can explore Azure AI Vision's OCR capabilities further in the Read
text with Azure AI Vision module on Microsoft Learn.

Describing an image with captions

Azure AI Vision has the ability to analyze an image, evaluate the objects
that are detected, and generate a human-readable phrase or sentence
that can describe what was detected in the image. For example, consider
the following image:
Azure AI Vision returns the following caption for this image:

A man jumping on a skateboard

Detecting common objects in an image

Azure AI Vision can identify thousands of common objects in images. For

example, when used to detect objects in the skateboarder image
discussed previously, Azure AI Vision returns the following predictions:

 Skateboard (90.40%)
 Person (95.5%)

The predictions include a confidence score that indicates the probability

the model has calculated for the predicted objects.

In addition to the detected object labels and their probabilities, Azure AI

Vision returns bounding box coordinates that indicate the top, left, width,
and height of the object detected. You can use these coordinates to
determine where in the image each object was detected, like this:
Tagging visual features

Azure AI Vision can suggest tags for an image based on its contents.
These tags can be associated with the image as metadata that
summarizes attributes of the image and can be useful if you want to index
an image along with a set of key terms that might be used to search for
images with specific attributes or contents.

For example, the tags returned for the skateboarder image (with
associated confidence scores) include:

 sport (99.60%)
 person (99.56%)
 footwear (98.05%)
 skating (96.27%)
 boardsport (95.58%)
 skateboarding equipment (94.43%)
 clothing (94.02%)
 wall (93.81%)
 skateboarding (93.78%)
 skateboarder (93.25%)
 individual sports (92.80%)
 street stunts (90.81%)
 balance (90.81%)
 jumping (89.87%)
 sports equipment (88.61%)
 extreme sport (88.35%)
 kickflip (88.18%)
 stunt (87.27%)
 skateboard (86.87%)
 stunt performer (85.83%)
 knee (85.30%)
 sports (85.24%)
 longboard (84.61%)
 longboarding (84.45%)
 riding (73.37%)
 skate (67.27%)
 air (64.83%)
 young (63.29%)
 outdoor (61.39%)

Training custom models

If the built-in models provided by Azure AI Vision don't meet your needs,
you can use the service to train a custom model for image
classification or object detection. Azure AI Vision builds custom models on
the pre-trained foundation model, meaning that you can train
sophisticated models by using relatively few training images.
Image classification

An image classification model is used to predict the category, or class of

an image. For example, you could train a model to determine which type
of fruit is shown in an image, like this:

Expand table

Apple Banana Orange

Object detection

Object detection models detect and classify objects in an image, returning

bounding box coordinates to locate each object. In addition to the built-in
object detection capabilities in Azure AI Vision, you can train a custom
object detection model with your own images. For example, you could use
photographs of fruit to train a model that detects multiple fruits in an
image, like this:
Note

Details of how to use Azure AI Vision to train a custom model are beyond
the scope of this module. You can find information about custom model
training in the Azure AI Vision documentation.

Analyze images in Vision Studio

Azure AI Vision includes numerous capabilities for understanding image
content and context and extracting information from images. Azure AI
Vision Studio allows you to try out many of the capabilities of image
analysis.

In this exercise, you will use Vision Studio to analyze images using the
built-in try-it-out experiences. Suppose the fictitious retailer Northwind
Traders has decided to implement a “smart store”, in which AI services
monitor the store to identify customers requiring assistance, and direct
employees to help them. By using Azure AI Vision, images taken by
cameras throughout the store can be analyzed to provide meaningful
descriptions of what they depict.
Create an Azure AI services resource

You can use Azure AI Vision’s image analysis capabilities with an Azure AI
services multi-service resource. If you haven’t already done so, create
an Azure AI services resource in your Azure subscription.

1. In another browser tab, open the Azure portal

at https://portal.azure.com, signing in with the Microsoft account
associated with your Azure subscription.
2. Click the ＋Create a resource button and search for Azure AI services.
Select create an Azure AI services plan. You will be taken to a page to
create an Azure AI services resource. Configure it with the following
settings:
o Subscription: Your Azure subscription.
o Resource group: Select or create a resource group with a unique
name.
o Region: East US.
o Name: Enter a unique name.
o Pricing tier: Standard S0.
o By checking this box I acknowledge that I have read and
understood all the terms below: Selected.
3. Select Review + create then Create and wait for deployment to
complete.

Connect your Azure AI service resource to Vision Studio

Next, connect the Azure AI service resource you provisioned above to

Vision Studio.

1. In another browser tab, navigate to Vision Studio.

2. Sign in with your account and making sure you are using the same
directory as the one where you have created your Azure AI services
resource.

3. On the Vision Studio home page, select View all resources under
the Getting started with Vision heading.
4. On the Select a resource to work with page, hover your mouse
cursor over the resource you created above in the list and then
check the box to the left of the resource name, then select Select
as default resource.
Note : If your resource is not listed, you may need to Refresh the page.

5. Close the settings page by selecting the “x” at the top right of the
screen.

Generate captions for an image

Now you are ready to use Vision Studio to analyze images taken by a
camera in the Northwind Traders store.

Let’s look at the image captioning functionality of Azure AI Vision. Image

captions are available through the Caption and Dense
Captions features.

1. In a web browser, navigate to Vision Studio.

2. On the Getting started with Vision landing page, select

the Image analysis tab and then select the Add captions to
images tile.
3. Under the Try It Out subheading, acknowledge the resource usage
policy by reading and checking the box.

4. Select https://aka.ms/mslearn-images-for-analysis to
download image-analysis.zip. Open the folder on your computer
and locate the file named store-camera-1.jpg; which contains the
following image:
5. Upload the store-camera-1.jpg image by dragging it to the Drag
and drop files here box, or by browsing to it on your file system.

6. Observe the generated caption text, visible in the Detected

attributes panel to the right of the image.

The Caption functionality provides a single, human-readable

English sentence describing the image’s content.

7. Next, use the same image to perform Dense captioning. Return to

the Vision Studio home page, and as you did before, select
the Image analysis tab, then select the Dense captioning tile.

The Dense Captions feature differs from the Caption capability in

that it provides multiple human-readable captions for an image, one
describing the image’s content and others, each covering the
essential objects detected in the picture. Each detected object
includes a bounding box, which defines the pixel coordinates within
the image associated with the object.

8. Hover over one of the captions in the Detected attributes list and
observe what happens within the image.
Move your mouse cursor over the other captions in the list, and
notice how the bounding box shifts in the image to highlight the
portion of the image used to generate the caption.

Tagging images

The next feature you will try is the Extract Tags functionality. Extract
tags is based on thousands of recognizable objects, including living
beings, scenery, and actions.

1. Return to the home page of Vision Studio, then select the Extract
common tags from images tile under the Image analysis tab.

2. In the Choose the model you want to try out, leave Prebuilt
product vs. gap model selected. In the Choose your language,
select English or a language of your preference.

3. Open the folder containing the images you downloaded and locate
the file named store-image-2.jpg, which looks like this:
4. Upload the store-camera-2.jpg file.

5. Review the list of tags extracted from the image and the confidence
score for each in the detected attributes panel. Here the confidence
score is the likelihood that the text for the detected attribute
describes what is actually in the image. Notice in the list of tags that
it includes not only objects, but actions, such as shopping, selling,
and standing.
Object detection

In this task, you use the Object detection feature of Image Analysis.
Object detection detects and extracts bounding boxes based on
thousands of recognizable objects and living beings.

1. Return to the home page of Vision Studio, then select the Detect
common objects in images tile under the Image analysis tab.

2. In the Choose the model you want to try out, leave Prebuilt
product vs. gap model selected.

3. Open the folder containing the images you downloaded and locate
the file named store-camera-3.jpg, which looks like this:
4. Upload the store-camera-3.jpg file.

5. In the Detected attributes box, observe the list of detected

objects and their confidence scores.

6. Hover your mouse cursor over the objects in the Detected

attributes list to highlight the object’s bounding box in the image.

7. Move the Threshold value slider until a value of 70 is displayed to

the right of the slider. Observe what happens to the objects in the
list. The threshold slider specifies that only objects identified with a
confidence score or probability greater than the threshold should be
displayed.

Clean up

If you don’t intend to do more exercises, delete any resources that you no
longer need. This avoids accruing any unnecessary costs.

1. Open the Azure portal and select the resource group that contains the
resource you created.
2. Select the resource and select Delete and then Yes to confirm. The
resource is then deleted.
Learn more

To learn more about what you can do with this service, see the Azure AI
Vision page.

Check your knowledge

Computer vision is based on the manipulation and analysis of

what kinds of values in an image?

Timestamps in photograph metadata

Pixels
Correct. Pixels are numeric values that represent shade intensity
for points in the image.
Image file names
2.

You want to use the Azure AI Vision service to analyze images.

You also want to use the Azure AI Language service to analyze
text. You want developers to require only one key and endpoint
to access all of your services. What kind of resource should you
create in your Azure subscription?

Azure AI Vision
Azure AI services
Correct. An Azure AI Services resource supports both Azure AI
Vision and Azure AI Language.
Azure OpenAI service
3.

You want to use the Azure AI Vision service to identify the

location of individual items in an image. Which of the following
features should you retrieve?

Objects
Correct. Azure AI Vision returns objects with a bounding box to
indicate their location in the image.
Visual Tags
Dense Captions

Computer Vision 2
No ratings yet
Computer Vision 2
62 pages
Chapter 2 Digital Image Fundamentalsn Final
No ratings yet
Chapter 2 Digital Image Fundamentalsn Final
85 pages
CS4442 - CS9542 - Part 2 - Lecture 1 - Intro - Filtering
No ratings yet
CS4442 - CS9542 - Part 2 - Lecture 1 - Intro - Filtering
52 pages
(Fall 2024) Images and Convolutions
No ratings yet
(Fall 2024) Images and Convolutions
69 pages
Mod 5
No ratings yet
Mod 5
96 pages
Computer Vision-Lec 02
No ratings yet
Computer Vision-Lec 02
121 pages
Ip CV Summary Finaaaal-1
No ratings yet
Ip CV Summary Finaaaal-1
178 pages
Computer Vision - Session 1
No ratings yet
Computer Vision - Session 1
36 pages
3.1 - Image Fundamentals
No ratings yet
3.1 - Image Fundamentals
32 pages
Computer Vision
No ratings yet
Computer Vision
23 pages
Convolutionniuhiu Siusd
No ratings yet
Convolutionniuhiu Siusd
27 pages
Lec01 Filter For Web
No ratings yet
Lec01 Filter For Web
47 pages
CS4442 - CS9542 - Part 2 - Lecture 1 - Intro - Filtering
No ratings yet
CS4442 - CS9542 - Part 2 - Lecture 1 - Intro - Filtering
40 pages
Digital Image Processing
No ratings yet
Digital Image Processing
37 pages
AD8703 Basic of Computer Vision UNIT 1
No ratings yet
AD8703 Basic of Computer Vision UNIT 1
65 pages
Machine - Learning (Computer Vision)
No ratings yet
Machine - Learning (Computer Vision)
56 pages
ECE280F24 Lab5
No ratings yet
ECE280F24 Lab5
27 pages
Lec01 Filter
No ratings yet
Lec01 Filter
60 pages
Project Proposal Business Presentation in Dark Blue Pink Abstract Tech Style
No ratings yet
Project Proposal Business Presentation in Dark Blue Pink Abstract Tech Style
38 pages
Robotics
No ratings yet
Robotics
35 pages
Image Feature Extraction
No ratings yet
Image Feature Extraction
11 pages
Computer Vision
No ratings yet
Computer Vision
8 pages
Unit-1: 1) Phases of Image Processing
No ratings yet
Unit-1: 1) Phases of Image Processing
23 pages
Class X Artificial Intelligence: Computer Vision
No ratings yet
Class X Artificial Intelligence: Computer Vision
54 pages
Module 2
No ratings yet
Module 2
34 pages
Lec01 Filter For Web
No ratings yet
Lec01 Filter For Web
47 pages
Computer Vision ch2
No ratings yet
Computer Vision ch2
75 pages
04-CNN PDF
No ratings yet
04-CNN PDF
170 pages
Class 10:artificial Intelligence: Computer Vision
No ratings yet
Class 10:artificial Intelligence: Computer Vision
36 pages
Extra Credit III
No ratings yet
Extra Credit III
12 pages
C10 - Ai - Computer Vision
No ratings yet
C10 - Ai - Computer Vision
40 pages
AI 10th Grade Pdfs
No ratings yet
AI 10th Grade Pdfs
30 pages
CH 3
No ratings yet
CH 3
22 pages
Image Filtering: Associate Professor Faculty of Computer Science Institute of Business Administration - Karachi
No ratings yet
Image Filtering: Associate Professor Faculty of Computer Science Institute of Business Administration - Karachi
59 pages
Filtering Basics
No ratings yet
Filtering Basics
83 pages
AI-Computer Vision
No ratings yet
AI-Computer Vision
16 pages
6960795-Class10 Ai Partb Unit5 Computervision
No ratings yet
6960795-Class10 Ai Partb Unit5 Computervision
17 pages
@introduction of Digital Image Processing (@background ,@digital Image Representation, @fundamental Step in Image Processing)
No ratings yet
@introduction of Digital Image Processing (@background ,@digital Image Representation, @fundamental Step in Image Processing)
4 pages
Practical Image-1
No ratings yet
Practical Image-1
22 pages
Chapter-4 Computer Vision Study Material
No ratings yet
Chapter-4 Computer Vision Study Material
4 pages
HODL Lec 3 DNNs For Vision 1
No ratings yet
HODL Lec 3 DNNs For Vision 1
36 pages
Fundamentals of Image Processing
No ratings yet
Fundamentals of Image Processing
32 pages
Computer Vision Class 10 Notes
No ratings yet
Computer Vision Class 10 Notes
5 pages
Ai
No ratings yet
Ai
14 pages
Introcduction To Image Processing With Python Nour Eddine ALAA and Ismail Zine El Abidne March 5, 2021
No ratings yet
Introcduction To Image Processing With Python Nour Eddine ALAA and Ismail Zine El Abidne March 5, 2021
77 pages
Suggested Readings: Image Processing Basics
No ratings yet
Suggested Readings: Image Processing Basics
11 pages
Microsoft
No ratings yet
Microsoft
6 pages
Computer Vision
No ratings yet
Computer Vision
15 pages
Lect02 ImageProcessingReview
No ratings yet
Lect02 ImageProcessingReview
53 pages
Computer Vision
No ratings yet
Computer Vision
29 pages
Computer Vision Course Lecture 2
No ratings yet
Computer Vision Course Lecture 2
53 pages
Digital Image Processing: Assignment No. 2
No ratings yet
Digital Image Processing: Assignment No. 2
18 pages
Computer Vision Class 10 Notes
100% (5)
Computer Vision Class 10 Notes
7 pages
Image Processing 1
No ratings yet
Image Processing 1
1 page
Computer Vision
No ratings yet
Computer Vision
19 pages
Screenshot 2023-10-23 at 5.51.17 AM
No ratings yet
Screenshot 2023-10-23 at 5.51.17 AM
14 pages
Image Filtering: Formal Terminology - Filtering With Masks
No ratings yet
Image Filtering: Formal Terminology - Filtering With Masks
30 pages
Advanced AI
No ratings yet
Advanced AI
184 pages
EE100CH11
No ratings yet
EE100CH11
25 pages
Deep Learning With Tensorflow
No ratings yet
Deep Learning With Tensorflow
15 pages
Unit-5 Computer Vision
No ratings yet
Unit-5 Computer Vision
3 pages
Pytorch MNIST Digits Prediction Hands On 1
No ratings yet
Pytorch MNIST Digits Prediction Hands On 1
16 pages
Purdue PGP AI and ML
No ratings yet
Purdue PGP AI and ML
35 pages
Xu - 2023 - AI in HCI Design and User Experience
No ratings yet
Xu - 2023 - AI in HCI Design and User Experience
36 pages
Spring 2022 CS7643 Deep Learning Syllabus and Schedule - v5.1
No ratings yet
Spring 2022 CS7643 Deep Learning Syllabus and Schedule - v5.1
11 pages
Fundamentals of Azure AI Services With QA
No ratings yet
Fundamentals of Azure AI Services With QA
15 pages
1 MC Culloh Pitts Neuron Model 22 Jul 2019material I 22 Jul 2019 Intro New
No ratings yet
1 MC Culloh Pitts Neuron Model 22 Jul 2019material I 22 Jul 2019 Intro New
58 pages
Introduction To Pharmacy Retail
No ratings yet
Introduction To Pharmacy Retail
15 pages
Pharmaco Vigilance
No ratings yet
Pharmaco Vigilance
11 pages
Healthcare Overview
No ratings yet
Healthcare Overview
16 pages
Fundamentals of Machine Learning With QA
No ratings yet
Fundamentals of Machine Learning With QA
41 pages
NN&DP Unit3
No ratings yet
NN&DP Unit3
41 pages
Lecture 11 - Introduction To Artificial Neural Networks (ANN)
No ratings yet
Lecture 11 - Introduction To Artificial Neural Networks (ANN)
35 pages
Brain Tumour Detection Using Machine Learning
No ratings yet
Brain Tumour Detection Using Machine Learning
26 pages
Plant Species Using ML
No ratings yet
Plant Species Using ML
11 pages
Borneo Mobile App
No ratings yet
Borneo Mobile App
17 pages
Deep Learning in Mental Health Outcome Research
No ratings yet
Deep Learning in Mental Health Outcome Research
26 pages
Headlight Detection
No ratings yet
Headlight Detection
16 pages
图书馆系统相关文献综述
100% (1)
图书馆系统相关文献综述
8 pages
Fundamentals of Facial Recognition With QA
No ratings yet
Fundamentals of Facial Recognition With QA
6 pages
Video Prediction PDF
No ratings yet
Video Prediction PDF
21 pages
Thesis On Mammogram Classification
100% (3)
Thesis On Mammogram Classification
4 pages
Automatic Logo Detection With Deep Region-Based Convolutional Networks
No ratings yet
Automatic Logo Detection With Deep Region-Based Convolutional Networks
10 pages
CAPSULE NETWORK Project Research
No ratings yet
CAPSULE NETWORK Project Research
6 pages
An Improved FakeBERT For Fake News Detection
No ratings yet
An Improved FakeBERT For Fake News Detection
9 pages
Survey Papers
No ratings yet
Survey Papers
14 pages
IEEE2023 Transfer Learning Approach To IDS On Cloud IoT Devices Using Optimized CNN
No ratings yet
IEEE2023 Transfer Learning Approach To IDS On Cloud IoT Devices Using Optimized CNN
16 pages
Research Paper Zebra Pose
No ratings yet
Research Paper Zebra Pose
16 pages
Pip-Net: Patch-Based Intuitive Prototypes For Interpretable Image Classification
No ratings yet
Pip-Net: Patch-Based Intuitive Prototypes For Interpretable Image Classification
10 pages
3.1 Faster - R-CNN - Towards - Real-Time - Object - Detection - With - Region - Proposal - Networks
No ratings yet
3.1 Faster - R-CNN - Towards - Real-Time - Object - Detection - With - Region - Proposal - Networks
13 pages
Design of Convolutional Neural Networks Architecture For Non Profiled Side Channel Attack Detection
No ratings yet
Design of Convolutional Neural Networks Architecture For Non Profiled Side Channel Attack Detection
6 pages
Addressing Data Imbalance in Plant Disease Recognition Through Contrastive Learning
No ratings yet
Addressing Data Imbalance in Plant Disease Recognition Through Contrastive Learning
6 pages
Weekly Report 12
No ratings yet
Weekly Report 12
3 pages
Weapon Detection Using Yolov4, CNN
No ratings yet
Weapon Detection Using Yolov4, CNN
7 pages
Deep Learning For Gravity and Magnetic Data Interpolation
No ratings yet
Deep Learning For Gravity and Magnetic Data Interpolation
5 pages
An Anti-Fraud System For Car Insurance Claim Based On Visual Evidence
No ratings yet
An Anti-Fraud System For Car Insurance Claim Based On Visual Evidence
6 pages
Develop Snake & Ladder Game in an Hour (Complete Guide with Code & Design)
From Everand
Develop Snake & Ladder Game in an Hour (Complete Guide with Code & Design)
Anurag Pandey
No ratings yet