0% found this document useful (0 votes)

22 views19 pages

4. Structured outputs- Data types

The document discusses structured outputs in convolutional neural networks (CNNs), highlighting their ability to produce high-dimensional tensors for tasks like pixel-level classification and image segmentation. It explains how CNNs can handle varying spatial extents and different data types, including 1-D, 2-D, and 3-D representations. The document emphasizes the advantages of using CNNs for complex data relationships and processing capabilities over traditional neural networks.

Uploaded by

devanand272003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views19 pages

4. Structured outputs- Data types

Uploaded by

devanand272003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Structured outputs,

Data types

Mr. Sivadasan E T
Associate Professor
Vidya Academy of Science and Technology, Thrissur
Structured outputs
• A "structured object" in the context of
convolutional neural networks (CNNs) refers to
outputs that go beyond simple classification or
regression values.

• These outputs have complex, meaningful

relationships between their components and
typically represent high-dimensional data with
intricate patterns or structures.
Structured outputs

Convolutional networks can be used to output a high-

dimensional, structured object, rather than just
predicting a class label for a classification task or a
real value for a regression task.
High-Dimensional Tensor Output:

CNNs often emit a tensor as output.

A tensor can be seen as a multi-dimensional grid of

numbers representing probabilities, pixel intensities, or
other information.
Structured outputs
Example - Pixel-Level Classification:
Suppose a CNN produces a tensor S where:

Si,j,k represents the probability that pixel (j, k) belongs

to class i (like "car" or "person").

This enables pixel-wise classification rather than

predicting just a single class for the entire image.
Structured outputs
Image Segmentation:

By assigning a class to each pixel, CNNs can create

precise masks that outline individual objects in an
image.

Use Case: Identifying and isolating cars, roads, and

pedestrians in autonomous driving images.
Structured outputs

• Once a prediction for each pixel is made,

various methods can be used to further process
these predictions in order to obtain a
segmentation of the image into regions.
Structured outputs

• The general idea is to assume that large groups

of contiguous pixels tend to be associated with
the same label.

• Graphical models can describe the probabilistic

relationships between neighboring pixels.
Data Types

The data used with a convolutional network usually

consists of several channels.

Each channel being the observation of a different

quantity at some point in space or time.
Data Types
• One advantage to convolutional networks is that
they can also process inputs with varying spatial
extents.
• These kinds of input simply cannot be represented
by traditional, matrix multiplication-based neural
networks.
• This provides a compelling reason to use
convolutional networks even when computational
cost and overfitting are not significant issues.
Data Types

• For example, consider a collection of images,

where each image has a different width and
height.

• It is unclear how to model such inputs with a

weight matrix of fixed size.
Data Types

• Convolution is straightforward to apply; the

kernel is simply applied a different number of
times depending on the size of the input, and the
output of the convolution operation scales
accordingly.
Data Types

1-D Single Channel

• Audio waveform: The axis we convolve over

corresponds to time.

• We discretize time and measure the amplitude

of the waveform once per time step.
Data Types
1-D Multi-Channel

• This involves animating 3D characters by

changing their joint angles over time.
• Each frame records the angles of different
joints, describing the character's pose.
• In convolutional models, each data channel
represents the angle of one joint around a
specific axis.
Data Types
2-D Single Channel:

• Audio data that has been preprocessed with a

Fourier transform:

• We can transform the audio waveform into a 2D

tensor with different rows corresponding to different
frequencies and different columns corresponding to
different points in time.
Data Types
2-D Multi-Channel:

Color image data:

• One channel contains the red pixels, one the green
pixels, and one the blue pixels.

• The convolution kernel moves over both the

horizontal and vertical axes of the image, conferring
translation equivariance in both directions.
Data Types

3-D Single Channel:

Volumetric data: A common source of this kind of data

is medical imaging technology, such as CT scans.
Data Types

3-D Multi-Channel:

Color video data: One axis corresponds

to time, one to the height of the video frame, and one
to the width of the video frame.
Thank You!

Embedded System_UG_Eng_3rd Yr (6)
No ratings yet
Embedded System_UG_Eng_3rd Yr (6)
259 pages
Srinivas Institute of Technology(Deep Learning)
No ratings yet
Srinivas Institute of Technology(Deep Learning)
12 pages
Artificial Intelligence Convolution Neural Networks
No ratings yet
Artificial Intelligence Convolution Neural Networks
77 pages
Powerpoint Presentation
No ratings yet
Powerpoint Presentation
53 pages
Intro to CNN
No ratings yet
Intro to CNN
93 pages
Cnns Convolution Neural Networks
No ratings yet
Cnns Convolution Neural Networks
50 pages
Convolutional Neural Networks: Jianxin Wu
No ratings yet
Convolutional Neural Networks: Jianxin Wu
35 pages
Unit 3 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 3 - Machine Learning - WWW - Rgpvnotes.in
29 pages
Unit 3
No ratings yet
Unit 3
105 pages
Computer Vision 2
No ratings yet
Computer Vision 2
62 pages
Week 11 - Convolutional
No ratings yet
Week 11 - Convolutional
78 pages
Lecture 8 Introduction To Color Image Processing
No ratings yet
Lecture 8 Introduction To Color Image Processing
60 pages
Unit 3 - Machine Learning
No ratings yet
Unit 3 - Machine Learning
27 pages
08. Chap 9-2_Convolutional Neural Network_Heechul Lim
No ratings yet
08. Chap 9-2_Convolutional Neural Network_Heechul Lim
58 pages
Deep Learning Module-04 Search Creators
No ratings yet
Deep Learning Module-04 Search Creators
17 pages
Fundamentals-of-ML-Study-Guide - M3
No ratings yet
Fundamentals-of-ML-Study-Guide - M3
20 pages
Casio Protrek 5470 Operation Manual PDF
No ratings yet
Casio Protrek 5470 Operation Manual PDF
26 pages
Ex Summary
No ratings yet
Ex Summary
51 pages
MODULE 5
No ratings yet
MODULE 5
20 pages
Valemount Directory Prospect List
No ratings yet
Valemount Directory Prospect List
45 pages
L11 Learning III Neural Network Architectures
No ratings yet
L11 Learning III Neural Network Architectures
35 pages
Deep Learning 4/7: Convolutional Neural Networks: C. de Castro, IEIIT-CNR, Cristina - Decastro@ieiit - Cnr.it
0% (1)
Deep Learning 4/7: Convolutional Neural Networks: C. de Castro, IEIIT-CNR, Cristina - Decastro@ieiit - Cnr.it
49 pages
Lecture2 CNN Network Design
No ratings yet
Lecture2 CNN Network Design
34 pages
Convolution Neural Network (CNN) Unit 2: Dr. Kavita R Singh
No ratings yet
Convolution Neural Network (CNN) Unit 2: Dr. Kavita R Singh
65 pages
CONVOLUTIONAL NEURAL NETWORK
No ratings yet
CONVOLUTIONAL NEURAL NETWORK
36 pages
Cnn
No ratings yet
Cnn
123 pages
Deep Learning Module-04
No ratings yet
Deep Learning Module-04
17 pages
Lecture2.2 UnimodalRepresentations Part1 PDF
No ratings yet
Lecture2.2 UnimodalRepresentations Part1 PDF
92 pages
Autonomic Receptors Atf
No ratings yet
Autonomic Receptors Atf
26 pages
Cnn
No ratings yet
Cnn
73 pages
Unit 3
No ratings yet
Unit 3
80 pages
Cnn
No ratings yet
Cnn
32 pages
AIML_ECE_UNIT-5
No ratings yet
AIML_ECE_UNIT-5
48 pages
CNN 1
No ratings yet
CNN 1
9 pages
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
No ratings yet
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
7 pages
Unit Iii Convolutional Networks and Sequence Modelling
No ratings yet
Unit Iii Convolutional Networks and Sequence Modelling
38 pages
Prelim Exam - Calculus Based Physics 1
No ratings yet
Prelim Exam - Calculus Based Physics 1
17 pages
Military AI-Week 05-AI in Computer Vision
No ratings yet
Military AI-Week 05-AI in Computer Vision
65 pages
1. Introduction to neural networks -Single layer perceptrons - Modified
No ratings yet
1. Introduction to neural networks -Single layer perceptrons - Modified
26 pages
PNAL9_CNNs
No ratings yet
PNAL9_CNNs
61 pages
Vitamin Deficiency Identification Using Image Processing
No ratings yet
Vitamin Deficiency Identification Using Image Processing
8 pages
Bird Migration: A New Understanding John H. Rappole - The ebook is ready for download with just one simple click
100% (3)
Bird Migration: A New Understanding John H. Rappole - The ebook is ready for download with just one simple click
31 pages
FODL Unit-4
No ratings yet
FODL Unit-4
46 pages
4
No ratings yet
4
5 pages
Kursus Jurulatih Utama Kurikulum Standard Sekolah Rendah (KSSR) 2011 Bahasa Inggeris-Tahun 2
No ratings yet
Kursus Jurulatih Utama Kurikulum Standard Sekolah Rendah (KSSR) 2011 Bahasa Inggeris-Tahun 2
26 pages
Unit III
No ratings yet
Unit III
89 pages
1. Introduction to deep learning- Deep feed forward network
No ratings yet
1. Introduction to deep learning- Deep feed forward network
24 pages
1. Computer Vision
No ratings yet
1. Computer Vision
20 pages
Convolutional Neural Networks - Part 1
No ratings yet
Convolutional Neural Networks - Part 1
44 pages
Unit - 2
No ratings yet
Unit - 2
51 pages
KPP 44 _ Modern Physics __ Varun JEE Advanced 2025
No ratings yet
KPP 44 _ Modern Physics __ Varun JEE Advanced 2025
6 pages
1. Recurrent Neural Networks RNN
No ratings yet
1. Recurrent Neural Networks RNN
19 pages
NN 06
No ratings yet
NN 06
18 pages
Unit 3 - Machine Learning
No ratings yet
Unit 3 - Machine Learning
29 pages
Cnnbasics 171028092801
No ratings yet
Cnnbasics 171028092801
43 pages
Computer Vision Part 2
No ratings yet
Computer Vision Part 2
5 pages
DL_UNIT_IV
No ratings yet
DL_UNIT_IV
18 pages
UNIT - 2
No ratings yet
UNIT - 2
31 pages
Variants of Cnn(page no 17-23), structured output(29-31),datatypes
No ratings yet
Variants of Cnn(page no 17-23), structured output(29-31),datatypes
31 pages
2. Activation Functions - Sigmoid- Tanh- ReLU- Softmax- Risk Minimization- Loss Function
No ratings yet
2. Activation Functions - Sigmoid- Tanh- ReLU- Softmax- Risk Minimization- Loss Function
17 pages
APA Guide
100% (1)
APA Guide
47 pages
2. Encoder-Decoder Sequence to Sequence Architechure
No ratings yet
2. Encoder-Decoder Sequence to Sequence Architechure
16 pages
Cloze Test
100% (1)
Cloze Test
18 pages
4a Convolutional Neural Networks
No ratings yet
4a Convolutional Neural Networks
56 pages
CNN and Autoencoder
No ratings yet
CNN and Autoencoder
56 pages
Free Trade and Autarky
No ratings yet
Free Trade and Autarky
21 pages
Unit 2
No ratings yet
Unit 2
20 pages
21CS743_Module4_notes
No ratings yet
21CS743_Module4_notes
15 pages
AD3501-DL-Unit 2
No ratings yet
AD3501-DL-Unit 2
33 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
26 pages
M4_IA2
No ratings yet
M4_IA2
6 pages
Neural Networks and Deep Learning (PE - V) (18CSE23) Unit - 4
No ratings yet
Neural Networks and Deep Learning (PE - V) (18CSE23) Unit - 4
11 pages
A 24-Ghz Full-360 ° Cmos Reflection-Type Phase Shifter Mmic With Low Loss-Variation
No ratings yet
A 24-Ghz Full-360 ° Cmos Reflection-Type Phase Shifter Mmic With Low Loss-Variation
4 pages
UNIT-III DeepLearning Notes
No ratings yet
UNIT-III DeepLearning Notes
30 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
Supply Chain Management of Covid Vaccines in India
No ratings yet
Supply Chain Management of Covid Vaccines in India
22 pages
3. AdaGrad- RMSProp- Adam
No ratings yet
3. AdaGrad- RMSProp- Adam
9 pages
Mks Mini Datasheet
100% (1)
Mks Mini Datasheet
6 pages
Rizal CHAPTER 2
No ratings yet
Rizal CHAPTER 2
21 pages
Design and Fiber Installation For University Campus System
No ratings yet
Design and Fiber Installation For University Campus System
7 pages
2. Speech Recognition
No ratings yet
2. Speech Recognition
7 pages
JHA For Pipe Scrap Loading and Unloading
No ratings yet
JHA For Pipe Scrap Loading and Unloading
5 pages
4th Unit Aktu Machine Learning
No ratings yet
4th Unit Aktu Machine Learning
9 pages
Software Design
No ratings yet
Software Design
5 pages
21CS743_DL_Module4_notes
No ratings yet
21CS743_DL_Module4_notes
7 pages
Renaissance Movement
No ratings yet
Renaissance Movement
4 pages
Deep LearningUNIT-IV
No ratings yet
Deep LearningUNIT-IV
16 pages
1.Introduction to Animal Classification
No ratings yet
1.Introduction to Animal Classification
4 pages
Policy On Response To Access To Patient Records
No ratings yet
Policy On Response To Access To Patient Records
3 pages
Bees, Pollination
No ratings yet
Bees, Pollination
4 pages
SJPO2011 Results
No ratings yet
SJPO2011 Results
8 pages
tec group ass 2 .1
No ratings yet
tec group ass 2 .1
2 pages
mergeddv
No ratings yet
mergeddv
2 pages
Introduction To Convolution Neural Network
No ratings yet
Introduction To Convolution Neural Network
6 pages
Business Management
No ratings yet
Business Management
4 pages
B.Tech CSE I Year B
No ratings yet
B.Tech CSE I Year B
1 page
Error-Correction on Non-Standard Communication Channels
From Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet

4. Structured outputs- Data types

Uploaded by

4. Structured outputs- Data types

Uploaded by

Structured outputs,

• These outputs have complex, meaningful

Convolutional networks can be used to output a high-

CNNs often emit a tensor as output.

A tensor can be seen as a multi-dimensional grid of

Si,j,k represents the probability that pixel (j, k) belongs

This enables pixel-wise classification rather than

By assigning a class to each pixel, CNNs can create

Use Case: Identifying and isolating cars, roads, and

• Once a prediction for each pixel is made,

• The general idea is to assume that large groups

• Graphical models can describe the probabilistic

The data used with a convolutional network usually

Each channel being the observation of a different

• For example, consider a collection of images,

• It is unclear how to model such inputs with a

• Convolution is straightforward to apply; the

1-D Single Channel

• Audio waveform: The axis we convolve over

• We discretize time and measure the amplitude

• This involves animating 3D characters by

• Audio data that has been preprocessed with a

• We can transform the audio waveform into a 2D

Color image data:

• The convolution kernel moves over both the

3-D Single Channel:

Volumetric data: A common source of this kind of data

Color video data: One axis corresponds

You might also like