0% found this document useful (0 votes)

7 views25 pages

Visual Processing

Uploaded by

inci.ahmet3814

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views25 pages

Visual Processing

Uploaded by

inci.ahmet3814

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Visual Processing

December 2024
• Section 1: Visual Processing: Biology and Technology

• Section 2: EfficientNet

• Section 3: Fine-Tuning EfficientNet-B0 in PyTorch

1
Section 1

Visual Processing: Biology and Technology

Biology and Technology

3
Biology and Technology

Human Visual System Convolutional Neural Network (CNN)

Resolution (Retina): High spatial resolution in the fovea; low Input Resolution: Size of the input image (e.g., 224×224 pixels),
resolution in the periphery for context and movement. determining the granularity of details processed.
Width (Feature Diversity): Parallel neurons in V1 respond to Width (Filters): Filters in convolutional layers capture diverse
various features, such as edges, orientation, motion, and color. spatial patterns, e.g., edges, textures, and shapes. Wider
networks detect more diverse features.
Depth (Hierarchy): Hierarchical layers process increasingly Depth (Layer Stacking): Layers process increasingly abstract
complex informa on. V1 (basic pa erns like lines/edges) → features, e.g., detec ng edges → combining edges into textures
V2/V4 (textures and shapes) → IT (complex objects like faces). → forming shapes and objects.
Combining Features: Later areas (e.g., IT) combine patterns like 1x1 Convolution: Combines features across channels, e.g.,
"red" + "round" = "red ball." combining "red," "round," and "smooth" features into "red ball."
Receptive Fields: Small receptive fields in early layers (e.g., V1 Receptive Fields: Early layers focus on local patterns (small
detect small edges). Later layers have larger receptive fields to receptive fields). Later layers expand receptive fields for global
understand global context (e.g., object recognition). understanding (e.g., object detection).
Convolution: The biological visual system does not use "filters" Convolution: Filters (small weight matrices) are applied to input
explicitly but responds to specific patterns in the visual field. data to detect patterns through: - Mathematics: Each filter slides
Neurons in the retina and visual cortex (e.g., V1) act like over the input, performing element-wise multiplications and
localized processors, responding to specific spatial patterns, summing them up (convolution operation). - Feature Maps: The
such as edges or orientations, within their receptive fields. output is a feature map that highlights locations where the filter
These neurons act as pattern detectors, similar to how detects patterns (e.g., edges, curves). Just like biological vision,
convolution processes small areas of an image. early layers detect basic patterns, while deeper layers combine
them to identify complex shapes and objects.
Non-Linearity: Visual neurons have thresholds; firing happens Non-Linearity: Activation functions (e.g., ReLU) introduce
only after sufficient stimulus activation. thresholds in CNNs to capture complex patterns.

4
Resolution: Detail vs. Efficiency

Human Visual System Convolutional Neural Networks

 Fovea vs. Periphery:  Input Resolution:

High-resolution in the fovea for detailed central vision (e.g., reading or The size of input images (e.g., 224×224 pixels) determines the
recognizing a face).Low-resolution in peripheral vision, optimized for network's capacity to capture fine details. Higher resolution inputs
detecting movement and broader spatial awareness. retain more information but demand higher computational power.
 Dynamic Focus:  Resolution Trade-offs:
Eyes focus dynamically, adjusting resolution for context (e.g., tracking a Lower resolutions reduce computational load but may sacrifice finer
fast-moving object vs. identifying fine details). details. Efficient architectures balance resolution with feature
extraction (e.g., EfficientNet scales input resolution proportionally to
 Neural Efficiency: Neural circuits optimize processing by discarding width and depth).
unnecessary details and focusing on relevant patterns.

Biology-Inspired Design:
• CNN architectures mimic the human strategy: focus
computational resources where detail matters most.
• Use layered hierarchies to process critical information
progressively.

5
Width: Detecting Diverse Features

6
Depth: Building Complexity

7
Depth: Building Complexity

These operations work in tandem: Convolution extracts patterns,

and subsampling refines the data to ensure computational
efficiency and robustness in recognizing features. This hierarchical
process is similar to the human visual pathway, where early stages
detect local patterns (like edges) and later stages combine these
patterns to form a global understanding.

8
Scaling in Biological and Computational Vision Systems

• Fovea Centralis: Located at the center of the

• Baseline Network: A simple model with initial depth,
retina, the fovea contains a high density of
width, and input resolution.
cone photoreceptors, enabling sharp central
vision and fine detail discrimination. • Width Scaling: Increases the number of channels in
each layer to capture more features.
• Retina: As one moves away from the fovea,
the density of cone cells decreases, and rod • Depth Scaling: Adds more layers to capture more
photoreceptors become more prevalent. This complex patterns.
arrangement supports peripheral vision, • Resolution Scaling: Uses higher resolution images to
which is more sensitive to motion and retain more fine-grained information.
functions better in low-light conditions but
offers lower resolution. • Compound Scaling: Combines all three scaling methods
in a balanced way to optimize performance and
efficiency.
9
Bottlenecks and Efficiency

10
Section 2

EfficientNet
Introduction to EfficientNet

12
The Challenge of Scaling in CNNs

13
Compound Scaling

14
EfficientNet-B0 Architecture

15
Design Assessment

16
Section 3

Fine-Tuning EfficientNet-B0 in PyTorch

What is Fine-Tuning?

18
Installing and Loading EfficientNet-B0

# Install the necessary library

!pip install torchvision efficientnet_pytorch

# Import EfficientNet from PyTorch

from efficientnet_pytorch import EfficientNet

# Load pre-trained EfficientNet-B0

model = EfficientNet.from_pretrained('efficientnet-b0')

19
Preparing the Dataset

from torchvision import datasets, transforms

# Define transformations
data_transforms = transforms.Compose([
transforms.Resize((224, 224)),
transforms.ToTensor(),
transforms.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225]),
])

# Load dataset
train_dataset = datasets.ImageFolder('path/to/train', transform=data_transforms)
train_loader = torch.utils.data.DataLoader(train_dataset, batch_size=32, shuffle=True)

20
Modifying the Classifier

import torch.nn as nn

# Modify the classifier

num_classes = 2 # For binary classification
model._fc = nn.Linear(model._fc.in_features, num_classes)

21
Setting Up Fine-Tuning

22
Training the Model

23
Evaluating the Model

Generative Ai With Python Harnessing The Power of Machine Learning and Deep Learning To Build Creative and Intelligent Systems
100% (1)
Generative Ai With Python Harnessing The Power of Machine Learning and Deep Learning To Build Creative and Intelligent Systems
239 pages
Deep Learning 2024
100% (1)
Deep Learning 2024
16 pages
Computer Vision 2011
100% (1)
Computer Vision 2011
103 pages
Convolutional Neural PDF
No ratings yet
Convolutional Neural PDF
187 pages
Lecture 1
100% (1)
Lecture 1
21 pages
Unit 3
No ratings yet
Unit 3
21 pages
Application of Fake News Detection in Stock Market Analyzer and Predictor Using Sentiment Analysis
No ratings yet
Application of Fake News Detection in Stock Market Analyzer and Predictor Using Sentiment Analysis
7 pages
Adding Conditional Control To Text-to-Image Diffusion Models
No ratings yet
Adding Conditional Control To Text-to-Image Diffusion Models
12 pages
Cnns Convolution Neural Networks
No ratings yet
Cnns Convolution Neural Networks
50 pages
CV Unit V
No ratings yet
CV Unit V
18 pages
17 Redundancy Avoidance For Big Data in Data Centers
100% (1)
17 Redundancy Avoidance For Big Data in Data Centers
3 pages
A Beginner's Guide To Understanding Convolutional Neural Networks Part 1 - Adit Deshpande - CS Under
100% (1)
A Beginner's Guide To Understanding Convolutional Neural Networks Part 1 - Adit Deshpande - CS Under
14 pages
Cse (Convolutional Neural Network) PPT+Questions
No ratings yet
Cse (Convolutional Neural Network) PPT+Questions
18 pages
Logical It Networking N
No ratings yet
Logical It Networking N
21 pages
LectureNotes PDF
No ratings yet
LectureNotes PDF
212 pages
CompVisNotes PDF
No ratings yet
CompVisNotes PDF
115 pages
Learning Opencv 3 Computer Vision With Python Up
No ratings yet
Learning Opencv 3 Computer Vision With Python Up
49 pages
Week5 Computer Vision
No ratings yet
Week5 Computer Vision
58 pages
4a Convolutional Neural Networks
No ratings yet
4a Convolutional Neural Networks
56 pages
Chapitre 8 2024
No ratings yet
Chapitre 8 2024
231 pages
20CT1153
No ratings yet
20CT1153
2 pages
DL Unit3 1
No ratings yet
DL Unit3 1
67 pages
Lecture 1 AI Summary
No ratings yet
Lecture 1 AI Summary
31 pages
Major Report
No ratings yet
Major Report
27 pages
Convolutional Neural Networks Notes
No ratings yet
Convolutional Neural Networks Notes
29 pages
Master's Thesis Deep Learning For Visual Recognition: Remi Cadene Supervised by Nicolas Thome and Matthieu Cord
No ratings yet
Master's Thesis Deep Learning For Visual Recognition: Remi Cadene Supervised by Nicolas Thome and Matthieu Cord
58 pages
Some Important Question
No ratings yet
Some Important Question
59 pages
Unit 5a - Machine Vision
No ratings yet
Unit 5a - Machine Vision
55 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
Convolutional Neural Networks As A Model of The Visual System: Past, Present, and Future
No ratings yet
Convolutional Neural Networks As A Model of The Visual System: Past, Present, and Future
15 pages
Introduction To Convolutional Neural Networks (CNNS)
No ratings yet
Introduction To Convolutional Neural Networks (CNNS)
28 pages
Introduction To AI and ML
No ratings yet
Introduction To AI and ML
22 pages
IA 3 Must Study Merged
No ratings yet
IA 3 Must Study Merged
69 pages
PEC CS 802C Deep Learning
No ratings yet
PEC CS 802C Deep Learning
13 pages
Computer Vision
No ratings yet
Computer Vision
33 pages
CV PPT Mt101
No ratings yet
CV PPT Mt101
16 pages
Computer Vision
No ratings yet
Computer Vision
28 pages
Headlight Detection
No ratings yet
Headlight Detection
16 pages
Crop Classification and Mapping For Agricultural Land From Satellite Images
No ratings yet
Crop Classification and Mapping For Agricultural Land From Satellite Images
21 pages
Syllabus
No ratings yet
Syllabus
15 pages
Computer Vision
No ratings yet
Computer Vision
14 pages
Image Classification Using CNN: Page - 1
No ratings yet
Image Classification Using CNN: Page - 1
13 pages
Image Inpainting For Irregular Holes Using Partial Convolutions
No ratings yet
Image Inpainting For Irregular Holes Using Partial Convolutions
23 pages
Assignment-6 STC-DL
No ratings yet
Assignment-6 STC-DL
17 pages
Convnets From Thesis
No ratings yet
Convnets From Thesis
9 pages
A Survey On Deep Learning Based Crop Yield Predict
No ratings yet
A Survey On Deep Learning Based Crop Yield Predict
14 pages
Module 5
No ratings yet
Module 5
20 pages
Advanced DL Computer Vision
No ratings yet
Advanced DL Computer Vision
10 pages
Tài Liệu Không Có Tiêu Đề-2
No ratings yet
Tài Liệu Không Có Tiêu Đề-2
19 pages
AYearof Computer Vision PDF
No ratings yet
AYearof Computer Vision PDF
56 pages
Talking Avatar Application
No ratings yet
Talking Avatar Application
9 pages
Overview
No ratings yet
Overview
5 pages
Shengyi Zhao Et Al - 2021 - Tomato Leaf Disease Diagnosis Based On Improved Convolution Neural Network by
No ratings yet
Shengyi Zhao Et Al - 2021 - Tomato Leaf Disease Diagnosis Based On Improved Convolution Neural Network by
15 pages
FT04 Haghighat Independent 2023
No ratings yet
FT04 Haghighat Independent 2023
40 pages
A Comprehensive Guide To Computer Vision
No ratings yet
A Comprehensive Guide To Computer Vision
6 pages
Visual Image Understanding
No ratings yet
Visual Image Understanding
7 pages
1 s2.0 S0738081X23002687 Main
No ratings yet
1 s2.0 S0738081X23002687 Main
9 pages
Explicitly Modeling Pre-Cortical Vision With A Neuro-Inspired Front-End Improves CNN Robustness
No ratings yet
Explicitly Modeling Pre-Cortical Vision With A Neuro-Inspired Front-End Improves CNN Robustness
13 pages
DL CNN
No ratings yet
DL CNN
7 pages
Szegedy Rethinking The Inception CVPR 2016 Paper PDF
No ratings yet
Szegedy Rethinking The Inception CVPR 2016 Paper PDF
9 pages
1 s2.0 S2352914819302047 Main PDF
No ratings yet
1 s2.0 S2352914819302047 Main PDF
6 pages
Midterm: Subject: Introduction To Computer Vision
No ratings yet
Midterm: Subject: Introduction To Computer Vision
35 pages
(IJETA-V11I3P44) :santosh Kumar, Harshvardhan Tailor, Hemant Singh Jadoun, Mandeep Kumar Biloniya, Aryan Jangid
No ratings yet
(IJETA-V11I3P44) :santosh Kumar, Harshvardhan Tailor, Hemant Singh Jadoun, Mandeep Kumar Biloniya, Aryan Jangid
4 pages
SoS'25 Midterm - Report
No ratings yet
SoS'25 Midterm - Report
14 pages
Computer Vision Part 2
No ratings yet
Computer Vision Part 2
5 pages
Ch-3 Convolutional Neural Networks (CNNS)
No ratings yet
Ch-3 Convolutional Neural Networks (CNNS)
11 pages
Dl-Unit 5
No ratings yet
Dl-Unit 5
62 pages
Computer Vision Revision Notes - 250322 - 101703
No ratings yet
Computer Vision Revision Notes - 250322 - 101703
4 pages
CNN Test Answers
No ratings yet
CNN Test Answers
8 pages
Unit4 - 3 CNN
No ratings yet
Unit4 - 3 CNN
19 pages
Real TimeObjectDetectionusingYOLOAreview
No ratings yet
Real TimeObjectDetectionusingYOLOAreview
7 pages
DR Synopsis
No ratings yet
DR Synopsis
5 pages
Music Source Separation Presentation - PPSX
No ratings yet
Music Source Separation Presentation - PPSX
6 pages
Reviewer - Convolutional Neural Networks (CNNS) - Muqaddas Bin Tahir
No ratings yet
Reviewer - Convolutional Neural Networks (CNNS) - Muqaddas Bin Tahir
8 pages
Convolutional Neural Networks (CNNS) : Foundations and Applications in Visual Representation Learning
No ratings yet
Convolutional Neural Networks (CNNS) : Foundations and Applications in Visual Representation Learning
9 pages
Technical Answers For Realworld Problems (ECE3999) : Project Title: Covid-19 Analysis Through Chest X-Rays
No ratings yet
Technical Answers For Realworld Problems (ECE3999) : Project Title: Covid-19 Analysis Through Chest X-Rays
7 pages
Airbus Ship Detection - Traditional V.S. Convolutional Neural Network Approach
No ratings yet
Airbus Ship Detection - Traditional V.S. Convolutional Neural Network Approach
6 pages
Chapter 5 - CNNs - Part1
No ratings yet
Chapter 5 - CNNs - Part1
30 pages
Spatiotemporal Vision Transformer For Short Time Weather Forecasting
No ratings yet
Spatiotemporal Vision Transformer For Short Time Weather Forecasting
6 pages
Computer Vision With CNNs
No ratings yet
Computer Vision With CNNs
3 pages
Computer Vision: Field of AI That Enables Computers To Derive Meaningful Information From
No ratings yet
Computer Vision: Field of AI That Enables Computers To Derive Meaningful Information From
26 pages
Sania Technical Seminar
No ratings yet
Sania Technical Seminar
14 pages
7 Applications of Convolutional Neural Networks - FWS
No ratings yet
7 Applications of Convolutional Neural Networks - FWS
3 pages
Convolutional Layer: Web-Based Demo
No ratings yet
Convolutional Layer: Web-Based Demo
3 pages
Deep Learning U3
No ratings yet
Deep Learning U3
3 pages
CS231n Convolutional Neural Networks For Visual Recognition
No ratings yet
CS231n Convolutional Neural Networks For Visual Recognition
2 pages
Convolutional Neural Networks (CNNS)
No ratings yet
Convolutional Neural Networks (CNNS)
2 pages
Course Outline: DLCP Curriculum Walkthrough
No ratings yet
Course Outline: DLCP Curriculum Walkthrough
3 pages

Visual Processing

Uploaded by

Visual Processing

Uploaded by

Visual Processing

• Section 3: Fine-Tuning EfficientNet-B0 in PyTorch

Visual Processing: Biology and Technology

Human Visual System Convolutional Neural Network (CNN)

Human Visual System Convolutional Neural Networks

 Fovea vs. Periphery:  Input Resolution:

These operations work in tandem: Convolution extracts patterns,

• Fovea Centralis: Located at the center of the

Fine-Tuning EfficientNet-B0 in PyTorch

# Install the necessary library

# Import EfficientNet from PyTorch

# Load pre-trained EfficientNet-B0

from torchvision import datasets, transforms

# Modify the classifier

You might also like