0% found this document useful (0 votes)

59 views50 pages

SSL 18 Mar 23 PDF

Self-supervised learning methods use pretext tasks to learn useful feature representations from unlabeled data. These tasks include predicting image rotations, relative patch locations, missing pixels through inpainting, and coloring grayscale images. The learned feature representations are then evaluated on downstream tasks with limited labeled data.

Uploaded by

arpan singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views50 pages

SSL 18 Mar 23 PDF

Uploaded by

arpan singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 50

CS60010: Deep Learning

Spring 2023

Sudeshna Sarkar

Self-Supervised Learning
Sudeshna Sarkar
17 Mar 2023
Self-supervised Learning

• Self-supervised learning methods solve “pretext” tasks that produce

good features for downstream tasks.
• Learn with supervised learning objectives, e.g., classification, regression.
• Labels of these pretext tasks are generated automatically
Representation learning

• Learn What?
• How to learn?
Coral
• Learn from what?
Fish

Compact Mental
Image Representation
im2vec
layer 3 representation of image

Image

layer 1 representation of image

Represent image as a neural embedding — a vector/tensor of neural activations

(perhaps representing a vector of detected texture patterns or object parts)
Slide credit: Phillip Isola
Investigating a representation via similarity analysis

How similar are these two images?

How about these two?

[Kriegeskorte et al. 2008]

Slide credit: Phillip Isola
Problem: Supervised Learning is Expensive!

Justin Johnson Lecture 22 - April 6, 2022

[slide credit: Justin Johnson]
Supervised computer
Vision in nature
vision
Raw unlabeled training data
Hand-curated training data
+ Cheap
+ Informative
- Noisy
- Expensive
- Harder to interpret
- Limited to teacher’s knowledge

Slide credit: Phillip Isola

Representation Learning

Representations??

Slide credit: Phillip Isola

Unsupervised + Deep Learning
ev
i
cet
Pretrained bj
“Deep” O
D
Representation
Unlabeled Unsupervised SG
Input Data Learning Machine

Must be
good for
transfer
learning
Data Dropout Prediction

Prediction
Objective
• Unsupervised / Self-supervised by predicting
part of data from other part
Self-supervised pretext tasks
learn to predict image transformations / complete corrupted images.

1. Solving the pretext tasks allow the model to learn good features.
2. We can automatically generate labels for the pretext tasks.
How to evaluate a self-supervised learning method?

1. Learn good feature extractors from self-supervised pretext tasks,

e.g., predicting image rotations
2. Evaluate the learned feature encoders on downstream target tasks
• Attach a shallow network on the feature extractor;
• train the shallow network on the target task with small amount of labeled
data
How to evaluate a self-supervised learning method?

Learn good feature extractors from self-

supervised pretext tasks, e.g., predicting
image rotations
How to evaluate a self-supervised learning method?

Learn good feature extractors from self-

supervised pretext tasks, e.g., predicting Evaluate the learned feature encoders on downstream
target tasks
image rotations
• Attach a shallow network on the feature extractor;
• train the shallow network on the target task with small
amount of labeled data
Pretext task: predict rotations

Hypothesis: a model could recognize the correct rotation of an object only if

it has the “visual commonsense” of what the object should look like
unperturbed.
Pretext task: predict rotations

Self-supervised
learning by rotating
the entire input
images.

The model learns to

predict which
rotation is applied
(4-way classification)
Pretext task: predict rotations
Evaluation on semi-supervised learning

Self-supervised learning on
CIFAR10 (entire training set)

Freeze conv1 + conv2 Learn

conv3 + linear layers with
subset of labeled CIFAR10 data
(classification).
Transfer learned features to supervised learning

Pretrained with full

ImageNet supervision

No pretraining

Self-supervised learning on
ImageNet (entire training set)
with AlexNet

Finetune on labeled data from

Pascal VOC 2007

Self-supervised learning
with rotation prediction
Pretext task: predict relative patch locations
Model predicts relative location of
two patches from the same image.
Discriminative pretraining task

Intuition: Requires understanding

objects and their parts

Doersch et al, “Unsupervised Visual Representation Learning by Context Prediction”, ICCV 2015

87
[slide credit: Justin Johnson]
Pretext task: solving “jigsaw puzzles”

Noroozi & Favaro, 2016)

Pretext task: predict missing pixels (inpainting)

Deepak Pathak, Philipp Krähenbühl, Jeff Donahue, Trevor Darrell, Alexei Efros. CVPR 2016
Feature Learning by Inpainting
Learning to inpaint by reconstruction

Learning to reconstruct the missing pixels

Context Encoders: Learning by Inpainting
Input Image

Encoder: Decoder:
𝜙𝜙 𝜓𝜓

Pathak et al, “Context Encoders: Feature Learning by Inpainting”, CVPR 2016

39
[slide credit: Justin Johnson]
Context Encoders: Learning by Inpainting
Input Image Predict Missing Pixels

Encoder: Decoder:
𝜙𝜙 𝜓𝜓

Pathak et al, “Context Encoders: Feature Learning by Inpainting”, CVPR 2016

40
[slide credit: Justin Johnson]
Context Encoders: Learning by Inpainting
Input Image Predict Missing Pixels

Encoder: Decoder:
𝜙𝜙 𝜓𝜓

L2 Loss
(Best for feature learning)
Pathak et al, “Context Encoders: Feature Learning by Inpainting”, CVPR 2016

41
[slide credit: Justin Johnson]
Context Encoders: Learning by Inpainting
Input Image Predict Missing Pixels

Encoder: Decoder:
𝜙𝜙 𝜓𝜓

L2 + Adversarial Loss
(Best for nice images)
Pathak et al, “Context Encoders: Feature Learning by Inpainting”, CVPR 2016

42
[slide credit: Justin Johnson]
Learning to inpaint by reconstruction

• Loss = reconstruction + adversarial learning

• Adversarial loss between “real” images and inpainted images

Inpainting evaluation

Input (context) reconstruction adversarial recon + adv

Pretext task: image coloring
Summary: pretext tasks from image transformations

• Pretext tasks focus on “visual common sense”, e.g., predict rotations,

inpainting, rearrangement, and colorization.
• The models are forced learn good features about natural images, e.g.,
semantic representation of an object category, in order to solve the
pretext tasks.
• We don’t care about the performance of these pretext tasks, but
rather how useful the learned features are for downstream tasks
(classification, detection, segmentation).
• Problems: 1) coming up with individual pretext tasks is tedious, and 2)
the learned representations may not be general.
Pretext tasks from image transformations

• Learned representations may be tied to a specific pretext task!Can we

come up with a more general pretext task?
Contrastive representation learning

• Intuition and formulation

• Instance contrastive learning: SimCLR and MOCO
• Sequence contrastive learning: CPC
A more general pretext task?
A more general pretext task?
Contrastive Representation Learning
Contrastive Learning
Assume we don’t have labels for images, but we know
whether some pairs of images are similar or dissimilar

Hadsell et al, “Dimensionality Reduction by Learning and Invariant Mapping”, CVPR 2006 White kitten image is free for commercial use under the Pixabay license

Justin Johnson Lecture 22 - 88 April 6, 2022

[slide credit: Justin Johnson]
Contrastive Learning
Assume we don’t have labels for images, but we know
whether some pairs of images are similar or dissimilar

Similar images should have similar features

CNN

Hadsell et al, “Dimensionality Reduction by Learning and Invariant Mapping”, CVPR 2006 White kitten image is free for commercial use under the Pixabay license

Justin Johnson Lecture 22 - 89 April 6, 2022

[slide credit: Justin Johnson]
Contrastive Learning
Assume we don’t have labels for images, but we know
whether some pairs of images are similar or dissimilar

Similar images should have similar features Dissimilar images should have dissimilar features

CNN CNN

Hadsell et al, “Dimensionality Reduction by Learning and Invariant Mapping”, CVPR 2006 White kitten image is free for commercial use under the Pixabay license

Justin Johnson Lecture 22 - 90 April 6, 2022

[slide credit: Justin Johnson]
Contrastive Learning
Assume we don’t have labels for images, but we know
whether some pairs of images are similar or dissimilar
Let d be the Euclidean distance between features for two images
Similar images should have similar features Dissimilar images should have dissimilar features

CNN CNN

Hadsell et al, “Dimensionality Reduction by Learning and Invariant Mapping”, CVPR 2006 White kitten image is free for commercial use under the Pixabay license

Justin Johnson Lecture 22 - 91 April 6, 2022

[slide credit: Justin Johnson]
Contrastive Learning
Assume we don’t have labels for images, but we know
whether some pairs of images are similar or dissimilar
Similar images should have similar features Dissimilar images should have dissimilar features

CNN CNN

𝑑𝑑 2
𝐿𝐿𝑆𝑆 𝑥𝑥1, 𝑥𝑥2 = 𝐿𝐿𝐷𝐷 𝑥𝑥1, 𝑥𝑥2 = max(0, 𝑚𝑚 − 𝑑𝑑2 )
Pull features together Push features apart
Justin Johnson Lecture 22 - 92 (upto margin m)
April 6, 2022
[slide credit: Justin Johnson]
Contrastive Learning
Problem: Where to get positive and negative pairs?

Similar images should have similar features Dissimilar images should have dissimilar features

CNN CNN

Hadsell et al, “Dimensionality Reduction by Learning and Invariant Mapping”, CVPR 2006 Hjelm et al, “Learning deep representations by mutual information estimation and maximization”, ICLR 2019 Tian et al, “Contrastive Multiview Coding”, ECCV 2020
Wu et al, “Unsupervised Feature Learning by Non-Parametric Instance-Level Discrimination”, CVPR 2018 Bachman et al, “Learning Representations by Maximizing Mutual Information Across Views”, NeurIPS 2019 He et al, “Momentum Contrast for Unsupervised Visual Representation Learning”, CVPR 2020
Van den Oord et al, “Representation Learning with Contrastive Predictive Coding”, NeurIPS 2018 Henaff et al, “Data-Efficient Image Recognition with Contrastive Predictive Coding”, ICML 2020 Chen et al, “A Simple Framework for Contrastive Learning of Visual Representations”, ICML 2020

Justin Johnson Lecture 22 - 95 April 6, 2022

[slide credit: Justin Johnson]
Contrastive Learning with Data Augmentation
Batch of N Two augmentations
images for each image

𝑥𝑥!

𝑥𝑥"

𝑥𝑥#

𝑥𝑥$

𝑥𝑥%

𝑥𝑥&
Hadsell et al, “Dimensionality Reduction by Learning and Invariant Mapping”, CVPR 2006 Hjelm et al, “Learning deep representations by mutual information estimation and maximization”, ICLR 2019 Tian et al, “Contrastive Multiview Coding”, ECCV 2020
Wu et al, “Unsupervised Feature Learning by Non-Parametric Instance-Level Discrimination”, CVPR 2018 Bachman et al, “Learning Representations by Maximizing Mutual Information Across Views”, NeurIPS 2019 He et al, “Momentum Contrast for Unsupervised Visual Representation Learning”, CVPR 2020
Van den Oord et al, “Representation Learning with Contrastive Predictive Coding”, NeurIPS 2018 Henaff et al, “Data-Efficient Image Recognition with Contrastive Predictive Coding”, ICML 2020 Chen et al, “A Simple Framework for Contrastive Learning of Visual Representations”, ICML 2020

Justin Johnson Lecture 22 - 96 April 6, 2022

[slide credit: Justin Johnson]
Contrastive Learning with Data Augmentation
Batch of N Two augmentations Extract
images for each image features

𝑥𝑥!

𝑥𝑥"

𝑥𝑥#

𝑥𝑥$

𝑥𝑥%

Justin Johnson Lecture 22 - 97 April 6, 2022

𝑥𝑥"

𝑥𝑥#

𝑥𝑥$

𝑥𝑥%

Justin Johnson Lecture 22 - 98 April 6, 2022

𝑥𝑥"

𝑥𝑥#

𝑥𝑥$

𝑥𝑥%

𝑥𝑥&

Justin Johnson Lecture 22 - 99 April 6, 2022

[slide credit: Justin Johnson]
Contrastive Learning with Data Augmentation
Batch of N Two augmentations Extract Each image tries to predict which o
images for each image features the other 2N-1 images came from
the same original image
𝑥𝑥! Similarity between 𝑥𝑥' and 𝑥𝑥( :
𝜙𝜙 𝑥𝑥' * 𝜙𝜙 𝑥𝑥(
𝑠𝑠' ,( =
𝑥𝑥" 𝜙𝜙 𝑥𝑥' ⋅ 𝜙𝜙 𝑥𝑥'

𝑥𝑥# If (𝑥𝑥' , 𝑥𝑥( ) is a positive pair,

then loss for 𝑥𝑥' is:
exp 𝑠𝑠' ,( /𝜏𝜏
𝑥𝑥$ 𝐿𝐿' = − log " .
∑+, ! exp 𝑠𝑠' ,+/𝜏𝜏
+- '

𝑥𝑥% (𝜏𝜏 is a temperature)

𝑥𝑥&

Justin Johnson Lecture 22 - 100 April 6, 2022

𝑥𝑥# If (𝑥𝑥' , 𝑥𝑥( ) is a positive pair,

then loss for 𝑥𝑥' is:
exp 𝑠𝑠' ,( /𝜏𝜏
𝑥𝑥$ 𝐿𝐿' = − log " .
∑+, ! exp 𝑠𝑠' ,+/𝜏𝜏
+- '

𝑥𝑥% (𝜏𝜏 is a temperature)

Interpretation: Cross-entropy
𝑥𝑥& loss over the other 2N-1
elements in the batch!
Hadsell et al, “Dimensionality Reduction by Learning and Invariant Mapping”, CVPR 2006 Hjelm et al, “Learning deep representations by mutual information estimation and maximization”, ICLR 2019 Tian et al, “Contrastive Multiview Coding”, ECCV 2020
Wu et al, “Unsupervised Feature Learning by Non-Parametric Instance-Level Discrimination”, CVPR 2018 Bachman et al, “Learning Representations by Maximizing Mutual Information Across Views”, NeurIPS 2019 He et al, “Momentum Contrast for Unsupervised Visual Representation Learning”, CVPR 2020
Van den Oord et al, “Representation Learning with Contrastive Predictive Coding”, NeurIPS 2018 Henaff et al, “Data-Efficient Image Recognition with Contrastive Predictive Coding”, ICML 2020 Chen et al, “A Simple Framework for Contrastive Learning of Visual Representations”, ICML 2020

Justin Johnson Lecture 22 - 101 April 6, 2022

[slide credit: Justin Johnson]

CERN Deep Learning and Vision
No ratings yet
CERN Deep Learning and Vision
72 pages
Unit-1 - Machine Learning
No ratings yet
Unit-1 - Machine Learning
85 pages
Review On Self-Supervised Image Recognition Using Deep Neural
No ratings yet
Review On Self-Supervised Image Recognition Using Deep Neural
22 pages
Does The Brain Do Inverse Graphics?
No ratings yet
Does The Brain Do Inverse Graphics?
49 pages
Lec 1 Intro
No ratings yet
Lec 1 Intro
54 pages
SimCLR: Simple Framework For Contrastive Learning of Visual Representaitons
No ratings yet
SimCLR: Simple Framework For Contrastive Learning of Visual Representaitons
20 pages
Basak Pseudo-Label Guided Contrastive Learning For Semi-Supervised Medical Image Segmentation CVPR 2023 Paper
No ratings yet
Basak Pseudo-Label Guided Contrastive Learning For Semi-Supervised Medical Image Segmentation CVPR 2023 Paper
12 pages
L U V U C - P N 3DM: Earning From Nlabelled Ideos Sing ON Trastive Redictive Eural Apping
No ratings yet
L U V U C - P N 3DM: Earning From Nlabelled Ideos Sing ON Trastive Redictive Eural Apping
19 pages
465-Lecture 17-CT
No ratings yet
465-Lecture 17-CT
22 pages
IT5413 Ch4 Self Supervising
No ratings yet
IT5413 Ch4 Self Supervising
29 pages
Lecture 12 Learning in Vision 2022
No ratings yet
Lecture 12 Learning in Vision 2022
100 pages
Learning With Few Data
No ratings yet
Learning With Few Data
67 pages
Machine Learning Tut
No ratings yet
Machine Learning Tut
68 pages
Understanding Dimensional Collapse
No ratings yet
Understanding Dimensional Collapse
17 pages
Unsupervised Classification
No ratings yet
Unsupervised Classification
18 pages
2024 MTH058 Lecture04 AILearningParadigms
No ratings yet
2024 MTH058 Lecture04 AILearningParadigms
85 pages
Dlincv 161110052148 PDF
No ratings yet
Dlincv 161110052148 PDF
271 pages
Intro To ML
No ratings yet
Intro To ML
107 pages
4.1 - Unsupervised Visual Representation Learning by Context Prediction
No ratings yet
4.1 - Unsupervised Visual Representation Learning by Context Prediction
10 pages
Context Encoders: Feature Learning by Inpainting
No ratings yet
Context Encoders: Feature Learning by Inpainting
12 pages
Islam 等 - 2020 - How Much Position Information Do Convolutional Neu
No ratings yet
Islam 等 - 2020 - How Much Position Information Do Convolutional Neu
11 pages
Unsupervised Embedding Learning Via Invariant and Spreading Instance Feature
No ratings yet
Unsupervised Embedding Learning Via Invariant and Spreading Instance Feature
10 pages
Unsupervised Embedding Learning Via Invariant and Spreading Instance Feature
No ratings yet
Unsupervised Embedding Learning Via Invariant and Spreading Instance Feature
11 pages
Pathak Context Encoders Feature CVPR 2016 Paper
No ratings yet
Pathak Context Encoders Feature CVPR 2016 Paper
9 pages
Context Encoders Feature Learning by Inpainting
No ratings yet
Context Encoders Feature Learning by Inpainting
9 pages
Master Inspera
No ratings yet
Master Inspera
45 pages
Lec 16
No ratings yet
Lec 16
76 pages
Sim CLR
No ratings yet
Sim CLR
11 pages
Short Course On Deep Learning: Welcome!!
No ratings yet
Short Course On Deep Learning: Welcome!!
57 pages
(Fall 2024) Deep Learning 3
No ratings yet
(Fall 2024) Deep Learning 3
54 pages
S - S L C - C G A N: EMI Upervised Earning With Ontext Onditional Enerative Dversarial Etworks
No ratings yet
S - S L C - C G A N: EMI Upervised Earning With Ontext Onditional Enerative Dversarial Etworks
10 pages
AML - Lecture - 11 - 19nov24
No ratings yet
AML - Lecture - 11 - 19nov24
103 pages
What Should Not Be Contrastive in Contrastive Learning
No ratings yet
What Should Not Be Contrastive in Contrastive Learning
13 pages
2025 Lecture06 MachineLearning
No ratings yet
2025 Lecture06 MachineLearning
56 pages
Beery Synthetic Examples Improve Generalization For Rare Classes WACV 2020 Paper
No ratings yet
Beery Synthetic Examples Improve Generalization For Rare Classes WACV 2020 Paper
11 pages
Revisiting Self-Supervised Visual Representation Learning PDF
No ratings yet
Revisiting Self-Supervised Visual Representation Learning PDF
10 pages
Contrastive Learning
No ratings yet
Contrastive Learning
10 pages
Dense Constrastive Learning For Self Supervised Visual Pre Training
No ratings yet
Dense Constrastive Learning For Self Supervised Visual Pre Training
11 pages
Self Supervised Learning
No ratings yet
Self Supervised Learning
5 pages
ML Lecture#1
No ratings yet
ML Lecture#1
52 pages
Unsupervised Feature Learning Via Non-Parametric Instance Discrimination
No ratings yet
Unsupervised Feature Learning Via Non-Parametric Instance Discrimination
10 pages
Object Recog
No ratings yet
Object Recog
102 pages
Contrastive Self Supervised Learning With Hard Negative Pair Mining
No ratings yet
Contrastive Self Supervised Learning With Hard Negative Pair Mining
8 pages
Paper 4
No ratings yet
Paper 4
12 pages
Weakly Supervised Contrastive Learning
No ratings yet
Weakly Supervised Contrastive Learning
10 pages
Self-Supervised Contrastive Representation Learning For Semi-Supervised Time-Series Classification
No ratings yet
Self-Supervised Contrastive Representation Learning For Semi-Supervised Time-Series Classification
15 pages
Sagar Institute of Research & Technology Department of Electronics & Communication
No ratings yet
Sagar Institute of Research & Technology Department of Electronics & Communication
13 pages
Revisting
No ratings yet
Revisting
13 pages
Cross Training
No ratings yet
Cross Training
11 pages
Introduction To Deep Learning: Technical Seminar by Md. Abul Fazl (14261A05A0) CSE Dept
No ratings yet
Introduction To Deep Learning: Technical Seminar by Md. Abul Fazl (14261A05A0) CSE Dept
21 pages
01 Introduction 1
No ratings yet
01 Introduction 1
71 pages
A Survey On Contrastive Self-Supervised Learning
No ratings yet
A Survey On Contrastive Self-Supervised Learning
21 pages
A Guide To Self-Supervised Learning in Computer Vision
No ratings yet
A Guide To Self-Supervised Learning in Computer Vision
15 pages
Intro
No ratings yet
Intro
24 pages
Self-Supervised Learning and Computer Vision Fast - Ai
No ratings yet
Self-Supervised Learning and Computer Vision Fast - Ai
7 pages
IELTS Family Vocabulary PDF
No ratings yet
IELTS Family Vocabulary PDF
3 pages
Self-Supervised Learning: Pretext Tasks
No ratings yet
Self-Supervised Learning: Pretext Tasks
3 pages
AI Test Study Guide
No ratings yet
AI Test Study Guide
2 pages
Spooning MC
No ratings yet
Spooning MC
2 pages
PE 8 DLL Q1 1st Week
No ratings yet
PE 8 DLL Q1 1st Week
3 pages
Week 13 LLM ChatGPT HAAI IITKgp v2
No ratings yet
Week 13 LLM ChatGPT HAAI IITKgp v2
119 pages
Pgdba - Cba - 2022
No ratings yet
Pgdba - Cba - 2022
5 pages
WIS2040 Spring 2025 Syllabus-1
No ratings yet
WIS2040 Spring 2025 Syllabus-1
11 pages
Transformer Part3 16 Mar 23 PDF
No ratings yet
Transformer Part3 16 Mar 23 PDF
59 pages
Bachelor of Engineering IN Computer Science and Engineering: Internship Report
No ratings yet
Bachelor of Engineering IN Computer Science and Engineering: Internship Report
24 pages
Pretraining Part1 16 Mar 23 PDF
No ratings yet
Pretraining Part1 16 Mar 23 PDF
32 pages
PHD Thesis Proposal
100% (3)
PHD Thesis Proposal
8 pages
FRA Course Outline
No ratings yet
FRA Course Outline
5 pages
Cover Page
No ratings yet
Cover Page
10 pages
09.30 Robert Feldman - McKinsey
No ratings yet
09.30 Robert Feldman - McKinsey
15 pages
Grade 9 Rationalized Integrated Science Schemes of Work-Term 2
No ratings yet
Grade 9 Rationalized Integrated Science Schemes of Work-Term 2
29 pages
Pauline Use of Notions Election and Pred-30967951
No ratings yet
Pauline Use of Notions Election and Pred-30967951
67 pages
Genua Istc 541 Lesson Plan Final
No ratings yet
Genua Istc 541 Lesson Plan Final
8 pages
Leadership Styles Thesis Statement
100% (2)
Leadership Styles Thesis Statement
5 pages
Word Embedding 9 Mar 23 PDF
No ratings yet
Word Embedding 9 Mar 23 PDF
16 pages
Ravens Standard Progressive Matrices
No ratings yet
Ravens Standard Progressive Matrices
80 pages
Manuscript
No ratings yet
Manuscript
47 pages
الامتحان الوطني في اللغة الإنجليزية 2024 مسلك علوم انسانية الدورة العادية
No ratings yet
الامتحان الوطني في اللغة الإنجليزية 2024 مسلك علوم انسانية الدورة العادية
5 pages
BSSW 3 2 Proposal
No ratings yet
BSSW 3 2 Proposal
5 pages
Hmems80 2021 Week00 Step by Step PDF
No ratings yet
Hmems80 2021 Week00 Step by Step PDF
6 pages
FADML 03 PPC Analysis of Algos PDF
No ratings yet
FADML 03 PPC Analysis of Algos PDF
29 pages
PGDBA Syllabus
No ratings yet
PGDBA Syllabus
5 pages
FADML 05 PPC Dynamic Programming PDF
No ratings yet
FADML 05 PPC Dynamic Programming PDF
21 pages
Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects For Blind Persons
No ratings yet
Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects For Blind Persons
5 pages
FADML 01 PPC Introduction PDF
No ratings yet
FADML 01 PPC Introduction PDF
13 pages
Micro Teaching Performa
No ratings yet
Micro Teaching Performa
21 pages
Academic Calendar (PGP 23-25)
No ratings yet
Academic Calendar (PGP 23-25)
2 pages
FADML 07 PPC Minimum Spanning Trees PDF
No ratings yet
FADML 07 PPC Minimum Spanning Trees PDF
16 pages
Setting Goals
No ratings yet
Setting Goals
3 pages
07 Af
No ratings yet
07 Af
17 pages
Social Psychology Islamic and Scientific Perspectives
No ratings yet
Social Psychology Islamic and Scientific Perspectives
10 pages
RA MIDWIVES TUGUE Apr2018 PDF
No ratings yet
RA MIDWIVES TUGUE Apr2018 PDF
5 pages
ML Lec 15 Naive Bayes
No ratings yet
ML Lec 15 Naive Bayes
16 pages
Salman Saleem CV Senior SEO Analyst
No ratings yet
Salman Saleem CV Senior SEO Analyst
4 pages
Curriculum Vitae .
No ratings yet
Curriculum Vitae .
1 page
Pgdba - Ob - 2022
No ratings yet
Pgdba - Ob - 2022
2 pages
Understanding Inductive Reasoning and Deductive Reasoning: June 2020
No ratings yet
Understanding Inductive Reasoning and Deductive Reasoning: June 2020
3 pages
Sf5 - 2018 - Grade 10 (Year IV) - 10-Diamond
No ratings yet
Sf5 - 2018 - Grade 10 (Year IV) - 10-Diamond
3 pages
Instant OpenCV for iOS
From Everand
Instant OpenCV for iOS
Kirill Kornyakov
No ratings yet
BeagleBone Robotic Projects
From Everand
BeagleBone Robotic Projects
Richard Grimmett
5/5 (2)
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
From Everand
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Mark Magic
No ratings yet

SSL 18 Mar 23 PDF

Uploaded by

SSL 18 Mar 23 PDF

Uploaded by

CS60010: Deep Learning

• Self-supervised learning methods solve “pretext” tasks that produce

layer 1 representation of image

Represent image as a neural embedding — a vector/tensor of neural activations

How similar are these two images?

How about these two?

[Kriegeskorte et al. 2008]

Justin Johnson Lecture 22 - April 6, 2022

Slide credit: Phillip Isola

Slide credit: Phillip Isola

1. Learn good feature extractors from self-supervised pretext tasks,

Learn good feature extractors from self-

Learn good feature extractors from self-

Hypothesis: a model could recognize the correct rotation of an object only if

The model learns to

Freeze conv1 + conv2 Learn

Pretrained with full

Finetune on labeled data from

Intuition: Requires understanding

Noroozi & Favaro, 2016)

Learning to reconstruct the missing pixels

Pathak et al, “Context Encoders: Feature Learning by Inpainting”, CVPR 2016

Pathak et al, “Context Encoders: Feature Learning by Inpainting”, CVPR 2016

• Loss = reconstruction + adversarial learning

• Adversarial loss between “real” images and inpainted images

Input (context) reconstruction adversarial recon + adv

• Pretext tasks focus on “visual common sense”, e.g., predict rotations,

• Learned representations may be tied to a specific pretext task!Can we

• Intuition and formulation

Justin Johnson Lecture 22 - 88 April 6, 2022

Similar images should have similar features

Justin Johnson Lecture 22 - 89 April 6, 2022

Justin Johnson Lecture 22 - 90 April 6, 2022

Justin Johnson Lecture 22 - 91 April 6, 2022

Justin Johnson Lecture 22 - 95 April 6, 2022

Justin Johnson Lecture 22 - 96 April 6, 2022

Justin Johnson Lecture 22 - 97 April 6, 2022

Justin Johnson Lecture 22 - 98 April 6, 2022

Justin Johnson Lecture 22 - 99 April 6, 2022

𝑥𝑥# If (𝑥𝑥' , 𝑥𝑥( ) is a positive pair,

𝑥𝑥% (𝜏𝜏 is a temperature)

Justin Johnson Lecture 22 - 100 April 6, 2022

𝑥𝑥# If (𝑥𝑥' , 𝑥𝑥( ) is a positive pair,

𝑥𝑥% (𝜏𝜏 is a temperature)

Justin Johnson Lecture 22 - 101 April 6, 2022

You might also like