Lecture 1 Part 2
Lecture 1 Part 2
Lecture 1 - Overview
cat
cat
cat
cat
Tasks Models
No spatial extent No objects, just pixels Multiple Object This image is CC0 public domain
Running?
Jumping?
DALL-E 2
Style Transfer
Zhou et al., 3D Shape Generation and Completion through Point-Voxel Diffusion (2021) Gkioxari et al., “Mesh R-CNN”, ICCV 2019
Li et al., BEHAVIOR-1K: A Benchmark for Embodied AI with 1,000 Everyday Activities Mandlekar and Xu et al., Learning to Generalize Across
and Realistic Simulation (2022) Long-Horizon Tasks from Human Demonstrations (2020)
That’s Fei-Fei
- Lectures will not be streamed on Zoom but will be broadcasted live via Panopto
- Slides will be posted on the course website shortly before each lecture
- All lectures will be recorded and uploaded to Canvas after the lecture under the
“Panopto Course Videos” Tab.
Hands-on tutorials, with more practical details than the main lecture
For questions about assignments, final project, midterm, logistics, etc, use Ed!
Access: Canvas -> Deep Learning for Computer Vision -> Ed Discussion
SCPD students: Use your @stanford.edu address to register for Ed; contact
[email protected] for help.
- K-Nearest Neighbor
- Linear classifiers: SVM, Softmax
- Two-layer neural network
- Image features
An auto-grading system:
We will be distributing credits to all enrolled students using your AWS account
IDs