Foundations of Computer Vision Indice
Foundations of Computer Vision Indice
Antonio Torralba
Phillip Isola
William T. Freeman
This license applies only to the work in full and not to any components included with permission.
Subject to such license, all rights are reserved.
The MIT Press would like to thank the anonymous peer reviewers who provided comments on
drafts of this book. The generous work of academic experts is essential for establishing the
authority and quality of our publications. We acknowledge with gratitude the contributions of these
otherwise uncredited readers.
ISBN: 978-0-262-04897-2
10 9 8 7 6 5 4 3 2 1
d_r0
Dedicated to all the pixels.
Contents
Preface
Notation
1 The Challenge of Vision
I FOUNDATIONS
2 A Simple Vision System
3 Looking at Images
4 Computer Vision and Society
II IMAGE FORMATION
5 Imaging
6 Lenses
7 Cameras as Linear Systems
8 Color
III FOUNDATIONS OF LEARNING
9 Introduction to Learning
10 Gradient-Based Learning Algorithms
11 The Problem of Generalization
12 Neural Networks
13 Neural Networks as Distribution Transformers
14 Backpropagation
IV FOUNDATIONS OF IMAGE PROCESSING
15 Linear Image Filtering
16 Fourier Analysis
V LINEAR FILTERS
17 Blur Filters
18 Image Derivatives
19 Temporal Filters
VI SAMPLING AND MULTISCALE IMAGE REPRESENTATIONS
20 Image Sampling and Aliasing
21 Downsampling and Upsampling Images
22 Filter Banks
23 Image Pyramids
VII NEURAL ARCHITECTURES FOR VISION
24 Convolutional Neural Nets
25 Recurrent Neural Nets
26 Transformers
VIII PROBABILISTIC MODELS OF IMAGES
27 Statistical Image Models
28 Textures
29 Probabilistic Graphical Models
IX GENERATIVE IMAGE MODELS AND REPRESENTATION
LEARNING
30 Representation Learning
31 Perceptual Grouping
32 Generative Models
33 Generative Modeling Meets Representation Learning
34 Conditional Generative Models
X CHALLENGES IN LEARNING-BASED VISION
35 Data Bias and Shift
36 Training for Robustness and Generality
37 Transfer Learning and Adaptation
XI UNDERSTANDING GEOMETRY
38 Representing Images and Geometry
39 Camera Modeling and Calibration
40 Stereo Vision
41 Homographies
42 Single View Metrology
43 Learning to Estimate Depth from a Single Image
44 Multiview Geometry and Structure from Motion
45 Radiance Fields
XII UNDERSTANDING MOTION
46 Motion Estimation
47 3D Motion and Its 2D Projection
48 Optical Flow Estimation
49 Learning to Estimate Motion
XIII UNDERSTANDING VISION WITH LANGUAGE
50 Object Recognition
51 Vision and Language
XIV ON RESEARCH, WRITING AND SPEAKING
52 How to Do Research
53 How to Write Papers
54 How to Give Talks
XV CLOSING REMARKS
55 A Simple Vision System—Revisited
Bibliography
Index