100% found this document useful (1 vote)
683 views10 pages

Foundations of Computer Vision Indice

The document is a book titled 'Foundations of Computer Vision' edited by Francis Bach and authored by Antonio Torralba, Phillip Isola, and William T. Freeman, published by The MIT Press. It covers a comprehensive range of topics in computer vision, including image formation, learning foundations, neural architectures, and challenges in vision. The book is part of the Adaptive Computation and Machine Learning series and is dedicated to the fundamental elements of visual data.

Uploaded by

vecaf70542
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
683 views10 pages

Foundations of Computer Vision Indice

The document is a book titled 'Foundations of Computer Vision' edited by Francis Bach and authored by Antonio Torralba, Phillip Isola, and William T. Freeman, published by The MIT Press. It covers a comprehensive range of topics in computer vision, including image formation, learning foundations, neural architectures, and challenges in vision. The book is part of the Adaptive Computation and Machine Learning series and is dedicated to the fundamental elements of visual data.

Uploaded by

vecaf70542
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 10

Foundations of Computer Vision

Adaptive Computation and Machine Learning series


Francis Bach, editor

A complete list of books published in the Adaptive Computation and


Machine Learning series appears at the back of this book.
Foundations of Computer Vision

Antonio Torralba
Phillip Isola
William T. Freeman

The MIT Press


Cambridge, Massachusetts
London, England
© 2024 Antonio Torralba, Phillip Isola, and William T. Freeman

This work is subject to a Creative Commons CC-BY-NC-ND license.

This license applies only to the work in full and not to any components included with permission.
Subject to such license, all rights are reserved.

The MIT Press would like to thank the anonymous peer reviewers who provided comments on
drafts of this book. The generous work of academic experts is essential for establishing the
authority and quality of our publications. We acknowledge with gratitude the contributions of these
otherwise uncredited readers.

This book was set in Times New Roman by the authors.

Library of Congress Cataloging-in-Publication Data is available.

ISBN: 978-0-262-04897-2

10 9 8 7 6 5 4 3 2 1

d_r0
Dedicated to all the pixels.
Contents

Preface
Notation
1 The Challenge of Vision
I FOUNDATIONS
2 A Simple Vision System
3 Looking at Images
4 Computer Vision and Society
II IMAGE FORMATION
5 Imaging
6 Lenses
7 Cameras as Linear Systems
8 Color
III FOUNDATIONS OF LEARNING
9 Introduction to Learning
10 Gradient-Based Learning Algorithms
11 The Problem of Generalization
12 Neural Networks
13 Neural Networks as Distribution Transformers
14 Backpropagation
IV FOUNDATIONS OF IMAGE PROCESSING
15 Linear Image Filtering
16 Fourier Analysis
V LINEAR FILTERS
17 Blur Filters
18 Image Derivatives
19 Temporal Filters
VI SAMPLING AND MULTISCALE IMAGE REPRESENTATIONS
20 Image Sampling and Aliasing
21 Downsampling and Upsampling Images
22 Filter Banks
23 Image Pyramids
VII NEURAL ARCHITECTURES FOR VISION
24 Convolutional Neural Nets
25 Recurrent Neural Nets
26 Transformers
VIII PROBABILISTIC MODELS OF IMAGES
27 Statistical Image Models
28 Textures
29 Probabilistic Graphical Models
IX GENERATIVE IMAGE MODELS AND REPRESENTATION
LEARNING
30 Representation Learning
31 Perceptual Grouping
32 Generative Models
33 Generative Modeling Meets Representation Learning
34 Conditional Generative Models
X CHALLENGES IN LEARNING-BASED VISION
35 Data Bias and Shift
36 Training for Robustness and Generality
37 Transfer Learning and Adaptation
XI UNDERSTANDING GEOMETRY
38 Representing Images and Geometry
39 Camera Modeling and Calibration
40 Stereo Vision
41 Homographies
42 Single View Metrology
43 Learning to Estimate Depth from a Single Image
44 Multiview Geometry and Structure from Motion
45 Radiance Fields
XII UNDERSTANDING MOTION
46 Motion Estimation
47 3D Motion and Its 2D Projection
48 Optical Flow Estimation
49 Learning to Estimate Motion
XIII UNDERSTANDING VISION WITH LANGUAGE
50 Object Recognition
51 Vision and Language
XIV ON RESEARCH, WRITING AND SPEAKING
52 How to Do Research
53 How to Write Papers
54 How to Give Talks
XV CLOSING REMARKS
55 A Simple Vision System—Revisited
Bibliography
Index

You might also like