An accessible, authoritative, and up-to-date computer vision textbook offering a comprehensive introduction to the foundations of the field that incorporates the latest deep learning advances.
Machine learning has revolutionized computer vision, but the methods of today have deep roots in the history of the field. Providing a much-needed modern treatment, this accessible and up-to-date textbook comprehensively introduces the foundations of computer vision while incorporating the latest deep learning advances. Taking a holistic approach that goes beyond machine learning, it addresses fundamental issues in the task of vision and the relationship of machine vision to human perception. Foundations of Computer Vision covers topics not standard in other texts, including transformers, diffusion models, statistical image models, issues of fairness and ethics, and the research process. To emphasize intuitive learning, concepts are presented in short, lucid chapters alongside extensive illustrations, questions, and examples. Written by leaders in the field and honed by a decade of classroom experience, this engaging and highly teachable book offers an essential next-generation view of computer vision.
Antonio Torralba is Professor and Head of the AI+D faculty at the Department of Electrical Engineering and Computer Science at MIT, where he is a member of the Computer Science and Artificial Intelligence Laboratory (CSAIL).
Phillip Isola is Associate Professor of Electrical Engineering and Computer Science at MIT, where he is a member of the Computer Science and Artificial Intelligence Laboratory (CSAIL).
William T. Freeman is Thomas and Gerd Perkins Professor of Electrical Engineering and Computer Science at MIT, where he is a member of the Computer Science and Artificial Intelligence Laboratory (CSAIL). He is also a research manager at Google Research in Cambridge, Massachusetts.