An in-depth study of how to enable computers to perceive the world using cameras, microphones, depth sensors, and other modalities. Topics will include recognizing speech, objects, faces, actions, and other selected categories using a variety of methods, with a focus on machine learning and deep networks. Each week students will prepare written summaries and critiques of technical papers in perception and computer vision to be discussed in class. Students will also work in groups to complete a set of projects that implement different types of systems for computer perception, including a final project of their choice.


