Following on from the previous lecture, DeepMind Research Scientist Viorica Patraucean introduces classic computer vision tasks beyond image classification (object detection, semantic segmentation, optical flow estimation) and describes state of the art models for each, together with standard benchmarks. She discusses similar models for video processing for tasks like action recognition, tracking, and the associated challenges. In particular, she refers to recent work to make video processing more efficient, including using elements of reinforcement learning. Next, she describes various settings for self-supervised learning in uni-modal and multi-modal (vision+audio, visio+language) settings, where large scale is beneficial. Viorica ends with a discussion on open questions in vision and the role of computer vision research within the broader goal of building intelligent agents.
Download the slides here:
Find out more about how DeepMind increases access to science here:
Viorica is a Research Scientist at DeepMind, working mainly on Computer Vision related problems, with focus on video processing. She did her PhD in Toulouse, France, on statistical models for image processing, and then focused on 3D shape- and video-analysis during her postdoctoral work in Paris and Cambridge. Her dream is to contribute to creating a computational model for the human visual system.
About the lecture series:
The Deep Learning Lecture Series is a collaboration between DeepMind and the UCL Centre for Artificial Intelligence. Over the past decade, Deep Learning has evolved as the leading artificial intelligence paradigm providing us with the ability to learn complex functions from raw data at unprecedented accuracy and scale. Deep Learning has been applied to problems in object recognition, speech recognition, speech synthesis, forecasting, scientific computing, control and many more. The resulting applications are touching all of our lives in areas such as healthcare and medical research, human-computer interaction, communication, transport, conservation, manufacturing and many other fields of human endeavour. In recognition of this huge impact, the 2019 Turing Award, the highest honour in computing, was awarded to pioneers of Deep Learning.
In this lecture series, research scientists from leading AI research lab, DeepMind, deliver 12 lectures on an exciting selection of topics in Deep Learning, ranging from the fundamentals of training neural networks via advanced ideas around memory, attention, and generative modelling to the important topic of responsible innovation.