Weakly Supervised Learning from Images and Video

Presented on September 30, 2016 at 12:00 p.m. in the Marcus Nanotechnology Building, Room 1116 Ivan Laptev is a research director at INRIA Paris, France. Laptev’s main research interests include visual recognition of human actions, objects, and interactions. Runtime: 56:30 minutes Recent progress in...

Full description

Bibliographic Details
Main Author: Laptev, Ivan
Other Authors: Georgia Institute of Technology. Institute for Robotics and Intelligent Machine, INRIA
Format: Lecture
Language:unknown
Published: 2016
Subjects:
Online Access:http://hdl.handle.net/1853/55927
Description
Summary:Presented on September 30, 2016 at 12:00 p.m. in the Marcus Nanotechnology Building, Room 1116 Ivan Laptev is a research director at INRIA Paris, France. Laptev’s main research interests include visual recognition of human actions, objects, and interactions. Runtime: 56:30 minutes Recent progress in visual recognition goes hand-in-hand with the supervised learning and large-scale training data. While the amount of existing images and videos is huge, their detailed annotation is expensive and often ambiguous. To address these problems, in this talk we will focus on weakly-supervised learning methods using incomplete and noisy supervision for training. In the first part I will discuss recognition from still images and will describe our work on weakly-supervised convolutional networks for recognizing and localizing objects and human actions. The second part of the talk will focus on the learning of human actions from videos. In particular we will consider understanding specific tasks from YouTube instruction videos and corresponding narrations. We will conclude with future challenges and opportunities of visual recognition.