ECCV 2020 Tutorial on

Visual Recognition for Images, Video, and 3D

Location: [ECCV Mini-site] [Youtube Playlist]

August 23rd (full day), 2020


Organizers





Overview

The purpose of this tutorial is to discuss popular approaches and recent advancements in the family of visual recognition tasks for different input modalities. We will cover in detail the most recent work on object recognition and scene understanding. Going beyond single images we will show current progress in video (detection and classification in video), 3D visual recognition (multi-object mesh prediction) and vision+language tasks. Our goal is to show existing connections between the techniques specialized for different input modalities and provide some insights about diverse challenges that each modality presents.

In conjunction with the tutorial we are open-sourcing three new visual recognition systems for images, videos, and 3D respectively. These PyTorch-based systems contain multiple state-of-the-art methods in the corresponding domains. In our tutorial we will pair each research talk with a talk that discusses these codebases sharing best engineering practices and showing details of implementation for each domain. We hope that such pairing will help researchers who are interested primarily in visual recognition to build and benchmark their systems easier. For researchers from different areas we hope to make SOTA recognition systems easy to incorporate in their frameworks.


Online Q&A Session Schedule

We will arrange our online Q&A session in groups. Please follow the schedule below to join the zoom meeting.

Online Q&A session 1: Saturday Evening, Aug. 22nd

---- 7:00 PM - 7:20 PM (PDT) ----

Language and Vision: Justin Johnson and Xinlei Chen

---- 7:25 PM - 7:45 PM (PDT) ----

Detection and Segmentation: Ross Girshick, Alexander Kirillov, Yuxin Wu

---- 7:50 PM - 8:10 PM (PDT) ----

Video Recognition: Christoph Feichtenhofer, Haoqi Fan, Yanghao Li

---- 8:15 PM - 8:35 PM (PDT) ----

3D Recognition: Hanbyul Joo Nikhila Ravi Georgia Gkioxari

---- 8:40 PM - 9:00 PM (PDT) ----

Representation Learning: Saining Xie, Piotr Dollár


Online Q&A session 2: Sunday Morning, Aug. 23rd

---- 8:30 AM - 8:50 AM (PDT) ----

Language and Vision: Justin Johnson and Xinlei Chen

---- 8:55 AM - 9:15 AM (PDT) ----

Detection and Segmentation: Ross Girshick, Alexander Kirillov, Yuxin Wu

---- 9:20 AM - 9:40 AM (PDT) ----

Video Recognition: Christoph Feichtenhofer, Haoqi Fan, Yanghao Li

---- 9:45 AM - 10:05 AM (PDT) ----

3D Recognition: Hanbyul Joo Nikhila Ravi Georgia Gkioxari

---- 10:10 AM - 10:30 AM (PDT) ----

Representation Learning: Saining Xie, Piotr Dollár


Contact: Saining Xie