Tutorial

ECCV 2020 Tutorial on Visual Recognition for Images, Video, and 3D
Location: [ECCV Mini-site] [Youtube Playlist] August 23rd (full day), 2020

Organizers

Ross Girshick
FAIR

Alexander Kirillov
FAIR

Yuxin Wu
FAIR

Christoph Feichtenhofer
FAIR

Haoqi Fan
FAIR

Yanghao Li
FAIR

Hanbyul Joo
FAIR

Justin Johnson
University of Michigan

Xinlei Chen
FAIR

Georgia Gkioxari
FAIR

Overview

The purpose of this tutorial is to discuss popular approaches and recent advancements in the family of visual recognition tasks for different input modalities. We will cover in detail the most recent work on object recognition and scene understanding. Going beyond single images we will show current progress in video (detection and classification in video), 3D visual recognition (multi-object mesh prediction) and vision+language tasks. Our goal is to show existing connections between the techniques specialized for different input modalities and provide some insights about diverse challenges that each modality presents.

In conjunction with the tutorial we are open-sourcing three new visual recognition systems for images, videos, and 3D respectively. These PyTorch-based systems contain multiple state-of-the-art methods in the corresponding domains. In our tutorial we will pair each research talk with a talk that discusses these codebases sharing best engineering practices and showing details of implementation for each domain. We hope that such pairing will help researchers who are interested primarily in visual recognition to build and benchmark their systems easier. For researchers from different areas we hope to make SOTA recognition systems easy to incorporate in their frameworks.

Online Q&A Session Schedule

We will arrange our online Q&A session in groups. Please follow the schedule below to join the zoom meeting.

Online Q&A session 1: Saturday Evening, Aug. 22nd

---- 7:00 PM - 7:20 PM (PDT) ----

Language and Vision: Justin Johnson and Xinlei Chen

---- 7:25 PM - 7:45 PM (PDT) ----

Detection and Segmentation: Ross Girshick, Alexander Kirillov, Yuxin Wu

---- 7:50 PM - 8:10 PM (PDT) ----

Video Recognition: Christoph Feichtenhofer, Haoqi Fan, Yanghao Li

---- 8:15 PM - 8:35 PM (PDT) ----

3D Recognition: Hanbyul Joo Nikhila Ravi Georgia Gkioxari

---- 8:40 PM - 9:00 PM (PDT) ----

Representation Learning: Saining Xie, Piotr Dollár

Online Q&A session 2: Sunday Morning, Aug. 23rd

---- 8:30 AM - 8:50 AM (PDT) ----

Language and Vision: Justin Johnson and Xinlei Chen

---- 8:55 AM - 9:15 AM (PDT) ----

Detection and Segmentation: Ross Girshick, Alexander Kirillov, Yuxin Wu

---- 9:20 AM - 9:40 AM (PDT) ----

Video Recognition: Christoph Feichtenhofer, Haoqi Fan, Yanghao Li

---- 9:45 AM - 10:05 AM (PDT) ----

3D Recognition: Hanbyul Joo Nikhila Ravi Georgia Gkioxari

---- 10:10 AM - 10:30 AM (PDT) ----

Representation Learning: Saining Xie, Piotr Dollár

Contact: Saining Xie

ECCV 2020 Tutorial on

Location: [ECCV Mini-site] [Youtube Playlist] August 23rd (full day), 2020

Organizers

Overview

Online Q&A Session Schedule

Online Q&A session 1: Saturday Evening, Aug. 22nd

Online Q&A session 2: Sunday Morning, Aug. 23rd

Location: [ECCV Mini-site] [Youtube Playlist]

August 23rd (full day), 2020