The workshop will be held on Sunday (June 16 2019, Los Angeles Time) at Hyatt Regency B Long Beach. Poster: June 16 2019,10:00-11:00 AM, Pacific Arena Ballroom (main convention center)

Introduction

Weakly supervised learning refers to a variety of studies that attempt to address the challenging pattern recognition tasks by learning from weak or imperfect supervision. Supervised learning methods including Deep Convolutional Neural Networks (DCNNs) have significantly improved the performance in many problems in the field of computer vision, thanks to the rise of large-scale annotated data set and the advance in computing hardware. However, these supervised learning approaches are notorious "data hungry", which makes them are sometimes not practical in many real-world industrial applications. We are often facing the problem that we are not able to acquire enough amount of perfect annotations (e.g., object bounding boxes and pixel-wise masks) for reliable training models. To address this problem, many efforts in so-called weakly supervised learning approaches have been made to improve the DCNNs training to deviate from traditional paths of supervised learning using imperfect data. For instance, various approaches have proposed new loss functions or novel training schemes. Weakly supervised learning is a popular research direction in Computer Vision and Machine Learning communities, many research works have been devoted to related topics, leading to rapid growth of related publications in the top-tier conferences and journals such as CVPR, ICCV, ECCV, T-IP, and T-PAMI. We organize this workshop to investigate current ways of building industry level AI system relying on learning from imperfect data. We hope this workshop will attract attention and discussions from both industry and academic people.

Call for Papers


The workshop are expected to deal with data in a weakly supervised manner, which will cover but not limit to the following topics:

  • Weakly supervised learning algorithms
  • One/few shot learning for computer vision
  • Learning with noisy webly data
  • Weakly supervised learning for medical images
  • Real world image applications, e.g. object semantic segmentation/detection/localization, scene parsing, etc.
  • Real world video applications, e.g. action recognition, event detection, object tracking, video object detection/segmentation, etc.
  • New datasets and metrics to evaluate the benefit of the weakly supervised approaches for the specific vision problems.

Format : Papers are limited to 8 pages, including figures and tables, in the CVPR style. Additional pages containing only cited references are allowed.
Location: Long Beach, United States
Submission Site: https://cmt3.research.microsoft.com/LID2019/Submission/Index
Latex/Word Templates: http://cvpr2019.thecvf.com/files/cvpr2019AuthorKit.zip
*Note: Paper should be prepared in blind-submission review-formatted template.

*Note: The challenge deadline is postponed to Jun 9, 2019 7:59:59 AM CST. (2019-05-30)

Challenge

We will organize the first Learning from Imperfect Data (LID) challenge on object semantic segmentation and scene parsing, which includes two competition tracks(challenge deadline: June 8, 2019):

Track1: Object semantic segmentation with image-level supervision

In this track, image-level annotations are provided for supervision and the target is performing pixel-level classification. The dataset is built upon the image detection track of ImageNet Large Scale Visual Recognition Competition (ILSVRC). We provide pixel-level annotations of 15K images (validation/testing: 5K/10K) from 200 basic-level categories for evaluation. To the best of our knowledge, this is the most diverse dataset to evaluate the semantic segmentation in the weakly supervised manner.

Track2: Scene parsing with point-based supervision

Beyond object segmentation, background categories such as wall, road, sky need to be further specified for the scene parsing, which is a challenging task compared with object semantic segmentation. Thus, it will be more difficult and expensive to manually annotate pixel-level mask for this task. In this track, we propose to leverage several labeled points that are much easier to obtain to guide the training process. The dataset is built upon the well-known ADE20K, which includes 20,210 training images from 150 categories. We provide the point-based annotations on the training set. Please download the data from LID Challenge Track2 data .
*Note:

  • Only point-based annotations on the training set (20K) can be used for training, dense annotaions are NOT permitted;
  • The prediction on 150 classes should have index range [0, 149] inclusive;
  • Participants can use dense annotations on the validation set (2K) for model tuning;
  • Challenge evaluation is performed on the testing (>3K) sets.
  • Evaluate: https://evalai.cloudcv.org/web/challenges/challenge-page/300/overview

Important Dates

Description Date
Paper Submission Deadline May 1, 2019
Notification to Authors May 7, 2019
Camera-Ready Deadline June 1, 2019
Challenge Deadline June 8, 2019
Poster 10:00-11:00, June 16, 2019

Workshop Schedule

Time Description
08:20-08:30 Opening remarks and welcome and the LID challenge summary
08:30-09:10 James Thewlis, PhD student, advised by Andrea Vedaldi, University of Oxford
"Learning from motion with little or no supervision"
09:20-10:00 Ming-Hsuan Yang , "Unseen Object Segmentation in Videos via Transferable Representations"
10:00-11:00 Poster, Pacific Arena Ballroom (main convention center)
10:00-10:30 coffee break
10:30-11:00 Bolei Zhou , Assistant Professor, The Chinese University of Hong Kong
"Objects Disentangled from Classifying and Synthesizing Scenes"
11:00-11:30 Yanping Huang, Software Engineer, Google Brain
"GPipe: Easy Scaling with Micro-Batch Pipeline Parallelism"
11:30-12:00 Anurag Arnab , PhD student, University of Oxford
"Learning from Weak Supervision: Panoptic Segmentation and 3D Human Pose Estimation"
12:00-14:00 Lunch
14:00-14:30 Rogerio Feris, Research Manager, IBM
"Learning More from Less: Weak Supervision and Beyond"
14:30-15:00 Bernardino Romera-Paredes, Research scientist, Google Deepmind
"Enhancing U-Nets to Segment Ambiguous Images"
15:00-15:20 coffee break
15:20-15:40 Oral talk 1: Winner of Track1: Object semantic segmentation with image-level supervision
15:40-16:00 Oral talk 2: Winner of Track2: Scene parsing with point-level supervision
16:00-16:15 Awards & Future Plans

2019 Accepted papers

#74 Missing Labels in Object Detection
#75 Utilizing the Instability in Weakly Supervised Object Detection
#76 Dense Crowd Counting Convolutional Neural Networks with Minimal Data using Semi-Supervised Dual-Goal Generative Adversarial Networks
#77 A Dual Attention Network with Semantic Embedding for Few-shot Learning
#78 A pairwise learning strategy for video-based face recognition
#79 Semantic Part RCNN for Real-World Pedestrian Detection
#80 Semi-supervised learning based on generative adversarial network: a comparison between good GAN and bad GAN approach
#81 Class Subset Selection for Partial Domain Adaptation
#82 Self-supervised Difference Detection for Refinement CRF and Seed Interpolation

Invited Speakers

Andrea Vedaldi
Associate Professor, University of Oxford
Ming-Hsuan Yang
Research Scientist, Google AI
Rogerio Feris
Research Manager, IBM T.J. Watson Research Center
Jiashi Feng
Assistant Professor, National University of Singapore
Bolei Zhou
Assistant Professor, The Chinese University of Hong Kong
Yanping Huang
Software Engineer, Google Brain
Bernardino Romera-Paredes
Research Scientist, Google Deepmind
Anurag Arnab
PhD Student, University of Oxford

Organizers

Webmaster

Zheng Lin
Ting Liu

Contact

yunchao@illinois.edu, szhengcvpr@gmail.com