Realistic Evaluation of Deep Semi-Supervised Learning Algorithms
Citations Over Time
Abstract
Semi-supervised learning (SSL) provides a powerful framework for leveraging unlabeled data when labels are limited or expensive to obtain. SSL algorithms based on deep neural networks have recently proven successful on standard benchmark tasks. However, we argue that these benchmarks fail to address many issues that these algorithms would face in real-world applications. After creating a unified reimplementation of various widely-used SSL techniques, we test them in a suite of experiments designed to address these issues. We find that the performance of simple baselines which do not use unlabeled data is often underreported, that SSL methods differ in sensitivity to the amount of labeled and unlabeled data, and that performance can degrade substantially when the unlabeled dataset contains out-of-class examples. To help guide SSL research towards real-world applicability, we make our unified reimplemention and evaluation platform publicly available.
Related Papers
- → Deep Residual Learning for Image Recognition(2016)216,943 cited
- → Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning(2018)2,784 cited
- → Mean teachers are better role models: Weight-averaged consistency\n targets improve semi-supervised deep learning results(2017)2,517 cited
- → MixMatch: A Holistic Approach to Semi-Supervised Learning(2019)604 cited