Me-Momentum: Extracting Hard Confident Examples from Noisily Labeled Data

PyTorch Code for the following paper at ICCV2021:
Title: Me-Momentum: Extracting Hard Confident Examples from Noisily Labeled Data
Authors: Yingbin Bai, Tongliang Liu
Institute: University of Sydney

Abstract

Examples that are close to the decision boundary—that we term hard examples, are essential to shape accurate classifiers. Extracting confident examples has been widely studied in the community of learning with noisy labels. However, it remains elusive how to extract hard confident examples from the noisy training data. In this paper, we propose a deep learning paradigm to solve this problem, which is built on the memorization effect of deep neural networks that they would first learn simple patterns, i.e., which are defined by these shared by multiple training examples. To extract hard confident examples that contain non-simple patterns and are entangled with the inaccurately labeled examples, we borrow the idea of momentum from physics. Specifically, we alternately update the confident examples and refine the classifier. Note that the extracted confident examples in the previous round can be exploited to learn a better classifier and that the better classifier will help identify better (and hard) confident examples. We call the approach the “Momentum of Memorization” (Me-Momentum). Empirical results on benchmark-simulated and real-world label-noise data illustrate the effectiveness of Me-Momentum for extracting hard confident examples, leading to better classification performance.

Experiments

To install requirements:

pip install -r requirements.txt

📋 Please download and place all datasets into the data directory. For Clohting1M, please run "python Clothing1m-data.npy" to generate a data file.

To run program on MNIST and CIFAR-10/100

python main.py --dataset mnist --noise_type instance --noise_rate 0.2

python main.py --dataset cifar10 --noise_type symmetric --noise_rate 0.2

python main.py --dataset cifar100 --noise_type symmetric --noise_rate 0.4

To run program on Clothing1M

python3 Clothing.py

Cite Me-Momentum

If you find the code useful in your research, please consider citing our paper:

@inproceedings{
    bai2021memomentum,
    title={Me-Momentum: Extracting Hard Confident Examples from Noisily Labeled Data},
    author={Yingbin Bai and Tongliang Liu},
    booktitle={ICCV},
    year={2021},
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
common		common
networks		networks
Clothing.py		Clothing.py
ClothingData.py		ClothingData.py
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Me-Momentum: Extracting Hard Confident Examples from Noisily Labeled Data

Abstract

Experiments

Cite Me-Momentum

About

Releases

Packages

Languages

bybeye/Me-Momentum

Folders and files

Latest commit

History

Repository files navigation

Me-Momentum: Extracting Hard Confident Examples from Noisily Labeled Data

Abstract

Experiments

Cite Me-Momentum

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages