Learning from Noisy Labels with Deep Neural Networks: A Survey

My Motivation

: 잘못 어노테이션된 레이블들

: 일부 노이즈한 레이블이 들어와도 노이즈에 강한 학습을 수행하자

Supervised Learning with Noisy Labels
- 기존의 risk minimization process는 noise-tolerant하지 않음
- DNN이 오염된 레이블을 쉽게 기억할 수 있음
- 학습된적 없는 데이터에대한 일반화가 잘 안될 수 있음
Taxonomy of Label Noise
- typical noise (independent noise)> 데이터 피쳐가 조건부독립이라 가정
  - symmetric noise (or uniform noise)
  - asymmetric noise: 특정 한 레이블로 더 많이 mis-label
  - pair noise: 완전히 특정 한 레이블로 mis-label
- instance noise (or label-dependent noise)
  - 데이터 피쳐가 의존적이라 가정
  - 아직 연구가 잘 없음
Non-deep Learning Approaches
- Data cleaning
- Probabilistic method
- Model-based method