A Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms 리뷰
Offline Evaluation 컨셉 이해
replay estimator 이해
실습
OBP 라이브러리를 활용해서 Replay method 실습
evaluate_offpolicy.ipynb