TY - GEN
T1 - Debiased offline evaluation of active learning in recommender systems
AU - Carraro, Diego
AU - Bridge, Derek
N1 - Publisher Copyright:
© FLAIRS 2020.All right reserved.
PY - 2020
Y1 - 2020
N2 - Active Learning (AL) when applied to Recommender Systems (RSs) aims at proactively acquiring additional ratings data from the RS users in order to improve subsequent recommendation quality. AL strategies are typically evaluated offline first, but the classic AL offline evaluation methodology does not take into account the bias problem in RS offline evaluation. This problem affects the evaluation of an RS, as brought to light by recent literature. But, we argue, it also affects the evaluation of AL strategies as well. For this reason, in paper, we propose a new AL offline evaluation methodology for RSs which mitigates the bias and thus facilitates a truer picture of the performances of the AL strategies under evaluation. We illustrate our proposed methodology on two datasets and with three simple and well-known AL strategies from the literature. Our experimental results differ from those reported previously in the literature, which shows the importance of our approach to AL evaluation.
AB - Active Learning (AL) when applied to Recommender Systems (RSs) aims at proactively acquiring additional ratings data from the RS users in order to improve subsequent recommendation quality. AL strategies are typically evaluated offline first, but the classic AL offline evaluation methodology does not take into account the bias problem in RS offline evaluation. This problem affects the evaluation of an RS, as brought to light by recent literature. But, we argue, it also affects the evaluation of AL strategies as well. For this reason, in paper, we propose a new AL offline evaluation methodology for RSs which mitigates the bias and thus facilitates a truer picture of the performances of the AL strategies under evaluation. We illustrate our proposed methodology on two datasets and with three simple and well-known AL strategies from the literature. Our experimental results differ from those reported previously in the literature, which shows the importance of our approach to AL evaluation.
UR - https://www.scopus.com/pages/publications/85102406116
M3 - Conference proceeding
AN - SCOPUS:85102406116
T3 - Proceedings of the 33rd International Florida Artificial Intelligence Research Society Conference, FLAIRS 2020
SP - 489
EP - 494
BT - Proceedings of the 33rd International Florida Artificial Intelligence Research Society Conference, FLAIRS 2020
A2 - Bell, Eric
A2 - Bartak, Roman
PB - The AAAI Press
T2 - 33rd International Florida Artificial Intelligence Research Society Conference, FLAIRS 2020
Y2 - 17 May 2020 through 20 May 2020
ER -