Time-dependent evaluation of recommender systems

Teresa Scheidt, Joeran Beel

Research output: Chapter in Book/Report/Conference proceedingPaper in conference proceedingpeer-review

Abstract

Evaluation of recommender systems is an actively discussed topic in the recommender system community. However, some aspects of evaluation have received little to no attention, one of them being whether evaluating recommender system algorithms with single-number metrics is sufficient. When presenting results as a single number, the only possible assumption is a stable performance over time regardless of changes in the datasets, while it intuitively seems more likely that the performance changes over time. We suggest presenting results over time, making it possible to identify trends and changes in performance as the dataset grows and changes. In this paper, we conduct an analysis of 6 algorithms on 10 datasets over time to identify the need for a time-dependent evaluation. To enable this evaluation over time, we split the datasets based on the provided timesteps into smaller subsets. At every tested timepoint we use all available data up to this timepoint, simulating a growing dataset as encountered in the realworld. Our results show that for 90% of the datasets the performance changes over time and in 60% even the ranking of algorithms changes over time.

Original languageEnglish
Title of host publicationPerspectives 2021
Subtitle of host publicationProceedings of the Perspectives on the Evaluation of Recommender Systems Workshop 2021
Volume2955
Publication statusPublished - 2021
Externally publishedYes
Event2021 Perspectives on the Evaluation of Recommender Systems Workshop, Perspectives 2021 - Amsterdam, Netherlands
Duration: 2021 Sept 25 → …

Publication series

NameCEUR Workshop Proceedings
PublisherCEUR Workshop Proceedings
ISSN (Print)1613-0073

Conference

Conference2021 Perspectives on the Evaluation of Recommender Systems Workshop, Perspectives 2021
Country/TerritoryNetherlands
CityAmsterdam
Period2021/09/25 → …

Subject classification (UKÄ)

  • Information Systems

Free keywords

  • Evaluation
  • Recommender systems
  • Time-dependent evaluation

Fingerprint

Dive into the research topics of 'Time-dependent evaluation of recommender systems'. Together they form a unique fingerprint.

Cite this