A Learn-to-Rank Approach for Predicting Road Cycling Race Outcomes

Professional road cycling is a very competitive sport, and many factors influence the outcome of the race. These factors can be internal (e.g., psychological preparedness, physiological profile of the rider, and the preparedness or fitness of the rider) or external (e.g., the weather or strategy of...

Full description

Bibliographic Details
Main Authors: de Leeuw, A.-W (Author), De Schepper, T. (Author), Hellinckx, P. (Author), Kholkine, L. (Author), Latré, S. (Author), Servotte, T. (Author), Verdonck, T. (Author)
Format: Article
Language:English
Published: Frontiers Media S.A. 2021
Subjects:
Online Access:View Fulltext in Publisher
LEADER 02618nam a2200265Ia 4500
001 10.3389-fspor.2021.714107
008 220427s2021 CNT 000 0 und d
020 |a 26249367 (ISSN) 
245 1 0 |a A Learn-to-Rank Approach for Predicting Road Cycling Race Outcomes 
260 0 |b Frontiers Media S.A.  |c 2021 
856 |z View Fulltext in Publisher  |u https://doi.org/10.3389/fspor.2021.714107 
520 3 |a Professional road cycling is a very competitive sport, and many factors influence the outcome of the race. These factors can be internal (e.g., psychological preparedness, physiological profile of the rider, and the preparedness or fitness of the rider) or external (e.g., the weather or strategy of the team) to the rider, or even completely unpredictable (e.g., crashes or mechanical failure). This variety makes perfectly predicting the outcome of a certain race an impossible task and the sport even more interesting. Nonetheless, before each race, journalists, ex-pro cyclists, websites and cycling fans try to predict the possible top 3, 5, or 10 riders. In this article, we use easily accessible data on road cycling from the past 20 years and the Machine Learning technique Learn-to-Rank (LtR) to predict the top 10 contenders for 1-day road cycling races. We accomplish this by mapping a relevancy weight to the finishing place in the first 10 positions. We assess the performance of this approach on 2018, 2019, and 2021 editions of six spring classic 1-day races. In the end, we compare the output of the framework with a mass fan prediction on the Normalized Discounted Cumulative Gain (NDCG) metric and the number of correct top 10 guesses. We found that our model, on average, has slightly higher performance on both metrics than the mass fan prediction. We also analyze which variables of our model have the most influence on the prediction of each race. This approach can give interesting insights to fans before a race but can also be helpful to sports coaches to predict how a rider might perform compared to other riders outside of the team. Copyright © 2021 Kholkine, Servotte, de Leeuw, De Schepper, Hellinckx, Verdonck and Latré. 
650 0 4 |a cycling race performance 
650 0 4 |a learn-to-rank 
650 0 4 |a machine learning 
650 0 4 |a road cycling 
650 0 4 |a winner prediction 
700 1 |a de Leeuw, A.-W.  |e author 
700 1 |a De Schepper, T.  |e author 
700 1 |a Hellinckx, P.  |e author 
700 1 |a Kholkine, L.  |e author 
700 1 |a Latré, S.  |e author 
700 1 |a Servotte, T.  |e author 
700 1 |a Verdonck, T.  |e author 
773 |t Frontiers in Sports and Active Living