A Goal Scoring Probability Model for Shots Based on Synchronized Positional and Event Data in Football (Soccer)

Due to the low scoring nature of football (soccer), shots are often used as a proxy to evaluate team and player performances. However, not all shots are created equally and their quality differs significantly depending on the situation. The aim of this study is to objectively quantify the quality of...

Full description

Bibliographic Details
Main Authors: Anzer, G. (Author), Bauer, P. (Author)
Format: Article
Language:English
Published: Frontiers Media S.A. 2021
Subjects:
XG
Online Access:View Fulltext in Publisher
Description
Summary:Due to the low scoring nature of football (soccer), shots are often used as a proxy to evaluate team and player performances. However, not all shots are created equally and their quality differs significantly depending on the situation. The aim of this study is to objectively quantify the quality of any given shot by introducing a so-called expected goals (xG) model. This model is validated statistically and with professional match analysts. The best performing model uses an extreme gradient boosting algorithm and is based on hand-crafted features from synchronized positional and event data of 105, 627 shots in the German Bundesliga. With a ranked probability score (RPS) of 0.197, it is more accurate than any previously published expected goals model. This approach allows us to assess team and player performances far more accurately than is possible with traditional metrics by focusing on process rather than results. Copyright © 2021 Anzer and Bauer.
ISBN:26249367 (ISSN)
DOI:10.3389/fspor.2021.624475