Predictive analytics using Big Data for the real estate market during the COVID-19 pandemic

Abstract As the COVID-19 pandemic came unexpectedly, many real estate experts claimed that the property values would fall like the 2007 crash. However, this study raises the question of what attributes of an apartment are most likely to influence a price revision during the pandemic. The findings in...

Full description

Bibliographic Details
Main Authors: Andrius Grybauskas, Vaida Pilinkienė, Alina Stundžienė
Format: Article
Language:English
Published: SpringerOpen 2021-08-01
Series:Journal of Big Data
Subjects:
TOM
Online Access:https://doi.org/10.1186/s40537-021-00476-0
id doaj-c41523bed027405d9b4e70fd1c02ac39
record_format Article
spelling doaj-c41523bed027405d9b4e70fd1c02ac392021-08-08T11:03:25ZengSpringerOpenJournal of Big Data2196-11152021-08-018112010.1186/s40537-021-00476-0Predictive analytics using Big Data for the real estate market during the COVID-19 pandemicAndrius Grybauskas0Vaida Pilinkienė1Alina Stundžienė2School of Economics and Business, Kaunas University of TechnologySchool of Economics and Business, Kaunas University of TechnologySchool of Economics and Business, Kaunas University of TechnologyAbstract As the COVID-19 pandemic came unexpectedly, many real estate experts claimed that the property values would fall like the 2007 crash. However, this study raises the question of what attributes of an apartment are most likely to influence a price revision during the pandemic. The findings in prior studies have lacked consensus, especially regarding the time-on-the-market variable, which exhibits an omnidirectional effect. However, with the rise of Big Data, this study used a web-scraping algorithm and collected a total of 18,992 property listings in the city of Vilnius during the first wave of the COVID-19 pandemic. Afterwards, 15 different machine learning models were applied to forecast apartment revisions, and the SHAP values for interpretability were used. The findings in this study coincide with the previous literature results, affirming that real estate is quite resilient to pandemics, as the price drops were not as dramatic as first believed. Out of the 15 different models tested, extreme gradient boosting was the most accurate, although the difference was negligible. The retrieved SHAP values conclude that the time-on-the-market variable was by far the most dominant and consistent variable for price revision forecasting. Additionally, the time-on-the-market variable exhibited an inverse U-shaped behaviour.https://doi.org/10.1186/s40537-021-00476-0Machine learningTOMReal estateApartmentsBig dataPandemics
collection DOAJ
language English
format Article
sources DOAJ
author Andrius Grybauskas
Vaida Pilinkienė
Alina Stundžienė
spellingShingle Andrius Grybauskas
Vaida Pilinkienė
Alina Stundžienė
Predictive analytics using Big Data for the real estate market during the COVID-19 pandemic
Journal of Big Data
Machine learning
TOM
Real estate
Apartments
Big data
Pandemics
author_facet Andrius Grybauskas
Vaida Pilinkienė
Alina Stundžienė
author_sort Andrius Grybauskas
title Predictive analytics using Big Data for the real estate market during the COVID-19 pandemic
title_short Predictive analytics using Big Data for the real estate market during the COVID-19 pandemic
title_full Predictive analytics using Big Data for the real estate market during the COVID-19 pandemic
title_fullStr Predictive analytics using Big Data for the real estate market during the COVID-19 pandemic
title_full_unstemmed Predictive analytics using Big Data for the real estate market during the COVID-19 pandemic
title_sort predictive analytics using big data for the real estate market during the covid-19 pandemic
publisher SpringerOpen
series Journal of Big Data
issn 2196-1115
publishDate 2021-08-01
description Abstract As the COVID-19 pandemic came unexpectedly, many real estate experts claimed that the property values would fall like the 2007 crash. However, this study raises the question of what attributes of an apartment are most likely to influence a price revision during the pandemic. The findings in prior studies have lacked consensus, especially regarding the time-on-the-market variable, which exhibits an omnidirectional effect. However, with the rise of Big Data, this study used a web-scraping algorithm and collected a total of 18,992 property listings in the city of Vilnius during the first wave of the COVID-19 pandemic. Afterwards, 15 different machine learning models were applied to forecast apartment revisions, and the SHAP values for interpretability were used. The findings in this study coincide with the previous literature results, affirming that real estate is quite resilient to pandemics, as the price drops were not as dramatic as first believed. Out of the 15 different models tested, extreme gradient boosting was the most accurate, although the difference was negligible. The retrieved SHAP values conclude that the time-on-the-market variable was by far the most dominant and consistent variable for price revision forecasting. Additionally, the time-on-the-market variable exhibited an inverse U-shaped behaviour.
topic Machine learning
TOM
Real estate
Apartments
Big data
Pandemics
url https://doi.org/10.1186/s40537-021-00476-0
work_keys_str_mv AT andriusgrybauskas predictiveanalyticsusingbigdatafortherealestatemarketduringthecovid19pandemic
AT vaidapilinkiene predictiveanalyticsusingbigdatafortherealestatemarketduringthecovid19pandemic
AT alinastundziene predictiveanalyticsusingbigdatafortherealestatemarketduringthecovid19pandemic
_version_ 1721216317709615104