Predictive analytics using Big Data for the real estate market during the COVID-19 pandemic
Abstract As the COVID-19 pandemic came unexpectedly, many real estate experts claimed that the property values would fall like the 2007 crash. However, this study raises the question of what attributes of an apartment are most likely to influence a price revision during the pandemic. The findings in...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2021-08-01
|
Series: | Journal of Big Data |
Subjects: | |
Online Access: | https://doi.org/10.1186/s40537-021-00476-0 |
id |
doaj-c41523bed027405d9b4e70fd1c02ac39 |
---|---|
record_format |
Article |
spelling |
doaj-c41523bed027405d9b4e70fd1c02ac392021-08-08T11:03:25ZengSpringerOpenJournal of Big Data2196-11152021-08-018112010.1186/s40537-021-00476-0Predictive analytics using Big Data for the real estate market during the COVID-19 pandemicAndrius Grybauskas0Vaida Pilinkienė1Alina Stundžienė2School of Economics and Business, Kaunas University of TechnologySchool of Economics and Business, Kaunas University of TechnologySchool of Economics and Business, Kaunas University of TechnologyAbstract As the COVID-19 pandemic came unexpectedly, many real estate experts claimed that the property values would fall like the 2007 crash. However, this study raises the question of what attributes of an apartment are most likely to influence a price revision during the pandemic. The findings in prior studies have lacked consensus, especially regarding the time-on-the-market variable, which exhibits an omnidirectional effect. However, with the rise of Big Data, this study used a web-scraping algorithm and collected a total of 18,992 property listings in the city of Vilnius during the first wave of the COVID-19 pandemic. Afterwards, 15 different machine learning models were applied to forecast apartment revisions, and the SHAP values for interpretability were used. The findings in this study coincide with the previous literature results, affirming that real estate is quite resilient to pandemics, as the price drops were not as dramatic as first believed. Out of the 15 different models tested, extreme gradient boosting was the most accurate, although the difference was negligible. The retrieved SHAP values conclude that the time-on-the-market variable was by far the most dominant and consistent variable for price revision forecasting. Additionally, the time-on-the-market variable exhibited an inverse U-shaped behaviour.https://doi.org/10.1186/s40537-021-00476-0Machine learningTOMReal estateApartmentsBig dataPandemics |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Andrius Grybauskas Vaida Pilinkienė Alina Stundžienė |
spellingShingle |
Andrius Grybauskas Vaida Pilinkienė Alina Stundžienė Predictive analytics using Big Data for the real estate market during the COVID-19 pandemic Journal of Big Data Machine learning TOM Real estate Apartments Big data Pandemics |
author_facet |
Andrius Grybauskas Vaida Pilinkienė Alina Stundžienė |
author_sort |
Andrius Grybauskas |
title |
Predictive analytics using Big Data for the real estate market during the COVID-19 pandemic |
title_short |
Predictive analytics using Big Data for the real estate market during the COVID-19 pandemic |
title_full |
Predictive analytics using Big Data for the real estate market during the COVID-19 pandemic |
title_fullStr |
Predictive analytics using Big Data for the real estate market during the COVID-19 pandemic |
title_full_unstemmed |
Predictive analytics using Big Data for the real estate market during the COVID-19 pandemic |
title_sort |
predictive analytics using big data for the real estate market during the covid-19 pandemic |
publisher |
SpringerOpen |
series |
Journal of Big Data |
issn |
2196-1115 |
publishDate |
2021-08-01 |
description |
Abstract As the COVID-19 pandemic came unexpectedly, many real estate experts claimed that the property values would fall like the 2007 crash. However, this study raises the question of what attributes of an apartment are most likely to influence a price revision during the pandemic. The findings in prior studies have lacked consensus, especially regarding the time-on-the-market variable, which exhibits an omnidirectional effect. However, with the rise of Big Data, this study used a web-scraping algorithm and collected a total of 18,992 property listings in the city of Vilnius during the first wave of the COVID-19 pandemic. Afterwards, 15 different machine learning models were applied to forecast apartment revisions, and the SHAP values for interpretability were used. The findings in this study coincide with the previous literature results, affirming that real estate is quite resilient to pandemics, as the price drops were not as dramatic as first believed. Out of the 15 different models tested, extreme gradient boosting was the most accurate, although the difference was negligible. The retrieved SHAP values conclude that the time-on-the-market variable was by far the most dominant and consistent variable for price revision forecasting. Additionally, the time-on-the-market variable exhibited an inverse U-shaped behaviour. |
topic |
Machine learning TOM Real estate Apartments Big data Pandemics |
url |
https://doi.org/10.1186/s40537-021-00476-0 |
work_keys_str_mv |
AT andriusgrybauskas predictiveanalyticsusingbigdatafortherealestatemarketduringthecovid19pandemic AT vaidapilinkiene predictiveanalyticsusingbigdatafortherealestatemarketduringthecovid19pandemic AT alinastundziene predictiveanalyticsusingbigdatafortherealestatemarketduringthecovid19pandemic |
_version_ |
1721216317709615104 |