Home is where the ad is: online interest proxies housing demand

Abstract Online activity leaves digital traces of human behavior. In this paper we investigate if online interest can be used as a proxy of housing demand, a key yet so far mostly unobserved feature of housing markets. We analyze data from an Italian website of housing sales advertisements (ads). Fo...

Full description

Bibliographic Details
Main Authors: Marco Pangallo, Michele Loberto
Format: Article
Language:English
Published: SpringerOpen 2018-11-01
Series:EPJ Data Science
Subjects:
Online Access:http://link.springer.com/article/10.1140/epjds/s13688-018-0176-2
id doaj-951c47ea5b6e4daca7be8d451a4e35d3
record_format Article
spelling doaj-951c47ea5b6e4daca7be8d451a4e35d32020-11-24T21:33:38ZengSpringerOpenEPJ Data Science2193-11272018-11-017112410.1140/epjds/s13688-018-0176-2Home is where the ad is: online interest proxies housing demandMarco Pangallo0Michele Loberto1Institute for New Economic Thinking at the Oxford Martin School, University of OxfordDirectorate General for Economics, Statistics and Research, Banca d’ItaliaAbstract Online activity leaves digital traces of human behavior. In this paper we investigate if online interest can be used as a proxy of housing demand, a key yet so far mostly unobserved feature of housing markets. We analyze data from an Italian website of housing sales advertisements (ads). For each ad, we know the timings at which website users clicked on the ad or used the corresponding contact form. We show that low online interest—a small number of clicks/contacts on the ad relative to other ads in the same neighborhood—predicts longer time on market and higher chance of downward price revisions, and that aggregate online interest is a leading indicator of housing market liquidity and prices. As online interest affects time on market, liquidity and prices in the same way as actual demand, we deduce that it is a good proxy. We then turn to a standard econometric problem: what difference in demand is caused by a difference in price? We use machine learning to identify pairs of duplicate ads, i.e. ads that refer to the same housing unit. Under some caveats, differences in demand between the two ads can only be caused by differences in price. We find that a 1% higher price causes a 0.66% lower number of clicks.http://link.springer.com/article/10.1140/epjds/s13688-018-0176-2Online dataHousing marketEconometricsMachine learningCausality
collection DOAJ
language English
format Article
sources DOAJ
author Marco Pangallo
Michele Loberto
spellingShingle Marco Pangallo
Michele Loberto
Home is where the ad is: online interest proxies housing demand
EPJ Data Science
Online data
Housing market
Econometrics
Machine learning
Causality
author_facet Marco Pangallo
Michele Loberto
author_sort Marco Pangallo
title Home is where the ad is: online interest proxies housing demand
title_short Home is where the ad is: online interest proxies housing demand
title_full Home is where the ad is: online interest proxies housing demand
title_fullStr Home is where the ad is: online interest proxies housing demand
title_full_unstemmed Home is where the ad is: online interest proxies housing demand
title_sort home is where the ad is: online interest proxies housing demand
publisher SpringerOpen
series EPJ Data Science
issn 2193-1127
publishDate 2018-11-01
description Abstract Online activity leaves digital traces of human behavior. In this paper we investigate if online interest can be used as a proxy of housing demand, a key yet so far mostly unobserved feature of housing markets. We analyze data from an Italian website of housing sales advertisements (ads). For each ad, we know the timings at which website users clicked on the ad or used the corresponding contact form. We show that low online interest—a small number of clicks/contacts on the ad relative to other ads in the same neighborhood—predicts longer time on market and higher chance of downward price revisions, and that aggregate online interest is a leading indicator of housing market liquidity and prices. As online interest affects time on market, liquidity and prices in the same way as actual demand, we deduce that it is a good proxy. We then turn to a standard econometric problem: what difference in demand is caused by a difference in price? We use machine learning to identify pairs of duplicate ads, i.e. ads that refer to the same housing unit. Under some caveats, differences in demand between the two ads can only be caused by differences in price. We find that a 1% higher price causes a 0.66% lower number of clicks.
topic Online data
Housing market
Econometrics
Machine learning
Causality
url http://link.springer.com/article/10.1140/epjds/s13688-018-0176-2
work_keys_str_mv AT marcopangallo homeiswheretheadisonlineinterestproxieshousingdemand
AT micheleloberto homeiswheretheadisonlineinterestproxieshousingdemand
_version_ 1725952946248613888