Augmenting Geostatistics with Matrix Factorization: A Case Study for House Price Estimation

Singular value decomposition (SVD) is ubiquitously used in recommendation systems to estimate and predict values based on latent features obtained through matrix factorization. But, oblivious of location information, SVD has limitations in predicting variables that have strong spatial autocorrelatio...

Full description

Bibliographic Details
Main Authors: Aisha Sikder, Andreas Züfle
Format: Article
Language:English
Published: MDPI AG 2020-04-01
Series:ISPRS International Journal of Geo-Information
Subjects:
Online Access:https://www.mdpi.com/2220-9964/9/5/288
id doaj-fce9bc3c6d254c10b40d6fad08feeced
record_format Article
spelling doaj-fce9bc3c6d254c10b40d6fad08feeced2020-11-25T03:04:27ZengMDPI AGISPRS International Journal of Geo-Information2220-99642020-04-01928828810.3390/ijgi9050288Augmenting Geostatistics with Matrix Factorization: A Case Study for House Price EstimationAisha Sikder0Andreas Züfle1Department of Geography and Geoinformation Science, George Mason University, Fairfax, VA 22030, USADepartment of Geography and Geoinformation Science, George Mason University, Fairfax, VA 22030, USASingular value decomposition (SVD) is ubiquitously used in recommendation systems to estimate and predict values based on latent features obtained through matrix factorization. But, oblivious of location information, SVD has limitations in predicting variables that have strong spatial autocorrelation, such as housing prices which strongly depend on spatial properties such as the neighborhood and school districts. In this work, we build an algorithm that integrates the latent feature learning capabilities of truncated SVD with kriging, which is called SVD-Regression Kriging (SVD-RK). In doing so, we address the problem of modeling and predicting spatially autocorrelated data for recommender engines using real estate housing prices by integrating spatial statistics. We also show that SVD-RK outperforms purely latent features based solutions as well as purely spatial approaches like Geographically Weighted Regression (GWR). Our proposed algorithm, SVD-RK, integrates the results of truncated SVD as an independent variable into a regression kriging approach. We show experimentally, that latent house price patterns learned using SVD are able to improve house price predictions of ordinary kriging in areas where house prices fluctuate locally. For areas where house prices are strongly spatially autocorrelated, evident by a house pricing variogram showing that the data can be mostly explained by spatial information only, we propose to feed the results of SVD into a geographically weighted regression model to outperform the orginary kriging approach.https://www.mdpi.com/2220-9964/9/5/288spatial statisticsrecommender systemssingular value decompositionuniversal krigingregression kriging
collection DOAJ
language English
format Article
sources DOAJ
author Aisha Sikder
Andreas Züfle
spellingShingle Aisha Sikder
Andreas Züfle
Augmenting Geostatistics with Matrix Factorization: A Case Study for House Price Estimation
ISPRS International Journal of Geo-Information
spatial statistics
recommender systems
singular value decomposition
universal kriging
regression kriging
author_facet Aisha Sikder
Andreas Züfle
author_sort Aisha Sikder
title Augmenting Geostatistics with Matrix Factorization: A Case Study for House Price Estimation
title_short Augmenting Geostatistics with Matrix Factorization: A Case Study for House Price Estimation
title_full Augmenting Geostatistics with Matrix Factorization: A Case Study for House Price Estimation
title_fullStr Augmenting Geostatistics with Matrix Factorization: A Case Study for House Price Estimation
title_full_unstemmed Augmenting Geostatistics with Matrix Factorization: A Case Study for House Price Estimation
title_sort augmenting geostatistics with matrix factorization: a case study for house price estimation
publisher MDPI AG
series ISPRS International Journal of Geo-Information
issn 2220-9964
publishDate 2020-04-01
description Singular value decomposition (SVD) is ubiquitously used in recommendation systems to estimate and predict values based on latent features obtained through matrix factorization. But, oblivious of location information, SVD has limitations in predicting variables that have strong spatial autocorrelation, such as housing prices which strongly depend on spatial properties such as the neighborhood and school districts. In this work, we build an algorithm that integrates the latent feature learning capabilities of truncated SVD with kriging, which is called SVD-Regression Kriging (SVD-RK). In doing so, we address the problem of modeling and predicting spatially autocorrelated data for recommender engines using real estate housing prices by integrating spatial statistics. We also show that SVD-RK outperforms purely latent features based solutions as well as purely spatial approaches like Geographically Weighted Regression (GWR). Our proposed algorithm, SVD-RK, integrates the results of truncated SVD as an independent variable into a regression kriging approach. We show experimentally, that latent house price patterns learned using SVD are able to improve house price predictions of ordinary kriging in areas where house prices fluctuate locally. For areas where house prices are strongly spatially autocorrelated, evident by a house pricing variogram showing that the data can be mostly explained by spatial information only, we propose to feed the results of SVD into a geographically weighted regression model to outperform the orginary kriging approach.
topic spatial statistics
recommender systems
singular value decomposition
universal kriging
regression kriging
url https://www.mdpi.com/2220-9964/9/5/288
work_keys_str_mv AT aishasikder augmentinggeostatisticswithmatrixfactorizationacasestudyforhousepriceestimation
AT andreaszufle augmentinggeostatisticswithmatrixfactorizationacasestudyforhousepriceestimation
_version_ 1724681740345147392