Crime Analysis and Prediction with Machine Learning Approaches from Location-Based Social Network Data

碩士 === 國立臺灣大學 === 電機工程學研究所 === 103 === With the advancement of location-based social networks (LBSNs), users are allowed to “check in” at points of interest (POI) with mobile devices. Compared with conventional demographics, social network data increases in unprecedented pace which resulting in user...

Full description

Bibliographic Details
Main Authors: You-Yue Huang, 黃友岳
Other Authors: 鄭士康
Format: Others
Language:en_US
Published: 2015
Online Access:http://ndltd.ncl.edu.tw/handle/64098078022810181232
id ndltd-TW-103NTU05442019
record_format oai_dc
spelling ndltd-TW-103NTU054420192016-11-19T04:09:21Z http://ndltd.ncl.edu.tw/handle/64098078022810181232 Crime Analysis and Prediction with Machine Learning Approaches from Location-Based Social Network Data 適地性社群資料分析在犯罪預測之應用 You-Yue Huang 黃友岳 碩士 國立臺灣大學 電機工程學研究所 103 With the advancement of location-based social networks (LBSNs), users are allowed to “check in” at points of interest (POI) with mobile devices. Compared with conventional demographics, social network data increases in unprecedented pace which resulting in user information. Therefore, human mobility has recently attracted much attention to be studied through LBSNs in spatial, temporal and social aspects. Prior work in urban studies suggests that there is a strong correlation between people dynamics and crime activities. Most works used kernel density estimation to calculate crime density distribution and predicted crime occurrence with it. Due to the proliferation of social media data, some studies implement crime prediction system through Twitter records. However, there is no research on to quantify human activities. In our model, the human activities and buildings are simulated by Gowalla and Foursquare data respectively for San Francisco and Chicago and each city is characterized by a set of grids. Five temporal periods, two geographic, five social and nine categorical factors are encoded in every grid. To retrieve relevance score of all factors to crime rate, Normalized Discounted Cumulative Gain metric is introduced for ranking relevant instances. Three machine learning models (Support vector machine, linear regression and random forest) are employed to build models to evaluate the predict crimes. The result shows data sparsity can affect the precision and collaborative factors have better predictive power than individual one. And the precision might slightly dropped attributed to less relevant factors. 鄭士康 2015 學位論文 ; thesis 68 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣大學 === 電機工程學研究所 === 103 === With the advancement of location-based social networks (LBSNs), users are allowed to “check in” at points of interest (POI) with mobile devices. Compared with conventional demographics, social network data increases in unprecedented pace which resulting in user information. Therefore, human mobility has recently attracted much attention to be studied through LBSNs in spatial, temporal and social aspects. Prior work in urban studies suggests that there is a strong correlation between people dynamics and crime activities. Most works used kernel density estimation to calculate crime density distribution and predicted crime occurrence with it. Due to the proliferation of social media data, some studies implement crime prediction system through Twitter records. However, there is no research on to quantify human activities. In our model, the human activities and buildings are simulated by Gowalla and Foursquare data respectively for San Francisco and Chicago and each city is characterized by a set of grids. Five temporal periods, two geographic, five social and nine categorical factors are encoded in every grid. To retrieve relevance score of all factors to crime rate, Normalized Discounted Cumulative Gain metric is introduced for ranking relevant instances. Three machine learning models (Support vector machine, linear regression and random forest) are employed to build models to evaluate the predict crimes. The result shows data sparsity can affect the precision and collaborative factors have better predictive power than individual one. And the precision might slightly dropped attributed to less relevant factors.
author2 鄭士康
author_facet 鄭士康
You-Yue Huang
黃友岳
author You-Yue Huang
黃友岳
spellingShingle You-Yue Huang
黃友岳
Crime Analysis and Prediction with Machine Learning Approaches from Location-Based Social Network Data
author_sort You-Yue Huang
title Crime Analysis and Prediction with Machine Learning Approaches from Location-Based Social Network Data
title_short Crime Analysis and Prediction with Machine Learning Approaches from Location-Based Social Network Data
title_full Crime Analysis and Prediction with Machine Learning Approaches from Location-Based Social Network Data
title_fullStr Crime Analysis and Prediction with Machine Learning Approaches from Location-Based Social Network Data
title_full_unstemmed Crime Analysis and Prediction with Machine Learning Approaches from Location-Based Social Network Data
title_sort crime analysis and prediction with machine learning approaches from location-based social network data
publishDate 2015
url http://ndltd.ncl.edu.tw/handle/64098078022810181232
work_keys_str_mv AT youyuehuang crimeanalysisandpredictionwithmachinelearningapproachesfromlocationbasedsocialnetworkdata
AT huángyǒuyuè crimeanalysisandpredictionwithmachinelearningapproachesfromlocationbasedsocialnetworkdata
AT youyuehuang shìdexìngshèqúnzīliàofēnxīzàifànzuìyùcèzhīyīngyòng
AT huángyǒuyuè shìdexìngshèqúnzīliàofēnxīzàifànzuìyùcèzhīyīngyòng
_version_ 1718394417584275456