Crime Analysis and Prediction with Machine Learning Approaches from Location-Based Social Network Data
碩士 === 國立臺灣大學 === 電機工程學研究所 === 103 === With the advancement of location-based social networks (LBSNs), users are allowed to “check in” at points of interest (POI) with mobile devices. Compared with conventional demographics, social network data increases in unprecedented pace which resulting in user...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2015
|
Online Access: | http://ndltd.ncl.edu.tw/handle/64098078022810181232 |
id |
ndltd-TW-103NTU05442019 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-103NTU054420192016-11-19T04:09:21Z http://ndltd.ncl.edu.tw/handle/64098078022810181232 Crime Analysis and Prediction with Machine Learning Approaches from Location-Based Social Network Data 適地性社群資料分析在犯罪預測之應用 You-Yue Huang 黃友岳 碩士 國立臺灣大學 電機工程學研究所 103 With the advancement of location-based social networks (LBSNs), users are allowed to “check in” at points of interest (POI) with mobile devices. Compared with conventional demographics, social network data increases in unprecedented pace which resulting in user information. Therefore, human mobility has recently attracted much attention to be studied through LBSNs in spatial, temporal and social aspects. Prior work in urban studies suggests that there is a strong correlation between people dynamics and crime activities. Most works used kernel density estimation to calculate crime density distribution and predicted crime occurrence with it. Due to the proliferation of social media data, some studies implement crime prediction system through Twitter records. However, there is no research on to quantify human activities. In our model, the human activities and buildings are simulated by Gowalla and Foursquare data respectively for San Francisco and Chicago and each city is characterized by a set of grids. Five temporal periods, two geographic, five social and nine categorical factors are encoded in every grid. To retrieve relevance score of all factors to crime rate, Normalized Discounted Cumulative Gain metric is introduced for ranking relevant instances. Three machine learning models (Support vector machine, linear regression and random forest) are employed to build models to evaluate the predict crimes. The result shows data sparsity can affect the precision and collaborative factors have better predictive power than individual one. And the precision might slightly dropped attributed to less relevant factors. 鄭士康 2015 學位論文 ; thesis 68 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣大學 === 電機工程學研究所 === 103 === With the advancement of location-based social networks (LBSNs), users are allowed to “check in” at points of interest (POI) with mobile devices. Compared with conventional demographics, social network data increases in unprecedented pace which resulting in user information. Therefore, human mobility has recently attracted much attention to be studied through LBSNs in spatial, temporal and social aspects. Prior work in urban studies suggests that there is a strong correlation between people dynamics and crime activities. Most works used kernel density estimation to calculate crime density distribution and
predicted crime occurrence with it. Due to the proliferation of social media data, some studies implement crime prediction system through Twitter records. However, there is no research on to quantify human activities. In our model, the human activities and buildings are simulated by Gowalla and Foursquare data respectively for San Francisco and Chicago and each city is characterized by a set of grids. Five temporal periods, two geographic, five social and nine categorical factors are encoded in every grid. To retrieve relevance score of all factors to crime rate, Normalized Discounted Cumulative Gain metric is introduced for ranking relevant instances. Three machine learning models (Support vector machine, linear regression and random forest) are employed to build models to evaluate the predict crimes. The result shows data sparsity can affect the precision and collaborative factors have better predictive power than individual one. And the precision might slightly dropped attributed to less relevant factors.
|
author2 |
鄭士康 |
author_facet |
鄭士康 You-Yue Huang 黃友岳 |
author |
You-Yue Huang 黃友岳 |
spellingShingle |
You-Yue Huang 黃友岳 Crime Analysis and Prediction with Machine Learning Approaches from Location-Based Social Network Data |
author_sort |
You-Yue Huang |
title |
Crime Analysis and Prediction with Machine Learning Approaches from Location-Based Social Network Data |
title_short |
Crime Analysis and Prediction with Machine Learning Approaches from Location-Based Social Network Data |
title_full |
Crime Analysis and Prediction with Machine Learning Approaches from Location-Based Social Network Data |
title_fullStr |
Crime Analysis and Prediction with Machine Learning Approaches from Location-Based Social Network Data |
title_full_unstemmed |
Crime Analysis and Prediction with Machine Learning Approaches from Location-Based Social Network Data |
title_sort |
crime analysis and prediction with machine learning approaches from location-based social network data |
publishDate |
2015 |
url |
http://ndltd.ncl.edu.tw/handle/64098078022810181232 |
work_keys_str_mv |
AT youyuehuang crimeanalysisandpredictionwithmachinelearningapproachesfromlocationbasedsocialnetworkdata AT huángyǒuyuè crimeanalysisandpredictionwithmachinelearningapproachesfromlocationbasedsocialnetworkdata AT youyuehuang shìdexìngshèqúnzīliàofēnxīzàifànzuìyùcèzhīyīngyòng AT huángyǒuyuè shìdexìngshèqúnzīliàofēnxīzàifànzuìyùcèzhīyīngyòng |
_version_ |
1718394417584275456 |