On Large-Scale Multi-Label Classification for POI Tagging
碩士 === 國立中央大學 === 資訊工程學系 === 105 === In recent years, mobile device become more popular. And due to convenient transportation, people have higher probability to visit strange places. It is not easy to find a point of interest in a strange places, so we need to provide an electronic map system for us...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2017
|
Online Access: | http://ndltd.ncl.edu.tw/handle/h4c5dp |
id |
ndltd-TW-105NCU05392105 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-105NCU053921052019-10-24T05:19:29Z http://ndltd.ncl.edu.tw/handle/h4c5dp On Large-Scale Multi-Label Classification for POI Tagging Kai-Qian Yang 楊鎧謙 碩士 國立中央大學 資訊工程學系 105 In recent years, mobile device become more popular. And due to convenient transportation, people have higher probability to visit strange places. It is not easy to find a point of interest in a strange places, so we need to provide an electronic map system for users. It is not enough to provide name search for users only, because the users may not know the exact name of points. They may just want to find a specific category of point, so a good electronic map system needs to provide category search service. In order to provide category search services, we need to classify all the points in the system. Because the system has many points, each item has one or more categories, so this is a large-scale multi-label classification problem. There are many kind of categories, we follow the categories defined by Chinese yellow pages. The category consists two levels. There are 29 categories in level 1and 1,287 in level 2. Because the number of points and categories are large, we need to spend much time for training classifiers and testing data. We reduce the dimension of categories to speed up training and testing. After the experiment, our method’s training time and testing time are superior to the general SVM classification, the performance in level 1 Micro-F1 is 0.813, in level 2 Micro-F1 is 0.718 all slightly lower than SVM in level 1 Micro-F1 is 0.842. In level 2 Micro-F1 is 0.783. We want to try Reweighting, Downsampling to improve performance, but the performance is not wall in large-scale data. Chia-Hui Chang 張嘉惠 2017 學位論文 ; thesis 30 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中央大學 === 資訊工程學系 === 105 === In recent years, mobile device become more popular. And due to convenient transportation, people have higher probability to visit strange places. It is not easy to find a point of interest in a strange places, so we need to provide an electronic map system for users. It is not enough to provide name search for users only, because the users may not know the exact name of points. They may just want to find a specific category of point, so a good electronic map system needs to provide category search service.
In order to provide category search services, we need to classify all the points in the system. Because the system has many points, each item has one or more categories, so this is a large-scale multi-label classification problem. There are many kind of categories, we follow the categories defined by Chinese yellow pages. The category consists two levels. There are 29 categories in level 1and 1,287 in level 2. Because the number of points and categories are large, we need to spend much time for training classifiers and testing data. We reduce the dimension of categories to speed up training and testing.
After the experiment, our method’s training time and testing time are superior to the general SVM classification, the performance in level 1 Micro-F1 is 0.813, in level 2 Micro-F1 is 0.718 all slightly lower than SVM in level 1 Micro-F1 is 0.842. In level 2 Micro-F1 is 0.783. We want to try Reweighting, Downsampling to improve performance, but the performance is not wall in large-scale data.
|
author2 |
Chia-Hui Chang |
author_facet |
Chia-Hui Chang Kai-Qian Yang 楊鎧謙 |
author |
Kai-Qian Yang 楊鎧謙 |
spellingShingle |
Kai-Qian Yang 楊鎧謙 On Large-Scale Multi-Label Classification for POI Tagging |
author_sort |
Kai-Qian Yang |
title |
On Large-Scale Multi-Label Classification for POI Tagging |
title_short |
On Large-Scale Multi-Label Classification for POI Tagging |
title_full |
On Large-Scale Multi-Label Classification for POI Tagging |
title_fullStr |
On Large-Scale Multi-Label Classification for POI Tagging |
title_full_unstemmed |
On Large-Scale Multi-Label Classification for POI Tagging |
title_sort |
on large-scale multi-label classification for poi tagging |
publishDate |
2017 |
url |
http://ndltd.ncl.edu.tw/handle/h4c5dp |
work_keys_str_mv |
AT kaiqianyang onlargescalemultilabelclassificationforpoitagging AT yángkǎiqiān onlargescalemultilabelclassificationforpoitagging |
_version_ |
1719276795729018880 |