Analysis of Subway Passenger Flow for a Smarter City: Knowledge Extraction From Seoul Metro’s ‘Untraceable’ Big Data
Timely and efficient analysis of big data collected from various gateways installed in a smart city is an intractable problem and requires immediate priority. Given the stochastic and massive nature of big data, the existing literature often relies on artificial intelligence techniques based on info...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9057464/ |
id |
doaj-ed0761715e2f4ca4945592063a4fdbfe |
---|---|
record_format |
Article |
spelling |
doaj-ed0761715e2f4ca4945592063a4fdbfe2021-03-30T01:46:55ZengIEEEIEEE Access2169-35362020-01-018692966931010.1109/ACCESS.2020.29857349057464Analysis of Subway Passenger Flow for a Smarter City: Knowledge Extraction From Seoul Metro’s ‘Untraceable’ Big DataHyunkyung Shin0https://orcid.org/0000-0001-6279-2726Department of Financial Mathematics, Gachon University, Seongnam, South KoreaTimely and efficient analysis of big data collected from various gateways installed in a smart city is an intractable problem and requires immediate priority. Given the stochastic and massive nature of big data, the existing literature often relies on artificial intelligence techniques based on information theory. As a new approach, this paper presents a knowledge extraction method based on an analysis of Seoul Metro's 'untraceable' ridership big data. Without identification information, the untraceable ridership data only shows the hourly accumulation of station entry and exit information. To reconstruct the missing information in the data set, this study proposes a fluid dynamics model and adopts a heuristic genetic algorithm based on optimization theory as the problem solver. The result of our model presents the distribution of the elapsed time defined on an hourly basis taken until a passenger returns to the station they departed from. To validate our model, we acquired subway ridership data with passengers' identification with permission from Seoul Metro. This paper presents two novel aspects of subway ridership, namely the dependency on departure time and the discrepancy between weekend and weekday traffic. Our analytical approach contributes to solving the problem of extracting hidden knowledge from big collection of data missing critical information, e.g., constantly and autonomously gathered data fragments from numerous gateways in smart cities.https://ieeexplore.ieee.org/document/9057464/Inverse problemgenetic algorithm (GA)optimizationwave decompositionharmony search algorithmmass conservation law |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Hyunkyung Shin |
spellingShingle |
Hyunkyung Shin Analysis of Subway Passenger Flow for a Smarter City: Knowledge Extraction From Seoul Metro’s ‘Untraceable’ Big Data IEEE Access Inverse problem genetic algorithm (GA) optimization wave decomposition harmony search algorithm mass conservation law |
author_facet |
Hyunkyung Shin |
author_sort |
Hyunkyung Shin |
title |
Analysis of Subway Passenger Flow for a Smarter City: Knowledge Extraction From Seoul Metro’s ‘Untraceable’ Big Data |
title_short |
Analysis of Subway Passenger Flow for a Smarter City: Knowledge Extraction From Seoul Metro’s ‘Untraceable’ Big Data |
title_full |
Analysis of Subway Passenger Flow for a Smarter City: Knowledge Extraction From Seoul Metro’s ‘Untraceable’ Big Data |
title_fullStr |
Analysis of Subway Passenger Flow for a Smarter City: Knowledge Extraction From Seoul Metro’s ‘Untraceable’ Big Data |
title_full_unstemmed |
Analysis of Subway Passenger Flow for a Smarter City: Knowledge Extraction From Seoul Metro’s ‘Untraceable’ Big Data |
title_sort |
analysis of subway passenger flow for a smarter city: knowledge extraction from seoul metro’s ‘untraceable’ big data |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
Timely and efficient analysis of big data collected from various gateways installed in a smart city is an intractable problem and requires immediate priority. Given the stochastic and massive nature of big data, the existing literature often relies on artificial intelligence techniques based on information theory. As a new approach, this paper presents a knowledge extraction method based on an analysis of Seoul Metro's 'untraceable' ridership big data. Without identification information, the untraceable ridership data only shows the hourly accumulation of station entry and exit information. To reconstruct the missing information in the data set, this study proposes a fluid dynamics model and adopts a heuristic genetic algorithm based on optimization theory as the problem solver. The result of our model presents the distribution of the elapsed time defined on an hourly basis taken until a passenger returns to the station they departed from. To validate our model, we acquired subway ridership data with passengers' identification with permission from Seoul Metro. This paper presents two novel aspects of subway ridership, namely the dependency on departure time and the discrepancy between weekend and weekday traffic. Our analytical approach contributes to solving the problem of extracting hidden knowledge from big collection of data missing critical information, e.g., constantly and autonomously gathered data fragments from numerous gateways in smart cities. |
topic |
Inverse problem genetic algorithm (GA) optimization wave decomposition harmony search algorithm mass conservation law |
url |
https://ieeexplore.ieee.org/document/9057464/ |
work_keys_str_mv |
AT hyunkyungshin analysisofsubwaypassengerflowforasmartercityknowledgeextractionfromseoulmetrox2019sx2018untraceablex2019bigdata |
_version_ |
1724186491990573056 |