Analysis of Subway Passenger Flow for a Smarter City: Knowledge Extraction From Seoul Metro’s ‘Untraceable’ Big Data

Timely and efficient analysis of big data collected from various gateways installed in a smart city is an intractable problem and requires immediate priority. Given the stochastic and massive nature of big data, the existing literature often relies on artificial intelligence techniques based on info...

Full description

Bibliographic Details
Main Author: Hyunkyung Shin
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9057464/
id doaj-ed0761715e2f4ca4945592063a4fdbfe
record_format Article
spelling doaj-ed0761715e2f4ca4945592063a4fdbfe2021-03-30T01:46:55ZengIEEEIEEE Access2169-35362020-01-018692966931010.1109/ACCESS.2020.29857349057464Analysis of Subway Passenger Flow for a Smarter City: Knowledge Extraction From Seoul Metro’s ‘Untraceable’ Big DataHyunkyung Shin0https://orcid.org/0000-0001-6279-2726Department of Financial Mathematics, Gachon University, Seongnam, South KoreaTimely and efficient analysis of big data collected from various gateways installed in a smart city is an intractable problem and requires immediate priority. Given the stochastic and massive nature of big data, the existing literature often relies on artificial intelligence techniques based on information theory. As a new approach, this paper presents a knowledge extraction method based on an analysis of Seoul Metro's 'untraceable' ridership big data. Without identification information, the untraceable ridership data only shows the hourly accumulation of station entry and exit information. To reconstruct the missing information in the data set, this study proposes a fluid dynamics model and adopts a heuristic genetic algorithm based on optimization theory as the problem solver. The result of our model presents the distribution of the elapsed time defined on an hourly basis taken until a passenger returns to the station they departed from. To validate our model, we acquired subway ridership data with passengers' identification with permission from Seoul Metro. This paper presents two novel aspects of subway ridership, namely the dependency on departure time and the discrepancy between weekend and weekday traffic. Our analytical approach contributes to solving the problem of extracting hidden knowledge from big collection of data missing critical information, e.g., constantly and autonomously gathered data fragments from numerous gateways in smart cities.https://ieeexplore.ieee.org/document/9057464/Inverse problemgenetic algorithm (GA)optimizationwave decompositionharmony search algorithmmass conservation law
collection DOAJ
language English
format Article
sources DOAJ
author Hyunkyung Shin
spellingShingle Hyunkyung Shin
Analysis of Subway Passenger Flow for a Smarter City: Knowledge Extraction From Seoul Metro’s ‘Untraceable’ Big Data
IEEE Access
Inverse problem
genetic algorithm (GA)
optimization
wave decomposition
harmony search algorithm
mass conservation law
author_facet Hyunkyung Shin
author_sort Hyunkyung Shin
title Analysis of Subway Passenger Flow for a Smarter City: Knowledge Extraction From Seoul Metro’s ‘Untraceable’ Big Data
title_short Analysis of Subway Passenger Flow for a Smarter City: Knowledge Extraction From Seoul Metro’s ‘Untraceable’ Big Data
title_full Analysis of Subway Passenger Flow for a Smarter City: Knowledge Extraction From Seoul Metro’s ‘Untraceable’ Big Data
title_fullStr Analysis of Subway Passenger Flow for a Smarter City: Knowledge Extraction From Seoul Metro’s ‘Untraceable’ Big Data
title_full_unstemmed Analysis of Subway Passenger Flow for a Smarter City: Knowledge Extraction From Seoul Metro’s ‘Untraceable’ Big Data
title_sort analysis of subway passenger flow for a smarter city: knowledge extraction from seoul metro’s ‘untraceable’ big data
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description Timely and efficient analysis of big data collected from various gateways installed in a smart city is an intractable problem and requires immediate priority. Given the stochastic and massive nature of big data, the existing literature often relies on artificial intelligence techniques based on information theory. As a new approach, this paper presents a knowledge extraction method based on an analysis of Seoul Metro's 'untraceable' ridership big data. Without identification information, the untraceable ridership data only shows the hourly accumulation of station entry and exit information. To reconstruct the missing information in the data set, this study proposes a fluid dynamics model and adopts a heuristic genetic algorithm based on optimization theory as the problem solver. The result of our model presents the distribution of the elapsed time defined on an hourly basis taken until a passenger returns to the station they departed from. To validate our model, we acquired subway ridership data with passengers' identification with permission from Seoul Metro. This paper presents two novel aspects of subway ridership, namely the dependency on departure time and the discrepancy between weekend and weekday traffic. Our analytical approach contributes to solving the problem of extracting hidden knowledge from big collection of data missing critical information, e.g., constantly and autonomously gathered data fragments from numerous gateways in smart cities.
topic Inverse problem
genetic algorithm (GA)
optimization
wave decomposition
harmony search algorithm
mass conservation law
url https://ieeexplore.ieee.org/document/9057464/
work_keys_str_mv AT hyunkyungshin analysisofsubwaypassengerflowforasmartercityknowledgeextractionfromseoulmetrox2019sx2018untraceablex2019bigdata
_version_ 1724186491990573056