CTD: Fast, accurate, and interpretable method for static and dynamic tensor decompositions.
How can we find patterns and anomalies in a tensor, i.e., multi-dimensional array, in an efficient and directly interpretable way? How can we do this in an online environment, where a new tensor arrives at each time step? Finding patterns and anomalies in multi-dimensional data have many important a...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2018-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC6059458?pdf=render |
id |
doaj-c0b36267f4744a7fa09866b475db495d |
---|---|
record_format |
Article |
spelling |
doaj-c0b36267f4744a7fa09866b475db495d2020-11-25T00:02:10ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-01137e020057910.1371/journal.pone.0200579CTD: Fast, accurate, and interpretable method for static and dynamic tensor decompositions.Jungwoo LeeDongjin ChoiLee SaelHow can we find patterns and anomalies in a tensor, i.e., multi-dimensional array, in an efficient and directly interpretable way? How can we do this in an online environment, where a new tensor arrives at each time step? Finding patterns and anomalies in multi-dimensional data have many important applications, including building safety monitoring, health monitoring, cyber security, terrorist detection, and fake user detection in social networks. Standard tensor decomposition results are not directly interpretable and few methods that propose to increase interpretability need to be made faster, more memory efficient, and more accurate for large and quickly generated data in the online environment. We propose two versions of a fast, accurate, and directly interpretable tensor decomposition method we call CTD that is based on efficient sampling method. First is the static version of CTD, i.e., CTD-S, that provably guarantees up to 11× higher accuracy than that of the state-of-the-art method. Also, CTD-S is made up to 2.3× faster and up to 24× more memory-efficient than the state-of-the-art method by removing redundancy. Second is the dynamic version of CTD, i.e. CTD-D, which is the first interpretable dynamic tensor decomposition method ever proposed. It is also made up to 82× faster than the already fast CTD-S by exploiting factors at previous time step and by reordering operations. With CTD, we demonstrate how the results can be effectively interpreted in online distributed denial of service (DDoS) attack detection and online troll detection.http://europepmc.org/articles/PMC6059458?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jungwoo Lee Dongjin Choi Lee Sael |
spellingShingle |
Jungwoo Lee Dongjin Choi Lee Sael CTD: Fast, accurate, and interpretable method for static and dynamic tensor decompositions. PLoS ONE |
author_facet |
Jungwoo Lee Dongjin Choi Lee Sael |
author_sort |
Jungwoo Lee |
title |
CTD: Fast, accurate, and interpretable method for static and dynamic tensor decompositions. |
title_short |
CTD: Fast, accurate, and interpretable method for static and dynamic tensor decompositions. |
title_full |
CTD: Fast, accurate, and interpretable method for static and dynamic tensor decompositions. |
title_fullStr |
CTD: Fast, accurate, and interpretable method for static and dynamic tensor decompositions. |
title_full_unstemmed |
CTD: Fast, accurate, and interpretable method for static and dynamic tensor decompositions. |
title_sort |
ctd: fast, accurate, and interpretable method for static and dynamic tensor decompositions. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2018-01-01 |
description |
How can we find patterns and anomalies in a tensor, i.e., multi-dimensional array, in an efficient and directly interpretable way? How can we do this in an online environment, where a new tensor arrives at each time step? Finding patterns and anomalies in multi-dimensional data have many important applications, including building safety monitoring, health monitoring, cyber security, terrorist detection, and fake user detection in social networks. Standard tensor decomposition results are not directly interpretable and few methods that propose to increase interpretability need to be made faster, more memory efficient, and more accurate for large and quickly generated data in the online environment. We propose two versions of a fast, accurate, and directly interpretable tensor decomposition method we call CTD that is based on efficient sampling method. First is the static version of CTD, i.e., CTD-S, that provably guarantees up to 11× higher accuracy than that of the state-of-the-art method. Also, CTD-S is made up to 2.3× faster and up to 24× more memory-efficient than the state-of-the-art method by removing redundancy. Second is the dynamic version of CTD, i.e. CTD-D, which is the first interpretable dynamic tensor decomposition method ever proposed. It is also made up to 82× faster than the already fast CTD-S by exploiting factors at previous time step and by reordering operations. With CTD, we demonstrate how the results can be effectively interpreted in online distributed denial of service (DDoS) attack detection and online troll detection. |
url |
http://europepmc.org/articles/PMC6059458?pdf=render |
work_keys_str_mv |
AT jungwoolee ctdfastaccurateandinterpretablemethodforstaticanddynamictensordecompositions AT dongjinchoi ctdfastaccurateandinterpretablemethodforstaticanddynamictensordecompositions AT leesael ctdfastaccurateandinterpretablemethodforstaticanddynamictensordecompositions |
_version_ |
1725439029497823232 |