Unsupervised acquisition of idiomatic units of symbolic natural language: An n-gram frequency-based approach for the chunking of news articles and tweets.
Symbolic sequential data are produced in huge quantities in numerous contexts, such as text and speech data, biometrics, genomics, financial market indexes, music sheets, and online social media posts. In this paper, an unsupervised approach for the chunking of idiomatic units of sequential text dat...
Main Authors: | Dario Borrelli, Gabriela Gongora Svartzman, Carlo Lipizzi |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2020-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0234214 |
Similar Items
-
Correction: Unsupervised acquisition of idiomatic units of symbolic natural language: An n-gram frequency-based approach for the chunking of news articles and tweets.
by: Dario Borrelli, et al.
Published: (2021-01-01) -
Tweeting News Articles
by: Marco Toledo Bastos, et al.
Published: (2013-09-01) -
Temporal chunking as a mechanism for unsupervised learning of task-sets
by: Flora Bouchacourt, et al.
Published: (2020-03-01) -
Unsupervised Chunking Based on Graph Propagation from Bilingual Corpus
by: Ling Zhu, et al.
Published: (2014-01-01) -
Word2vec convolutional neural networks for classification of news articles and tweets.
by: Beakcheol Jang, et al.
Published: (2019-01-01)