Dissimilarities Detections in Texts Using Symbol n-grams and Word Histograms
Texts (books, novels, papers, short messages) are sequences of sentences, words or symbols. Each author has an unique writing style. It can be characterized by some collection of attributes obtained from texts. The text verification is the case of an authorship verification where we have some text a...
Main Authors: | Andrejková Gabriela, Almarimi Abdulwahed |
---|---|
Format: | Article |
Language: | English |
Published: |
De Gruyter
2016-11-01
|
Series: | Open Computer Science |
Subjects: | |
Online Access: | https://doi.org/10.1515/comp-2016-0014 |
Similar Items
-
Selection of Korean Proper Translation Words Using Bi-Gram-Based Histograms
by: Hanmin Jung, et al.
Published: (2007-03-01) -
Network-Based Bag-of-Words Model for Text Classification
by: Dongyang Yan, et al.
Published: (2020-01-01) -
Authorship Attribution with Function Word N-Grams
by: Johnson, Russell Clark
Published: (2013) -
Techniques for text classification: Literature review and current trends
by: Rajni Jindal, et al.
Published: (2015-12-01) -
Learning Chinese Word Embeddings With Words and Subcharacter N-Grams
by: Ruizhi Kang, et al.
Published: (2019-01-01)