Epistemological Considerations of Text Mining: Implications for Systematic Literature Review

In the era of big data, the capacity to produce textual documents is increasing day by day. Our ability to generate large amounts of information has impacted our lives at both the individual and societal levels. Science has not escaped this evolution either, and it is often difficult to quickly and...

Full description

Bibliographic Details
Main Authors: Daniel Caballero-Julia, Philippe Campillo
Format: Article
Language:English
Published: MDPI AG 2021-08-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/9/16/1865
id doaj-ce88198250fe496abc23454af36aa5f9
record_format Article
spelling doaj-ce88198250fe496abc23454af36aa5f92021-08-26T14:01:59ZengMDPI AGMathematics2227-73902021-08-0191865186510.3390/math9161865Epistemological Considerations of Text Mining: Implications for Systematic Literature ReviewDaniel Caballero-Julia0Philippe Campillo1ULR 7369—URePSSS—Unité de Recherche Pluridisciplinaire Sport Santé Société, Faculté des Sciences du Sport et de l’Éducation Physique, Univ. Lille, Univ. Littoral Côte d’Opale, Univ. Artois, F-59000 Lille, FranceULR 7369—URePSSS—Unité de Recherche Pluridisciplinaire Sport Santé Société, Faculté des Sciences du Sport et de l’Éducation Physique, Univ. Lille, Univ. Littoral Côte d’Opale, Univ. Artois, F-59000 Lille, FranceIn the era of big data, the capacity to produce textual documents is increasing day by day. Our ability to generate large amounts of information has impacted our lives at both the individual and societal levels. Science has not escaped this evolution either, and it is often difficult to quickly and reliably “stand on the shoulders of giants”. Text mining is presented as a promising mathematical solution. However, it has not yet convinced qualitative analysts who are usually wary of mathematical calculation. For this reason, this article proposes to rethink the epistemological principles of text mining, by returning to the qualitative analysis of its meaning and structure. It presents alternatives, applicable to the process of constructing lexical matrices for the analysis of a complex textual corpus. At the same time, the need for new multivariate algorithms capable of integrating these principles is discussed. We take a practical example in the use of text mining, by means of Multivariate Analysis of Variance Biplot (MANOVA-Biplot) when carrying out a systematic review of the literature. The article will show the advantages and disadvantages of exploring and analyzing a large set of publications quickly and methodically.https://www.mdpi.com/2227-7390/9/16/1865text miningbig datasystematic literature reviewscopusweb of science
collection DOAJ
language English
format Article
sources DOAJ
author Daniel Caballero-Julia
Philippe Campillo
spellingShingle Daniel Caballero-Julia
Philippe Campillo
Epistemological Considerations of Text Mining: Implications for Systematic Literature Review
Mathematics
text mining
big data
systematic literature review
scopus
web of science
author_facet Daniel Caballero-Julia
Philippe Campillo
author_sort Daniel Caballero-Julia
title Epistemological Considerations of Text Mining: Implications for Systematic Literature Review
title_short Epistemological Considerations of Text Mining: Implications for Systematic Literature Review
title_full Epistemological Considerations of Text Mining: Implications for Systematic Literature Review
title_fullStr Epistemological Considerations of Text Mining: Implications for Systematic Literature Review
title_full_unstemmed Epistemological Considerations of Text Mining: Implications for Systematic Literature Review
title_sort epistemological considerations of text mining: implications for systematic literature review
publisher MDPI AG
series Mathematics
issn 2227-7390
publishDate 2021-08-01
description In the era of big data, the capacity to produce textual documents is increasing day by day. Our ability to generate large amounts of information has impacted our lives at both the individual and societal levels. Science has not escaped this evolution either, and it is often difficult to quickly and reliably “stand on the shoulders of giants”. Text mining is presented as a promising mathematical solution. However, it has not yet convinced qualitative analysts who are usually wary of mathematical calculation. For this reason, this article proposes to rethink the epistemological principles of text mining, by returning to the qualitative analysis of its meaning and structure. It presents alternatives, applicable to the process of constructing lexical matrices for the analysis of a complex textual corpus. At the same time, the need for new multivariate algorithms capable of integrating these principles is discussed. We take a practical example in the use of text mining, by means of Multivariate Analysis of Variance Biplot (MANOVA-Biplot) when carrying out a systematic review of the literature. The article will show the advantages and disadvantages of exploring and analyzing a large set of publications quickly and methodically.
topic text mining
big data
systematic literature review
scopus
web of science
url https://www.mdpi.com/2227-7390/9/16/1865
work_keys_str_mv AT danielcaballerojulia epistemologicalconsiderationsoftextminingimplicationsforsystematicliteraturereview
AT philippecampillo epistemologicalconsiderationsoftextminingimplicationsforsystematicliteraturereview
_version_ 1721191868305244160