Corpus Linguistics, Network Analysis and Co-occurrence Matrices Corpus Linguistics, Network Analysis and Co-occurrence Matrices

This article describes research undertaken in order to design a methodology for the reticular representation of knowledge of a specific discourse community. To achieve this goal, a representative corpus of the scientific production of the members of this discourse community (Universidad Polit&am...

Full description

Bibliographic Details
Main Authors: Keith Stuart, Ana Botella
Format: Article
Language:English
Published: Universidad de Murcia 2009-12-01
Series:International Journal of English Studies (IJES)
Subjects:
Online Access:http://revistas.um.es/ijes/article/view/99481
Description
Summary:This article describes research undertaken in order to design a methodology for the reticular representation of knowledge of a specific discourse community. To achieve this goal, a representative corpus of the scientific production of the members of this discourse community (Universidad Polit&amp;eacute;cnica de Valencia, UPV) was created. The article presents the practical analysis (frequency, keyword, collocation and cluster analysis) that was carried out in the initial phases of the study aimed at establishing the theoretical and practical background and framework for our matrix and network analysis of the scientific discourse of the UPV. In the methodology section, the processes that have allowed us to extract from the corpus the linguistic elements needed to develop co-occurrence matrices, as well as the computer tools used in the research, are described. From these co-occurrence matrices, semantic networks of subject and discipline knowledge were generated. Finally, based on the results obtained, we suggest that it may be viable to extract and to represent the intellectual capital of an academic institution using corpus linguistics methods in combination with the formulations of network theory.<br>En este art&amp;iacute;culo describimos la investigaci&amp;oacute;n que se ha desarrollado en el dise&amp;ntilde;o de una metodolog&amp;iacute;a para la representaci&amp;oacute;n reticular del conocimiento que se genera en el seno de una instituci&amp;oacute;n a partir de un corpus representativo de la producci&amp;oacute;n cient&amp;iacute;fica de los integrantes de dicha comunidad discursiva, la Universidad Polit&amp;eacute;cnica de Valencia.. Para ello, presentamos las acciones que se realizaron en las fases iniciales del estudio encaminadas a establecer el marco te&amp;oacute;rico y pr&amp;aacute;ctico en el que se inscribe nuestro an&amp;aacute;lisis. En la secci&amp;oacute;n de metodolog&amp;iacute;a se describen las herramientas inform&amp;aacute;ticas utilizadas, as&amp;iacute; como los procesos que nos permitieron disponer de aquellos elementos presentes en el corpus, que nos llevar&amp;iacute;an al desarrollo de matrices de co-ocurrencias con las que se generaron redes sem&amp;aacute;nticas del conocimiento disciplinar. Finalmente, a partir de los resultados obtenidos, constatamos la viabilidad de extraer y representar el capital intelectual bas&amp;aacute;ndonos en los principios de la ling&amp;uuml;&amp;iacute;stica de corpus en combinaci&amp;oacute;n con las formulaciones de la teor&amp;iacute;a de redes.
ISSN:1578-7044