Deriving Genetic Networks Using Text Mining

On the Internet an enormous amount of information is available that is represented in an unstructured form. The purpose with a text mining tool is to collect this information and present it in a more structured form. In this report text mining is used to create an algorithm that searches abstracts a...

Full description

Bibliographic Details
Main Author: Olsson, Elin
Format: Others
Language:English
Published: Högskolan i Skövde, Institutionen för datavetenskap 2002
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:his:diva-708
Description
Summary:On the Internet an enormous amount of information is available that is represented in an unstructured form. The purpose with a text mining tool is to collect this information and present it in a more structured form. In this report text mining is used to create an algorithm that searches abstracts available from PubMed and finds specific relationships between genes that can be used to create a network. The algorithm can also be used to find information about a specific gene. The network created by Mendoza et al. (1999) was verified in all the connections but one using the algorithm. This connection contained implicit information. The results suggest that the algorithm is better at extracting information about specific genes than finding connections between genes. One advantage with the algorithm is that it can also find connections between genes and proteins and genes and other chemical substances.