Automatic Thesaurus Generation for an Electronic Community System

Artificial Intelligence Lab, Department of MIS, University of Arizona === This research reports an algorithmic approach to the automatic generation of thesauri for electronic community systems. The techniques used included term filtering, automatic indexing, and cluster analysis. The testbed for our...

Full description

Bibliographic Details
Main Authors: Chen, Hsinchun, Schatz, Bruce R., Yim, Tak, Fye, David
Language:en
Published: Wiley Periodicals, Inc 1995
Subjects:
Online Access:http://hdl.handle.net/10150/105321
id ndltd-arizona.edu-oai-arizona.openrepository.com-10150-105321
record_format oai_dc
spelling ndltd-arizona.edu-oai-arizona.openrepository.com-10150-1053212015-10-23T04:23:04Z Automatic Thesaurus Generation for an Electronic Community System Chen, Hsinchun Schatz, Bruce R. Yim, Tak Fye, David Artificial Intelligence Indexing Artificial Intelligence Lab, Department of MIS, University of Arizona This research reports an algorithmic approach to the automatic generation of thesauri for electronic community systems. The techniques used included term filtering, automatic indexing, and cluster analysis. The testbed for our research was the Worm Community System, which contains a comprehensive library of specialized community data and literature, currently in use by molecular biologists who study the nematode worm C. elegans. The resulting worm thesaurus included 2709 researchers’ names, 798 gene names, 20 experimental methods, and 4302 subject descriptors. On average, each term had about 90 weighted neighboring terms indicating relevant concepts. The thesaurus was developed as an online search aide. We tested the worm thesaurus in an experiment with six worm researchers of varying degrees of expertise and background. The experiment showed that the thesaurus was an excellent “memory-jogging” device and that it supported learning and serendipitous browsing. Despite some occurrences of obvious noise, the system was useful in suggesting relevant concepts for the researchers’ queries and it helped improve concept recall. With a simple browsing interface, an automatic thesaurus can become a useful tool for online search and can assist researchers in exploring and traversing a dynamic and complex electronic community system. 1995-04 Journal Article (Paginated) Automatic Thesaurus Generation for an Electronic Community System 1995-04, 46(3):175-193 Journal of the American Society for Information Science http://hdl.handle.net/10150/105321 Journal of the American Society for Information Science en Wiley Periodicals, Inc
collection NDLTD
language en
sources NDLTD
topic Artificial Intelligence
Indexing
spellingShingle Artificial Intelligence
Indexing
Chen, Hsinchun
Schatz, Bruce R.
Yim, Tak
Fye, David
Automatic Thesaurus Generation for an Electronic Community System
description Artificial Intelligence Lab, Department of MIS, University of Arizona === This research reports an algorithmic approach to the automatic generation of thesauri for electronic community systems. The techniques used included term filtering, automatic indexing, and cluster analysis. The testbed for our research was the Worm Community System, which contains a comprehensive library of specialized community data and literature, currently in use by molecular biologists who study the nematode worm C. elegans. The resulting worm thesaurus included 2709 researchers’ names, 798 gene names, 20 experimental methods, and 4302 subject descriptors. On average, each term had about 90 weighted neighboring terms indicating relevant concepts. The thesaurus was developed as an online search aide. We tested the worm thesaurus in an experiment with six worm researchers of varying degrees of expertise and background. The experiment showed that the thesaurus was an excellent “memory-jogging” device and that it supported learning and serendipitous browsing. Despite some occurrences of obvious noise, the system was useful in suggesting relevant concepts for the researchers’ queries and it helped improve concept recall. With a simple browsing interface, an automatic thesaurus can become a useful tool for online search and can assist researchers in exploring and traversing a dynamic and complex electronic community system.
author Chen, Hsinchun
Schatz, Bruce R.
Yim, Tak
Fye, David
author_facet Chen, Hsinchun
Schatz, Bruce R.
Yim, Tak
Fye, David
author_sort Chen, Hsinchun
title Automatic Thesaurus Generation for an Electronic Community System
title_short Automatic Thesaurus Generation for an Electronic Community System
title_full Automatic Thesaurus Generation for an Electronic Community System
title_fullStr Automatic Thesaurus Generation for an Electronic Community System
title_full_unstemmed Automatic Thesaurus Generation for an Electronic Community System
title_sort automatic thesaurus generation for an electronic community system
publisher Wiley Periodicals, Inc
publishDate 1995
url http://hdl.handle.net/10150/105321
work_keys_str_mv AT chenhsinchun automaticthesaurusgenerationforanelectroniccommunitysystem
AT schatzbrucer automaticthesaurusgenerationforanelectroniccommunitysystem
AT yimtak automaticthesaurusgenerationforanelectroniccommunitysystem
AT fyedavid automaticthesaurusgenerationforanelectroniccommunitysystem
_version_ 1718096117033336832