Updateable PAT-Tree Approach to Chinese Key Phrase Extraction using Mutual Information: A Linguistic Foundation for Knowledge Management

Artificial Intelligence Lab, Department of MIS, University of Arizona === There has been renewed research interest in using the statistical approach to extraction of key phrases from Chinese documents because existing approaches do not allow online frequency updates after phrases have been extract...

Full description

Bibliographic Details
Main Authors: Ong, Thian-Huat, Chen, Hsinchun
Language:en
Published: 1999
Subjects:
Online Access:http://hdl.handle.net/10150/105216
id ndltd-arizona.edu-oai-arizona.openrepository.com-10150-105216
record_format oai_dc
spelling ndltd-arizona.edu-oai-arizona.openrepository.com-10150-1052162015-10-23T04:22:57Z Updateable PAT-Tree Approach to Chinese Key Phrase Extraction using Mutual Information: A Linguistic Foundation for Knowledge Management Ong, Thian-Huat Chen, Hsinchun Knowledge Management Information Extraction Artificial Intelligence Lab, Department of MIS, University of Arizona There has been renewed research interest in using the statistical approach to extraction of key phrases from Chinese documents because existing approaches do not allow online frequency updates after phrases have been extracted. This consequently results in inaccurate, partial extraction. In this paper, we present an updateable PAT-tree approach. In our experiment, we compared our approach with that of Lee-Feng Chien with that showed an improvement in recall from 0.19 to 0.43 and in precision from 0.52 to 0.70. This paper also reviews the requirements for a data structure that facilitates implementation of any statistical approaches to key-phrase extraction, including PATtree, PAT-array and suffix array with semi-infinite strings. 1999 Conference Paper Updateable PAT-Tree Approach to Chinese Key Phrase Extraction using Mutual Information: A Linguistic Foundation for Knowledge Management 1999, :63-84 http://hdl.handle.net/10150/105216 en
collection NDLTD
language en
sources NDLTD
topic Knowledge Management
Information Extraction
spellingShingle Knowledge Management
Information Extraction
Ong, Thian-Huat
Chen, Hsinchun
Updateable PAT-Tree Approach to Chinese Key Phrase Extraction using Mutual Information: A Linguistic Foundation for Knowledge Management
description Artificial Intelligence Lab, Department of MIS, University of Arizona === There has been renewed research interest in using the statistical approach to extraction of key phrases from Chinese documents because existing approaches do not allow online frequency updates after phrases have been extracted. This consequently results in inaccurate, partial extraction. In this paper, we present an updateable PAT-tree approach. In our experiment, we compared our approach with that of Lee-Feng Chien with that showed an improvement in recall from 0.19 to 0.43 and in precision from 0.52 to 0.70. This paper also reviews the requirements for a data structure that facilitates implementation of any statistical approaches to key-phrase extraction, including PATtree, PAT-array and suffix array with semi-infinite strings.
author Ong, Thian-Huat
Chen, Hsinchun
author_facet Ong, Thian-Huat
Chen, Hsinchun
author_sort Ong, Thian-Huat
title Updateable PAT-Tree Approach to Chinese Key Phrase Extraction using Mutual Information: A Linguistic Foundation for Knowledge Management
title_short Updateable PAT-Tree Approach to Chinese Key Phrase Extraction using Mutual Information: A Linguistic Foundation for Knowledge Management
title_full Updateable PAT-Tree Approach to Chinese Key Phrase Extraction using Mutual Information: A Linguistic Foundation for Knowledge Management
title_fullStr Updateable PAT-Tree Approach to Chinese Key Phrase Extraction using Mutual Information: A Linguistic Foundation for Knowledge Management
title_full_unstemmed Updateable PAT-Tree Approach to Chinese Key Phrase Extraction using Mutual Information: A Linguistic Foundation for Knowledge Management
title_sort updateable pat-tree approach to chinese key phrase extraction using mutual information: a linguistic foundation for knowledge management
publishDate 1999
url http://hdl.handle.net/10150/105216
work_keys_str_mv AT ongthianhuat updateablepattreeapproachtochinesekeyphraseextractionusingmutualinformationalinguisticfoundationforknowledgemanagement
AT chenhsinchun updateablepattreeapproachtochinesekeyphraseextractionusingmutualinformationalinguisticfoundationforknowledgemanagement
_version_ 1718096084217102336