A Close-to-linear Topic Detection Algorithm using Relative Entropy based Relevance Model and Inverted Indices Retrieval
Although timely access to information is becoming increasingly important and gaining such access is no longer a problem, the capacity for humans to assimilate such huge amounts of information is limited. Topic Detection(TD) is then a promising research area that addresses speedy access of desired in...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Atlantis Press
2012-08-01
|
Series: | International Journal of Computational Intelligence Systems |
Subjects: | |
Online Access: | https://www.atlantis-press.com/article/25868005.pdf |
id |
doaj-029d7f223ec7469a98007201b11029d8 |
---|---|
record_format |
Article |
spelling |
doaj-029d7f223ec7469a98007201b11029d82020-11-25T02:20:21ZengAtlantis PressInternational Journal of Computational Intelligence Systems 1875-68832012-08-015410.1080/18756891.2012.718156A Close-to-linear Topic Detection Algorithm using Relative Entropy based Relevance Model and Inverted Indices RetrievalSteve Kansheng ShiLemin LiAlthough timely access to information is becoming increasingly important and gaining such access is no longer a problem, the capacity for humans to assimilate such huge amounts of information is limited. Topic Detection(TD) is then a promising research area that addresses speedy access of desired information. However, ironically, the time complexity of existing TD algorithms themselves is usually up to the -th power of . Linear performance requirement of real world topic detection has not been significantly addressed. This paper reveals a new patented topic detection algorithm called that combines elevance odel with nformation etrieval technique to improve on time efficiency. Relevance Model(RM) is a theoretical extension of statistical language modeling that was developed for the task of document retrieval. To reduce the costs of fetching RM, we reduce the number of comparisons for stories by a query-based approach that makes similar stories exist in the top-k query results. We also build our query based on inverted indices, which have the complexity close to linear. The time cost of rest of operations in the topic detection process is a constant. Hence, the total complexity of topic detection algorithm should be close to linear as shown in experimental results. In addition, also gains better detection rates and robustness by relative entropy based topic model design https://www.atlantis-press.com/article/25868005.pdfTopic DetectionLink Topic DetectionRetrospective Event DetectionInformation RetrievalRelevance ModelsInverted Indices |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Steve Kansheng Shi Lemin Li |
spellingShingle |
Steve Kansheng Shi Lemin Li A Close-to-linear Topic Detection Algorithm using Relative Entropy based Relevance Model and Inverted Indices Retrieval International Journal of Computational Intelligence Systems Topic Detection Link Topic Detection Retrospective Event Detection Information Retrieval Relevance Models Inverted Indices |
author_facet |
Steve Kansheng Shi Lemin Li |
author_sort |
Steve Kansheng Shi |
title |
A Close-to-linear Topic Detection Algorithm using Relative Entropy based Relevance Model and Inverted Indices Retrieval |
title_short |
A Close-to-linear Topic Detection Algorithm using Relative Entropy based Relevance Model and Inverted Indices Retrieval |
title_full |
A Close-to-linear Topic Detection Algorithm using Relative Entropy based Relevance Model and Inverted Indices Retrieval |
title_fullStr |
A Close-to-linear Topic Detection Algorithm using Relative Entropy based Relevance Model and Inverted Indices Retrieval |
title_full_unstemmed |
A Close-to-linear Topic Detection Algorithm using Relative Entropy based Relevance Model and Inverted Indices Retrieval |
title_sort |
close-to-linear topic detection algorithm using relative entropy based relevance model and inverted indices retrieval |
publisher |
Atlantis Press |
series |
International Journal of Computational Intelligence Systems |
issn |
1875-6883 |
publishDate |
2012-08-01 |
description |
Although timely access to information is becoming increasingly important and gaining such access is no longer a problem, the capacity for humans to assimilate such huge amounts of information is limited. Topic Detection(TD) is then a promising research area that addresses speedy access of desired information. However, ironically, the time complexity of existing TD algorithms themselves is usually up to the -th power of . Linear performance requirement of real world topic detection has not been significantly addressed. This paper reveals a new patented topic detection algorithm called that combines elevance odel with nformation etrieval technique to improve on time efficiency. Relevance Model(RM) is a theoretical extension of statistical language modeling that was developed for the task of document retrieval. To reduce the costs of fetching RM, we reduce the number of comparisons for stories by a query-based approach that makes similar stories exist in the top-k query results. We also build our query based on inverted indices, which have the complexity close to linear. The time cost of rest of operations in the topic detection process is a constant. Hence, the total complexity of topic detection algorithm should be close to linear as shown in experimental results. In addition, also gains better detection rates and robustness by relative entropy based topic model design |
topic |
Topic Detection Link Topic Detection Retrospective Event Detection Information Retrieval Relevance Models Inverted Indices |
url |
https://www.atlantis-press.com/article/25868005.pdf |
work_keys_str_mv |
AT stevekanshengshi aclosetolineartopicdetectionalgorithmusingrelativeentropybasedrelevancemodelandinvertedindicesretrieval AT leminli aclosetolineartopicdetectionalgorithmusingrelativeentropybasedrelevancemodelandinvertedindicesretrieval AT stevekanshengshi closetolineartopicdetectionalgorithmusingrelativeentropybasedrelevancemodelandinvertedindicesretrieval AT leminli closetolineartopicdetectionalgorithmusingrelativeentropybasedrelevancemodelandinvertedindicesretrieval |
_version_ |
1724871969851047936 |