Comparison of generality based algorithm variants for automatic taxonomy generation

Supplementary Material can be found on http:// ssm-vm011.mit.edu/henschel/IIT09/.

Bibliographic Details
Main Authors: Madnick, Stuart E. (Contributor), Henschel, Andreas (Author), Wachter, Thomas (Author), Woon, Wei Lee (Author)
Other Authors: Sloan School of Management (Contributor)
Format: Article
Language:English
Published: Institute of Electrical and Electronics Engineers, 2010-10-15T15:19:21Z.
Subjects:
Online Access:Get fulltext
LEADER 01680 am a22002293u 4500
001 59371
042 |a dc 
100 1 0 |a Madnick, Stuart E.  |e author 
100 1 0 |a Sloan School of Management  |e contributor 
100 1 0 |a Madnick, Stuart E.  |e contributor 
100 1 0 |a Madnick, Stuart E.  |e contributor 
700 1 0 |a Henschel, Andreas  |e author 
700 1 0 |a Wachter, Thomas  |e author 
700 1 0 |a Woon, Wei Lee  |e author 
245 0 0 |a Comparison of generality based algorithm variants for automatic taxonomy generation 
260 |b Institute of Electrical and Electronics Engineers,   |c 2010-10-15T15:19:21Z. 
856 |z Get fulltext  |u http://hdl.handle.net/1721.1/59371 
520 |a Supplementary Material can be found on http:// ssm-vm011.mit.edu/henschel/IIT09/. 
520 |a We compare a family of algorithms for the automatic generation of taxonomies by adapting the Heymann-algorithm in various ways. The core algorithm determines the generality of terms and iteratively inserts them in a growing taxonomy. Variants of the algorithm are created by altering the way and the frequency, generality of terms is calculated. We analyse the performance and the complexity of the variants combined with a systematic threshold evaluation on a set of seven manually created benchmark sets. As a result, betweenness centrality calculated on unweighted similarity graphs often performs best but requires threshold fine-tuning and is computationally more expensive than closeness centrality. Finally, we show how an entropy-based filter can lead to more precise taxonomies. 
546 |a en_US 
655 7 |a Article 
773 |t International Conference on Innovations in Information Technology, 2009. IIT '09.