Comparison of generality based algorithm variants for automatic taxonomy generation

Comparison of generality based algorithm variants for automatic taxonomy generation

Supplementary Material can be found on http:// ssm-vm011.mit.edu/henschel/IIT09/.

Bibliographic Details
Main Authors:	Madnick, Stuart E. (Contributor), Henschel, Andreas (Author), Wachter, Thomas (Author), Woon, Wei Lee (Author)
Other Authors:	Sloan School of Management (Contributor)
Format:	Article
Language:	English
Published:	Institute of Electrical and Electronics Engineers, 2010-10-15T15:19:21Z.
Subjects:	Article
Online Access:	Get fulltext


LEADER	01680 am a22002293u 4500
001	59371
042			\|a dc
100	1	0	\|a Madnick, Stuart E. \|e author
100	1	0	\|a Sloan School of Management \|e contributor
100	1	0	\|a Madnick, Stuart E. \|e contributor
100	1	0	\|a Madnick, Stuart E. \|e contributor
700	1	0	\|a Henschel, Andreas \|e author
700	1	0	\|a Wachter, Thomas \|e author
700	1	0	\|a Woon, Wei Lee \|e author
245	0	0	\|a Comparison of generality based algorithm variants for automatic taxonomy generation
260			\|b Institute of Electrical and Electronics Engineers, \|c 2010-10-15T15:19:21Z.
856			\|z Get fulltext \|u http://hdl.handle.net/1721.1/59371
520			\|a Supplementary Material can be found on http:// ssm-vm011.mit.edu/henschel/IIT09/.
520			\|a We compare a family of algorithms for the automatic generation of taxonomies by adapting the Heymann-algorithm in various ways. The core algorithm determines the generality of terms and iteratively inserts them in a growing taxonomy. Variants of the algorithm are created by altering the way and the frequency, generality of terms is calculated. We analyse the performance and the complexity of the variants combined with a systematic threshold evaluation on a set of seven manually created benchmark sets. As a result, betweenness centrality calculated on unweighted similarity graphs often performs best but requires threshold fine-tuning and is computationally more expensive than closeness centrality. Finally, we show how an entropy-based filter can lead to more precise taxonomies.
546			\|a en_US
655	7		\|a Article
773			\|t International Conference on Innovations in Information Technology, 2009. IIT '09.

Cannot write session to /tmp/vufind_sessions/sess_miljs5mbu2qvqo4bp8ht48uj81