nips nips2001 nips2001-90 nips2001-90-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Jorg Ontrup, Helge Ritter
Abstract: We introduce a new type of Self-Organizing Map (SOM) to navigate in the Semantic Space of large text collections. We propose a “hyperbolic SOM” (HSOM) based on a regular tesselation of the hyperbolic plane, which is a non-euclidean space characterized by constant negative gaussian curvature. The exponentially increasing size of a neighborhood around a point in hyperbolic space provides more freedom to map the complex information space arising from language into spatial relations. We describe experiments, showing that the HSOM can successfully be applied to text categorization tasks and yields results comparable to other state-of-the-art methods.
[1] T. Kohonen. Self-Organizing Maps. Springer Series in Information Sciences. 3rd edition, 2001.
[2] Teuvo Kohonen, Samuel Kaski, Krista Lagus, Jarkko Saloj¨ rvi, Vesa Paatero, and Antti Saarela. a Organization of a massive document collection. IEEE Transactions on Neural Networks, Special Issue on Neural Networks for Data Mining and Knowledge Discovery, 11(3):574–585, May 2000.
[3] John Lamping and Ramana Rao. Laying out and visualizing large trees using a hyperbolic space. In Proceedings of UIST’94, pages 13–14, 1994.
[4] T. Munzer. Exploring large graphs in 3D hyperbolic space. IEEE Computer Graphics and Applications, 18(4):18–23, July/August 1998.
[5] H. S. M. Coxeter. Non Euclidean Geometry. Univ. of Toronto Press, Toronto, 1957.
[6] J.A. Thorpe. Elementary Topics in Differential Geometry. Springer-Verlag, New York, 1979.
[7] Frank Morgan. Riemannian Geometry: A Beginner’s Guide. Jones and Bartlett Publishers, Boston, London, 1993.
[8] Charles W. Misner, J. A. Wheeler, and Kip S. Thorne. Gravitation. Freeman, 1973.
[9] R. Fricke and F. Klein. Vorlesungen uber die Theorie der automorphen Funktionen, volume 1. ¨ Teubner, Leipzig, 1897. Reprinted by Johnson Reprint, New York, 1965.
[10] W. Magnus. Noneuclidean Tesselations and Their Groups. Academic Press, 1974.
[11] Helge Ritter. Self-organizing maps in non-euclidian spaces. In E. Oja and S. Kaski, editors, Kohonen Maps, pages 97–108. Amer Elsevier, 1999.
[12] J. Ontrup and H. Ritter. Text categorization and semantic browsing with self-organizing maps on non-euclidean spaces. In Proc. of the PKDD-01, 2001.
[13] G. Salton and C. Buckley. Term-weighting approaches in automatic text retrieval. Information Processing and Management, 24(5):513–523, 1988.
[14] T. Joachims. Text categorization with support vector machines: learning with many relevant features. In Proc. of ECML-98, number 1398, pages 137–142, Chemnitz, DE, 1998.
[15] Huma Lodhi, John Shawe-Taylor, Nello Cristianini, and Chris Watkins. Text classification using string kernels. In Todd K. Leen, Thomas G. Dietterich, and Volker Tresp, editors, Advances in Neural Information Processing Systems 13, pages 563–569. MIT Press, 2001.
[16] F. Sebastiani, A. Sperduti, and N. Valdambrini. An improved boosting algorithm and its application to automated text categorization. In Proc. of CIKM-00, pages 78–85, 2000.
[17] Y. Yang. An evaluation of statistical approaches to text categorization. Information Retrieval, 1-2(1):69–90, 1999.