nips nips2005 nips2005-89 nips2005-89-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Xuerui Wang, Natasha Mohanty, Andrew McCallum
Abstract: We present a probabilistic generative model of entity relationships and their attributes that simultaneously discovers groups among the entities and topics among the corresponding textual attributes. Block-models of relationship data have been studied in social network analysis for some time. Here we simultaneously cluster in several modalities at once, incorporating the attributes (here, words) associated with certain relationships. Significantly, joint inference allows the discovery of topics to be guided by the emerging groups, and vice-versa. We present experimental results on two large data sets: sixteen years of bills put before the U.S. Senate, comprising their corresponding text and voting records, and thirteen years of similar data from the United Nations. We show that in comparison with traditional, separate latent-variable models for words, or Blockstructures for votes, the Group-Topic model’s joint inference discovers more cohesive groups and improved topics. 1
[1] Doug Beeferman and Adam Berger. Agglomerative clustering of a search engine query log. In The 6th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, 2000.
[2] Indrajit Bhattacharya and Lise Getoor. Deduplication and group detection using links. In The 10th SIGKDD Conference Workshop on Link Analysis and Group Detection (LinkKDD), 2004.
[3] Aleks Jakulin and Wray Buntine. Analyzing the US Senate in 2003: Similarities, networks, clusters and blocs, 2004. http://kt.ijs.si/aleks/Politics/us senate.pdf.
[4] Charles Kemp, Thomas L. Griffiths, and Joshua Tenenbaum. Discovering latent classes in relational data. Technical report, AI Memo 2004-019, MIT CSAIL, 2004.
[5] Jeremy Kubica, Andrew Moore, Jeff Schneider, and Yiming Yang. Stochastic link and group detection. In The 17th National Conference on Artificial Intelligence (AAAI), 2002.
[6] Andrew McCallum, Andres Corrada-Emanuel, and Xuerui Wang. Topic and role discovery in social networks. In The 19th International Joint Conference on Artificial Intelligence, 2005.
[7] Krzysztof Nowicki and Tom A.B. Snijders. Estimation and prediction for stochastic blockstructures. Journal of the American Statistical Association, 96(455):1077–1087, 2001. 2 http://en.wikipedia.org/wiki/List of countries by GDP %28PPP%29. In Table 6, we omit some countries (represented by ...) in order to show other interesting but relatively low-ranked countries (for example, Russia) in the GDP list.