acl acl2011 acl2011-287 acl2011-287-reference knowledge-graph by maker-knowledge-mining

287 acl-2011-Structural Topic Model for Latent Topical Structure Analysis


Source: pdf

Author: Hongning Wang ; Duo Zhang ; ChengXiang Zhai

Abstract: Topic models have been successfully applied to many document analysis tasks to discover topics embedded in text. However, existing topic models generally cannot capture the latent topical structures in documents. Since languages are intrinsically cohesive and coherent, modeling and discovering latent topical transition structures within documents would be beneficial for many text analysis tasks. In this work, we propose a new topic model, Structural Topic Model, which simultaneously discovers topics and reveals the latent topical structures in text through explicitly modeling topical transitions with a latent first-order Markov chain. Experiment results show that the proposed Structural Topic Model can effectively discover topical structures in text, and the identified structures significantly improve the performance of tasks such as sentence annotation and sentence ordering. ,


reference text

R. Barzilay and M. Lapata. 2005. Collective content selection for concept-to-text generation. In Proceedings ofthe conference on Human Language Technology and Empirical Methods in Natural Language Processing, pages 33 1–338. R. Barzilay and L. Lee. 2004. Catching the drift: Probabilistic content models, with applications to generation and summarization. In Proceedings of HLT-NAACL, pages 113–120. D.M. Blei and M.I. Jordan. 2003. Modeling annotated data. In Proceedings of the 26th annual international ACM SIGIR conference, pages 127–134. D.M. Blei and J.D. Lafferty. 2007. A correlated topic model of science. The Annals of Applied Statistics, 1(1): 17–35. D.M. Blei and P.J. Moreno. 2001. Topic segmentation with an aspect hidden Markov model. In Proceedings of the 24th annual international ACM SIGIR conference, page 348. ACM. D.M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent dirichlet allocation. The Journal of Machine Learning Research, 3(2-3):993 1022. H. Chen, SRK Branavan, R. Barzilay, and D.R. Karger. 2009. Global models of document structure using latent permutations. In Proceedings of HLT-NAACL, pages 371–379. P. Diaconis and D. Ylvisaker. 1979. Conjugate priors for exponential families. The Annals of statistics, 7(2):269–281. M. Galley, K. McKeown, E. Fosler-Lussier, and H. Jing. 2003. Discourse segmentation of multi-party conversation. In Proceedings of the 41st Annual Meeting on Association for Computational Linguistics-Volume 1, pages 562–569. J. Goldstein, V. Mittal, J. Carbonell, and M. Kantrowitz. 2000. Multi-document summarization by sentence extraction. In NAACL-ANLP 2000 Workshop on Automatic summarization, pages 40–48. T. Grenager, D. Klein, and C.D. Manning. 2005. Unsupervised learning of field segmentation models for information extraction. In Proceedings of the 43rd an– nual meeting on associationfor computational linguistics, pages 371–378. T.L. Griffiths, M. Steyvers, D.M. Blei, and J.B. Tenenbaum. 2005. Integrating topics and syntax. Advances in neural information processing systems, 17:537– 544. Amit Gruber, Yair Weiss, and Michal Rosen-Zvi. 2007. Hidden topic markov models. volume 2, pages 163– 170. T. Hofmann. 1999. Probabilistic latent semantic indexing. In Proceedings of the 22nd annual international 1535 ACM SIGIR conference on Research and development in information retrieval, pages 50–57. E.H. Hovy. 1993. Automated discourse generation using discourse structure relations. Artificial intelligence, 63(1-2):341–385. M. Johnson. 2007. Why doesn’t EM find good HMM POS-taggers. In Proceedings ofthe 2007Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pages 296–305. M.I. Jordan, Z. Ghahramani, T.S. Jaakkola, and L.K. Saul. 1999. An introduction to variational methods for graphical models. Machine learning, 37(2): 183– 233. H. Kamp. 1981. A theory of truth and semantic representation. Formal methods in the study of language, 1:277–322. M. Lapata. 2006. Automatic evaluation of information ordering: Kendall’s tau. Computational Linguistics, 32(4):471–484. L. Lov a´sz and M.D. Plummer. 1986. Matching theory. Elsevier Science Ltd. Y. Lu and C. Zhai. 2008. Opinion integration through semi-supervised topic modeling. In Proceeding of the 1 international conference on World Wide Web, 7th pages 121–130. Daniel Marcu. 1998. The rhetorical parsing of natural language texts. In ACL ’98, pages 96–103. Q. Mei, X. Ling, M. Wondra, H. Su, and C.X. Zhai. 2007. Topic sentiment mixture: modeling facets and opinions in weblogs. In Proceedings of the 16th international conference on World Wide Web, pages 171–1 80. L.R. Rabiner. 1989. A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE, 77(2):257–286. R. Soricut and D. Marcu. 2003. Sentence level discourse parsing using syntactic and lexical information. In Proceedings of the 2003 Conference of the NAACLHTC, pages 149–156. B. Sun, P. Mitra, C.L. Giles, J. Yen, and H. Zha. 2007. Topic segmentation with shared topic detection and alignment of multiple documents. In Proceedings of the 30th ACM SIGIR, pages 199–206. ChengXiang Zhai, Atulya Velivelli, and Bei Yu. 2004. A cross-collection mixture model for comparative text minning. In Proceeding of the 10th ACM SIGKDD international conference on Knowledge discovery in data mining, pages 743–748. L. Zhuang, F. Jing, and X.Y. Zhu. 2006. Movie review mining and summarization. In Proceedings of the 15th ACM international conference on Information and knowledge management, pages 43–50.