nips nips2004 nips2004-87 nips2004-87-reference knowledge-graph by maker-knowledge-mining

87 nips-2004-Integrating Topics and Syntax


Source: pdf

Author: Thomas L. Griffiths, Mark Steyvers, David M. Blei, Joshua B. Tenenbaum

Abstract: Statistical approaches to language learning typically focus on either short-range syntactic dependencies or long-range semantic dependencies between words. We present a generative model that uses both kinds of dependencies, and can be used to simultaneously find syntactic classes and semantic topics despite having no representation of syntax or semantics beyond statistical dependency. This model is competitive on tasks like part-of-speech tagging and document classification with models that exclusively use short- and long-range dependencies respectively. 1


reference text

[1] H. J. Neville, D. L. Mills, and D. S. Lawson. Fractionating language: Different neural subsytems with different sensitive periods. Cerebral Cortex, 2:244–258, 1992.

[2] R. Brown. A first language. Harvard University Press, Cambridge, MA, 1973.

[3] M. Redington, N. Chater, and S. Finch. Distributional information: A powerful cue for acquiring syntactic categories. Cognitive Science, 22:425–469, 1998.

[4] T. K. Landauer and S. T. Dumais. A solution to Plato’s problem: the Latent Semantic Analysis theory of acquisition, induction, and representation of knowledge. Psychological Review, 104:211–240, 1997.

[5] C. Manning and H. Sch¨ tze. Foundations of statistical natural language processing. MIT Press, u Cambridge, MA, 1999.

[6] D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent Dirichlet Allocation. Journal of Machine Learning Research, 3:993–1022, 2003.

[7] N. Coccaro and D. Jurafsky. Towards better integration of semantic predictors in statistical language modeling. In Proceedings of ICSLP-98, volume 6, pages 2403–2406, 1998.

[8] T. L. Griffiths and M. Steyvers. Finding scientific topics. Proceedings of the National Academy of Science, 101:5228–5235, 2004.

[9] W.R. Gilks, S. Richardson, and D. J. Spiegelhalter, editors. Markov Chain Monte Carlo in Practice. Chapman and Hall, Suffolk, 1996.

[10] H. Kucera and W. N. Francis. Computational analysis of present-day American English. Brown University Press, Providence, RI, 1967.

[11] R. E. Kass and A. E. Rafferty. Bayes factors. Journal of the American Statistical Association, 90:773–795, 1995.

[12] L. Hubert and P. Arabie. Comparing partitions. Journal of Classification, 2:193–218, 1985.