acl acl2010 acl2010-55 acl2010-55-reference knowledge-graph by maker-knowledge-mining

55 acl-2010-Bootstrapping Semantic Analyzers from Non-Contradictory Texts

Source: pdf

Author: Ivan Titov ; Mikhail Kozhevnikov

Abstract: We argue that groups of unannotated texts with overlapping and non-contradictory semantics represent a valuable source of information for learning semantic representations. A simple and efficient inference method recursively induces joint semantic representations for each group and discovers correspondence between lexical entries and latent semantic concepts. We consider the generative semantics-text correspondence model (Liang et al., 2009) and demonstrate that exploiting the noncontradiction relation between texts leads to substantial improvements over natural baselines on a problem of analyzing human-written weather forecasts.

reference text

Regina Barzilay and Lillian Lee. 2002. Bootstrapping lexical choice via multiple-sequence alignment. In Proceedings of the Conference on Em- pirical Methods in Natural Language Processing (EMNLP), pages 164–171. Regina Barzilay and Lillian Lee. 2003. Learning to paraphrase: An unsupervised approach using multiple-sequence alignment. In Proceedings of the Conference on Human Language Technology and North American chapter of the Association for Computational Linguistics (HLT-NAACL). Sugatu Basu, Arindam Banjeree, and Raymond Mooney. 2004. Active semi-supervision for pairwise constrained clustering. In Proc. of the SIAM International Conference on Data Mining (SDM), pages 333–344. A. Blum and T. Mitchell. 1998. Combining labeled and unlabeled data with co-training. In COLT: Proceedings of the Workshop on Computational Learning Theory, Morgan Kaufmann Publishers, pages 209–214. Xavier Carreras and Lluis Marquez. 2005. Introduction to the conll-2005 shared task: Semantic role labeling. In Proceedings of CoNLL-2005, Ann Arbor, MI USA. 966 David L. Chen and Raymond L. Mooney. 2008. Learning to sportcast: A test of grounded language acquisition. In Proc. of International Conference on Ma- chine Learning, pages 128–135. A. P. Dempster, N. M. Laird, and D. B. Rubin. 1977. Maximum likelihood from incomplete data via the EM algorithms. Journal of the Royal Statistical Society. Series B (Methodological), 39(1): 1–38. P. Diaconis and B. Efron. 1983. Computer-intensive methods in statistics. Scientific American, pages 116–130. Bill Dolan, Chris Quirk, and Chris Brockett. 2004. Unsupervised construction of large paraphrase corpora: Exploiting massively parallel news sources. In Proceedings of the Conference on Computational Linguistics (COLING), pages 350–356. Ruifang Ge and Raymond J. Mooney. 2005. A statistical semantic parser that integrates syntax and semantics. In Proceedings of the Ninth Conference on Computational Natural Language Learning (CONLL-05), Ann Arbor, Michigan. Joao Graca, Kuzman Ganchev, and Ben Taskar. 2008. Expectation maximization and posterior constraints. Advances in Neural Information Processing Systems 20 (NIPS). Zellig Harris. 1968. Mathematical structures of lan- guage. Wiley. Rohit J. Kate and Raymond J. Mooney. 2007. Learning language semantics from ambigous supervision. In Association for the Advancement of Artificial Intelligence (AAAI), pages 895–900. Percy Liang, Michael I. Jordan, and Dan Klein. 2009. Learning semantic correspondences with less supervision. In Proc. of the Annual Meeting of the Association for Computational Linguistics and International Joint Conference on Natural Language Processing (ACL-IJCNLP). Andrew McCallum, Gideon Mann, and Gregory Druck. 2007. Generalized expectation criteria. Technical Report TR 2007-60, University of Massachusetts, Amherst, MA. Raymond J. Mooney. 2007. Learning for semantic parsing. In Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing, pages 982–991 . Kevin P. Murphy, Yair Weiss, and Michael I. Jordan. 1999. Loopy belief propagation for approximate inference: An empirical study. In Proc. of Uncertainty in Artificial Intelligence (UAI), pages 467–475. Judea Pearl. 1982. Reverend bayes on inference engines: A distributed hierarchical approach. In Proc. of the National Conference on Artificial Intelligence (AAAI), pages 133–136. Hoifung Poon and Pedro Domingos. 2009. Unsupervised semantic parsing. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, (EMNLP-09). Dragomir Radev. 2000. A common theory of information fusion from multiple text sources step one: Cross-document structure. In 1st SIGdial Workshop on Discourse and Dialogue, pages 74–83. Yusuke Shinyama and Satoshi Sekine. 2003. Paraphrase acquisition for information extraction. In Proceedings of Second International Workshop on Paraphrasing (IWP2003), pages 65–71. Benjamin Snyder and Regina Barzilay. 2007. Database-text alignment via structured multilabel classification. In Proceedings of International Joint Conference on Artificial Intelligence (IJCAI-05), pages 1713–1718. J. Weeds and W. Weir. 2005. Co-occurrence retrieval: A flexible framework for lexical distributional similarity. Computational Linguistics, 3 1(4):439–475. Luke Zettlemoyer and Michael Collins. 2005. Learning to map sentences to logical form: Structured classification with probabilistic categorial grammar. In Proceedings of the Twenty-first Conference on Uncertainty in Artificial Intelligence, Edinburgh, UK, August. 967