acl acl2013 acl2013-291 acl2013-291-reference knowledge-graph by maker-knowledge-mining

291 acl-2013-Question Answering Using Enhanced Lexical Semantic Models

Source: pdf

Author: Wen-tau Yih ; Ming-Wei Chang ; Christopher Meek ; Andrzej Pastusiak

Abstract: In this paper, we study the answer sentence selection problem for question answering. Unlike previous work, which primarily leverages syntactic analysis through dependency tree matching, we focus on improving the performance using models of lexical semantic resources. Experiments show that our systems can be consistently and significantly improved with rich lexical semantic information, regardless of the choice of learning algorithms. When evaluated on a benchmark dataset, the MAP and MRR scores are increased by 8 to 10 points, compared to one of our baseline systems using only surface-form matching. Moreover, our best system also outperforms pervious work that makes use of the dependency tree structure by a wide margin.

reference text

E. Agirre, E. Alfonseca, K. Hall, J. Kravalova, M. Pas ¸ca and A. Soroa. 2009. A study on similarity and relatedness using distributional and WordNetbased approaches. In Proceedings of NAACL, pages 19–27. M. Bilotti, P. Ogilvie, J. Callan, and E. Nyberg. 2007. Structured retrieval for question answering. In Proceedings of SIGIR, pages 35 1–358. E. Blanco and D. Moldovan. 2011. Semantic representation of negation using focus detection. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-HLT 2011). K. Bollacker, C. Evans, P. Paritosh, T. Sturge, and J. Taylor. 2008. Freebase: a collaboratively created graph database for structuring human knowledge. In ACM Conference on Management of Data (SIGMOD), pages 1247–1250. A. Budanitsky and G. Hirst. 2006. Evaluating WordNet-based measures of lexical semantic relatedness. Computational Linguistics, 32: 13–47, March. M. Chang, D. Goldwasser, D. Roth, and V. Srikumar. 2010. Discriminative learning over constrained latent representations. In Proceedings of NAACL. I. Dagan, O. Glickman, and B. Magnini, editors. 2006. The PASCAL Recognising Textual Entailment Challenge, volume 3944. Springer-Verlag, Berlin. W. Dolan, C. Quirk, and C. Brockett. 2004. Unsupervised construction of large paraphrase corpora: Exploiting massively parallel news sources. In Proceedings of COLING. A. Echihabi and D. Marcu. 2003. A noisy-channel approach to question answering. In Annual Meeting of the Association for Computational Linguistics (ACL), pages 16–23. Oren Etzioni. 2011. Search needs a shake-up. Nature, 476(7358):25–26. P. Felzenszwalb, R. Girshick, D. McAllester, and D. Ramanan. 2009. Object detection with discriminatively trained part based models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 99(1). D. Ferrucci. 2012. Introduction to “This is Watson”. IBM Journal of Research and Development, 56(3.4): 1–1. J. Friedman. 2001. Greedy function approximation: a gradient boosting machine. Annals of Statistics, 29(5): 1189–1232. E. Gabrilovich and S. Markovitch. 2007. Computing semantic relatedness using Wikipedia-based explicit semantic analysis. In AAAI Conference on Artificial Intelligence (AAAI). J. Gao, K. Toutanova, and W. Yih. 2011. Clickthrough-based latent semantic models for web search. In Proceedings of SIGIR, pages 675–684. S. Harabagiu and D. Moldovan. 2001. Open-domain textual question answering. Tutorial of NAACL2001. M. Hearst. 1992. Automatic acquisition of hyponyms from large text corpora. In Proceedings of COLING, pages 539–545. M. Heilman and N. Smith. 2010. Tree edit models for recognizing textual entailments, paraphrases, and answers to questions. In Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pages 1011–1019. 1752 D. Jurgens, S. Mohammad, P. Turney, and K. Holyoak. 2012. SemEval-2012 Task 2: Measuring degrees of relational similarity. In Proceedings of the Sixth International Workshop on Semantic Evaluation (SemEval 2012), pages 356–364. T. Mikolov, M. Karafi´ at, L. Burget, J. Cernock y´, and S. Khudanpur. 2010. Recurrent neural network based language model. In Annual Conference of the International Speech Communication Association (INTERSPEECH), pages 1045–1048. D. Moldovan, M. Pas ¸ca, S. Harabagiu, and M. Surdeanu. 2003. Performance issues and error analysis in an open-domain question answering system. ACM Transactions on Information Systems (TOIS), 21(2): 133–154. D. Moldovan, C. Clark, S. Harabagiu, and D. Hodges. 2007. COGEX: A semantically and contextually enriched logic prover for question answering. Journal of Applied Logic, 5(1):49–69. R. Morante and E. Blanco. 2012. *SEM 2012 shared task: Resolving the scope and focus of negation. In Proceedings of the First Joint Conference on Lexical and Computational Semantics, pages 265–274. S. Ponzetto and M. Strube. 2007. Deriving a large scale taxonomy from wikipedia. In AAAI Conference on Artificial Intelligence (AAAI). V. Punyakanok, D. Roth, and W. Yih. 2004. Mapping dependencies trees: An application to question answering. In International Symposium on Artificial Intelligence and Mathematics (AI & Math). K. Radinsky, E. Agichtein, E. Gabrilovich, and S. Markovitch. 2011. A word at a time: computing word relatedness using temporal semantic analysis. In WWW ’11, pages 337–346. J. Reisinger and R. Mooney. 2010. Multi-prototype vector-space models of word meaning. In Proceedings of NAACL. P. Resnik. 1995. Using information content to evaluate semantic similarity in a taxonomy. In International Joint Conference on Artificial Intelligence (IJCAI). B. Rink and S. Harabagiu. 2012. UTD: Determining relational similarity using lexical patterns. In Proceedings of the Sixth International Workshop on Semantic Evaluation (SemEval 2012), pages 413–418. S. Robertson, S. Walker, S. Jones, M. HancockBeaulieu, and M. Gatford. 1995. Okapi at TREC-3. In Text REtrieval Conference (TREC), pages 109– 109. D. Roth and W. Yih. 2007. Global inference for entity and relation identification via a linear programming formulation. In Lise Getoor and Ben Taskar, editors, Introduction to Statistical Relational Learning. MIT Press. D. Shen and M. Lapata. 2007. Using semantic roles to improve question answering. In Proceedings of EMNLP-CoNLL, pages 12–21. D. Smith and J. Eisner. 2006. Quasi-synchronous grammars: Alignment by soft projection of syntactic dependencies. In Proceedings of the HLT-NAACL Workshop on Statistical Machine Translation, pages 23–30. Y. Song, H. Wang, Z. Wang, H. Li, and W. Chen. 2011. Short text conceptualization using a probabilistic knowledgebase. In International Joint Conference on Artificial Intelligence (IJCAI), pages 2330–2336. K. Tai. 1979. The tree-to-tree correction problem. J. ACM, 26(3):422–433, July. P. Turney and P. Pantel. 2010. From frequency to meaning: Vector space models of semantics. Journal of Artificial Intelligence Research, 37(1): 141– 188. E. Voorhees and D. Tice. 2000. Building a question answering test collection. In Proceedings of SIGIR, pages 200–207. M. Wang and C. Manning. 2010. Probabilistic treeedit models with structured latent variables for textual entailment and question answering. In Proceedings of COLING. M. Wang, N. Smith, and T. Mitamura. 2007. What is the Jeopardy model? A quasi-synchronous grammar for QA. In Proceedings of EMNLP-CoNLL. T. Winograd. 1977. Five lectures on artificial intelligence. In A. Zampolli, editor, Linguistic Structures Processing, pages 399–520. North Holland. W. Woods. 1973. Progress in natural language understanding: An application to lunar geology. In Proceedings of the National Computer Conference and Exposition (AFIPS), pages 441–450. W. Wu, H. Li, H. Wang, and K. Zhu. 2012. Probase: a probabilistic taxonomy for text understanding. In ACM Conference on Management of Data (SIGMOD), pages 481–492. W. Yih and V. Qazvinian. 2012. Measuring word relatedness using heterogeneous vector space models. In Proceedings of NAACL-HLT 2012, pages 616–620. W. Yih, K. Toutanova, J. Platt, and C. Meek. 2011. Learning discriminative projections for text similarity measures. In ACL Conference on Natural Language Learning (CoNLL), pages 247–256. W. Yih, G. Zweig, and J. Platt. 2012. Polarity inducing latent semantic analysis. In Proceedings of EMNLPCoNLL, pages 1212–1222. A. Zhila, W. Yih, C. Meek, G. Zweig, and T. Mikolov. 2013. Combining heterogeneous models for measuring relational similarity. In Proceedings of HLTNAACL. 1753