acl acl2012 acl2012-217 acl2012-217-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Zhi Zhong ; Hwee Tou Ng
Abstract: Previous research has conflicting conclusions on whether word sense disambiguation (WSD) systems can improve information retrieval (IR) performance. In this paper, we propose a method to estimate sense distributions for short queries. Together with the senses predicted for words in documents, we propose a novel approach to incorporate word senses into the language modeling approach to IR and also exploit the integration of synonym relations. Our experimental results on standard TREC collections show that using the word senses tagged by a supervised WSD system, we obtain significant improvements over a state-of-the-art IR system.
E. Agirre, X. Arregi, and A. Otegi. 2010. Document expansion based on WordNet for robust IR. In Proceedings of the 23rd International Conference on Computational Linguistics, pages 9–17. 5The top two systems on RB04 are the results of the same participant with different configurations. They used lots of web resources, such as search engines, to improve the performance. J. Allan, M. E. Connell, W.B. Croft, F.F. Feng, D. Fisher, and X. Li. 2000. INQUERY and TREC-9. In Proceedings of the 9th Text REtrieval Conference, pages 551–562. M. Bendersky, W. B. Croft, and D. A. Smith. 2010. Structural annotation of search queries using pseudorelevance feedback. In Proceedings of the 19th ACM Conference on Information and Knowledge Management, pages 1537–1540. A. Broder, M. Fontoura, E. Gabrilovich, A. Joshi, V. Josifovski, and T. Zhang. 2007. Robust classification of rare queries using web knowledge. In Proceedings of the 30th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 23 1–238. G. Cao, J. Y. Nie, and J. Bai. 2005. Integrating word relationships into language models. In Proceedings of the 28th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 298–305. M. Carpuat and D. Wu. 2007. Improving statistical machine translation using word sense disambiguation. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pages 61–72. Y. S. Chan and H. T. Ng. 2005. Scaling up word sense disambiguation via parallel texts. In Proceedings of the 20th National Conference on Artificial Intelligence, pages 1037–1042. Y. S. Chan, H. T. Ng, and D. Chiang. 2007. Word sense disambiguation improves statistical machine translation. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, pages 33–40. H. Fang. 2008. A re-examination of query expansion using lexical resources. In Proceedings of the 46th Annual Meeting of the Association of Computational Linguistics: Human Language Technologies, pages 139– 147. J. Gonzalo, F. Verdejo, I. Chugur, and J. Cigarrin. 1998. Indexing with WordNet synsets can improve text retrieval. In Proceedings ofthe COLING-ACL Workshop on Usage of WordNet in Natural Language Processing Systems, pages 38–44. J. Gonzalo, A. Penas, and F. Verdejo. 1999. Lexical ambiguity and information retrieval revisited. In Proceedings ofthe 1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, pages 195–202. D. He and D. Wu. 2011. Enhancing query translation with relevance feedback in translingual information retrieval. Information Processing & Management, 47(1): 1–17. 281 S. B. Kim, H. C. Seo, and H. C. Rim. 2004. Information retrieval using word senses: root sense tagging approach. In Proceedings of the 27th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 258–265. R. Krovetz and W. B. Croft. 1992. Lexical ambiguity and information retrieval. ACM Transactions on Information Systems, 10(2): 115–141 . K. L. Kwok and M. Chan. 1998. Improving two-stage ad-hoc retrieval for short queries. In Proceedings of the 21st International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 250–256. J. Lafferty and C. Zhai. 2001. Document language mod- els, query models, and risk minimization for information retrieval. In Proceedings of the 24th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 111–1 19. V. Lavrenko and W. B. Croft. 2001. Relevance based language models. In Proceedings of the 24th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 120–127. S. Liu, F. Liu, C. Yu, and W. Meng. 2004. An effective approach to document retrieval via utilizing WordNet and recognizing phrases. In Proceedings of the 27th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 266–272. S. Liu, C. Yu, and W. Meng. 2005. Word sense disambiguation in queries. In Proceedings of the 14th ACM Conference on Information and Knowledge Management, pages 525–532. G. A. Miller. 1990. WordNet: An on-line lexical database. International Journal of Lexicography, 3(4):235–312. F. J. Och and H. Ney. 2003. A systematic comparison of various statistical alignment models. Computational Linguistics, 29(1): 19–5 1. P. Ogilvie and J. Callan. 2001 . Experiments using the Lemur toolkit. In Proceedings of the 10th Text REtrieval Conference, pages 103–108. J. M. Ponte and W. B. Croft. 1998. A language modeling approach to information retrieval. In Proceedings of the 21st International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 275–281. J. M. Ponte. 1998. A Language Modeling Approach to Information Retreival. Ph.D. thesis, Department of Computer Science, University of Massachusetts. M. Sanderson. 1994. Word sense disambiguation and information retrieval. In Proceedings of the 17th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 142–151 . M. Sanderson. 2000. Retrieving with good sense. Information Retrieval, 2(1):49–69. H. Sch u¨tze and J. O. Pedersen. 1995. Information retrieval based on word senses. In Proceedings of the 4th Annual Symposium on Document Analysis and Information Retrieval, pages 161–175. C. Stokoe, M. P. Oakes, and J. Tait. 2003. Word sense disambiguation in information retrieval revisited. In Proceedings of the 26th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 159–166. E. M. Voorhees. 1993. Using WordNet to disambiguate word senses for text retrieval. In Proceedings of the 16th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 171–180. E. M. Voorhees. 1994. Query expansion using lexicalsemantic relations. In Proceedings of the 1 Inter7th national ACM SIGIR Conference on Research and Development in Information Retrieval, pages 61–69. C. Zhai and J. Lafferty. 2001a. Model-based feedback in the language modeling approach to information retrieval. In Proceedings of the 10th ACM Conference on Information and Knowledge Management, pages 403–410. C. Zhai and J. Lafferty. 2001b. A study of smoothing methods for language models applied to ad hoc information retrieval. In Proceedings of the 24th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 334–342. Z. Zhong and H. T. Ng. 2009. Word sense disambiguation for all words without hard labor. In Proceedings of the 21st International Joint Conference on Artificial Intelligence, pages 1616–1621 . Z. Zhong and H. T. Ng. 2010. It Makes Sense: A widecoverage word sense disambiguation system for free text. In Proceedings of the 48th Annual Meeting of the Association of Computational Linguistics: System Demonstrations, pages 78–83. 282