acl acl2010 acl2010-124 acl2010-124-reference knowledge-graph by maker-knowledge-mining

124 acl-2010-Generating Image Descriptions Using Dependency Relational Patterns


Source: pdf

Author: Ahmet Aker ; Robert Gaizauskas

Abstract: This paper presents a novel approach to automatic captioning of geo-tagged images by summarizing multiple webdocuments that contain information related to an image’s location. The summarizer is biased by dependency pattern models towards sentences which contain features typically provided for different scene types such as those of churches, bridges, etc. Our results show that summaries biased by dependency pattern models lead to significantly higher ROUGE scores than both n-gram language models reported in previous work and also Wikipedia baseline summaries. Summaries generated using dependency patterns also lead to more readable summaries than those generated without dependency patterns.


reference text

A. Aker and R. Gaizauskas. 2009. Summary Generation for Toponym-Referenced Images using Object 1257 Type Language Models. International Conference on Recent Advances in Natural Language Processing (RANLP),2009. A. Aker and R. Gaizauskas. 2010. Model Summaries for Location-related Images. In Proc. of the LREC2010 Conference. K. Barnard and D. Forsyth. 2001. Learning the semantics of words and pictures. In International Conference on Computer Vision, volume 2, pages 408–415. Vancouver: IEEE. K. Barnard, P. Duygulu, D. Forsyth, N. de Freitas, D.M. Blei, and M.I. Jordan. 2003. Matching words and pictures. The Journal of Machine Learning Research, 3: 1107–1 135. T.L. Berg, A.C. Berg, J. Edwards, and DA Forsyth. 2005. Whos in the Picture? In Advances in Neural Information Processing Systems 17: Proc. Of The 2004 Conference. MIT Press. R.C. Bunescu and R.J. Mooney. 2005. A shortest path dependency kernel for relation extraction. In Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, pages 724–73 1. Association for Computational Linguistics Morristown, NJ, USA. A. Culotta and J. Sorensen. 2004. Dependency Tree Kernels for Relation Extraction. In Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL’04), Main Volume, pages 423–429, Barcelona, Spain, July. H.T. Dang. 2005. Overview of DUC 2005. DUC 05 Workshop at HLT/EMNLP. H.T. Dang. 2006. Overview of DUC 2006. National Institute of Standards and Technology. K. Deschacht and M.F. Moens. 2007. Text Analysis for Automatic Image Annotation. Proc. of the 45th Annual Meeting of the Association for Computational Linguistics. East Stroudsburg: ACL. P. Duygulu, K. Barnard, JFG de Freitas, and D.A. Forsyth. 2002. Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary. In Seventh European Conference on Computer Vision (ECCV), 4:97–1 12. X. Fan, A. Aker, M. Tomko, P. Smart, M Sanderson, and R. Gaizauskas. 2010. Automatic Image Captioning From the Web For GPS Photographs. In Proc. of the 11th ACM SIGMM International Conference on Multimedia Information Retrieval, National Constitution Center, Philadelphia, Pennsylvania. Y. Feng and M. Lapata. 2008. Automatic Image Annotation Using Auxiliary Text Information. Proc. of Association for Computational Linguistics (ACL) 2008, Columbus, Ohio, USA. C.Y. Lin. 2004. ROUGE: A Package for Automatic Evaluation of Summaries. Proc. of the Workshop on Text Summarization Branches Out (WAS 2004), pages 25–26. E.E. Marsh and M.D. White. 2003. A taxonomy of relationships between images and text. Journal of Documentation, 59:647–672. Y. Mori, H. Takahashi, and R. Oka. 2000. Automatic word assignment to images based on image division and vector quantization. In Proc. of RIAO 2000: Content-Based Multimedia Information Access. C. Nobata, S. Sekine, H. Isahara, and R. Grishman. 2002. Summarization system integrated with named entity tagging and ie pattern discovery. In Proc. of the LREC-2002 Conference, pages 1742–1745. J.Y. Pan, H.J. Yang, P. Duygulu, and C. Faloutsos. 2004. Automatic image captioning. In Multimedia and Expo, 2004. ICME’04. IEEE International Conference on, volume 3. RS Purves, A. Edwardes, and M. Sanderson. 2008. Describing the where–improving image annotation and search through geography. 1st Intl. Workshop on Metadata Mining for Image Understanding, Funchal, Madeira-Portugal. S. Satoh, Y. Nakamura, and T. Kanade. 1999. Name-It: naming and detecting faces in news videos. Multimedia, IEEE, 6(1):22–35. F. Song and W.B. Croft. 1999. A general language model for information retrieval. In Proc. of the eighth international conference on Information and knowledge management, pages 3 16–321. ACM New York, NY, USA. M. Stevenson and M.A. Greenwood. 2005. A semantic approach to IE pattern induction. In Proc. of the 43rd Annual Meeting on Association for Computational Linguistics, pages 379–386. Association for Computational Linguistics Morristown, NJ, USA. M. Stevenson and M. Greenwood. 2009. Dependency Pattern Models for Information Extraction. Research on Language and Computation, 7(1): 13– 39. K. Sudo, S. Sekine, and R. Grishman. 2001 . Automatic pattern acquisition for Japanese information extraction. In Proc. of the first international conference on Human language technology research, page 7. Association for Computational Linguistics. R. Yangarber, R. Grishman, P. Tapanainen, and S. Huttunen. 2000. Automatic acquisition of domain knowledge for information extraction. In Proc. of the 18th International Conference on Computational Linguistics (COLING 2000), pages 940–946. Saarbriicken, Germany, August. 1258