acl acl2010 acl2010-216 acl2010-216-reference knowledge-graph by maker-knowledge-mining

216 acl-2010-Starting from Scratch in Semantic Role Labeling

Source: pdf

Author: Michael Connor ; Yael Gertner ; Cynthia Fisher ; Dan Roth

Abstract: A fundamental step in sentence comprehension involves assigning semantic roles to sentence constituents. To accomplish this, the listener must parse the sentence, find constituents that are candidate arguments, and assign semantic roles to those constituents. Each step depends on prior lexical and syntactic knowledge. Where do children learning their first languages begin in solving this problem? In this paper we focus on the parsing and argumentidentification steps that precede Semantic Role Labeling (SRL) training. We combine a simplified SRL with an unsupervised HMM part of speech tagger, and experiment with psycholinguisticallymotivated ways to label clusters resulting from the HMM so that they can be used to parse input for the SRL system. The results show that proposed shallow representations of sentence structure are robust to reductions in parsing accuracy, and that the contribution of alternative representations of sentence structure to successful semantic role labeling varies with the integrity of the parsing and argumentidentification stages.

reference text

M.J. Beal. 2003. Variational Algorithms for Approximate Bayesian Inference. Ph.D. thesis, Gatsby Computational Neuroscience Unit, University College London. L. Bloom. 1970. Language development: Form and function in emerging grammars. MIT Press, Cambridge, MA. L. Bloom. 1973. One word at a time: The use of single-word utterances before syntax. Mouton, The Hague. M.R. Brent and J.M. Siskind. 2001 . The role of exposure to isolated words in early vocabulary development. Cognition, 81:3 1–44. E. Brill. 1997. Unsupervised learning of disambiguation rules for part of speech tagging. In Natural Language Processing Using Very Large Corpora. Kluwer Academic Press. R. Brown. 1973. A First Language. Harvard University Press, Cambridge, MA. X. Carreras and L. M `arquez. 2004. Introduction to the CoNLL-2004 shared tasks: Semantic role labeling. In Proceedings of CoNLL-2004, pages 89–97. Boston, MA, USA. E. Charniak. 1997. Statistical parsing with a contextfree grammar and word statistics. In Proc. National Conference on Artificial Intelligence. E.V. Clark. 1978. Awwareness of language: Some evidence from what children say and do. In R. J. A. Sinclair and W. Levelt, editors, The child’s conception of language. Springer Verlag, Berlin. M. Connor, Y. Gertner, C. Fisher, and D. Roth. 2008. Baby srl: Modeling early language acquisition. In Proc. of the Annual Conference on Computational Natural Language Learning (CoNLL), pages xx–yy, Aug. M. Connor, Y. Gertner, C. Fisher, and D. Roth. 2009. Minimally supervised model of early language acquisition. In Proc. of the Annual Conference on Computational Natural Language Learning (CoNLL), Jun. M. Demetras, K. Post, and C. Snow. 1986. Feedback to first-language learners. Journal of Child Language, 13:275–292. K. Demuth, J. Culbertson, and J. Alter. 2006. Wordminimality, epenthesis, and coda licensing in the acquisition of english. Language & Speech, 49: 137– 174. C. Fisher. 1996. Structural limits on verb mapping: The role of analogy in children’s interpretation of sentences. Cognitive Psychology, 3 1:41–81 . Jianfeng Gao and Mark Johnson. 2008. A comparison of bayesian estimators for unsupervised hidden markov model pos taggers. In Proceedings of EMNLP-2008, pages 344–352. D. Gentner. 2006. Why verbs are hard to learn. In K. Hirsh-Pasek and R. Golinkoff, editors, Action meets word: How children learn verbs, pages 544– 564. Oxford University Press. Y. Gertner and C. Fisher. 2006. Predicted errors in early verb learning. In 31st Annual Boston University Conference on Language Development. 997 Y. Gertner, C. Fisher, and J. Eisengart. 2006. Learning words and rules: Abstract knowledge of word order in early sentence comprehension. Psychological Science, 17:684–691. J. Gillette, H. Gleitman, L. R. Gleitman, and A. Lederer. 1999. Human simulations of vocabulary learning. Cognition, 73: 135–176. Sharon Goldwater and Tom Griffiths. 2007. A fully bayesian approach to unsupervised part-of-speech tagging. In Proceedings of 45th Annual Meeting of the Association of Computational Linguists, pages 744–751. R. Gomez and L. Gerken. 1999. Artificial grammar learning by 1-year-olds leads to specific and abstract knowledge. Cognition, 70: 109–135. A. Haghighi and D. Klein. 2006. Prototype-drive learning for sequence models. In Proceedings of NAACL-2006, pages 320–327. Mark Johnson. 2007. Why doesnt em find good hmm pos-taggers? In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pages 296–305. M.H. Kelly. 1992. Using sound to solve syntactic problems: The role of phonology in grammatical category assignments. Psychological Review, 99:349–364. J. Lidz, H. Gleitman, and L. R. Gleitman. 2003. Understanding how input matters: verb learning and the footprint of universal grammar. Cognition, 87: 15 1 178. B. MacWhinney. 2000. The CHILDES project: Tools for analyzing talk. Third Edition. Lawrence Elrbaum Associates, Mahwah, NJ. M. P. Marcus, B. Santorini, and M. Marcinkiewicz. 1993. Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, 19(2):313–330, June. Marina Meil a˘. 2002. Comparing clusterings. Technical Report 418, University of Washington Statistics Department. T. Mintz. 2003. Frequent frames as a cue for grammatical categories in child directed speech. Cognition, 90:91–1 17. P. Monaghan, N. Chater, and M.H. Christiansen. 2005. The differential role of phonological and distributional cues in grammatical categorisation. Cognition, 96: 143–182. S. Pinker. 1984. Language learnability and language development. Harvard University Press, Cambridge, MA. V. Punyakanok, D. Roth, and W. Yih. 2008. The importance of syntactic parsing and inference in semantic role labeling. Computational Linguistics, 34(2). L. R. Rabiner. 1989. A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE, 77(2):257–285. Sujith Ravi and Kevin Knight. 2009. Minimized models for unsupervised part-of-speech tagging. In Proceedings of the Joint Conferenceof the 47th An- nual Meeting of the Association for Computational Linguistics and the 4th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACLIJCNLP). J.R. Saffran, R.N. Aslin, and E.L. Newport. 1996. Statistical learning by 8-month-old infants. Science, 274: 1926–1928. Rushen Shi, James L. Morgan, and Paul Allopenna. 1998. Phonological and acoustic bases for earliest grammatical category assignment: a crosslinguistic perspective. Journal of Child Language, 25(01): 169–201. Rushen Shi, Janet F. Werker, and James L. Morgan. 1999. Newborn infants’ sensitivity to perceptual cues to lexical and grammatical words. Cognition, 72(2):B1 1 B21. – L.B. Smith and C. Yu. 2008. Infants rapidly learn word-referent mappings via cross-situational statistics. Cognition, 106: 1558–1568. Kiristina Toutanova and Mark Johnson. 2007. A bayesian lda-based model for semi-supervised partof-speech tagging. In Proceedings of NIPS. S. Yuan and C. Fisher. 2009. “really? she blicked the baby?”: Two-year-olds learn combinatorial facts about verbs by listening. Psychological Science, 20:619–626. S. Yuan, C. Fisher, Y. Gertner, and J. Snedeker. 2007. Participants are more than physical bodies: 21month-olds assign relational meaning to novel transitive verbs. In Biennial Meeting of the Society for Research in Child Development, Boston, MA. 998