emnlp emnlp2010 emnlp2010-25 emnlp2010-25-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Wei Lu ; Hwee Tou Ng
Abstract: This paper focuses on the task of inserting punctuation symbols into transcribed conversational speech texts, without relying on prosodic cues. We investigate limitations associated with previous methods, and propose a novel approach based on dynamic conditional random fields. Different from previous work, our proposed approach is designed to jointly perform both sentence boundary and sentence type prediction, and punctuation prediction on speech utterances. We performed evaluations on a transcribed conversational speech domain consisting of both English and Chinese texts. Empirical results show that our method outperforms an approach based on linear-chain conditional random fields and other previous approaches.
D. Beeferman, A. Berger, and J. Lafferty. 1998. CYBERPUNC: A lightweight punctuation annotation system for speech. In Proc. of ICASSP’98. S.F. Chen and J. Goodman. 1996. An empirical study of smoothing techniques for language modeling. In Proc. of ACL’06. H. Christensen, Y. Gotoh, and S. Renals. 2001. Punctuation annotation using statistical prosody models. In Proc. of ISCA Workshop on Prosody in Speech Recognition and Understanding. B. Efron, R. Tibshirani, and R.J. Tibshirani. 1993. An introduction to the bootstrap. Chapman & Hall/CRC. B. Favre, R. Grishman, D. Hillard, H. Ji, D. HakkaniTur, and M. Ostendorf. 2008. Punctuating speech for information extraction. In Proc. of ICASSP’08. A. Gravano, M. Jansche, and M. Bacchiani. 2009. Restoring punctuation and capitalization in transcribed speech. In Proc. of ICASSP’09. D. Hillard, Z. Huang, H. Ji, R. Grishman, D. HakkaniTur, M. Harper, M. Ostendorf, and W. Wang. 2006. Impact of automatic comma prediction on POS/name tagging of speech. In Proc. of SLT’06. J. Huang and G. Zweig. 2002. Maximum entropy model for punctuation annotation from speech. In Proc. of ICSLP’02. J.H. Kim and P.C. Woodland. 2001. The use of prosody in a combined system for punctuation generation and speech recognition. In Proc. of EuroSpeech’01. K. Kirchhoff and M. Yang. 2007. The University of Washington machine translation system for the IWSLT 2007 competition. In Proc. of IWSLT’07. P. Koehn, A. Axelrod, A.B. Mayne, C. Callison-Burch, M. Osborne, and D. Talbot. 2005. Edinburgh system description for the 2005 IWSLT speech translation evaluation. In Proc. of IWSLT’05. P. Koehn, H. Hoang, A. Birch, C. Callison-Burch, M. Federico, N. Bertoldi, B. Cowan, W. Shen, C. Moran, R. Zens, et al. 2007. Moses: Open source toolkit for statistical machine translation. In Proc. of ACL’07 (Demo Session). J. Lafferty, A. McCallum, and F. Pereira. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proc. of ICML’01. P. Liang, B. Taskar, and D. Klein. 2006. Alignment by agreement. In Proc. of HLT/NAACL’06. Y. Liu, A. Stolcke, E. Shriberg, and M. Harper. 2005. Using conditional random fields for sentence boundary detection in speech. In Proc. of ACL’05. E. Matusov, A. Mauser, and H. Ney. 2006. Automatic sentence segmentation and punctuation prediction for spoken language translation. In Proc. of IWSLT’06. 186 A. McCallum, K. Rohanimanesh, and C. Sutton. 2003. Dynamic conditional random fields for jointly labeling multiple sequences. In Proc. of NIPS’03 Workshop on Syntax, Semantics and Statistics. F.J. Och. 2003. Minimum error rate training in statistical machine translation. In Proc. of ACL’03. K. Papineni, S. Roukos, T. Ward, and W.J. Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. In Proc. of ACL’02. M. Paul. 2008. Overview of the IWSLT 2008 evaluation campaign. In Proc. of IWSLT’08. M. Paul. 2009. Overview of the IWSLT 2009 evaluation campaign. In Proc. of IWSLT’09. S. Sarawagi and W.W. Cohen. 2005. Semi-Markov conditional random fields for information extraction. In Proc. of NIPS’05. F. Sha and F. Pereira. 2003. Shallow parsing with conditional random fields. In Proc. of HLT-NAACL’03. A. Stolcke, E. Shriberg, R. Bates, M. Ostendorf, D. Hakkani, M. Plauche, G. Tur, and Y. Lu. 1998. Automatic detection of sentence boundaries and disfluencies based on recognized words. In Proc. of ICSLP’98. A. Stolcke. 2002. SRILM–an extensible language modeling toolkit. In Proc. of ICSLP’02. C. Sutton and A. McCallum. 2004. Collective segmentation and labeling of distant entities in information extraction. In Proc. of ICML’04 workshop on Statistical Relational Learning. C. Sutton, A. McCallum, and K. Rohanimanesh. 2007. Dynamic conditional random fields: Factorized probabilistic models for labeling and segmenting sequence data. Journal of Machine Learning Research, 8. C. Sutton. 2006. GRMM: GRaphical Models in Mallet. http : / /mal let . cs .umas s .edu / grmm/ . H. Tseng, P. Chang, G. Andrew, D. Jurafsky, and C. Manning. 2005. A conditional random field word segmenter for sighan bakeoff 2005. In Proc. of the Fourth SIGHAN Workshop on Chinese Language Processing. M. Wainwright, T. Jaakkola, and A. Willsky. 2001 . Treebased reparameterization for approximate inference on loopy graphs. In Proc. of NIPS’01. H. Wang, H. Wu, X. Hu, Z. Liu, J. Li, D. Ren, and Z. Niu. 2008. The TCH machine translation system for IWSLT 2008. In Proc. of IWSLT’08.