emnlp emnlp2012 emnlp2012-45 emnlp2012-45-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Junsheng Zhou ; Weiguang Qu ; Fen Zhang
Abstract: Most existing systems solved the phrase chunking task with the sequence labeling approaches, in which the chunk candidates cannot be treated as a whole during parsing process so that the chunk-level features cannot be exploited in a natural way. In this paper, we formulate phrase chunking as a joint segmentation and labeling task. We propose an efficient dynamic programming algorithm with pruning for decoding, which allows the direct use of the features describing the internal characteristics of chunk and the features capturing the correlations between adjacent chunks. A relaxed, online maximum margin training algorithm is used for learning. Within this framework, we explored a variety of effective feature representations for Chinese phrase chunking. The experimental results show that the use of chunk-level features can lead to significant performance improvement, and that our approach achieves state-of-the-art performance. In particular, our approach is much better at recognizing long and complicated phrases. 1
Steven P. Abney. 1991 . Parsing by chunks. In Robert C. Berwick, Steven P. Abney, and Carol Tenny, editors, Principle-Based Parsing , pages 257-278. Kluwer Academic Publishers. Daniel M, Bikel. 2004. On the Parameter Space of Generative Lexicalized Statistical Parsing Models. Ph.D. thesis, University of Pennsylvania. Wenliang Chen, Yujie Zhang, and Hitoshi Isahara. 2006. An empirical study of Chinese chunking. In Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, pages 97-104. Michael Collins. 2002. Discriminative training methods for hidden Markov models: Theory and experiments with perceptron algorithms. In Proc. EMNLP-02. Michael Collins. 1999. Head-Driven Statistical Models for Natural Language Parsing. Ph.D. thesis, University of Pennsylvania. Koby Crammer. 2004. Online Learning of Complex Categorial Problems. Hebrew University of Jerusalem, PhD Thesis. Taku Kudo and Yuji Matsumoto. 2001 . Chunking with support vector machines. In Proceedings of NAACL01 . Koby Crammer, Ryan McDonald, and Fernando Pereira. 2005. Scalable large-margin online learning for structured classification. In NIPS Workshop on Learning With Structured Outputs. Heng Li, Jonathan J. Webster, Chunyu Kit, and Tianshun Yao. 2003. Transductive hmm based chinese text chunking. In Proceedings of IEEE NLPKE2003, pages 257-262, Beijing, China. Ryan McDonald, Femando Pereira, Kiril Ribarow, and Hajic. 2005. Non-projective dependency parsing using spanning tree algorithms. In Proceedings of Jan HLT/EMNLP, pages 523-530. Ryan. McDonald, Flexible Multilabel K. Crammer, Text and F. Pereira, 2005. Segmentation Classification. with Structured Proceedings In HLT/EMNLP, pages 987- 994. Ryan McDonald. Spanning University Tree 2006. Discriminative Algorithms of Pennsylvania, Training and Dependency Parsing. PhD Thesis. for Beata Megyesi. 2002. Shallow parsing with pos taggers and linguistic Research, features. Journal of Machine Learning 2:639-668. Antonio Molina and Ferran Pla. 2002. Shallow parsing using specialized hmms. Journal of Machine Learning Research., 2:595- 613. E.F.T.K Sang and S. Buchholz. 2000. Introduction to the CoNLL-2000 shared task: Chunking. In Proceedings CoNLL-00, pages 127-132. Sunita Sarawagi and W. Cohen. 2004. Semi-markov conditional random fields for information extraction. In Proceedings of NIPS 17, pages 1185–1 192. Fei Sha and Fernando Pereira. 2003. Shallow parsing with conditional random fields. In Proceedings of HLT-NAACL03. Xu Sun, Louis-Philippe Morency, Daisuke Okanohara, and Jun’ichi Tsujii. 2008. Modeling Latent-Dynamic in Shallow Parsing: A Latent Conditional Model with Improved Inference. In Proceedings of the 22nd International Conference on Computational Linguistics, pages 841–848. Yongmei Tan, Tianshun Yao, Qing Chen, and Jingbo Zhu. 2004. Chinese chunk identification using svms single discriminative model. EMNLP, pages 843-852. In Proceedings of plus sigmoid. In IJCNLP, pages 527-536. Yongmei Tan, Tianshun Yao, Qing Chen, and Jingbo Zhu. 2005. Applying conditional random fields to chinese shallow parsing. In Proceedings of CICLing2005, pages 167-176. Erik F. Tjong Kim Sang. 2002. Memory-based shallow parsing. JMLR, 2(3):559-594. Yu-Chieh Wu, Chia-Hui Chang, and Yue-Shi Lee. 2006. A general and multi-lingual phrase chunking model based on masking method. In Proceedings of 7th International Conference on Intelligent Text Processing and Computational Linguistics, pages 144-155. Nianwen Xue, Fei Xia, Shizhe Huang, and Anthony Kroch. 2000. The bracketing guidelines for the penn chinese treebank. Technical report, University of Pennsylvania. Stavros A. Zenios Yair Censor. 1997. Parallel Optimization: Theory, Algorithms, and Applications. Oxford University Press. Tong Zhang, F. Damerau, and D. Johnson. 2002. Text chunking based on a generalization of winnow. Journal of Machine Learning Research, 2:615-637. Yue Zhang and Stephen Clark. 2008. Joint word segmentation and POS tagging using a single perceptron. In Proceedings of ACL/HLT, pages 888896. Yue Zhang and Stephen Clark. 2010. A fast decoder for joint word segmentation and POS-tagging using a 567