acl acl2013 acl2013-310 acl2013-310-reference knowledge-graph by maker-knowledge-mining

310 acl-2013-Semantic Frames to Predict Stock Price Movement


Source: pdf

Author: Boyi Xie ; Rebecca J. Passonneau ; Leon Wu ; German G. Creamer

Abstract: Semantic frames are a rich linguistic resource. There has been much work on semantic frame parsers, but less that applies them to general NLP problems. We address a task to predict change in stock price from financial news. Semantic frames help to generalize from specific sentences to scenarios, and to detect the (positive or negative) roles of specific companies. We introduce a novel tree representation, and use it to train predictive models with tree kernels using support vector machines. Our experiments test multiple text representations on two binary classification tasks, change of price and polarity. Experiments show that features derived from semantic frame parsing have significantly better performance across years on the polarity task.


reference text

Apoorv Agarwal, Fadi Biadsy, and Kathleen Mckeown. 2009. Contextual phrase-level polarity analysis using lexical affect scoring and syntactic N-grams. In Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009), pages 24– 32, Athens, Greece, March. Association for Computational Linguistics. Apoorv Agarwal, Boyi Xie, Ilia Vovsha, Owen Rambow, and Rebecca Passonneau. 2011. Sentiment analysis of twitter data. In Proceedings of the Workshop on Languages in SocialMedia, LSM ’ 11, pages 30–38. Association for Computational Linguistics. Pierre Baldi, Søren Brunak, Yves Chauvin, Claus A. F. Andersen, and Henrik Nielsen. 2000. Assessing the accuracy of prediction algorithms for classification: an overview. Bioinformatics, 16:412 – 424. Roy Bar-Haim, Elad Dinur, Ronen Feldman, Moshe Fresko, and Guy Goldstein. 2011. Identifying and following expert investors in stock microblogs. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, pages 13 10–1319, Edinburgh, Scotland, UK., July. Association for Computational Linguistics. David M. Blei and Jon D. McAuliffe. 2007. Supervised topic models. In Advances in Neural Information Processing Systems, Proceedings of the TwentyFirst Annual Conference on Neural Information Processing Systems, Vancouver, British Columbia, Canada, December 3-6. Christopher Chua, Maria Milosavljevic, and James R. Curran. 2009. A sentiment detection engine for internet stock message boards. In Proceedings of the Australasian Language Technology Association Workshop 2009, pages 89–93, Sydney, Australia, December. Michael Collins and Nigel Duffy. 2002. New rank- ing algorithms for parsing and tagging: kernels over discrete structures, and the voted perceptron. In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, ACL ’02, pages 263– 270, Stroudsburg, PA, USA. Association for Computational Linguistics. 881 Germ a´n G. Creamer, Yong Ren, and Jeffrey V. Nickerson. 2012. A Longitudinal Analysis of Asset Return, Volatility and Corporate News Network. In Business Intelligence Congress 3 Proceedings. Dipanjan Das and Noah A. Smith. 2011. Semisupervised frame-semantic parsing for unknown predicates. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1, HLT ’ 11, pages 1435–1444, Stroudsburg, PA, USA. Association for Computational Linguistics. Dipanjan Das and Noah A. Smith. 2012. Graph-based lexicon expansion with sparsity-inducing penalties. In HLT-NAACL, pages 677–687. The Association for Computational Linguistics. Scott Deerwester, Susan T. Dumais, George W. Furnas, Thomas K. Landauer, and Richard Harshman. 1990. Indexing by latent semantic analysis. Journal of the American Society for Information Science. Ann Devitt and Khurshid Ahmad. 2007. Sentiment polarity identification in financial news: A cohesionbased approach. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pages 984–991, Prague, Czech Republic, June. Association for Computational Linguistics. William Dolan, Chris Quirk, and Chris Brockett. 2004. Unsupervised construction of large paraphrase corpora: Exploiting massively parallel news sources. Proceedings of the 20th International Conference on Computational Linguistics. Joseph Engelberg and Christopher A. Parsons. 2011. The causal impact of media in financial markets. Journal of Finance, 66(1):67–97. Charles J. Fillmore, Christopher R. Johnson, and Miriam R. L. Petruck. 2003. Background to Framenet. International Journal of Lexicography, 16(3):235–250, September. Charles J. Fillmore. 1976. Frame semantics and the nature of language. Annals of the New York Academy of Sciences, 280(1):20–32. Syed Aqueel Haider and Rishabh Mehrotra. 2011. Corporate news classification and valence predic- tion: A supervised approach. In Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis (WASSA 2.011), pages 175–181, Portland, Oregon, June. Association for Computational Linguistics. Thorsten Joachims. 2006. Training linear svms in linear time. In Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, KDD ’06, pages 217–226, New York, NY, USA. ACM. Giuseppe Jurman and Cesare Furlanello. 2010. A unifying view for performance measures in multi-class prediction. ArXiv e-prints. Soo-Min Kim and Eduard Hovy. 2006. Extracting opinions, opinion holders, and topics expressed in online news media text. In Proceedings of the Workshop on Sentiment and Subjectivity in Text, SST ’06, pages 1–8, Stroudsburg, PA, USA. Association for Computational Linguistics. Shimon Kogan, Dimitry Levin, Bryan R. Routledge, Jacob S. Sagi, and Noah A. Smith. 2009. Predicting risk from financial reports with regression. In Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, NAACL ’09, pages 272–280, Stroudsburg, PA, USA. Association for Computational Linguistics. Victor Lavrenko, Matt Schmill, Dawn Lawrie, Paul Ogilvie, David Jensen, and James Allan. 2000. Mining of concurrent text and time series. In In proceedings of the 6th ACM SIGKDD Int’l Conference on Knowledge Discovery and Data Mining Workshop on Text Mining, pages 37–44. Ronny Luss and Alexandre d’Aspremont. 2008. Predicting abnormal returns from news using text classification. CoRR, abs/0809.2792. Brian W. Matthews. 1975. Comparison of the predicted and observed secondary structure of t4 phage lysozyme. Biochimica et Biophysica Acta (BBA) Protein Structure, 405(2):442 451. – Alessandro Moschitti. 2006. Making tree kernels practical for natural language learning. In In Proceedings of the 11th Conference of the European Chapter of the Association for Computational Linguistics. Daniele Pighin and Alessandro Moschitti. 2009. Reverse engineering of tree kernel feature spaces. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, 6-7 August 2009, Singapore, pages 111–120. Eduardo J. Ruiz, Vagelis Hristidis, Carlos Castillo, Aristides Gionis, and Alejandro Jaimes. 2012. Correlating financial time series with micro-blogging activity. In Proceedings of the fifth ACM international conference on Web search and data mining, WSDM ’ 12, pages 513–522, New York, NY, USA. ACM. Josef Ruppenhofer and Ines Rehbein. 2012. Semantic frames as an anchor representation for sentiment analysis. In Proceedings of the 3rd Workshop in Computational Approaches to Subjectivity and Sentiment Analysis, WASSA ’ 12, pages 104– 109, Stroudsburg, PA, USA. Association for Computational Linguistics. Tina H. Rydberg and Neil Shephard. 2003. Dynamics of Trade-by-Trade Price Movements: Decomposition and Models. Journal of Financial Econometrics, 1(1):2–25. 882 Paul C. Tetlock, Maytal Saar-Tsechansky, Macskassy. Language 2008. More than Words: to Measure Firms’ and Sofus Quantifying Fundamentals. The Journal of Finance. Paul C. Tetlock. 2007. Giving Content to Investor Sen- timent: The Role of Media in the Stock Market. The Journal of Finance. Cynthia M. Whissel. language. 1989. The dictionary of affect in Emotion: Theory, Research, and Experi- ence, 39(4): 113–13 1. 883