acl acl2013 acl2013-33 acl2013-33-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Vasileios Lampos ; Daniel Preoţiuc-Pietro ; Trevor Cohn
Abstract: Social Media contain a multitude of user opinions which can be used to predict realworld phenomena in many domains including politics, finance and health. Most existing methods treat these problems as linear regression, learning to relate word frequencies and other simple features to a known response variable (e.g., voting intention polls or financial indicators). These techniques require very careful filtering of the input texts, as most Social Media posts are irrelevant to the task. In this paper, we present a novel approach which performs high quality filtering automatically, through modelling not just words but also users, framed as a bilinear model with a sparse regulariser. We also consider the problem of modelling groups of related output variables, using a structured multi-task regularisation method. Our experiments on voting intention prediction demonstrate strong performance over large-scale input from Twitter on two distinct case studies, outperforming competitive baselines.
Faiz A Al-Khayyal and James E Falk. 1983. Jointly Constrained Biconvex Programming. Mathematics of Operations Research, 8(2):273–286. Andreas Argyriou, Theodoros Evgeniou, and Massimiliano Pontil. 2008. Convex multi-task feature learning. Machine Learning, 73(3):243–272, January. Adam Bermingham and Alan F Smeaton. 2011. On using Twitter to monitor political sentiment and predict election results. In Proceedings ofthe Workshop on Sentiment Analysis where AI meets Psychology (SAAIP 2011), pages 2–10, November. Johan Bollen, Huina Mao, and Xiaojun Zeng. 2011. Twitter mood predicts the stock market. Journal of Computational Science, 2(1): 1–8, March. Andrea Esuli and Fabrizio Sebastiani. 2006. SentiWordNet: A publicly available lexical resource for opinion mining. In Proceeding of the 5th Conference on Language Resources and Evaluation (LREC), pages 417–422. Daniel Gayo-Avello, Panagiotis T Metaxas, and Eni Mustafaraj. 2011. Limits of Electoral Predictions using Twitter. In Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media (ICWSM), pages 490–493. Daniel Gayo-Avello. 2012. No, You Cannot Predict Elections with Twitter. IEEE Internet Computing, 16(6):91–94, November. Trevor Hastie, Robert Tibshirani, and Jerome Friedman. 2009. The Elements of Statistical Learning. Springer Series in Statistics. Springer. Bernard J Jansen, Mimi Zhang, Kate Sobel, and Abdur Chowdury. 2009. Twitter power: Tweets as electronic word of mouth. Journal of the American Societyfor Information Science and Technology, 60(1 1):2169–2188. Vasileios Lampos and Nello Cristianini. 2010. Tracking the flu pandemic by monitoring the Social Web. In 2nd IAPR Workshop on Cognitive Information Processing, pages 411–416. IEEE Press. Vasileios Lampos and Nello Cristianini. 2012. Nowcasting Events from the Social Web with Statistical Learning. ACM Transactions on Intelligent Systems and Technology, 3(4): 1–22, September. Vasileios Lampos, Tijl De Bie, and Nello Cristianini. 2010. Flu Detector - Tracking Epidemics on Twitter. In Proceedings of European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD), pages 599– 602. Springer. Vasileios Lampos. 2012. On voting intentions inference from Twitter content: a case study on UK 2010 General Election. CoRR, April. Thomas Lansdall-Welfare, Vasileios Lampos, and Nello Cristianini. 2012. Effects of the recession on public mood in the UK. In Proceedings of the 21st international conference companion on World Wide Web, WWW ’ 12 Companion, pages 1221– 1226. ACM. Jun Liu, Shuiwang Ji, and Jieping Ye. 2009. Multitask feature learning via efficient l2,1-norm minimization. pages 339–348, June. Panagiotis T Metaxas, Eni Mustafaraj, and Daniel Gayo-Avello. 2011. How (Not) To Predict Elections. In IEEE 3rd International Conference on Social Computing (SocialCom), pages 165 171. IEEE Press. – John A Nelder and Robert W M Wedderburn. 1972. Generalized Linear Models. Journal of the Royal Statistical Society - Series A (General), 135(3):370. Brendan O’Connor, Ramnath Balasubramanyan, Bryan R Routledge, and Noah A Smith. 2010. From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series. In Proceedings of the International AAAI Conference on Weblogs and Social Media, pages 122–129. AAAI Press. Michael J Paul and Mark Dredze. 2011. You Are What You Tweet: Analyzing Twitter for Public Health. Proceedings of the 5th International AAAI Conference on Weblogs and Social Media, pages 265–272. James W Pennebaker, Cindy K Chung, Molly Ireland, Amy Gonzales, and Roger J Booth. 2007. The Development and Psychometric Properties of LIWC2007. Technical report, Universities of Texas at Austin & University of Auckland, New Zealand. Hamed Pirsiavash, Deva Ramanan, and Charless Fowlkes. 2009. Bilinear classifiers for visual recog- nition. In Advances in Neural Information Processing Systems, volume 22, pages 1482–1490. Daniel Preo ¸tiuc-Pietro, Sina Samangooei, Trevor Cohn, Nicholas Gibbins, and Mahesan Niranjan. 2012. Trendminer: An Architecture for Real Time Analysis of Social Media Text. In Sixth International AAAI Conference on Weblogs and Social Media, pages 38–42. AAAI Press, July. Ignacio Quesada and Ignacio E Grossmann. 1995. A global optimization algorithm for linear fractional and bilinear programs. Journal of Global Optimization, 6(1):39–76, January. Takeshi Sakaki, Makoto Okazaki, and Yutaka Matsuo. 2010. Earthquake shakes Twitter users: real-time event detection by social sensors. In Proceedings of the 19th international conference on World Wide Web (WWW), pages 851–860. ACM. Robert Tibshirani. 1996. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society - Series B (Methodological), 58(1):267–288. 1002 Andranik Tumasjan, Timm O Sprenger, Sandner, and Isabell M Welpe. elections with Twitter: 2010. Philipp What 140 characters about political sentiment. In Proceedings G Predicting reveal of the 4th International AAAI Conference on Weblogs and Social Media, pages 178–185. AAAI. Ming Yuan and Yi Lin. 2006. Model selection and es- timation in regression with grouped variables. Journal of the Royal Statistical Society - Series B: Statis- tical Methodology, 68(1):49–67. Peng Zhao and Bin Yu. 2006. On model selection consistency of Lasso. Journal of Machine Learning Research, 7(11):2541–2563. Hui Zou and Trevor Hastie. 2005. Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 67(2):301–320, April. 1003