Forecasting with Twitter data: An application to USA TV series audience
Price
Free (open access)
Volume
Volume 11 (2016), Issue 3
Pages
9
Page Range
220 - 229
Paper DOI
10.2495/DNE-V11-N3-220-229
Copyright
WIT Press
Author(s)
L. MOLTENI & J. PONCE DE LEON
Abstract
Various researchers and analysts highlighted the potential of Big Data, and social networks in particular, to optimize demand forecasts in managerial decision processes in different sectors. Other authors focused the attention on the potential of Twitter data in particular to predict TV ratings. In this paper, the interactions between television audience and social networks have been analysed, especially considering Twitter data. In this experiment, about 2.5 million tweets were collected, for 14 USA TV series in a nine-week period through the use of an ad hoc crawler created for this purpose. Subsequently, tweets were classified according to their sentiment (positive, negative, neutral) using an original method based on the use of decision trees. A linear regression model was then used to analyse the data. To apply linear regression, TV series have been grouped in clusters; clustering is based on the average audience for the individual series and their coefficient of variability. The conclusions show and explain the existence of a significant relationship between audience and tweets.
Keywords
Audience, Forecasting, Regression, Sentiment, TV, Twitter.