> The question then is to what extent the content of Twitter is representative of voting behaviour and how accurately we can predict the results of an election with Twitter data.
Some demographics (young? city?) are overrepresented than others (old? farms?). In some cases, it's cool to vote publically for a party, so the people is more eager to say it publically or directly lie in the tweets. In other cases, people is afraid to say publically that they will vote for a party ...
I think you will need many magical constants that are not available until you have 3 or 4 previous elections to fit the data.
Can you make a prediction for each province / state?
Good look, and post the progress and predictions before the election, and a post-mortem analysis after.
I had made an prediction for each province after initially collecting all the tweets, but the results were not accurate. At the moment I have also determined the location of about 33% of the twitter accounts and I hope the result will be better if I exclude all Twitterers which are not from the same province as the one I am doing the calculation on.
Thanks for the feedback :)
(1) http://webrazzi.com/2014/07/17/genart-ve-nielsenin-turkiyede...
I have tried to predict the outcome of Turkish General elections of 1 November 2015 using Twitter data and I believe I will be able to predict quiete accurately. Within a few percent.