I have been trying to find out about the AFINN sentiment analysis model, if I understand it correctly it is a list of words containing each its score.
https://github.com/fnielsen/afinn/tree/master/afinn/dataIt makes me think about Microsoft
Flow (edit: the name isn't Flow but I'm unable to remember it). Does this tool use the number of tweets or more about the feelings expressed?
Or truly a mix of both?
The first question I asked myself concerns all the Twitter bots that auto-post a tweet every XX minutes (using IFTTT or Zapier for example).
The charts could therefore be easily manipulated, by creating hundreds of accounts, collecting subscribers, and then automating the accounts.
It is very easy to create a hundred accounts per day.
I believe 1000 accounts could be enough to manipulate the charts. Currently, there are 15.000 tweets/hours, an account can post every 10 minutes (to avoid the Twitter algorithm) 1.000 accounts x 6 tweets/hours= 6.000 tweets. It's about 40% of the 15.000
How the tool filters this?
Good Points. You clearly had an in-depth look at the tool.
Yes, that is the correct understanding of AFINN.
The tool does take into account volume of tweets along with sentiment plus a few other factors (Twitter account age and activity is used to filter out fresh accounts).
Also, Twitter is very good at detecting spam. How do I know? Because I actually tried to create a bot that would tweet out current sentiment and the API was very restrictive on how often it would let me post.
There are no explicit rate-limits to posting statuses on twitter, but the algorithm doesn't want you to automate tweets that are too similar to your recent statuses etc.