I'd kinda like to see which swear words appear most often in commit messages. I'm guessing that "shit" and "fuck" are much more common than "cocksucker" and "motherfucker", and if that's not true, I want to know which language has the most cocksuckers and motherfuckers.
Yeah, the pie chart doesn't quite cover it - I'd like to see both swear words per commit per language (if, say, Java has 10% of the swear words but 3% of the commits) and complexity of the swear words - a simple "Fuck" implies far less frustration than a "Motherfucking Cocksucker!"
Could develop quite a nice Programming Language Pain Index…
I run a slang dictionary website which lets users assign an offensiveness score to each term. That would be an interesting bit of data to add: not only the raw word count, but how offensive the swearing is for each language.
i'd say alter the list of swear words in general to a list possibly more tuned to programming. agreeing with above, word breakdown would be nice as well.