Author Archives: jsilter

Protected: Estimating active reddit users

There is no excerpt because this is a protected post.

Posted in reddit, Statistics | Enter your password to view comments.

The comments of the few outweigh the comments of the many

The Pareto Principle for businesses states that 80% of sales come from 20% of customers. Social media has the same skew; the majority of content comes from a minority of users. I’ve always been curious just how skewed this activity can be. … Continue reading

Posted in Uncategorized | Leave a comment

Some musings on statistics

A) Beware of The Wrong Summary Statistics SlateStarCodex had a pretty interesting post entitle “Beware of Summary Statistics“, showing how they can be misleading. This isn’t exactly new, there are famous examples of how just looking at the mean and standard deviation greatly … Continue reading

Posted in Statistics | Leave a comment

Subreddit Map

Reddit describes itself as the “front page of the internet”, and given how many users it has, that’s not too far off. It’s divided into subreddits, which can have either broad or narrow topics. These subreddits are (mostly) user-created, with … Continue reading

Posted in reddit, Social Media, Text Mining | 4 Comments

Exaggeration of Science

Communicating scientific results to the public is difficult, even with the best intentions. There are all kinds of subtleties in any study which don’t make it into media coverage. Furthermore, caveats about interpreting results get lost along the way. A recent … Continue reading

Posted in Science Publishing | Tagged , | 2 Comments