This is a personal attempt to understand the digg front page. I am not a mathematician, nor a coder nor an Excel wiz (all of which will become obvious). Nonetheless, I wanted to understand digg better than I did and decided a tiny bit of analysis was in order.
This was the state of play on the front page at 16:25 BST today.

(Time means the number of hours since a post was submitted, to the nearest hour.)
Things to note? Your ranking on the digg site does not correspond any to the three metrics available to users. Nor any combination of them that I was able to concoct. I tried a lot of formulas, but nothing that worked. If the number of diggs was key, then the number one story would be on page 3 or something. All the front page stories have attracted a fair amount of comments, but there’s no correlation between that and their rank. Comments are good and diggs are good, but there’s no way of knowing how much. I could post some graphs at this point, but since there’s no visible relationship, there’s no point. It’s clear that each story has a time-to-live on the front page, but impossible to detect what that is.
However, we’ve learned that digg front page stories do appear to be submitted within the last day, with around 15 hours being the average between submission and the front page.
It makes sense for digg to use a secret algorithm for posts, one that isn’t easily available to users through any of the information they’re given. Otherwise, the solution for gaming digg would be publicised, get spammed and the service would lose its users.
What we don’t have is either the rate of diggs or the rate of comments. I think it would be fair to assume that the ‘G-Meter in 1 minute’ story, the number one story in the list at the time I recorded, gained either a large number of diggs or comments over the recent past. Otherwise, the number two story, and all the rest, would have a higher place. Rate of diggs or comments seems to be more important than their number, although both are important. Since this is about just one moment in time, I can’t comment on how quickly or slowly a post rises and falls dependent on that. However, this is a key indicator, I am sure.
So I failed to reverse-engineer the digg algorithm. I’m frankly not up to the job and I don’t believe we have the information available. So then I looked to a more folksy way to understand the page. Folksy is my forte. I tried to come up with tags that would categorise the stories I was seeing. I know this is not academically rigorous in any way. But this was how it panned out according to my own categorisation of what came up:

So all of these things are good. Again, it’s just one moment from the contantly circulating digg nexus. I wouldn’t want to draw too many rules from this that you couldn’t work out for yourself. Web, Major Vendors (especially MS & Apple) and Conspiracies seem especially good.
I look forward to future posts about ‘How To Beat Microsoft’s Planned Web Conspiracy About Melons’.
P.S. working out how ‘Friends’ or voting blocs might contribute to any of this is well beyond my reach. Alex has some thoughts on this. Thanks to David for the link.
P.P.S. Maths geniuses are welcome to the data here. (.xls file)










David Cruickshank
3 years ago
Rate intuitively feels right as an important ingredient of popularity. If my website gets 1000 hits, it’s more popular if it happens in an hour than if it happens over the course of a year. I’m fairly sure that the homepage is also designed to be inclusive of members of each of the categories where possible. I see a lot of sport articles on there that have far fewer diggs for instance. Rate as a proportion of total diggs in that category might be interesting to look at.
Owen Byrne
3 years ago
The order of items on the front page is chronological based on the time they are promoted, which is not actually shown (just the date they are posted). The promotion algorithm itself has nothing to do with the order on the front page.
Ian Delaney
3 years ago
That’s good information, Owen & Dave. And obviously, the next comparison that needs to be made is between page one and page two. Were I a statistician, I’m sure I’d relish that…
Understanding digg again, natural order at twopointouch: web 2.0, blogs and social media
3 years ago
[...] My first attempts to understand digg, the news-voting site, were a bit of a shambles, to be honest. I tried to work out the order and content of the front page and ended up in a tangle of half-remembered Maths lessons. Owen Byrne, senior software engineer at the service, put me out of my misery by commenting that the order was actually chronological according the time they were promoted to the top. I also commented on the importance of rate and topic. [...]
twopointouch » Another post
3 years ago
[...] This is a post with more writing and also a picture. My first attempts to understand digg, the news-voting site, were a bit of a shambles, to be honest. I tried to work out the order and content of the front page and ended up in a tangle of half-remembered Maths lessons. Owen Byrne, senior software engineer at the service, put me out of my misery by commenting that the order was actually chronological according the time stories were promoted to the top. I also commented on the importance of rate and topic, which may have been less useless.Yesterday, Fred Stutzman posted something to revive my interest. He was talking about the moaning and groaning about the power of top users and the voting blocs around them. Essentially, he says the reason for this is because we need some way to sort through the thousands of stories submitted to digg. Users can’t read them all, a lot of them are spam anyway, and so we develop coping mechanisms.My first attempts to understand digg, the news-voting site, were a bit of a shambles, to be honest. I tried to work out the order and content of the front page and ended up in a tangle of half-remembered Maths lessons. Owen Byrne, senior software engineer at the service, put me out of my misery by commenting that the order was actually chronological according the time stories were promoted to the top. I also commented on the importance of rate and topic, which may have been less useless. [...]