Encyclopaedic Knowledge

Aaron Swartz contributes some fascinating analysis to the study of who writes Wikipedia. Founder Jimmy Wales has often stated that a small number of people make the largest number of contributions. He told Stanford University that “the most active 2%, which is 1400 people, have done 73.4% of all the edits,” for example.

Swartz decided to count the evidence in a different way. He counted the words added to articles, as opposed to changes for the sake of grammar, spelling and structure. From his analysis, it appears that, in fact, a large number of very different people contribute articles or major chunks of articles. Then, a much smaller number tinker with the story from then on.

When you put it all together, the story become clear: an outsider makes one edit to add a chunk of information, then insiders make several edits tweaking and reformatting it. In addition, insiders rack up thousands of edits doing things like changing the name of a category across the entire site - the kind of thing only insiders deeply care about. As a result, insiders account for the vast majority of the edits. But it’s the outsiders who provide nearly all of the content.

This is the way it should be, of course. Experts in a subject are a lot more rare than people with general editorial skills. And this is, in some respects, how traditional encyclopaedias are written. The editorial board recruits experts to write articles which are then edited by the board.

Wikipedia stands this normal process on its head, of course. The original contributions come from anywhere, hopefully an expert, but there are no guarantees. Then the editors continually refine these contributions to meet acceptable standards. If Wikipedia were a more traditional business, and paid its editors, this would be deemed very inefficient. However, when you’ve got hundreds of thousands of people working for free, it works well enough.

Where the contributors vs editors ratio affects things more controversially is in decisions over the running of the site. To qualify to vote on decisions, you need to have racked up a substantial number of edits. But the people who have done this, dedicated and necessary as they are, may not necessarily have contributed many articles at all. They could feasibly have racked up thousands of edits eliminating uses of the oxford comma because they simply don’t like it. A voting caucus based around contributions as opposed to changes and deletions might look very different.

Swartz is running for election to the Wikipedia Foundation Board.


7 Comments

A Wikipedia ‘admin’ deleted the Web 3.0 entry which was put up by someone I am absolutely not related to in any way shape or form. They also deleted the other Web 3 entry that had nothing to do with the Semantic Web.

If the admins are dumb and malnutritioned then what we get is a dumb and mulnutritioned Wikipedia.

:)

Not sure about the chances of a Web 3.0 entry in any other encyclopaedia, Marc. I think they’d die of embarassment first.

Well, there were two entries. One written by A List Apart author (Zeldman) and one written by a Peter Campbell.

I wonder why when there is already a Web 2.0 entry.

Web 3.0 is the next phase, numerically speaking, so why not describe the emerging ideas?

Also, I got an invite to publish the Web 3.0 article for someone affiliated with IEEE and Stanford Univeristy.

So ….

I think they’re behind the times… resisting the inevitable. :)

Sorry, Marc. I guess that was a bit flippant - though I do think web 2.0’s worst enemy is its name. Semantic Web is still the other name for web 3.0, though, isn’t it?

I wish I could vote for Aaron Swartz in the Wikimedia election, but I am told that “you are not qualified to vote” as “you need to have made 400 edits … you have made 115″. 115 edits doesn’t qualify me to vote in a supposedly egalitarian place??? And never mind that one “edit” was the contribution of a comprehensive new article, something I’m sure many eligible voters have never done.

As a fan of Wikipedia, I’m immensely grateful to Swartz for shedding some much-needed light on how Wikipedia really evolves. Now I just hope that the Wikipedia powers-that-be will act according to this reality, rather than be ruled by their egos.

Humanity was never meant to produce consistent logic. And consistent logic was never meant to produce humanity.

As I’m sure we both know, Rohan, egos are a big part of almost any organisation. I’m a massive fan of wikipedia myself and agree that original contributors have a higher value than what they call ‘janitors’ in the normal scheme of things. Trouble is, since there’s no quality control on contributors, janitors come to rule the roost. Not sure of the way out there.


Leave a Comment