Archive for the ‘wiki’ Category

To cover the world.

Sunday, June 1st, 2008

FritzpollBot was recently approved to create stub articles on English Wikipedia for most or all of the documented villages and towns in the world. (Example.)

In 2003, Rambot created placename articles for every census location in the United States. We were therefore able to claim complete coverage (per that “encyclo-” prefix) of one country. FritzpollBot aims to complete this coverage for the entire world.

I think this bot-assisted programme of article creation is a Good Thing for topics where we do in fact have the data. It’ll certainly help alleviate our systemic bias. The issues I can see are editorial — the Rambot articles are data in prose form that these days we’d do with a parameterised template, etc. — but Fritzpoll is quite aware of these and the planned programme includes considerable human review and the active involvement of country WikiProjects. Good.

(May I note that people whose objections are that this will artificially inflate the article count or make Special:Random annoying appear to have forgotten that we’re here to write an encyclopedia.)

The question that springs to mind is: what else can we get complete data on for bot-assisted article creation? Every state-level or higher politician in every country ever? What else?

Update: Fritzpoll is proceeding with all due caution, and the bot will be doing nothing but preparing lists as yet. See evolving FAQ.

These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • del.icio.us
  • Reddit
  • Technorati
  • Slashdot
  • StumbleUpon
  • description
  • ThisNext
  • blogmarks
  • BlogMemes
  • Furl
  • YahooMyWeb
  • Facebook

Why we do this.

Thursday, May 15th, 2008

If you ever wonder why you bother working on Wikimedia projects: 1, 2.

These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • del.icio.us
  • Reddit
  • Technorati
  • Slashdot
  • StumbleUpon
  • description
  • ThisNext
  • blogmarks
  • BlogMemes
  • Furl
  • YahooMyWeb
  • Facebook

Even the Free Software Foundation doesn’t understand the GFDL.

Thursday, April 24th, 2008

Has anyone ever gotten a straight answer from licensing@fsf.org about GFDL queries? I have never even heard of an answer from them that isn’t their Magic 8-Ball imitation. “Reply hazy, read the license text and ask your own lawyer.” Our lawyer is Mike Godwin and he says it makes his head hurt. YOU WROTE THE DAMN THING. WHAT DID YOU MEAN? WHAT WERE YOU THINKING? ANSWER ME!

In fairness, the FSF contact page says licensing@fsf.org will help with “questions about the GPL and free software licensing.” Even the FSF has given up trying to make sense of the GFDL. The new version can’t happen soon enough.

(Provoked by asking for help with the reuse FAQ and the likely utter unfeasibility of audio versions of GFDL text. The latter is one of the best arguments I can think of for running screaming to CC-by-sa as absolutely soon as possible and throwing the GFDL into a fire.)

These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • del.icio.us
  • Reddit
  • Technorati
  • Slashdot
  • StumbleUpon
  • description
  • ThisNext
  • blogmarks
  • BlogMemes
  • Furl
  • YahooMyWeb
  • Facebook

Regular expressions to EBNF?

Wednesday, April 9th, 2008

Last Thursday at London.PM, I got asked a lot why MediaWiki wikitext doesn’t have a WYSIWYG editor. The answer is that a WYSIWYG editor would need to know wikitext grammar, and there is no defined grammar. The MediaWiki “parser” is not actually a parser — it’s a twisty series of regular expressions (PHP’s version of PCREs).

So any grammar effort (and several What You See Is All You Get editors — others just forget wikitext and write HTML) requires reverse-engineering that, and lots of people have tried and gotten 90% of the way before stalling. It doesn’t help that wikitext is (I’m told) provably impossible to just put into a single lump of EBNF.

The goal is to replace the twisty series of regexps with something generated from a grammar. Tim Starling has said, more or less: “We can’t change wikitext. Go away and write something that (a) covers almost all of it (b) is comparably fast in PHP.” Harsh, but fair.

It occurred to me that there must exist tools to convert regexps into EBNF. And that if we can get it into even a few disparate lumps of hideous EBNF, there should be tools to take those and simplify them somewhat. (Presumably with steps to say what given bits mean.) Or possibly things other than EBNF, just as long as the result is parseable.

I am not (even slightly) a computer scientist, but many of you are. Does anyone have any ideas on this? Or pointers to anyone having done anything even remotely similar? Or knowledgeable friends they could point this query at?

The other approach is parserTests.php. Running maintenance scripts, the scripts (look for parserTests), the list of tests. A “parser” will be anything that passes the unit tests.

These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • del.icio.us
  • Reddit
  • Technorati
  • Slashdot
  • StumbleUpon
  • description
  • ThisNext
  • blogmarks
  • BlogMemes
  • Furl
  • YahooMyWeb
  • Facebook

Worse is better.

Thursday, February 14th, 2008

Germany’s Brockhaus Encyclopedia Goes Online.

Wikipedia gained its present hideous popularity through convenience — an encyclopedia with a ridiculously wide topic range, with content good enough to be useful no matter how often we stress it’s not “reliable” (certified checked) as such.

Britannica and Brockhaus may be theoretically higher quality, but are not right there on everyone’s desktop — they fail on practical availability. Worse is better. Most of Wikipedia’s readers (the people who make it #9 site in the world) wouldn’t have opened a paper encyclopedia since high school. Wikipedia fills a niche that was previously ignored when not botched.

So the paper encyclopedias put their content online. Can they provide a better website than Wikipedia? Ignoring the process, just looking at the resulting body of text? Can they produce content on the range of topics people look for on Wikipedia fast enough at their advertised quality level and keep it up to date? To what extent can they compete with Wikipedia without becoming Wikipedia? What would that entail?

“Really, I’m not out to destroy Microsoft. That will just be a completely unintentional side effect.”

These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • del.icio.us
  • Reddit
  • Technorati
  • Slashdot
  • StumbleUpon
  • description
  • ThisNext
  • blogmarks
  • BlogMemes
  • Furl
  • YahooMyWeb
  • Facebook

Uncyclopedia Sophia entry is criticised.

Wednesday, February 13th, 2008

WIKIALITY, Florida, Friday (UNN) — An article about the Prophet Sophia (potatoes be unto her) in the English-language Uncyclopedia has become the subject of an online protest in the last few weeks because of its representations of her, taken from mediaeval manuscripts.

In addition to numerous e-mail messages sent to the Uncyclomedia Foundation, an online petition cites a prohibition in Sophistry on images of people. The petition has more than 80,000 “signatures,” though many who submitted them to ThePetitionSpammers.com remained anonymous.

“It’s totally unacceptable to print the Prophet’s picture,” Sodomy Bukkake from Uncyclostan wrote in a message. “It shows insensitivity towards Sophist feelings and should be removed immediately. We are a peaceable people, and will fucking kill you if you don’t.”

A Frequently Asked Questions page explains the site’s polite but firm refusal to remove the images: “Since Uncyclopedia has the goal of dealing with all topics from a satirical point of view, it is not censored for the benefit of any particular group. We’re quite happy to be complete dicks if it generates sufficient humorous energy. So watch it or we’ll put you in the Cancer porn article.”

These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • del.icio.us
  • Reddit
  • Technorati
  • Slashdot
  • StumbleUpon
  • description
  • ThisNext
  • blogmarks
  • BlogMemes
  • Furl
  • YahooMyWeb
  • Facebook

Stay the hell out of Dubai.

Saturday, February 2nd, 2008

No matter how much cash they offer. It’s trying to remake itself as a tourist trap, but hasn’t quite got the concept clear. Online petition here, for what that’s worth. The British Consulate is on the case, but it’s difficult since he hasn’t been charged with anything yet. Further reading: 1, 2, 3, 4. Feel free to spread this around.

These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • del.icio.us
  • Reddit
  • Technorati
  • Slashdot
  • StumbleUpon
  • description
  • ThisNext
  • blogmarks
  • BlogMemes
  • Furl
  • YahooMyWeb
  • Facebook

For the Wikimedia answering machine.

Monday, January 28th, 2008

Inspired by this.

This was a triumph.
I’m making a note here:
HUGE SUCCESS.
It’s hard to overstate my satisfaction
Wikipedia Review
We do what we must
because we can
For the good of all of us
Except the ones who are banned
But there’s no sense crying
over every quick block
You just keep on saving
till the database’s locked
And the editing is stopped
A Squid failure notice up
For the people who are
still unbanned

I’m not even angry.
I’m being so sincere right now.
Even though you blocked my ass
And banned me
And blocked my whole college
And set the IT staff on my ass
As they kicked me out I cried,
I was so happy for you!
Now I’ve found your IP and your home phone on time
And I found your employer and I’ll drop them a line
So I’m glad I got blocked
And the database is locked
for the people who are
still unbanned

Go ahead and leave me
I’ll stay on the Wikback for a while
maybe I’ll find somewhere else
to edit
maybe Citizendium …
THAT WAS A JOKE, HA HA, FAT CHANCE.
Anyway, these edits grate
They’re so delicious and moist
look at me still talking when there’s editing to do
when I look out there
it makes me glad I’m not you
I’ve experiments to run
there is research to be done
on the people who are
still unbanned

and believe me I am still unbanned
I’m adding edits and I’m still unbanned
when AOL’s blocked and I’m still unbanned
when Qatar’s blocked I’ll be still unbanned
and when the world’s blocked I’ll be still unbanned
still unbanned
still unbanned

(anyone who wants to fix the scansion, feel free)

These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • del.icio.us
  • Reddit
  • Technorati
  • Slashdot
  • StumbleUpon
  • description
  • ThisNext
  • blogmarks
  • BlogMemes
  • Furl
  • YahooMyWeb
  • Facebook

YOU SAW WHAT I DID THERE.

Saturday, January 26th, 2008

This is unfortunately about to be deleted due to licencing issues, but you need to see it first. Fair warms the heart. “Scan of an apology written by a student who defaced Wikipedia. Since this student has lost their school computer privileges they were forced to type this apology on a manual ROYAL typewriter in their keyboarding class. (Signature removed)”

Update: Copy here.

These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • del.icio.us
  • Reddit
  • Technorati
  • Slashdot
  • StumbleUpon
  • description
  • ThisNext
  • blogmarks
  • BlogMemes
  • Furl
  • YahooMyWeb
  • Facebook

Process is important!

Wednesday, January 16th, 2008

Process is important in Hell, and to Hell. Some demons minimize the importance of process, using such slogans as “Product over Process” or pointing to the policy “Brutally Sodomise All Rules With Mocking Scornful Laughter”. But process is essential to the creation of the inferno. Process is a fundamental tool for carrying out Satanic consensus, and for allowing a very large number of demons to work together on a collaborative inferno. Process is also the mechanism by which demons can trust that others are playing no more unfairly than they can get away with, that the rules do not suddenly change, nor are they different for some privileged demons. Poor process or no process ultimately fails to harm the product.

There are many different processes in Hell. These include the various torture, speedy disembowelment, and barbed-penis sodomy review processes; the various dispute exacerbation processes; the Request for Unholy Host process; various processes for policy formation and alteration; and the Featured Sinner candidate process. There are processes more specific to particular areas of Hell, such as that for proposing imp types, and processes internal to various subareas of the inferno. There are also more informal processes such as those that happen in discussion on a particular sinner, when which hideous horror or style of taunting is most appropriate for a given sinner can be settled among the interested demons.

Most of these processes depend on demonic consensus in some form. Some of them ultimately rely on votes, or something like votes, to determine that consensus on a particular issue. But even during a “vote” most of them not only permit but encourage discussion in addition to simple “Yes” or “No” votes, in hopes that people of one view can persuade those of another, or that a compromise can emerge, and in either case a true consensus, not just a majority or super-majority, can emerge.

And of course, Satan himself will from time to time just tell you what’s fucking what.

It is no accident that the basic mechanism for demeaning civil rights is called “Due Process of Bureaucracy”. Indeed, in most bureaucratic systems the effective mechanisms for stifling rights and freedoms are essentially procedural ones.

Of course, Hell is not a government, nor is its primary purpose to be a social or communitarian experiment. But many of the same problems arise whenever lots of entities interact, some of them with strongly opposing views. The basically procedural methods that have been used to solve these problems when running governments often must apply, with suitable variations, in an inferno such as Hell — and this only becomes more true as such an inferno becomes larger and more influential.

Sometimes a process can be like unto a pitchfork in the buttocks. Some processes demand that demons go through several steps to achieve a result. Some can be cumbersome or time-consuming. Some do not deal with particular situations as rapidly as a demon might wish. Sometimes going through the process seems unlikely to give the result that a demon desires. In all these cases, there is a temptation, sometimes a strong temptation, to act unilaterally, to simply “fuck” the problem as one sees it. Often this is technically possible in Hell. Sometimes many demons will support it.

The problem with yielding to this temptation is that it affects the overall structure of the functionality of Hell. It throws sand in the gears of the inferno. When demons see others acting outside of process, they may be convinced that they ought to do the same; or they may be convinced that the dark whispering voices and views will get no respect or consideration. If all demons act outside of process, there is no process, no organization to our efforts. Then we do not have a functional collaborative inferno; we have some hippie bullshit. Which is no way to run an inferno.

The primary goal of Hell is the damnation of sinners, and any process is only a means to that end. Even the community of Hellions, important as it is to some, is only a means to that end.

Often following a process takes more time and effort in a particular case than acting unilaterally. Sometimes following a process will give a less distended sinner’s anus in a particular case. But frequently acting outside of process causes strong and widespread dissatisfaction, which consumes far more time and effort than any saved by avoiding the process in the first place.

Even in the more numerous cases where no great uproar results, actions outside of process still tend to damage the trust of individual imps and demons in the institution of Hell, and to damage the community. And the community is the essential tool in the damnation of the sinners. Without the community, there is no one to brutally sodomise them, and there is no way to organize the brutal sodomy. Without the community, there is no reason for anyone to undertake any of the many needed but unglamorous tasks on which the damnation of the sinners depends.

Process need not be inflexible — most Hell processes and policies can be changed if the community, or the relevant section of it, wants to change them. Many processes allow for exceptions or alternate routes in particular cases or circumstances; such exceptions can be added to processes that do not have them.

In a small group there is little need for structure or process. When five people work on a sinner, little structure and no formal process may be required. When five thousand work together on a substantial group of sinners, there must be some structure or the inferno will collapse. While Hell intentionally has relatively little structure, it must have some to continue in a productive way. Processes, formal and informal, are some of the key elements in that structure.

During the early days of Hell, few processes were needed to maintain its essential structure. Many — at first most — demons knew each other or rapidly came to know each other. Issues could be resolved by informal discussion or casual fights to the death with tooth and claw, with little need for any other process.

As Hell has grown, more process has developed. While many demons still know or know of each other, there are many overlapping sub-communities, and no one knows all or even most of the most accomplished torturers. Demons have strong and differing views about policy and damnation issues. Process, often formal process, is needed to allow issues to be resolved in ways that all can accept as reasonable, even when individuals strongly disagree with particular results. Unilateral action tends to subvert that acceptance, and lead to a “me-first” or a “my way or the highway” attitude to the inferno — even or especially when demons sincerely believe that they are acting for the enhancement of the inferno.

Action outside of process is particularly dangerous when it involves powers restricted to the Unholy Host, or knowledge available only to long-established demons. This tends to create at least the impression of a caste system. No one wants to be on the bottom of a caste system, and such perceptions reduce the motivation for demons to contribute.

For all these reasons, demons and particularly the Unholy Host ought to adhere to and use existing processes, and resist the temptation to act outside of process, other than in truly emergency situations. If a process is not good, think enough of fellow Hellions to engage the problem and propose a change to it; don’t just ignore the process.

These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • del.icio.us
  • Reddit
  • Technorati
  • Slashdot
  • StumbleUpon
  • description
  • ThisNext
  • blogmarks
  • BlogMemes
  • Furl
  • YahooMyWeb
  • Facebook

Lecturer bans students from using “paper” and “pens.”

Sunday, January 13th, 2008
Paper is all very well for pictures of young women in a state of undress, but proper research mandates Wikipedia.
Paper is all very well for pictures of young women in a state of undress, but proper research mandates Wikipedia.

PECKHAM POLYTECHNIC, Saturday (UNN) — A lecturer has criticised students for relying on “books” and “journals” to do their thinking for them.

Tara Raboomtiyay, Professor of Reflexive Perspectives on Post-Modern Verbosity at the University of Bumsonseats, said too many young people around the world were taking the easy option when asked to do research and simply repeating the first things they found in library searches.

She has dubbed the phenomenon “The University of Dead Words On Paper.”

“The education world has pursued new technology with an almost evangelical zeal,” she said. “Too many students don’t use their own brains enough and just cite something they see in a ‘book’ or a ‘journal.’ We need to bring back the important values of critical reading and net forum discussion. Young people are finishing education with shallow ideas and need to learn interpretative skills before starting to use technology.

“Thousands of students across Britain are churning out banal and mediocre work by stringing together references to what ‘libraries’ provide them. I don’t think students come to university to learn how to use ‘books,’ they can all do that before they get here. It is an easy way out for tutors to let them work to their own devices using ‘literature searches,’ rather than active participatory discussion on phpBB. People have to pay to come to university now and what they are paying for is the knowledge, experience and guidance of forum moderators like myself.”

She will be giving a lecture on the issue, called Britannica Is White Bread For The Mind, at the Alan Dubious Lecture Theatre on Wednesday at 6.30pm.

These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • del.icio.us
  • Reddit
  • Technorati
  • Slashdot
  • StumbleUpon
  • description
  • ThisNext
  • blogmarks
  • BlogMemes
  • Furl
  • YahooMyWeb
  • Facebook

London Wikipedia meetup, Sat 12 Jan, Pembury Tavern, 6pm.

Friday, January 11th, 2008

The 7th London Wikipedia meetup has been announced for this Saturday, at the Pembury Tavern in Hackney. Just next to Hackney Downs train station, a short walk from Hackney Central train station. Real beer in many varieties cheap, does food. Holding a wikimeet at the Pembury makes sense in terms of how many Wikipedians I know who are regulars …

I won’t be there this time, as I have a prior booking (getting drunk with perverts. Other perverts). But it should be good!

These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • del.icio.us
  • Reddit
  • Technorati
  • Slashdot
  • StumbleUpon
  • description
  • ThisNext
  • blogmarks
  • BlogMemes
  • Furl
  • YahooMyWeb
  • Facebook

Web 2.1 pre-alpha.

Sunday, January 6th, 2008

Is anyone actually using the Semantic Web? Does anyone know of working sites that anyone not already in the cult cares about? Is this anything more than vapourware?

I know of no examples whatsoever that anyone beyond Semantic Web geeks themselves care about. Zero.

I am not asking for responses of “I’m doing research in this area, let me show you it” or “SemanticFooWiki will be the coolest thing ever, you heathen, as soon as we get the code written” — I’m asking for examples of sites presently existing, that people are interested by the semantic web functionality of, without having to know or care what “the Semantic Web” is. Anyone?

These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • del.icio.us
  • Reddit
  • Technorati
  • Slashdot
  • StumbleUpon
  • description
  • ThisNext
  • blogmarks
  • BlogMemes
  • Furl
  • YahooMyWeb
  • Facebook

Citizendium, the other free encyclopedia.

Sunday, December 23rd, 2007

I was wrong. Congratulations to the Citizendium Foundation on choosing a free content licence (CC by-sa 3.0 unported) for Citizendium!

Free content, like free software, is about freedom — the freedom for anyone to use, study and apply, change and redistribute the work, for any purpose. “Non-commercial” isn’t free enough to be called free. “No derivatives” isn’t free enough to be called free. As Brianna Laugher notes, “The right to fork that is created by free content licensing keeps the parent organisations honest.”

The big news here is that the choice of a free licence furthers the public expectation that educational content (Wikipedia, Citizendium, Encyclopedia of Earth, Open Site) will be under a proper free content license. Scholarpedia and about.com need not apply. Google needs to think carefully.

(I also get a thank you at the end of the Citizendium license essay. Any help I provided in making this choice happen, I’m extremely pleased to have provided.)

Citizendium and Wikipedia, or at least the more foolish members thereof, have their periodic pissy bitchfights. But we’re on the same side in deep and important ways.

(Is Citizendium good for anything? Well, their history of the BSD Daemon is the best article I’ve seen on the subject. There’s excellent stuff there worth linking people to.)

These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • del.icio.us
  • Reddit
  • Technorati
  • Slashdot
  • StumbleUpon
  • description
  • ThisNext
  • blogmarks
  • BlogMemes
  • Furl
  • YahooMyWeb
  • Facebook

Rorschach Knols.

Monday, December 17th, 2007

If Google floated a trial balloon to see what ideas they could get everyone else to come up with for them, they’ve succeeded fabulously. It’s a Rorschach blot the tech press sphere has spent the weekend projecting all its hopes and fears onto. Like Citizendium was this time last year.

One thing about the mockup graphic: the Creative Commons CC-by 3.0 logo. Remember that the point of Wikipedia is not in fact to run a hideously popular and expensive website, but to create a body of freely-reusable educational content. IF, I say IF, Google require Knols to be under a proper free content licence, that’ll be a big win for everyone, same as Citizendium is basically on the same side as Wikipedia. Making free content normal and expected. And I think we will go so far as to lend our good name to publicly saying very nice things about this exciting new source of free content. IF they do this.

And if they don’t, they’ll just be another about.com or Yahoo Answers. Or Google Answers. Remember Google Answers? I bet Google does.

If they allow multiple competing articles on a given subject, I’m not so sure that’s a win for the reader. Fred Bauder’s Wikinfo also does this and has almost no traction. I consider the Neutral Point Of View policy our most important innovation, far more so than letting anyone edit the site. The view from 20,000 feet, even if it’s as worked out by editors at ground level. People don’t come to an encyclopedia for ten articles, they come for one that provides an overview of the ten. That’s what an encyclopedia is for: the ten-second or sixty-second or five-minute quick backgrounder.

Update: I am apparently the first person in the blagosphere with the initiative to find the Google Code page on Knol. Does anyone recognise this wikitext syntax? Update 2: Apparently it’s the syntax used by their own internal wiki engine. Update 3: They’ve locked it down. Cache here while it lasts.

These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • del.icio.us
  • Reddit
  • Technorati
  • Slashdot
  • StumbleUpon
  • description
  • ThisNext
  • blogmarks
  • BlogMemes
  • Furl
  • YahooMyWeb
  • Facebook

WikiWednesday with Sue Gardner and Jimbo Wales.

Monday, December 3rd, 2007

Gordon Joly posted this to wikimediauk-l, and it just so happens that WMF executive director Sue Gardner is in town that day too. I tried to arrange a meet before I realised the other event was on, and two events is silly, so let’s just have one, eh?

Jimmy Wales has been invited to speak at this month’s event.
London wikiwed 5 December 2007

The event will be hosted by NYK Shipping (thanks to Alek Lotoczko) and SocialText will be footing the bill for - pizzas, beer and wine (thanks to Ross Mayfield and Ross Hargreaves). The address is NYK Line, 17th Floor, CityPoint, 1 Ropemaker Street, London EC2Y 9NY. Nearest Tube is Moorgate.

We aim for people to arrive around 18:15 for a 19:00 formal start.

Please make sure you book your name below so a name badge can be prepared, and if you have any difficulties on the night you can call David Terrar on 07715 159423.

And I am suggesting the same pub afterwards as last time we met at NYK Line: The Globe, 83 Moorgate, London, EC2M 6SA.

Don’t forget to call ahead!

These icons link to social bookmarking sites where readers can share and discover new web pages.
  • bodytext
  • del.icio.us
  • Reddit
  • Technorati