Wikimedia blog

News from the Wikimedia Foundation and about the Wikimedia movement

Friends

Remembering Aaron Swartz (1986-2013)

Aaron Swartz at a Boston Wikipedia meetup in 2009

Aaron Swartz was found dead in his New York apartment Friday, an apparent suicide. Aaron was a prolific hacker and a free culture activist. He was also a Wikipedian. Today, the Internet community at large is reeling from Aaron’s early death, and Wikimedia is joining in remembering an extraordinary individual.

In 2000, as a 13-year-old, he was the youngest finalist in a teen website competition with his project “The Info Network”, an online encyclopedia inviting anyone to contribute their knowledge. Aaron would later recall that while he was not able to find enough contributors for his first web site, “luckily, several years later, my mother pointed me to this new site called ‘Wikipedia’ that was doing the same thing.”

At age 14, Aaron co-authored RSS 1.0, an important web standard. Later he founded Infogami, a startup which would merge with Reddit, which today is one of the most influential social news sites. He led the development of the Open Library, a project launched by the non-profit Internet Archive in 2007 with the aim of offering “one web page for every book”, integrating user contributions through a wiki interface.

In 2003 he started editing Wikipedia. His userpage lists more than 200 articles he started or contributed a large amount of content to. His most recent edit was on Thursday, January 10.

In 2006, he was a candidate for the Wikimedia Foundation Board of Trustees, which in part is elected by the Wikimedia community. It was during that time that he wrote a series of essays about Wikipedia, sharing his concerns, hopes and dreams for the project’s future.

This included “Who writes Wikipedia”, which proposed that the role of casual contributors to the encyclopedia is often severely underestimated, and that protecting the encyclopedia’s fundamentally open nature was critical to its future. “If Wikipedia continues down this path of focusing on the encyclopedia at the expense of the wiki, it might end up not being much of either,” Aaron wrote. His essay triggered a debate and research that continues to this day.

In recent years, Aaron’s focus was on online activism. He believed strongly that the freedoms that we take for granted online are constantly under threat and need to be defended. To this end, he co-founded Demand Progress, and was one of the leaders in the grass-roots campaign against legislation known as SOPA and PIPA, a campaign which Wikipedia participated in through the 2012 Wikipedia blackout. Aaron’s keynote at the Freedom to Connect conference in 2012 re-tells the important story of how SOPA and PIPA were ultimately defeated.

Aaron also strongly believed that the public should have free access to the laws that govern it, and to publicly funded scholarship and scientific research. In 2011, he was indicted for allegedly breaking into MIT’s network to download large amounts of scholarly materials.

Family, friends and those close to the case have raised questions about the fervor and zeal with which Aaron was pursued — Lawrence Lessig’s post “Prosecutor as bully” provides some important background, as does expert witness Alex Stamos’ summary.

Whatever caused Aaron to take his own life, it is a shocking and painful loss of an extraordinary individual who has touched so many through his ideas and actions. His friends and family have started an online memorial to share remembrance stories, and Wikipedians are also leaving comments on his talk page. We join them in remembering Aaron Swartz, a beautiful human being.

Further reading:

Wiki inventor supports Wikimedia Foundation with donation of award winnings

Ward Cunningham

“Come on in and I’ll trust you to contribute in good faith and to make your words a gift to this community.”

That’s the spirit of the original wiki, invented by programmer Ward Cunningham, that persists in Wikipedia today. It’s also one of the great quotes from an interview Ward did recently in honor of winning the Excellence in Programming Award from Dr. Dobb’s Journal.

This award has been given out intermittently since 1995, and exists to recognize outstanding contributions to software development. It also comes with a donation of $1,000 to a non-profit of the recipient’s choice. Ward has kindly chosen the Wikimedia Foundation, in order to support Wikipedia and all our projects.

The Wikimedia movement has long owed much of our success to the work set down in the original wiki and its community. From inventing the software we depend on, to serving on our Advisory Board, Ward’s contributions since 2001 have been essential for us to create thriving community-based projects.

While the award from Dr. Dobb’s is for Ward’s contributions to the craft of programming, we think that some of his most important contributions to the world are less technical. As he says about the original wiki: “everyday that I spent hours on that wiki, I would say to myself, ‘I’ve got to stop doing this. I’ve got to focus on my business. I have work to do.’ Thank goodness I didn’t bother to do that work and fiddled with wiki instead!”

That description of donating time to a cherished project is a sentiment mirrored today in the volunteer contributions of Wikimedians all around the world. Thank you to Ward for his continued generosity over the last 11 years.

Steven Walling
Community Organizer

Wikimedia supports American Censorship Day

Today (Wednesday, November 16, 2011) is an important day in Washington, DC.

This morning, hearings take place regarding the “Internet Blacklist Bill” – a bill that, if approved, would overturn laws relating to Digital Millennium Copyright Act (DMCA) safe harbor, and would allow any government or corporation to block a website, remove it from a search engine, and/or cut it off from payment processors or advertisers. In response to these hearings, organizations like Wikimedia, Creative Commons, the Electronic Frontier Foundation, Mozilla, and many more are joining together to declare American Censorship Day.

If approved, this bill would have disastrous effects for Wikipedia and its sister projects.

Why is this bill an issue for a project like Wikipedia?

In a nutshell, Wikipedia relies on Creative Commons licenses and a series of established, community-led open collaboration processes to ensure that its information and media are a part of free culture, and that copyrighted materials (which may inadvertently end up on Wikipedia or its sister projects) can be quickly and effectively removed so we remain in compliance with US copyright law.  Our global, volunteer community understands these laws well – maybe better than any other online community on the net – and they work hard to ensure that everything on Wikipedia and its sister sites complies with the law.

The Internet Blacklist Bill would change all of that.  The bill would allow corporations, organizations, or the government to order an internet service provider to block an entire website simply due to an allegation that the site posted infringing content.  In addition, sites like Wikipedia could be required to monitor for any “banned” links, resulting in delegated proactive censorship of the Web, not to mention significant additional costs to Wikipedia, a site of a non-profit charity.  Useful international sources of knowledge and information – which often serve as a basis for our articles and projects – could be blacklisted if rights owners simply felt that there was some infringing content. Individual contributors could face criminal liability for posting or sharing a copyright work for what we consider to be common fair-use situations.  The DMCA system, which allows Wikimedia and its volunteer community to quickly remove copyright-violating material at the request of the copyright owner, would be overturned.  In short, our users and all of our projects, would be forced to operate in an untenable legislative environment, putting Wikipedia at the beck and call of the rights owners as opposed to the distribution of free knowledge. Simply put, this bill is a reckless and burdensome model in Internet censorship.

The future of Wikipedia, the free knowledge movement, and tens of thousands of open and free projects is at stake, and we must stand up to oppose this bill.  Join us in these efforts by spreading the word.  If you are in the United States, contact your local government representative, and take a stand on American Censorship Day.

Jay Walsh, Communications

 

Open source hackfest benefits WMF, community

On May 24th and 25th, the Wikimedia Foundation hosted a CiviCRM coding sprint in our San Francisco office. CiviCRM is the premier open source constituent relationship manager; WMF uses it to store donor and contribution information. Our CiviCRM database contains more than a million contact records and a million contribution records.

CiviCRM, The Free and Open Source Solution for the Civic Sector

The sprint was a terrific success. The eight participants squashed many CiviCRM bugs — and the Foundation directly benefited, as they improved CiviCRM contact/contribution search performance by 15 to 25 times! Formerly, it could take more than two minutes for someone to search among the contribution records. The developers’ tweaks, hacks and patches whittled that down to about 4-6 seconds per search. This will save innumerable hours for WMF administrators and fundraisers.

The Foundation’s Arthur Richards, a fundraising engineer, enthused: “Any software tool, open source or not, comes with headaches; the beauty of tools like CiviCRM is that we can solve our own problems. Thanks to having some great hackers in one place, we managed to mitigate one of our biggest CiviCRM pain points in a matter of hours.”

You can read more details about the sprint on Donald Lobo’s CiviCRM blog.

Richards was especially excited to “highlight how awesome it is working with other open source projects and using other open source tools. We get to scratch each other’s backs, which helps support a sustainable, healthy ecosystem of software/communities. Also, using open source tools like CiviCRM – while not without their (often big) pain points – is great because we can fix the software ourselves. While the tools are free to use, with a little bit of elbow grease and some resources, they can be molded and fixed to meet our needs much easier (and likely much cheaper) than relying on proprietary tools. Plus, the CiviCRM community has been instrumental in helping us troubleshoot, solve problems and add new features to meet our usage requirements.”

The CiviCRM community is planning to run another code sprint in the fall in Northern California; please contact them if you’d like to participate or even host it. In the meantime, Wikimedia and thousands of other nonprofits will enjoy the CiviCRM improvements developed in May.

-Sumana Harihareswara
Volunteer Development Coordinator, Wikimedia Foundation

“How Wikipedia Works” hits our shelves

Thanks to our friends at No Starch Press, San Francisco-based publishers of one of the largest and most detailed guides to Wikipedia ever printed (and available under the GFDL!) the bookshelves at Wikimedia Foundation’s offices in San Francisco are over-flowing with how-to knowledge.

In January the publishers offered to provide us with a few hundred copies of “How Wikipedia Works” to give to Foundation visitors, particularly editors and contributors.  Authored by Wikimedia Foundation board member Phoebe Ayers, and long-time Wikipedians Charles Matthews and Ben Yates, the 500+ page book discusses the culture, history, technology, and impact of Wikipedia, while also providing a detailed primer for getting involved and participating among the community of editors.

Our thanks to No Starch Press for  their generous donation and their continuing support of the Wikimedia mission.

Jay Walsh, Communications

Sue Gardner joins Ada Initiative advisory board

Today the Ada Initiative announced the appointment of Sue Gardner, ED of the Wikimedia Foundation, to its first advisory board. The Ada Initiative launched just a few weeks ago, and has the aim of promoting the visibility and participation of women in open-source culture. The group, founded by Valerie Aurora and Mary Gardiner, will undertake unique research in the field of women in open-source culture, provide consultative services to organizations and businesses, and develop training and education services.

The Initiative‘s namesake, Countess Ada Lovelace (10 December 1815 – 27 November 1852), was considered one of the world’s first computer programmers, and was almost certainly the first woman in computer programming. She collaborated with Charles Babbage, the creator of one of the first mechanical computers, the analytical engine, writing what is generally considered the first code instructions for a computer.

From Wikipedia,

She was the only legitimate child of the poet Lord Byron (with Anne Isabella Milbanke), but had no relationship with her father, who died when she was nine. As a young adult she took an interest in mathematics, and in particular Babbage’s work on the analytical engine. Between 1842 and 1843 she translated an article by Italian mathematician Luigi Menabrea on the engine, which she supplemented with a set of notes of her own. These notes contain what is considered the first computer program—that is, an algorithm encoded for processing by a machine. Though Babbage’s engine was never built, Lovelace’s notes are important in the early history of computers. She also foresaw the capability of computers to go beyond mere calculating or number-crunching while others, including Babbage himself, focused only on these capabilities. [1]

Wikipedia has been in the news recently following a New York Times story highlighting the lack of women participating in the project, based on researched gathered by the United Nations University Study.  Interest in the topic has brought new thinkers to the Wikimedia community, which also recently resulted in the creation of a Wikimedia gender gap mailing list, which is open to the public.

Congratulations, Sue, and good luck to everyone involved in the Ada Initiative!

Jay Walsh, Communications

[1] Ada Lovelace. (2011, February 24). In Wikipedia, The Free Encyclopedia. Retrieved 18:03, February 24, 2011, from http://en.wikipedia.org/w/index.php?title=Ada_Lovelace&oldid=415671634

Wikimedia selects Watchmouse for global monitoring services

Earlier today we announced our selection of Watchmouse website monitoring to assist both the Foundation and anyone around the world in keeping an eye on our server uptime and status.  With Watchmouse’s help, the Foundation now has a public status page, which is maintained offsite on servers independent from Wikimedia, that reports our uptime and accessibility levels from over 50 locations around the world. The service breaks out each of the primary server systems of the Foundation, because it definitely takes more than one computer to keep us up and running.

This is the first time Wikimedia has offered a publicly visible, externally hosted website monitoring service. Uptime is of course critical for reaching all of Wikimedia’s users, but also for ensuring that our wikis are open and editable to everyone, all the time.

With a rapidly growing, and global, audience of hundreds of millions of readers and contributors, Wikimedia’s properties have become an integral part of how the world accesses and shares knowledge.  This new service is particularly important as the Foundation establishes its permanent data center infrastructure, and looks beyond the US and Europe to establish more data centers (more regular updates from our engineering team can be found on the Wikimedia tech blog). Publicly sharing where downtime (and uptime, of course) is being experienced also helps us maintain our mission focus on transparency and accessibility.

Thanks for joining us as mission supporters, Watchmouse!

Jay Walsh, Communications

A gift of visualization on Wikipedia’s birthday

Earlier today the Washington D.C. based creative agency JESS3 posted the video above, and an informative web case study, www.thestateofwikipedia.com – a follow-up to another recent case study they did on another big idea, the Internet. JESS3 (also donors to the Wikimedia Foundation) folks Leslie Bradshaw and Becca Colbaugh on the inspiration for the work:

In a collaborative effort to capture a historic moment in time for Wikipedia, we announced this morning “The State of Wikipedia,” a digital short aimed at teaching the layperson Wikipedia’s initial concept and consequent evolution into becoming one of the most visited web sites across the globe.

We look forward to see what the next 10 years hold for Wikipedia and how it will continue to help add contours, diversity and permanency to information the world over.

They were supported by long-time Wikipedian William Beutler, and the voice you might recognize is none other than the founder of Wikipedia, Jimmy Wales. The video is CC-BY-SA (it can be downloaded from Vimeo – Commons link as soon as we have it), which means anyone around the world can use, re-use, and share this great work that tells the story of our project and our movement.

A big thanks to JESS3 for taking the considerable time to put this story together. We think it will make a big difference in helping people talk about our big projects and the complex world of the Wikimedia movement. A great Wikipedia 10 birthday gift!

Jay Walsh, Communications

Encyclopedia of Life curates Wikipedia’s species articles

There are more than 1.9 million animals, plants, and other forms of life on Earth. In May 2007, some of the world’s leading scientists announced the development of the Encyclopedia of Life (EOL) to document them all. Inspired by biologist E. O. Wilson’s TED Wish and supported by more than $25 million in funding, the project aggregates and makes accessible information about species ranging from 19th century journals to modern online databases.

See the page about Solanum lycopersicum, the garden tomato, as an example. Much of the information comes from Solanaceae Source, a specialized source of  names lists, species descriptions, specimen collections and publication lists for the genus Solanum. The Biodiversity Heritage Library provides historical public domain texts about the species from various published journals. Many other specialized and general resources contribute to the overall species page.

A Wikipedia article included in an Encyclopedia of Life species page. The yellow background indicates that no curator has reviewed the content yet. Click the image to enlarge.

You’ll also find a “Wikipedia” entry in the table of contents. It reveals a copy of the Wikipedia article about tomatoes. As of this writing, the article text has a yellow background.

This means that an Encyclopedia of Life curator has not yet reviewed the content for inclusion in EOL. An EOL species page can have one or more curators who select and validate information added to EOL pages. Wikipedia articles, where they exist, are included by default.

Once the article has been validated by a curator, the yellow background is removed. The information for curators and curation standards pages on EOL give some additional background on the curation process, which applies to all content objects in EOL. Specific guidelines have been written for curation of content from Wikipedia and Wikimedia Commons. We’re particularly pleased that EOL encourages its curators to improve Wikipedia directly if errors or omissions are found.

So far, more than 200 Wikipedia articles have been reviewed through this process. Reviewers classify the information as follows:

  • ‘trusted’ – reviewed by curator and not deemed to contain substantially incorrect information
  • ‘untrusted’ – reviewed by curator and deemed to include incorrect or unverifiable information
  • ‘inappropriate’ – reviewed by curator and deemed to not be eligible for inclusion in EOL for other reasons (e.g. too short to add value)

EOL makes the entirety of all review information (who reviewed what when, with what outcome) available through an Atom feed. This means that Wikipedians, and others, can use this information easily in the development of new applications.

The book creator tool makes it possible to order a printed and bound book from any Wikipedia article selection. A custom cover can be chosen. Nautilus photograph by Lee Berger, Creative Commons Attribution/Share-Alike License. (Click to enlarge.)

A proof-of-concept for expert reviews

Magnus Manske is a biochemist and programmer at the Sanger Institute in the United Kingdom. He is also a long-time Wikimedia volunteer, and wrote the first version of the PHP software used by Wikipedia, which later became MediaWiki. As a scientist, Magnus has advocated for the scientific community to use and improve Wikipedia, most recently as co-author of the paper Ten Simple Rules for Editing Wikipedia.

I informed Magnus about the new EOL review information, and suggested that we might want to explore using this information to generated printed books or PDF collections of reviewed articles. The software for exporting Wikipedia articles into books already exists, so it was just a matter of putting two and two together.

So, Magnus used the available data feed to create an automated tool that creates a list of all EOL-reviewed article versions in a form that can be used by Wikipedia’s book tool.

This makes it possible to download a PDF file or order a printed book that only contains EOL-reviewed versions of Wikipedia species articles.

To try it out, visit the page for Magnus’ example book. Click “Download PDF” to generate the (very large) PDF file that contains all the species articles, or “order printed book” to preview or order a printed book from PediaPress (which, as of this month, also offers books in color and hardcover format). If you want to remix or play with the book further, you can click “Open book creator”.

We’re very pleased with this first proof-of-concept, and are grateful to the Encyclopedia of Life team for engaging its community in the curation of Wikipedia articles. Both parties benefit: The Encyclopedia of Life enriches its species pages using the often well-developed Wikipedia content. Wikipedia benefits because EOL’s trusted reviewers add their stamp of approval to Wikipedia articles, which helps Wikipedia readers and editors alike. Where EOL reviewers do not approve, they are encouraged to edit the Wikipedia article.
I asked Bob Corrigan, EOL Product Manager and Acting Deputy Director, to give his take on this project. He writes: “This is definitely a win-win partnership. EOL is focused on providing very deep, structured access to trusted biodiversity information from our network of content partners and curators, and vetted Wikipedia articles can be a terrific gateway to this information. We see a closer relationship with Wikimedia as an important way to expand access to global knowledge about life on Earth.”

Hardcover book made from curated Wikipedia articles. Photo credit: Guillaume Paumier; Nautilus photograph by Lee Berger. Creative Commons Attribution/Share-Alike License 3.0

Example page from the book. Photo credit: Guillaume Paumier; Nautilus photograph by Lee Berger. Creative Commons Attribution/Share-Alike License 3.0

A replicable model

Magnus’ implementation was already created with an eye to future extensibility. If you’re inclined to take a closer technical look, check out Magnus’ “Sifter-Books” script which generates the book data, and can potentially support multiple partner institutions/organizations providing article reviews. As of the time of this writing, Magnus has already added two additional groups who review Wikipedia articles, Rfam and Pfam, databases of RNA and protein families.

Moreover, Magnus has written a small proof-of–concept script which makes the existence of reviews visible on Wikipedia itself. You need to create a user account on the English Wikipedia and follow the installation instructions to use the script. Once installed, a “Reviews” tab will indicate available article reviews.

We look forward to exploring similar partnerships with subject-matter experts in institutions (like universities and libraries), scientific associations, and specialized knowledge communities. If you’re interested in this model, drop me a note (erik at wikimedia dot org).

Erik Moeller
Deputy Director, Wikimedia Foundation
Representative of Wikimedia in the Encyclopedia of Life Institutional Council

Wikipedia hard-cover editions now available

This week our friends over at Pediapress announced that custom-printable books containing Wikipedia articles are now also being offered in attractive hard cover, bound editions – and in color. Previously customers could order softcover editions of books containing a customizable list of Wikipedia articles in any configuration. The new hardcover editions even contain a silk bookmark and stitched bindings.

The Pediapress MediaWiki extension on Wikipedia allows users to collect any number of articles or categories into a single PDF file or OpenOffice text file, which can then be downloaded for off-line viewing or local printing, or through Pediapress’ on-demand printing technologies the document can be turned into a bound book and shipped right to you. To start creating a book, look for the Create a book link under Print/Export on the lefthand Wikipedia menu. Some incredibly unique and inspired Wikipedia books have been created since Pediapress kicked off.

Now is your chance to get your very favorite lists of Wikipedia articles bound in a bookshelf-friendly format. Offline versions of Wikipedia are an important part of the Wikimedia Foundation’s mission to spread free knowledge to everyone on the planet, so we’re happy to see the options and quality of this format expand.

Jay Walsh, Head of Communications