data

  1. You can now add automatically generated citations to millions of books on Wikipedia

    Wikipedia editors can now draw on WorldCat, the world’s largest database of books, to generate citations on Wikipedia thanks to a collaboration between OCLC (Online Computer Library Center) and the Wikimedia Foundation’s Wikipedia Library program.... Read more

  2. Importing JSON into Hadoop via Kafka

    Our three key players are Hadoop, the defacto distributed batch data processing platform; JSON, a ubiquitous data format; and Kafka, which is becoming the system of choice for transporting streams of data. However, much of the data that flows into Kafka is in JSON format, and there isn’t good community support around importing JSON data from Kafka into Hadoop. This article summarizes some comm... Read more

  3. The most popular browser

    Looking at the internet through the lens of Wikimedia.... Read more

  4. Introducing the unique devices dataset: a new way to estimate reach on Wikimedia projects

    With the unique devices dataset, we’ve been able to quantify the shift to mobile across all projects. In almost all Wikimedia projects, more than half of our unique devices are accessing content using the mobile sites.... Read more

  5. 15 years of Wikipedia in data visualization

    Leave it to designers to show us how dynamic, global, and human Wikipedia really is. Here, we look at fifteen of the best data visualizations making use of Wikipedia data.... Read more

  6. How much time do editors spend editing? and more survey results

    How much time do Wikipedians spend editing each week? What is their most important motivation for contributing? What are their views of the Wikimedia Foundation? Results from the 2012 editor survey, with a public dataset enabling everyone to drill deeper into such questions.... Read more

  7. Wikidata, coming soon to a menu near you

    Wikidata hosted a menu challenge for restaurants, around the world, to expand their menus with key information. Participants used wikidata to provided wikidata labels, translations, images and more.... Read more

  8. Wikimedia Foundation releases third transparency report

    The Wikimedia Foundation has released its third transparency report covering five key data points: Content alteration and takedown requests; Copyright takedown requests; Right to be forgotten; Requests for user data; and voluntary disclosure... Read more

  9. Get the latest Wikipedia updates easily with IFTTT

    Wikipedia’s new Channel on IFTTT makes it easier than ever to share free knowledge.... Read more

  10. It’s time for some #tastydata

    You’re invited to join the Menu Challenge, starting May 8! We could use your help to add translations, photos and/or audio recordings to Wikidata — about ingredients and dishes to be served at the ‘Taste Stockholm’ food fair. This can show the endless possibilities of open data and crowdsourcing to thousands of visitors. Let’s get some #tastydata!... Read more