data

  1. Importing JSON into Hadoop via Kafka

    Our three key players are Hadoop, the defacto distributed batch data processing platform; JSON, a ubiquitous data format; and Kafka, which is becoming the system of choice for transporting streams of data. However, much of the data that flows into Kafka is in JSON format, and there isn’t good community support around importing JSON data from Kafka into Hadoop. This article summarizes some comm... Read more

  2. The most popular browser

    Looking at the internet through the lens of Wikimedia.... Read more

  3. Introducing the unique devices dataset: a new way to estimate reach on Wikimedia projects

    With the unique devices dataset, we’ve been able to quantify the shift to mobile across all projects. In almost all Wikimedia projects, more than half of our unique devices are accessing content using the mobile sites.... Read more

  4. 15 years of Wikipedia in data visualization

    Leave it to designers to show us how dynamic, global, and human Wikipedia really is. Here, we look at fifteen of the best data visualizations making use of Wikipedia data.... Read more

  5. How much time do editors spend editing? and more survey results

    How much time do Wikipedians spend editing each week? What is their most important motivation for contributing? What are their views of the Wikimedia Foundation? Results from the 2012 editor survey, with a public dataset enabling everyone to drill deeper into such questions.... Read more

  6. Wikidata, coming soon to a menu near you

    Wikidata hosted a menu challenge for restaurants, around the world, to expand their menus with key information. Participants used wikidata to provided wikidata labels, translations, images and more.... Read more

  7. Wikimedia Foundation releases third transparency report

    The Wikimedia Foundation has released its third transparency report covering five key data points: Content alteration and takedown requests; Copyright takedown requests; Right to be forgotten; Requests for user data; and voluntary disclosure... Read more

  8. Get the latest Wikipedia updates easily with IFTTT

    Wikipedia’s new Channel on IFTTT makes it easier than ever to share free knowledge.... Read more

  9. It’s time for some #tastydata

    You’re invited to join the Menu Challenge, starting May 8! We could use your help to add translations, photos and/or audio recordings to Wikidata — about ingredients and dishes to be served at the ‘Taste Stockholm’ food fair. This can show the endless possibilities of open data and crowdsourcing to thousands of visitors. Let’s get some #tastydata!... Read more

  10. Growing free knowledge through open data

    Open data can help us understand how people find and share knowledge online. The Wikimedia Foundation’s Research and Data Team has published 5 open data sets about Wikimedia projects. (…)... Read more