Technology

  1. Importing JSON into Hadoop via Kafka

    Photo by Eric Kilby, CC BY-SA 2.0.

    Our three key players are Hadoop, the defacto distributed batch data processing platform; JSON, a ubiquitous data format; and Kafka, which is becoming the system of choice for transporting streams of data. However, much of the data that flows into Kafka is in JSON format, and there isn’t good community support around importing JSON data from Kafka into Hadoop. This article summarizes some comm... Read more

  2. Wikimedia Foundation receives $3 million grant from Alfred P. Sloan Foundation to make freely licensed images accessible and reusable across the web

    Photo by Ajepbah, CC BY-SA 3.0 DE.

    The funding will support Wikimedia community efforts to enable structured data on Wikimedia Commons, the free media repository with more than 35 million media files.... Read more

  3. Wikipedia Zero joins Mossab Banat on his trip to freely share human knowledge

    Photo by Joseph Zakarian/Wikimedia Foundation, CC BY-SA 3.0.

    Banat has made over a hundred thousand edits and helped expand the Arabic Wikipedia while using zero-rate mobile data.... Read more

  4. Introducing Montage, the web platform used to help judge the world’s largest photo competition

    Image courtesy of the Montage team.

    Imagine, for a moment, what it takes to organize a competition with a quarter of a million contestants participating in over 40 different national campaigns around the world. How do you build tools for this unconventional process while still keeping it flexible and open for organizers to decide how to run their competitions? This year, the international team of Wiki Loves Monuments (WLM) organizer... Read more

  5. Wikipedia will talk to you: Wikispeech

    Photo by Manfred Werner/Tsui, CC BY-SA 3.0.

    The development of Wikispeech, a new MediaWiki extension, has started, and by 2017 Wikipedia should be able to talk to you. The project will bring together existing language resources and develop a functioning text-to-speech solution for Swedish, English and Arabic.... Read more

  6. Brand-new template support for the content translation tool

    Photo by Dương Trần Quốc, public domain/CC0.

    Content Translation is getting a major new feature: completely re-written support for templates, making it easier to translate templates across languages.... Read more

  7. How we partnered with volunteers to clean up copy-paste plagiarism on Wikipedia

    Photo by Arturo de Frias Marques, CC BY-SA 4.0.

    Each year, the Wikimedia Foundation’s Community Tech team opens a community wishlist survey to determine what features and tools are needed most on Wikimedia sites. Last year’s survey brought a new plagiarism detection bot, among other things. What will this year’s bring? Cast your votes now.... Read more

  8. Wikimedia Foundation welcomes Victoria Coleman as Chief Technology Officer

    Photo by Myleen Hollero/Wikimedia Foundation, CC BY-SA 4.0.

    Victoria brings more than 20 years of experience leading development and strategy for complex technologies at scale. ... Read more

  9. New dataset shows fifteen years of Wikipedia’s quality trends

    Drawing by Edward Dana and James Dana via the British Library, public domain/CC0.

    Looking to study how Wikipedia articles have improved over time? We’ve generated a dataset that tracks the quality of articles at monthly intervals over the entire 15-year history of Wikipedia across multiple languages—that’s 670 million assessments!... Read more

  10. Supporting the future of Wikidata

    Photo by Open Data Institute Knowledge for Everyone, CC BY-SA 2.0.

    The Wikimedia Foundation will now directly fund basic expenses for Wikidata software development.... Read more