How can we make multimedia data easier to use on Wikimedia Commons, Wikipedia and sister sites?

Today, information about media files on Wikimedia sites is stored in unstructured formats that cause a range of issues: for example, file information is hard to search, some of it is only available in English, and it is difficult to edit or re-use files to comply with their license terms.

To address these issues, members of the Wikidata and Multimedia teams met with community volunteers for a week-long bootcamp in Berlin from October 5 to 10, 2014.


The Multimedia and Wikidata teams met with community volunteers in Berlin to discuss structured data on Commons.
(Photo: Structured Data Bootcamp Group Photo – Closeup by Christopher Schwarzkopf, under CC-by-sa 2.0)

The focus of this event was to investigate how to structure data on Wikimedia Commons, reusing the same technology as the one developed for Wikidata. Participants collaborated in small workgroups to explore a range of problems and solutions, in parallel sessions focused on community, design, engineering, licensing and product management challenges.

Each workgroup produced concrete examples of how these ideas could be implemented, including:

  • first data models for structuring file information, to make it machine-readable and license-compliant
  • first user interface designs for viewing and editing structured data seamlessly
  • a working prototype of a high-level API, for reading and updating metadata about media files
  • improvements to a prototype dashboard identifying files missing machine-readable metadata.

These preliminary ideas are now being documented on Commons so they can be discussed and improved with community members. For a project overview, check out this development page and these project slides.

The bootcamp was very productive, but many questions remain unanswered. Next steps include community discussions, design, prototyping, testing and a series of experiments — before starting actual development and data migration next year.

Everyone is invited to contribute to this important project. Your ideas and comments are much welcome, and developers would love your active participation to define and guide this project.

We look forward to working with our community to modernize our multimedia infrastructure and better support the needs of our users.

For the Structured Data project team at the Wikimedia Foundation:

Fabrice Florin – Product Manager, Multimedia (WMF)
Keegan Peterzell – Community Liaison (Product) (WMF)
Gilles Dubuc – Tech Lead, Multimedia (WMF)