Wikimedia blog

News from the Wikimedia Foundation and about the Wikimedia movement

Posts Tagged ‘wikiviz11’

“A Thousand Fibers Connect Us” – WikiViz 2011 winner visualizes Wikipedia’s global reach

WikiViz 2011: Screenshot of the winning entry

In July, the International Symposium on Wikis and Open Collaboration (WikiSym) and the Wikimedia Foundation launched WikiViz 2011, a data challenge calling for submissions to visualize the impact of Wikipedia beyond the scope of its own community, using open data. At this week’s annual WikiSym conference in Mountain View, California, the author of the winning entry, Jen Lowe (Datatelling.com), presented her work, titled “A Thousand Fibers Connect Us – Wikipedia’s Global Reach”. Drawing on open data published by the Wikimedia Foundation and the World Bank, she designed an interactive visualization that allows users to explore the readership of different Wikipedia language versions by country, and to compare countries with high or low levels of internet access. The following is an excerpt from Jen’s talk at the WikiViz awarding ceremony.

Visualizing Emptiness: Reflections on a Preoccupation with Missing Values

The first question to be answered for any visualization is always: what data to use? I spent a lot of time looking for outward-facing data about Wikipedia. When I finally found data about Wikipedia traffic by country, I knew I had the connections I needed between the world and the world of Wikipedia.

I cleaned data with R and visualized it with Processing, both open source tools. The top represents countries, colored by region and more broadly by global north (blue) and south (red). The bottom represents languages. Connections represent over 100,000 page requests in the year from April 2010 to March 2011. It’s interactive, countries and regions can be highlighted, and sorted by population, pageviews, pageviews per person, and internet access. All data is transparently available on rollover.

Jen Lowe presenting her visualization at WikiSym

Missing Values

I think that visualization is amazing for its ability to force us to see what’s missing; to see the missing values in a collection of data. Anyone who has experience with data analysis, especially with analyzing other people’s data, knows the feeling of being totally preoccupied with missing values: how are they represented in the dataset? How should we deal with them – bootstrap to fill them in, or throw out the associated data completely? I find that visualization trains my mind to notice what’s missing.

Missed Connections in the Global South

When I sort by region, I can force you to see the emptiness, the missed connections in the global south. The more I do visualization work, the more I notice who’s missing, not just globally, but personally.

Conclusions

There are people in the empty spaces of the visualization who want to be Wikipedia editors, who want to contribute, but don’t know it exists, or don’t see a way in. Openness is easy – you can just attach a license and say something is open. Accessibility is hard – it requires someone to take responsibility, to commit sustained effort. So – the goal I propose is: we meet back in 10 years and see the circle FILLED. No more missing values, no more missed connections, no more empty spaces. With the quantity of Wikipedia data being collected, we will be able to see, rather than speculate on, exactly how a diversity of voices has changed patterns of edits, the content, and the connections of Wikipedia. We will all have a Wikipedia for everyone, that reflects the collaborative contributions of everyone.

Quotes from the jury

Erik Zachte, data analyst for WMF, says:

I find this visualization extremely elegant, even mesmerizing. It is a joy to play with the different options, and to watch how the screen responds. Part of its appeal is its complexity: It resonates with how many people see Wikipedia – colossal and manifold, it is not so easy to grasp its inner workings. Coupled with the orderly presentation, this complexity invites the user to dive in, and perhaps be the first to find some new treasure, some hidden pattern

Moritz Stefaner, information visualizer, commented:

The visualization is very rich in data and navigation modes. I much applaud the audacity to include this much data, navigation modes, and detail information, this has certainly been a great effort. The amount and density of the data is staggering.

“A Thousand Fibers Connect Us” is released under a Creative Commons BY-SA license and the underlying code will be published under an open license shortly.

Tilman Bayer, WMF Movement Communications

Dario Taraborelli, WMF Senior Research Analyst and WikiViz co-chair.

WikiViz 2011: Visualizing the impact of Wikipedia

To celebrate the 10-year anniversary of Wikipedia, and its impressive growth in content, quality, diversity, and readership, the International Symposium on Wikis and Open Collaboration (WikiSym) and the Wikimedia Foundation (WMF) are jointly launching WikiViz 2011 – a call for data/information visualization experts, computational journalists, data artists and data scientists to create the most insightful visualization of Wikipedia’s impact.

WikiViz 2011 is about visualizing the impact of Wikipedia using open data. We want to see the most effective, compelling and creative data-driven visualizations of how Wikipedia impacted the world with its content, culture and open collaboration model. Potential topics include: the imprint of Wikipedia on knowledge sharing and access to information; its impact on literacy and education, journalism and research; on the functioning of scientific and cultural organizations and businesses, as well as the daily life of individuals around the world. In addition, we want to see visualizations of areas of knowledge, geographical regions, organizations and people Wikipedia has not been able to reach or has impacted less than one would have expected. In summary, the main goal of this competition is to improve our understanding of how Wikipedia is affecting the world beyond the scope of its own community.

Awards

The WikiViz 2011 Awarding Ceremony will take place on October 4, at WikiSym 2011 main venue, Microsoft Research Silicon Valley campus (Mountain View, California). The ceremony will open with keynote speaker Jeff Heer (Stanford University), on the impact of emerging visualization techniques to understand open collaboration today.

Three finalist teams (1 winner, 2 runners-up) will be invited to present their work at WikiSym 2011, in Mountain View (California). Travel expenses and registration fees will be covered for one delegate per finalist team. The submissions from these three teams will be showcased at the WikiSym 2011 exhibit, presented during the WikiViz awards ceremony and featured by our Knowledge and Media Partners (Unidad Editorial, Periscopic, Information Aesthetics, Visualizing.org and Flowing Data).

Furthermore, Spanish media group Unidad Editorial will run a voting process in September, among the visitors of El Mundo.es, (the largest digital newspaper in Spanish by readership worldwide), to select the “Public’s choice” visualization among the top 10 submissions received. The winner will be featured in the digital edition of El Mundo.

Jury

The finalists will be selected by a jury composed of world-class experts in data visualization and social computing:

How to participate

Please, refer to the WikiViz call for participation to learn more details about terms and conditions to participate, submission instructions, selection rules and evaluation criteria. Only entries based on open data and licensed under a Wikimedia Commons-compatible open license will be considered.

Important dates

  • June 29, 2011: Challenge call for submissions.
  • August 28, 2011: Submission deadline (extended).
  • September 12, 2011: Winner and finalist submissions announced.
  • October 4, 2011: WikiViz awards session, WikiSym 2011 (Mountain View, CA).

Contact

For any questions, comments or interest in supporting or collaborating with this challenge, please contact the co-organizers at: wikiviz2011@easychair.org

You can also follow us on Twitter: @WikiViz (tag your tweets with #wikiviz11).

More

WikiViz 2011 is the second of two data challenges the Wikimedia Foundation is organizing this summer. If you are interesting in building predictive models of Wikipedia editor activity, check out the Wikipedia participation challenge

Organizers

WikiSym Wikimedia Foundation

Media Sponsors

El Mundo.es

Knowledge Partners

infosthetics FlowingData.com
visualizing.org Periscopic