<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Wikimedia blog &#187; Wikimedia Research Newsletter</title>
	<atom:link href="http://blog.wikimedia.org/c/research-2/wikimedia-research-newsletter/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.wikimedia.org</link>
	<description>News from inside the Wikimedia Foundation.org</description>
	<lastBuildDate>Wed, 22 Feb 2012 18:58:26 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3</generator>
		<item>
		<title>Wikimedia Research Newsletter, January 2012</title>
		<link>http://blog.wikimedia.org/2012/01/31/wikimedia-research-newsletter-january-2012/</link>
		<comments>http://blog.wikimedia.org/2012/01/31/wikimedia-research-newsletter-january-2012/#comments</comments>
		<pubDate>Tue, 31 Jan 2012 17:08:43 +0000</pubDate>
		<dc:creator>Tilman</dc:creator>
				<category><![CDATA[Highlights]]></category>
		<category><![CDATA[Wikimedia Research Newsletter]]></category>
		<category><![CDATA[research]]></category>

		<guid isPermaLink="false">http://blog.wikimedia.org/?p=9473</guid>
		<description><![CDATA[Vol: 2 • Issue: 1 • January 2012 [archives] Language analyses examine power structure and political slant; Wikipedia compared to commercial databases With contributions by: Tbayer and Piotrus Contents 1 Admins influence the language of non-admins 2 Can Wikipedia replace commercial biography databases? 3 Students predict connections between Wikipedians 4 Language analysis finds Wikipedia&#8217;s political [...]]]></description>
			<content:encoded><![CDATA[<div style="float: left; width: 99%; border: 1px solid #CCC; margin-bottom: .3em;">
<div style="padding: .6em 2em 0 2em; border-bottom: 1px solid #CCC;">
<p><a title="Research:Newsletter" href="https://meta.wikimedia.org/wiki/Research:Newsletter"><img src="http://upload.wikimedia.org/wikipedia/commons/7/7d/WRN_header.png" alt="WRN header.png" width="554" height="86" /></a></p>
</div>
<p style="clear: left; background-color: #eee; margin: 0; padding: .3em; font-size: 175%; font-weight: normal; font-family: Georgia, Times, serif; color: #666;">Vol: 2 • Issue: 1 • January 2012 <span style="font-size: 75%; float: right; margin-right: 1em; text-transform: uppercase;"><a title="Research:Newsletter/Archives" href="https://meta.wikimedia.org/wiki/Research:Newsletter/Archives">[<!--  -->archives]</a> <a title="Syndicate the Wikimedia Research Newsletter feed" href="http://blog.wikimedia.org/c/research-2/wikimedia-research-newsletter/feed/"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/4/43/Feed-icon.svg/16px-Feed-icon.svg.png" alt="Syndicate the Wikimedia Research Newsletter feed" width="16" height="16" /></a></span></p>
</div>
<p style="clear: left; margin-top: .3em; padding: .3em; line-height: 130%; font-size: 150%; font-weight: normal; font-style: italic; font-family: Georgia, Times, serif; color: #666;">Language analyses examine power structure and political slant; Wikipedia compared to commercial databases</p>
<p><strong>With contributions by:</strong> <a title="w:User:Tbayer (WMF)" href="http://en.wikipedia.org/wiki/User:Tbayer_(WMF)">Tbayer</a> and <a title="w:User:Piotrus" href="http://en.wikipedia.org/wiki/User:Piotrus">Piotrus</a></p>
<table id="toc" class="toc">
<tbody>
<tr>
<td>
<div id="toctitle">
<h2>Contents</h2>
</div>
<ul>
<li><a href="https://blog.wikimedia.org/2012/01/31/wikimedia-research-newsletter-january-2012/#Admins_influence_the_language_of_non-admins"><span class="tocnumber">1</span> <span class="toctext">Admins influence the language of non-admins</span></a></li>
<li class="toclevel-1 tocsection-2"><a href="https://blog.wikimedia.org/2012/01/31/wikimedia-research-newsletter-january-2012/#Can_Wikipedia_replace_commercial_biography_databases.3F"><span class="tocnumber">2</span> <span class="toctext">Can Wikipedia replace commercial biography databases?</span></a></li>
<li class="toclevel-1 tocsection-3"><a href="https://blog.wikimedia.org/2012/01/31/wikimedia-research-newsletter-january-2012/#Students_predict_connections_between_Wikipedians"><span class="tocnumber">3</span> <span class="toctext">Students predict connections between Wikipedians</span></a></li>
<li class="toclevel-1 tocsection-4"><a href="https://blog.wikimedia.org/2012/01/31/wikimedia-research-newsletter-january-2012/#Language_analysis_finds_Wikipedia.27s_political_bias_moving_from_left_to_right"><span class="tocnumber">4</span> <span class="toctext">Language analysis finds Wikipedia&#8217;s political bias moving from left to right</span></a></li>
<li class="toclevel-1 tocsection-5"><a href="https://blog.wikimedia.org/2012/01/31/wikimedia-research-newsletter-january-2012/#Briefly"><span class="tocnumber">5</span> <span class="toctext">Briefly</span></a></li>
<li class="toclevel-1 tocsection-6"><a href="https://blog.wikimedia.org/2012/01/31/wikimedia-research-newsletter-january-2012/#References"><span class="tocnumber">6</span> <span class="toctext">References</span></a></li>
</ul>
</td>
</tr>
</tbody>
</table>
<h3 id="Admins_influence_the_language_of_non-admins">Admins influence the language of non-admins</h3>
<p>An <a title="w:Arxiv" href="http://en.wikipedia.org/wiki/Arxiv">Arxiv</a> preprint titled &#8220;Echoes of power: Language effects and power differences in social interaction&#8221;<sup id="cite_ref-0" class="reference"><a href="#cite_note-0">[1]</a></sup> looks at the language used by Wikipedia editors. The authors look at how conversational language can be used to understand power relationships. The research analyzes how much one adapts their language to the language of others involved in a discussion (the process of <a title="w:language coordination" href="http://en.wikipedia.org/wiki/language_coordination">language coordination</a>). The findings indicate that the more such adoption occurs, the more deferential one is. The authors find that editors on Wikipedia tend to coordinate (language-wise) more with the administrators than with non-administrators. Furthermore, the study suggests that one&#8217;s ability to coordinate language has an impact on one&#8217;s chances to become an administrator: the admin-candidates who do more language coordination have a higher chance of becoming an administrator than those who don&#8217;t change their language. Once a person is elected an administrator, they tend to coordinate less.</p>
<p>A blog post on the website of <em><a title="w:Technology Review" href="http://en.wikipedia.org/wiki/Technology_Review">Technology Review</a></em> summarized the results using the headline &#8220;<a href="http://www.technologyreview.com/blog/arxiv/27437/">Algorithm Measures Human Pecking Order</a>&#8221; and highlighted the fact that one of the authors is <a title="w:Jon Kleinberg" href="http://en.wikipedia.org/wiki/Jon_Kleinberg">Jon Kleinberg</a>, known as inventor of the <a title="w:HITS algorithm" href="http://en.wikipedia.org/wiki/HITS_algorithm">HITS algorithm</a> (also known as &#8220;hubs and authorities&#8221;).</p>
<h3 id="Can_Wikipedia_replace_commercial_biography_databases.3F">Can Wikipedia replace commercial biography databases?</h3>
<div style="text-align: center; border: 1px solid #ccc; margin: 0.5em 0 0.8em 1.4em; float: right; clear: right;">
<div style="padding: 3px !important; border: 1px solid #cccccc; text-align: center; overflow: hidden; font-size: 94%; background-color: white; width: 222px;"><a class="image" href="https://meta.wikimedia.org/wiki/File:Csueb_view.jpg"><img style="border: 1px solid #ccc;" src="http://upload.wikimedia.org/wikipedia/commons/thumb/4/41/Csueb_view.jpg/220px-Csueb_view.jpg" alt="" width="220" height="144" /></a></p>
<div style="border: none; text-align: left; line-height: 1.4em; padding: 3px !important; font-size: 94%;">
<div style="float: right; border: none !important; background: none !important;"><a class="internal" title="Enlarge" href="https://meta.wikimedia.org/wiki/File:Csueb_view.jpg"><img src="http://bits.wikimedia.org/skins-1.18/common/images/magnify-clip.png" alt="" width="15" height="11" /></a></div>
<p>California State University, East Bay: Could it rely on biographical information from Wikipedia and the web alone?</p>
</div>
</div>
</div>
<p>An article<sup id="cite_ref-1" class="reference"><a href="#cite_note-1">[2]</a></sup> by a librarian and professor at <a title="w:California State University, East Bay" href="http://en.wikipedia.org/wiki/California_State_University,_East_Bay">California State University</a> offers a comparison of &#8220;biographical content for literary authors writing in English&#8221; between Wikipedia, &#8220;the web&#8221; (i.e. top Google search results) and two commercial databases: the Biography Reference Bank (BRB, now part of <a title="w:EBSCO Industries" href="http://en.wikipedia.org/wiki/EBSCO_Industries">EBSCO Industries</a>) and <a title="w:Contemporary Authors Online" href="http://en.wikipedia.org/wiki/Contemporary_Authors_Online">Contemporary Authors Online</a>, motivated by the decision of the author&#8217;s institution to cancel its subscription to the latter database (CAO) during a budget crisis in 2008-2009, which among other reasons had been accompanied by &#8220;a comment that this information is &#8216;on the web&#8217;&#8221;.</p>
<p>The paper starts out with a literature review on the reliability of Wikipedia and then describes how the author compiled a <a href="https://sites.google.com/site/biographyanalysis/home">list of 500 authors</a> (mostly from the US and UK) by &#8220;examining curricula and textbooks from English literature courses across the USA&#8221; and <a href="http://lists.ala.org/wws/arc/les-l/2010-09/msg00002.html">soliciting</a> additional suggestions from peers. These names were then searched on BRB, CAO (as part of the <a title="w:Literature Resource Center" href="http://en.wikipedia.org/wiki/Literature_Resource_Center">Literature Resource Center</a>), Wikipedia and Google.</p>
<p><span id="more-9473"></span></p>
<p>Regarding breadth of coverage, only six of the 500 names were &#8220;absent&#8221; on Wikipedia (meaning that they had &#8220;no entry of their own or reference in any other entry&#8221;), compared to 14 for LRC, and 50 for the Biography Reference Bank.</p>
<p>While the study does not seem to have attempted a systematic comparison of factual accuracy, it observes that Wikipedia &#8220;entries are less uniform than those in commercial databases. The biographical information ranges from extensive to perfunctory&#8221;.</p>
<p>The author remarks favorably on Wikipedia&#8217;s searchability:</p>
<dl>
<dd><em>&#8220;The databases and Wikipedia deal better than the Web with variant names, pseudonyms, and names that apply to multiple people. Cross-referencing is very good. [...] Wikipedia searching is very easy.There were even cases where it was easier to search Wikipedia than the databases. [...] Wikipedia also &#8216;disambiguates&#8217; names and offers quick descriptions to enable the searcher to find the correct individual.&#8221;</em></dd>
</dl>
<p>A large part of the comparison consists of examining each resource&#8217;s production process. Wikipedians may find parallels to their policies on biographies of living people, self-published sources and notability in the description for the Biography Reference Bank:</p>
<dl>
<dd><em>&#8220;Current Biography [the main content source of BRB] articles rely on secondary sources, but Wilson [the then publisher] has occasionally spoken directly with subjects or their proxies. Upon publication, many articles have been sent to subjects for review before being updated for the print annual and the databases. If subjects raise objections, misinformation is corrected, but not matters of public record. Adjustments may be made for privacy, for example omitting the specific names of children.</em></dd>
<dd><em>&#8220;To be included in World Authors [another source of BRB], authors must have published more than one critically acclaimed book. [...]&#8220;</em></dd>
<dd><em>&#8220;For autobiographies, Wilson attempted to contact subjects in Junior Authors and World Authors for a statement, but not subjects in Current Biography. [... An example offered by a Wilson employee:] For some reason, Jennie Tourel, a Russian-American opera singer, often provided false information, but, according to the Wilson biography, “passports and other documents that surfaced soon after her death helped to correct some of these inaccuracies&#8217;&#8221;.</em></dd>
</dl>
<p>In the conclusion, the author answers the initial question by recommending that her employer &#8220;re-subscribe to a commercial biographical database&#8221; if the budget would permit it again, because &#8220;Commercial databases provide a foundation with authoritative core content authenticated prior to publication and integrated with the fabric of information in the library’s holdings. They are easy to search and reliable, although they cannot be as current as Wikipedia or the Web because of their authentication processes. Wikipedia become[sic] more impressive as searching proceeded. The focus may be on verifiability rather than authority and there may be challenges in securing contributors, but the current contributors provide citations and often include unique information.&#8221; All in all she seems to favor Wikipedia and the two databases over &#8220;The web&#8221; (Google results) which &#8220;may have plenty of dross and be less reliable, harder to search, and focused on commercialism, but there are gold nuggets.&#8221; She worries: &#8220;What will happen if contributors to Wikipedia and the web have no authoritative databases to use as sources?&#8221;</p>
<h3 id="Students_predict_connections_between_Wikipedians">Students predict connections between Wikipedians</h3>
<p>Among the student projects in a class on &#8220;<a href="http://www.cs.rpi.edu/~magdon/courses/CASPfall2011.html">Computational Analysis of Social Processes</a>&#8221; at <a title="w:Rensselaer Polytechnic Institute" href="http://en.wikipedia.org/wiki/Rensselaer_Polytechnic_Institute">Rensselaer Polytechnic Institute</a>, three analyzed social networks of Wikipedia editors:</p>
<ul>
<li>The write-up for a project titled &#8220;Interaction vs. Homophily in Wikipedia Administrator Selection&#8221;<sup id="cite_ref-2" class="reference"><a href="#cite_note-2">[3]</a></sup> provides an analysis of factors related to one&#8217;s participation (or lack thereof) in the <a title="w:WP:RFA" href="http://en.wikipedia.org/wiki/WP:RFA">Request for Adminship</a> discussions. It confirms previous findings that many participants are drawn to the discussions by their personal contacts and experiences with others. The paper tries to analyze the impact of direct past interaction versus <a title="w:homophily" href="http://en.wikipedia.org/wiki/homophily">homophily</a> (roughly defined as shared interests). The findings suggest that homophily plays a much smaller role compared to past interactions. Overall, it appears that administrators are often elected (or opposed) not by the community at large, but by a group of their closest peers. To quote from the conclusion of the paper: &#8220;This raises questions about the robustness of Wikipedia&#8217;s administrator selection process which is then comprised of a very small interaction-selected group of editors.&#8221;</li>
<li>Another project write-up titled &#8220;Link Prediction Analysis in the Wikipedia Collaboration Graph&#8221;<sup id="cite_ref-3" class="reference"><a href="#cite_note-3">[4]</a></sup> tested various models to predict the strength of the connection between two Wikipedia editors in a &#8220;dynamic collaboration graph&#8221; that measures, at a given point in time, how often they recently edited the same page, with more recent edits weighing stronger.</li>
<li>A third student paper titled &#8220;Link prediction on a Wikipedia dataset based on <a title="w:triadic closure" href="http://en.wikipedia.org/wiki/triadic_closure">triadic closure</a>&#8220;<sup id="cite_ref-4" class="reference"><a href="#cite_note-4">[5]</a></sup> likewise tested various models on a similar graph consisting of Wikipedia users as vertices, regarding the closure of triangles (i.e. if user A is connected with B, and B with C, is A connected with C as well?). Among the conclusions is that such &#8220;triadic closure, while still occurring in Wikipedia, is happening at a slower pace now than before–likely due to the influx of less active editors&#8221;.</li>
</ul>
<h3 id="Language_analysis_finds_Wikipedia.27s_political_bias_moving_from_left_to_right">Language analysis finds Wikipedia&#8217;s political bias moving from left to right</h3>
<p>A study <a href="http://www.aeaweb.org/aea/2012conference/program/meetingpapers.php">presented</a> earlier this month at the annual meeting of the <a title="w:American Economic Association" href="http://en.wikipedia.org/wiki/American_Economic_Association">American Economic Association</a> which is to appear in <em><a title="w:The American Economic Review" href="http://en.wikipedia.org/wiki/The_American_Economic_Review">The American Economic Review</a></em><sup id="cite_ref-5" class="reference"><a href="#cite_note-5">[6]</a></sup> sets out to test whether the English Wikipedia is truly neutral, by measuring bias within a sample of 28,000 entries about US political topics, examined over a decade. The bias is identified through detecting the use of language specific to one side of the American political scene (Democrats or Republicans). To quote from the article: &#8220;In brief, we ask whether a given Wikipedia article uses phrases favored more by Republican members or by Democratic members of Congress&#8221; (in the text of the 2005 <a title="w:Congressional Record" href="http://en.wikipedia.org/wiki/Congressional_Record">Congressional Record</a>, using a method developed in an earlier paper by Gentzkow and Shapiro who applied it to newspapers). The authors identified, as of January 2011, 70,668 articles related to US politics, about 40% of which had a statistically significant bias. They find that Wikipedia articles are often biased upon creation, and that this bias rarely changes. Early on in Wikipedia&#8217;s history, most had a pro-Democratic bias, and while &#8220;by the last date, Wikipedia&#8217;s articles appear to be centered close to a middle point on average&#8221;, this is simply an effect of a larger amount of new pro-Republican articles than due to the existing ones having been rewritten neutrally.</p>
<p>While the authors made efforts to exclude articles not pertinent to US politics (requiring the terms &#8220;United States&#8221; or &#8220;America&#8221; to appear at least three times in the article text), the sample also includes the clearly international article <a title="w:Iraq War" href="http://en.wikipedia.org/wiki/Iraq_War">Iraq War</a>. And in what Wikipedians may call out as <a title="w:Wikipedia:systemic bias" href="http://en.wikipedia.org/wiki/Wikipedia:systemic_bias">systemic bias</a>, the authors never question their assumption that for an international encyclopedia, a lack of bias would be indicated by the replication of the spectrum of opinions present in the US Congress. <a href="http://www.pbs.org/mediashift/2006/04/email-debatewales-discusses-political-bias-on-wikipedia111.html">As early as 2006</a>, Jimmy Wales objected to such notions with respect to the community of contributors: &#8220;If averages mattered, and due to the nature of the wiki software (no voting) they almost certainly don&#8217;t, I would say that the Wikipedia community is slightly more liberal than the U.S. population on average, because we are global and the international community of English speakers is slightly more liberal than the U.S. population. &#8230; The idea that neutrality can only be achieved if we have some exact demographic matchup to [the] United States of America is preposterous.&#8221; Nevertheless, even if one turns the study on its head and reads it as a statement on average American political opinion compared to the rest of the world as reflected in the English Wikipedia, its results remain remarkable.</p>
<h3 id="Briefly">Briefly</h3>
<ul>
<li><strong>Calls for papers</strong>have appeared this month for
<ul>
<li><a href="http://www.wikisym.org/2012/01/17/wikisym-2012-call-for-participation/">WikiSym 2012</a>, the eighth instance of this <a title="w:WikiSym" href="http://en.wikipedia.org/wiki/WikiSym">annual research conference</a> on wikis and open collaboration</li>
<li><a href="http://wikimania2012.wikimedia.org/wiki/Submissions">Wikimania 2012</a>, the eighth <a title="w:Wikimania" href="http://en.wikipedia.org/wiki/Wikimania">annual global conference of Wikimedians</a></li>
<li><a href="http://blog.wikimedia.de/2012/01/17/call-for-papers-wikipedia-academy-2012-research-and-free-knowledge/">Wikipedia Academy: Research and Free Knowledge</a>, a conference organized by Wikimedia Germany</li>
<li>&#8220;<a href="http://www.onlinecreation.info/?p=474">Academic research into Wikipedia: Beyond English Wikipedia and towards comparative perspectives</a>&#8220;, an upcoming issue of the e-journal <em>Digithum</em></li>
</ul>
</li>
<li><strong>New effort at comprehensive wiki research literature database</strong>: Wikipedian <a title="w:User:emijrp" href="http://en.wikipedia.org/wiki/User:emijrp">emijrp</a> has <a href="http://lists.wikimedia.org/pipermail/wiki-research-l/2012-January/001799.html">announced</a> the launch of <a href="http://wikipapers.referata.com/">WikiPapers</a>, a <a title="w:Semantic MediaWiki" href="http://en.wikipedia.org/wiki/Semantic_MediaWiki">Semantic MediaWiki</a>-based wiki dedicated to the &#8220;compilation of resources (conference papers, journal articles, theses, books, datasets and tools) focused on the research of wikis&#8221;. The task of creating such a database has been <a title="w:Wikipedia:Wikipedia Signpost/2011-04-11/Recent research" href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2011-04-11/Recent_research">attempted several times</a> before and its difficulties were explored in a well-attended workshop at last year&#8217;s WikiSym conference (see the <a title="Research:Newsletter/2011-10-31" href="https://meta.wikimedia.org/wiki/Research:Newsletter/2011-10-31#Wiki_research_beyond_the_English_Wikipedia_at_WikiSym">October issue</a> of this newsletter). Researcher Finn Årup Nielsen (who last year published an overview of such literature,<sup id="cite_ref-6" class="reference"><a href="#cite_note-6">[7]</a></sup> mentioning well over 1000 publications) <a href="http://lists.wikimedia.org/pipermail/wiki-research-l/2012-January/001805.html">pointed out</a> the possibility of exchanging content between the new wiki, the existing (likewise Semantic MediaWiki-based) Acawiki and his own Brede Wiki.</li>
<li><strong>Review of <em>Good Faith Collaboration</em></strong>: Sociological journal <em><a title="w:The Information Society" href="http://en.wikipedia.org/wiki/The_Information_Society">The Information Society</a></em> reviewed<sup id="cite_ref-7" class="reference"><a href="#cite_note-7">[8]</a></sup> Joseph Reagle&#8217;s 2010 book <em><a title="w:Good Faith Collaboration" href="http://en.wikipedia.org/wiki/Good_Faith_Collaboration">Good Faith Collaboration: The Culture of Wikipedia</a></em> (which was recently <a href="http://reagle.org/joseph/blog/social/wikipedia/gfc-web-cc-announce">released</a> online under a Creative Commons license), praising it as &#8220;an accurate account of this sociocultural and sociotechnological phenomenon that Wikipedia is&#8221;. The reviewer calls Wikipedia a &#8220;virtual tool and reference <a title="w:Jim Dandy (song)" href="http://en.wikipedia.org/wiki/Jim_Dandy_(song)">jim-dandy</a> [which] is another flashpoint in our path of social anxieties&#8221; and holds that &#8220;nobody can think of a true rival [of Wikipedia] in this knowledge contest. The sins are there: lightness, temporary reliability, questionable scholarly approaches, sometimes oversimplification, sometimes data excess; however, these are venial sins and easily absolved.&#8221; Somewhat cryptically, he observes that &#8220;the European Union is trying to adapt this part of Western academia to the global university system (though inevitably Anglo-American inspired)&#8221;. He commends the book as an &#8220;accessible analysis [which] makes it clear that Wikipedia is not wasted knowledge; it is human thirst for knowledge and we are simply gathering scattered pieces&#8221;. Gentle criticism includes that &#8220;though [...] inaccuracies are stated, two other important worries — that it is not financially sustainable, and that Wikipedia has lost touch with its founding ideal—are not as openly dealt with&#8221;.</li>
<li><strong>Predicting categories from links</strong>: In a paper titled &#8220;Using Network Structure to Learn Category Classification in Wikipedia&#8221;<sup id="cite_ref-8" class="reference"><a href="#cite_note-8">[9]</a></sup> (the write-up of a class project for an Autumn 2011 Stanford course titled &#8220;<a href="http://www.stanford.edu/class/cs224w">Social and Information Network Analysis&#8221;</a>), three students describe the construction of a <a title="w:classifier" href="http://en.wikipedia.org/wiki/classifier">classifier</a> algorithm that tries to predict from an article&#8217;s ingoing and outgoing wikilinks whether it is a member of the <a title="w:Category:American actors" href="http://en.wikipedia.org/wiki/Category:American_actors">Category:American actors</a> – &#8220;We chose this particular category because it is one of the largest on Wikipedia (almost 25,000 pages)&#8221;.</li>
<li><strong>Wikipedia vs. library catalogue</strong>: An article in <em>Library and Information Research</em> titled &#8220;Searching where for what: A comparison of use of the library catalogue, Google and Wikipedia&#8221;<sup id="cite_ref-9" class="reference"><a href="#cite_note-9">[10]</a></sup> analyzed search queries from users of Google (using <a title="w:Hitwise" href="http://en.wikipedia.org/wiki/Hitwise">Hitwise</a> data) and Wikipedia, and a state library in Australia, unsurprisingly finding that the library catalogue is used much less frequently than the former two, but positing that the &#8220;fact that popular culture queries accounted for [a very] substantial proportion of Google and Wikipedia queries and almost no [library] catalogue queries indicates that, indeed, people do turn to different information resources for different subjects.&#8221;</li>
<li><strong>&#8220;Lexical clues&#8221; predict article quality</strong>: A paper was presented at the 3rd Symposium on Web Society (SWS) last October<sup id="cite_ref-10" class="reference"><a href="#cite_note-10">[11]</a></sup> which sought to predict article quality based on eight different ratios derived from counting the number of sentences, words, diverse words, nouns, verbs, diverse nouns, diverse verbs and copulas in the article text. They trained a <a title="w:decision tree" href="http://en.wikipedia.org/wiki/decision_tree">decision tree</a> on a sample of 200 start-class and 200 featured articles (truncating each of the latter to 800 to 1000 words to arrive at a typical start-class article length) and then tested it on a different sample of 100 start-class and 100 featured articles, achieving <a title="w:precision and recall" href="http://en.wikipedia.org/wiki/precision_and_recall">precision and recall</a> of more than 83% each.</li>
</ul>
<h2 id="References">References</h2>
<div class="references-small">
<ol class="references">
<li id="cite_note-0"><a href="#cite_ref-0">↑</a> Danescu-Niculescu-Mizil, C., Lee, L., Pang, B., &amp; Kleinberg, J. (2011). Echoes of power: Language effects and power differences in social interaction. <a href="http://arxiv.org/abs/1112.3670">http://arxiv.org/abs/1112.3670</a> <a title="Open access" href="http://en.wikipedia.org/wiki/Open_access"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" alt="Open access" width="12" height="19" /></a></li>
<li id="cite_note-1"><a href="#cite_ref-1">↑</a> Soules, A. (2012). Where’s the bio? Databases, Wikipedia, and the web. New Library World, 113(1/2), 77–89. Emerald Group Publishing Limited. <a title="doi:10.1108/03074801211199068" href="http://dx.doi.org/10.1108/03074801211199068">DOI:10.1108/03074801211199068</a> <a title="Closed access" href="http://en.wikipedia.org/wiki/Closed_access"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/c/c1/Closed_Access_logo_alternative.svg/12px-Closed_Access_logo_alternative.svg.png" alt="Closed access" width="12" height="19" /></a></li>
<li id="cite_note-2"><a href="#cite_ref-2">↑</a> Lavoie, A. (2011). Interaction vs. Homophily in Wikipedia Administrator Selection. <a href="http://assassin.cs.rpi.edu/~magdon/courses/casp/projects/Lavoie.pdf">http://assassin.cs.rpi.edu/~magdon/courses/casp/projects/Lavoie.pdf</a> <a title="Open access" href="http://en.wikipedia.org/wiki/Open_access"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" alt="Open access" width="12" height="19" /></a></li>
<li id="cite_note-3"><a href="#cite_ref-3">↑</a> Molnar, F. (2011). Link Prediction Analysis in the Wikipedia Collaboration Graph. <a href="http://assassin.cs.rpi.edu/~magdon/courses/casp/projects/Molnar.pdf">http://assassin.cs.rpi.edu/~magdon/courses/casp/projects/Molnar.pdf</a> <a title="Open access" href="http://en.wikipedia.org/wiki/Open_access"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" alt="Open access" width="12" height="19" /></a></li>
<li id="cite_note-4"><a href="#cite_ref-4">↑</a> George, R. (2011). Link prediction on a Wikipedia dataset based on triadic closure. <a href="http://www.cs.rpi.edu/~magdon/courses/casp/projects/George.pdf">http://www.cs.rpi.edu/~magdon/courses/casp/projects/George.pdf</a> <a title="Open access" href="http://en.wikipedia.org/wiki/Open_access"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" alt="Open access" width="12" height="19" /></a></li>
<li id="cite_note-5"><a href="#cite_ref-5">↑</a> Zhu, Feng; Greenstein, S. (forthcoming). Is Wikipedia Biased? American Economic Review (Papers and Proceedings). <a href="http://www-bcf.usc.edu/~fzhu/wikipediabias.pdf">http://www-bcf.usc.edu/~fzhu/wikipediabias.pdf</a> <a title="Open access" href="http://en.wikipedia.org/wiki/Open_access"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" alt="Open access" width="12" height="19" /></a></li>
<li id="cite_note-6"><a href="#cite_ref-6">↑</a> Nielsen, F. A. (2011). Wikipedia research and tools: Review and comments. <a href="http://www2.imm.dtu.dk/pubdb/views/publication_details.php?id=6012">http://www2.imm.dtu.dk/pubdb/views/publication_details.php?id=6012</a> (working paper) <a title="Open access" href="http://en.wikipedia.org/wiki/Open_access"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" alt="Open access" width="12" height="19" /></a></li>
<li id="cite_note-7"><a href="#cite_ref-7">↑</a> Redondo-Olmedilla, J.-C. (2012). A Review of “Good Faith Collaboration: The Culture of Wikipedia.” The Information Society, 28(1), 53–54. Routledge. <a href="http://dx.doi.org/10.1080/01972243.2011.632286">http://dx.doi.org/10.1080/01972243.2011.632286</a> <a title="Closed access" href="http://en.wikipedia.org/wiki/Closed_access"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/c/c1/Closed_Access_logo_alternative.svg/12px-Closed_Access_logo_alternative.svg.png" alt="Closed access" width="12" height="19" /></a></li>
<li id="cite_note-8"><a href="#cite_ref-8">↑</a> Colgrove, Caitlin; Neidert, Julia; Chakoumakos, R. (2011). Using Network Structure to Learn Category Classification in Wikipedia. <a href="http://www.stanford.edu/class/cs224w/proj/colgrove_Finalwriteup_v1.pdf">http://www.stanford.edu/class/cs224w/proj/colgrove_Finalwriteup_v1.pdf</a> <a title="Open access" href="http://en.wikipedia.org/wiki/Open_access"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" alt="Open access" width="12" height="19" /></a></li>
<li id="cite_note-9"><a href="#cite_ref-9">↑</a> Waller, V. (2011, November 8). Searching where for what: A comparison of use of the library catalogue, Google and Wikipedia. Library and Information Research. <a href="http://www.lirgjournal.org.uk/lir/ojs/index.php/lir/article/view/466">http://www.lirgjournal.org.uk/lir/ojs/index.php/lir/article/view/466</a> <a title="Open access" href="http://en.wikipedia.org/wiki/Open_access"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" alt="Open access" width="12" height="19" /></a></li>
<li id="cite_note-10"><a href="#cite_ref-10">↑</a> Yanxiang, X., &amp; Tiejian, L. (2011). Measuring article quality in Wikipedia: Lexical clue model. 2011 3rd Symposium on Web Society (pp. 141–146). IEEE. <a href="http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=6101286">http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=6101286</a> <a title="Closed access" href="http://en.wikipedia.org/wiki/Closed_access"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/c/c1/Closed_Access_logo_alternative.svg/12px-Closed_Access_logo_alternative.svg.png" alt="Closed access" width="12" height="19" /></a></li>
</ol>
</div>
<p>&nbsp;</p>
<div style="float: left; width: 99%; border-top: 1px solid #CCC; border-bottom: 1px solid #CCC;">
<p style="text-align: center; clear: left; background-color: #eee; padding: .6em; font-size: 140%; font-weight: normal; font-family: Georgia, Times, serif; color: #666;"><em>Wikimedia Research Newsletter</em><br />
Vol: 2 • Issue: 1 • January 2012<br />
<span style="font-size: 80%;">This newletter is brought to you by the <a title="Research:Committee" href="https://meta.wikimedia.org/wiki/Research:Committee">Wikimedia Research Committee</a> and <a title="w:Wikipedia:Signpost" href="http://en.wikipedia.org/wiki/Wikipedia:Signpost">The Signpost</a><br />
<a title="Syndicate the Wikimedia Research Newsletter feed" href="http://blog.wikimedia.org/c/research-2/wikimedia-research-newsletter/feed/"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/4/43/Feed-icon.svg/16px-Feed-icon.svg.png" alt="Syndicate the Wikimedia Research Newsletter feed" width="16" height="16" /></a> <a title="Research:Newsletter/Archives" href="https://meta.wikimedia.org/wiki/Research:Newsletter/Archives">[<!--  -->archives]</a> <a title="w:Wikipedia:Wikipedia Signpost/2012-01-30/Recent research" href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2012-01-30/Recent_research">[signpost edition]</a> <a title="Research:Newsletter" href="https://meta.wikimedia.org/wiki/Research:Newsletter">[contribute]</a> <a title="Research:Index" href="https://meta.wikimedia.org/wiki/Research:Index">[research index]</a></span></p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://blog.wikimedia.org/2012/01/31/wikimedia-research-newsletter-january-2012/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Wikimedia Research Newsletter, December 2011</title>
		<link>http://blog.wikimedia.org/2011/12/28/wikimedia-research-newsletter-december-2011/</link>
		<comments>http://blog.wikimedia.org/2011/12/28/wikimedia-research-newsletter-december-2011/#comments</comments>
		<pubDate>Wed, 28 Dec 2011 17:39:33 +0000</pubDate>
		<dc:creator>Tilman</dc:creator>
				<category><![CDATA[Highlights]]></category>
		<category><![CDATA[Wikimedia Research Newsletter]]></category>
		<category><![CDATA[research]]></category>

		<guid isPermaLink="false">http://blog.wikimedia.org/?p=8829</guid>
		<description><![CDATA[Vol: 1 • Issue: 6 • December 2011 [archives] Psychiatrists: Wikipedia better than Britannica; spell-checking Wikipedia; Wikipedians smart but fun; structured biological data With contributions by: Tbayer, DarTar and Jodi.a.schneider Contents 1 Mental health information on Wikipedia more accurate than Britannica and Kaplan &#38; Sadock psychiatry textbook 2 Psychologists gauge impact of Wikipedia&#8217;s Rorschach test [...]]]></description>
			<content:encoded><![CDATA[<div style="float:left; width: 99%;">
<p><a href="http://meta.wikimedia.org/wiki/Research:Newsletter" title="Research:Newsletter"><img alt="WRN header.png" src="http://upload.wikimedia.org/wikipedia/commons/7/7d/WRN_header.png" width="554" height="86" /></a></p>
<p style="clear:left; background-color: #EEE; padding: .3em; font-size:200%; font-weight:normal; font-family: Georgia, Times, serif; color:#666">Vol: 1 • Issue: 6 • December 2011 <span style="font-size:75%; float:right; margin-right:1em; text-transform:uppercase"><a href="http://meta.wikimedia.org/wiki/Research:Newsletter/Archives" title="Research:Newsletter/Archives">[<!--  -->archives]</a> <a href="http://blog.wikimedia.org/c/research-2/wikimedia-research-newsletter/feed/" title="Syndicate the Wikimedia Research Newsletter feed"><img alt="Syndicate the Wikimedia Research Newsletter feed" src="http://upload.wikimedia.org/wikipedia/commons/thumb/4/43/Feed-icon.svg/16px-Feed-icon.svg.png" width="16" height="16" /></a></span></p>
<p style="clear:left; padding: .3em; line-height: 130%; font-size:150%; font-weight:normal; font-style: italic; font-family: Georgia, Times, serif; color:#666">Psychiatrists: Wikipedia better than Britannica; spell-checking Wikipedia; Wikipedians smart but fun; structured biological data</p>
</div>
<p>
<b>With contributions by:</b> <a href="http://meta.wikimedia.org/wiki/User:Tbayer_(WMF)" title="User:Tbayer (WMF)">Tbayer</a>, <a href="http://meta.wikimedia.org/wiki/User:DarTar" title="User:DarTar">DarTar</a> and <a href="http://meta.wikimedia.org/wiki/User:Jodi.a.schneider" title="User:Jodi.a.schneider">Jodi.a.schneider</a></p>
<table id="toc" class="toc">
<tr>
<td>
<div id="toctitle">
<h2>Contents</h2>
</div>
<ul>
<li class="toclevel-1 tocsection-1"><a href="http://blog.wikimedia.org/2011/12/28/wikimedia-research-newsletter-december-2011/#Mental_health_information_on_Wikipedia_more_accurate_than_Britannica_and_Kaplan_.26_Sadock_psychiatry_textbook"><span class="tocnumber">1</span> <span class="toctext">Mental health information on Wikipedia more accurate than Britannica and Kaplan &amp; Sadock psychiatry textbook</span></a></li>
<li class="toclevel-1 tocsection-2"><a href="http://blog.wikimedia.org/2011/12/28/wikimedia-research-newsletter-december-2011/#Psychologists_gauge_impact_of_Wikipedia.27s_Rorschach_test_coverage"><span class="tocnumber">2</span> <span class="toctext">Psychologists gauge impact of Wikipedia&#8217;s Rorschach test coverage</span></a></li>
<li class="toclevel-1 tocsection-3"><a href="http://blog.wikimedia.org/2011/12/28/wikimedia-research-newsletter-december-2011/#Spell-checking_the_English_Wikipedia"><span class="tocnumber">3</span> <span class="toctext">Spell-checking the English Wikipedia</span></a></li>
<li class="toclevel-1 tocsection-4"><a href="http://blog.wikimedia.org/2011/12/28/wikimedia-research-newsletter-december-2011/#Wikipedians_are_.22smart_but_fun.22.2C_and_have_expertise_in_topics_they_edit"><span class="tocnumber">4</span> <span class="toctext">Wikipedians are &#8220;smart but fun&#8221;, and have expertise in topics they edit</span></a></li>
<li class="toclevel-1 tocsection-5"><a href="http://blog.wikimedia.org/2011/12/28/wikimedia-research-newsletter-december-2011/#Wikipedia_as_a_database_for_structured_biological_data"><span class="tocnumber">5</span> <span class="toctext">Wikipedia as a database for structured biological data</span></a></li>
<li class="toclevel-1 tocsection-6"><a href="http://blog.wikimedia.org/2011/12/28/wikimedia-research-newsletter-december-2011/#Individual_and_social_drivers_of_participation_in_Wikipedia"><span class="tocnumber">6</span> <span class="toctext">Individual and social drivers of participation in Wikipedia</span></a></li>
<li class="toclevel-1 tocsection-7"><a href="http://blog.wikimedia.org/2011/12/28/wikimedia-research-newsletter-december-2011/#Mining_article_revision_histories_for_insights_into_open_collaboration"><span class="tocnumber">7</span> <span class="toctext">Mining article revision histories for insights into open collaboration</span></a></li>
<li class="toclevel-1 tocsection-8"><a href="http://blog.wikimedia.org/2011/12/28/wikimedia-research-newsletter-december-2011/#Briefly"><span class="tocnumber">8</span> <span class="toctext">Briefly</span></a></li>
<li class="toclevel-1 tocsection-9"><a href="http://blog.wikimedia.org/2011/12/28/wikimedia-research-newsletter-december-2011/#References"><span class="tocnumber">9</span> <span class="toctext">References</span></a></li>
</ul>
</td>
</tr>
</table>
<h3 id="Mental_health_information_on_Wikipedia_more_accurate_than_Britannica_and_Kaplan_.26_Sadock_psychiatry_textbook">Mental health information on Wikipedia more accurate than Britannica and Kaplan &amp; Sadock psychiatry textbook</h3>
<div style="text-align:center;border:1px solid #ccc;margin:2px;float:right;clear:right;margin:0.5em 0 0.8em 1.4em;">
<div style="padding: 3px !important; border: 1px solid rgb(204, 204, 204); text-align: center; overflow: hidden; font-size: 94%; background-color: white; width:222px;"><a href="https://commons.wikimedia.org/wiki/File:Cloth_embroidered_by_a_schizophrenia_sufferer.jpg"><img alt="" src="http://upload.wikimedia.org/wikipedia/commons/thumb/b/b2/Cloth_embroidered_by_a_schizophrenia_sufferer.jpg/220px-Cloth_embroidered_by_a_schizophrenia_sufferer.jpg" width="220" height="165" style="border:1px solid #ccc;" /></a></p>
<div style="border:none;text-align:left;line-height:1.4em;padding:3px !important;font-size:94%;">
<div style="float:right;border:none !important;background:none !important;"><a href="https://commons.wikimedia.org/wiki/File:Cloth_embroidered_by_a_schizophrenia_sufferer.jpg" class="internal" title="Enlarge"><img src="http://bits.wikimedia.org/skins-1.18/common/images/magnify-clip.png" width="15" height="11" alt="" /></a></div>
<p>Wikipedia articles on <a href="http://en.wikipedia.org/wiki/schizophrenia" title="w:schizophrenia">schizophrenia</a> and other mental health topics were assessed for accuracy, richness of references and readability.</div>
</div>
</div>
<p>In an article for <i><a href="http://en.wikipedia.org/wiki/Psychological_Medicine" title="w:Psychological Medicine">Psychological Medicine</a></i>,<sup id="cite_ref-0" class="reference"><a href="#cite_note-0">[1]</a></sup> ten researchers from the <a href="http://en.wikipedia.org/wiki/University_of_Melbourne" title="w:University of Melbourne">University of Melbourne</a> conclude that &#8220;the quality of information on depression and schizophrenia on Wikipedia is generally as good as, or better than, that provided by centrally controlled websites, Encyclopaedia Britannica and a psychiatry textbook.&#8221;</p>
<p>The study focused on ten mental health topics (e.g. &#8220;antidepressants and suicide in young people&#8221; or &#8220;side-effects of antipsychotics&#8221;), five each in the areas of <a href="http://en.wikipedia.org/wiki/depression" title="w:depression">depression</a> and <a href="http://en.wikipedia.org/wiki/schizophrenia" title="w:schizophrenia">schizophrenia</a>. &#8220;Using the topic terms (or synonyms) as key words for the searches or through manual browsing, content relating to these topics was extracted from [Wikipedia and 13 other websites selected for prominent Google results for <i>depression</i> and <i>schizophrenia</i>] and from the most recent edition of <i>Kaplan &amp; Sadock’s Comprehensive Textbook of Psychiatry</i> &#8230; and the online version of <i><a href="http://en.wikipedia.org/wiki/Encyclopaedia_Britannica" title="w:Encyclopaedia Britannica">Encyclopaedia Britannica</a></i>&#8221; by two reviewers. For both depression and schizophrenia, three psychologists with clinical and research expertise in that area evaluated these extracts on accuracy, up-to-dateness, breadth of coverage, referencing and readability, on a scale from 1 to 5 (&#8220;e.g. Accuracy: 1 = many errors of fact or unsubstantiated opinions, 3=some errors of fact or unsubstantiated opinions, 5 = all information factually accurate&#8221;). As in an earlier study of the quality of health information on Wikipedia (<i>Signpost</i> coverage: &#8220;<a href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2010-06-07/In_the_news#Wikipedia.27s_cancer_coverage_is_reliable_and_thorough.2C_but_not_very_readable" title="w:Wikipedia:Wikipedia Signpost/2010-06-07/In the news">Wikipedia&#8217;s cancer coverage is reliable and thorough, but not very readable</a>&#8220;), readability was also measured using a <a href="http://en.wikipedia.org/wiki/Flesch%E2%80%93Kincaid_readability_test" title="w:Flesch–Kincaid readability test">Flesch–Kincaid readability test</a>, which is calculated from word and sentence lengths.</p>
<p>For both depression and schizophrenia, Wikipedia scored highest in the accuracy, up-to-dateness, and references categories – surpassing all other resources, including <a href="http://en.wikipedia.org/wiki/WebMD" title="w:WebMD">WebMD</a>, <a href="http://en.wikipedia.org/wiki/NIMH" title="w:NIMH">NIMH</a>, the <a href="http://en.wikipedia.org/wiki/Mayo_Clinic" title="w:Mayo Clinic">Mayo Clinic</a> and Britannica online. In breadth of coverage, it was behind Kaplan &amp; Saddock and others for both areas. And &#8220;of the online resources, Wikipedia was rated the least readable [by the human reviewers], although some of its topics received an average rating.&#8221; Likewise, the Wikipedia content had relatively high Flesch–Kincaid Grade Level indices (around 16 for schizophrenia and 15 for depression – indicating that a tertiary level of education is necessary to understand the content), similar to that of Britannica but higher than most other resources examined.</p>
<p>The authors note that their &#8220;findings largely parallel those of other recent studies of the quality of health information on Wikipedia&#8221; (citing eight such studies published between 2007 and 2010):</p>
<dl>
<dd><i>&#8220;Despite variability in the methodologies and conclusions of these studies, the overall implication is that Wikipedia articles on health topics typically contain relatively few factual errors, although they may lack breadth of coverage. &#8230; Given the number of patients, would-be patients and concerned others using the internet to search for information on health issues, it seems that Wikipedia is an appropriate recommendation as an information source.</i>&#8220;</dd>
</dl>
<h3 id="Psychologists_gauge_impact_of_Wikipedia.27s_Rorschach_test_coverage">Psychologists gauge impact of Wikipedia&#8217;s Rorschach test coverage</h3>
<p><span id="more-8829"></span></p>
<div style="text-align:center;border:1px solid #ccc;margin:2px;float:right;clear:right;margin:0.5em 0 0.8em 1.4em;">
<div style="padding: 3px !important; border: 1px solid rgb(204, 204, 204); text-align: center; overflow: hidden; font-size: 94%; background-color: white; width:202px;"><a href="https://commons.wikimedia.org/wiki/File:Inkblot.svg" class="image"><img alt="" src="http://upload.wikimedia.org/wikipedia/commons/thumb/1/13/Inkblot.svg/200px-Inkblot.svg.png" width="200" height="135" style="border:1px solid #ccc;" /></a></p>
<div style="border:none;text-align:left;line-height:1.4em;padding:3px !important;font-size:94%;">
<div style="float:right;border:none !important;background:none !important;"><a href="https://commons.wikimedia.org/wiki/File:Inkblot.svg" class="internal" title="Enlarge"><img src="http://bits.wikimedia.org/skins-1.18/common/images/magnify-clip.png" width="15" height="11" alt="" /></a></div>
<p>A black outline of the first of the ten cards in the Rorschach inkblot test (used in a pre-2009 version of the article <a href="http://en.wikipedia.org/wiki/Rorschach_test" title="w:Rorschach test">Rorschach test</a>)</div>
</div>
</div>
<p>A paper in the <i><a href="http://en.wikipedia.org/wiki/Journal_of_Personality_Assessment" title="w:Journal of Personality Assessment">Journal of Personality Assessment</a></i><sup id="cite_ref-1" class="reference"><a href="#cite_note-1">[2]</a></sup> tried to assess the impact of the Wikipedia article <a href="http://en.wikipedia.org/wiki/Rorschach_test" title="w:Rorschach test">Rorschach test</a> on psychologists&#8217; use of that test. As summarized by the authors, &#8220;In the summer of 2009, an emergency room physician [<a href="http://en.wikipedia.org/wiki/User:Jmh649" title="w:User:Jmh649">User:Jmh649</a> - James Heilman, MD] posted images of all 10 Rorschach inkblots on [...] Wikipedia. The images were accompanied by descriptions of “common responses” to each blot. &#8230; a fierce debate ensued between some psychologists who claimed that posting the inkblots is a threat to test security and other individuals, including some psychologists and other mental health professionals, who argued that all information should be freely available, including full details of the Rorschach&#8221;. (In fact, <a href="http://en.wikipedia.org/wiki/Talk:Rorschach_test/images" title="w:Talk:Rorschach test/images">the debates</a> on whether to display versions of the inkblots in the article go back to <a href="http://en.wikipedia.org/wiki/Talk:Rorschach_test/Archive_1#No_header" title="w:Talk:Rorschach test/Archive 1">at least 2005</a>, at first accompanied by rather spurious copyright claims &#8211; Rorschach died in 1922.) The authors note that the inkblots had already been revealed to the general public in a 1980s book and cite an earlier study<sup id="cite_ref-2" class="reference"><a href="#cite_note-2">[3]</a></sup> that had found &#8220;particularly damaging information&#8221; about personality assessment tests on the Internet as early as 2000, &#8220;including examples of test stimuli from&#8230; the Rorschach&#8221; (presumably including <a href="http://web.archive.org/web/20001001065937/http://deltabravo.net/custody/rorschach.htm">this site</a>). Still, &#8220;Internet coverage of the Rorschach appeared to grow exponentially during&#8221; the 2009 debate about the Wikipedia article, which made it to the front page of the New York Times (<i>Signpost</i> coverage: &#8220;<a href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2009-08-03/In_the_news#Rorschach_test_dispute_reported" title="w:Wikipedia:Wikipedia Signpost/2009-08-03/In the news">Rorschach test dispute reported</a>&#8220;).</p>
<p>The first part of the study examined the top 50 Google search results for &#8220;<a href="http://www.google.com/search?q=Rorschach+-watchmen">Rorschach</a>&#8221; (excluding &#8220;watchmen&#8221; in order to filter out results about a comic book and film) and <a href="http://www.google.com/search?&amp;q=%22inkblot%20test%22">&#8220;inkblot test&#8221;</a>, coding them into four levels representing the &#8220;threat each site presents to test security and the extent to which the content of the site might aid an individual in dissimulating on the Rorschach&#8221;. 44% of the sites were classified as Level 0 (&#8220;no threat&#8221;), e.g. home page of bands with &#8220;Rorschach&#8221; in their name, and 15% as Level 1 (&#8220;minimal threat&#8221;). The 22% Level 2 (&#8220;indirect threat&#8221;) sites which &#8220;tended to discuss test procedures more explicitly&#8221; apparently included &#8220;several &#8216;official&#8217; Rorschach Web sites, where one is able to register for Continuing Education Rorschach workshops, [and which] also allow visitors to purchase materials that contain sensitive test information. For example, certain training Web sites allow individuals to purchase training texts and instructional media without requiring a license or other professional credentials&#8221;. The authors find it &#8220;disturbing&#8221; that many sites in this threat category &#8220;were authored by psychologists&#8221;. 19% of the sites were classified as the highest level, &#8220;direct threat&#8221;, e.g. many that contained depictions of one or more Rorschach inkblots, or specific information about how responses are interpreted. Together with results about the high percentage of Internet users consulting Wikipedia for health information (36% in the US in 2007 according to Pew research), the authors conclude that &#8220;we can no longer presume that examinees have not been exposed to this information prior to an assessment&#8221;.</p>
<p>The second part of the study likewise starts out with a <a href="http://en.wikipedia.org/wiki/Google_News" title="w:Google News">Google News</a> search for &#8220;Rorschach&#8221; and &#8220;Wikipedia&#8221;, noting that &#8220;of the 25 news stories reviewed, 13 included one or more of the Rorschach inkblots, with Card I as the most frequently displayed&#8221;, and eventually arriving at five media stories about the controversy which allowed readers&#8217; comments (<a href="http://www.digitaljournal.com/article/276688">[1]</a>, <a href="http://www.findingdulcinea.com/news/science/2009/july/Rorschach-Fail–The-Test-s-Validity-Is-Again-Scrutinized-as-Plates-Appear-on-Wikipedia.html">[2]</a>, <a href="http://io9.com/5344390/all-of-rorschachs-secrets–revealed">[3]</a>, <a href="http://www.cbc.ca/technology/story/2009/07/31/rorschach-test.html">[4]</a>, <a href="http://www.nytimes.com/2009/07/29/technology/internet/29inkblot.html?r=1">[5]</a>). The altogether 520 comments on these stories were &#8220;coded according to the opinion expressed by the writer regarding each of the following categories: (a) the field of psychology, (b) psychologists, and (c) the Rorschach.&#8221; While the vast majority did not state a clear opinion on the first two categories, the authors note that &#8220;Of those comments that did express an opinion toward psychologists [ca. 16%] most were overwhelmingly negative.&#8221; Many more of the commenters on the Wikipedia/Rorschach news stories expressed an opinion about the test itself: &#8220;In total, 182 (35%) of comments were classified as unfavorable toward the Rorschach, whereas only 55 (11%) were coded as favorable toward the Rorschach. The remaining 283 (54%) comments were categorized as neutral or not mentioned.&#8221; Among those who identified as mental health professionals, 61% expressed a favorable opinion about the test and 15% a negative one.</p>
<p>Asked for his comment on the paper, Heilman said: &#8220;My main criticism of their paper is that they seem to take as axiomatic that exposure to these images hurts test reliability without any real evidence to back it up. Otherwise it is an interesting piece.&#8221; (The paper includes a section reviewing literature on &#8220;the impact of &#8216;coaching&#8217; on psychological tests&#8221;, however it does not mention results pertaining specifically to the Rorschach test, and mostly concerns subjects who deliberately try to &#8220;cheat&#8221; on such tests, rather than those who have accidentally been exposed to a test&#8217;s material before.)</p>
<h3 id="Spell-checking_the_English_Wikipedia">Spell-checking the English Wikipedia</h3>
<div style="text-align:center;border:1px solid #ccc;margin:2px;float:right;clear:right;margin:0.5em 0 0.8em 1.4em;">
<div style="padding: 3px !important; border: 1px solid rgb(204, 204, 204); text-align: center; overflow: hidden; font-size: 94%; background-color: white; width:222px;"><a href="https://commons.wikimedia.org/wiki/File:WikiCheck.jpg" class="image"><img alt="" src="http://upload.wikimedia.org/wikipedia/commons/thumb/0/07/WikiCheck.jpg/220px-WikiCheck.jpg" width="220" height="193" style="border:1px solid #ccc;" /></a></p>
<div style="border:none;text-align:left;line-height:1.4em;padding:3px !important;font-size:94%;">
<div style="float:right;border:none !important;background:none !important;"><a href="https://commons.wikimedia.org/wiki/File:WikiCheck.jpg" class="internal" title="Enlarge"><img src="http://bits.wikimedia.org/skins-1.18/common/images/magnify-clip.png" width="15" height="11" alt="" /></a></div>
<p>Proofreading Wikipedia articles: A screenshot from <a href="http://community.languagetool.org/wikiCheck">wikiCheck</a></div>
</div>
</div>
<p><a href="http://en.wikipedia.org/wiki/University_of_Nebraska-Lincoln" title="w:University of Nebraska-Lincoln">University of Nebraska-Lincoln</a> MBA candidate Jon Stacey reports on the results of a proof-of-concept tool to measure the rate of misspelled words in the English Wikipedia over time.<sup id="cite_ref-3" class="reference"><a href="#cite_note-3">[4]</a></sup><sup id="cite_ref-4" class="reference"><a href="#cite_note-4">[5]</a></sup> A text parser (code <a href="http://jonsview.com/text-mining-wikipedia-for-misspelled-words#appendices_and_references">available for download</a>) was applied to a random sample of 2,400 articles. Instead of considering the latest revision, a random revision from the history of each article was used. The final corpus was obtained by stripping markup and non-ASCII characters as well as article sections such as the references and table of contents. Words were matched against a dictionary obtained by manually combining <i>12dicts</i> and <i>SCOWL</i> (<a href="http://wordlist.sourceforge.net/">source</a>) with <a href="http://en.wikipedia.org/wiki/Wiktionary" title="w:Wiktionary">Wiktionary</a>.</p>
<p>The results show that the percentage of misspellings has been growing steadily, reaching 6.23% for revisions created in 2011. Several weaknesses with the method are discussed, including the lack of Unicode support, the high rate of false positives, and the possibility that the rising rate might be associated with a rise in the complexity of content. The concluding remarks speculate on how semi-automated spell-checking may support editorial work at a large scale. (Wikipedians have used <a href="http://en.wikipedia.org/wiki/Wikipedia:lists_of_common_misspellings" title="w:Wikipedia:lists of common misspellings">lists of common misspellings</a> for many years, also integrated in semi-automatic editing tools such as <a href="http://en.wikipedia.org/wiki/Wikipedia:AutoWikiBrowser/Typos" title="w:Wikipedia:AutoWikiBrowser/Typos">AutoWikiBrowser</a>.)</p>
<p>In related news, the developers of an open-source multilingual proofreading application called <a href="http://www.languagetool.org/"><i>LanguageTool</i></a> released a beta application for proofreading Wikipedia articles. <a href="http://community.languagetool.org/wikiCheck">wikiCheck</a> proofreads articles from the English and German Wikipedias based on a set of customizable syntax and grammar rules. A bookmarklet is available to access the application from a browser.</p>
<h3 id="Wikipedians_are_.22smart_but_fun.22.2C_and_have_expertise_in_topics_they_edit">Wikipedians are &#8220;smart but fun&#8221;, and have expertise in topics they edit</h3>
<p>Three researchers from <a href="http://en.wikipedia.org/wiki/Stanford_University" title="w:Stanford University">Stanford University</a> and <a href="http://en.wikipedia.org/wiki/Yahoo!_Research" title="w:Yahoo! Research">Yahoo! Research</a> used a novel method to construct &#8220;a data-driven portrait of Wikipedia editors&#8221;, as described in a preprint currently undergoing review for publication.<sup id="cite_ref-5" class="reference"><a href="#cite_note-5">[6]</a></sup> While earlier studies relied on Wikipedians participating in surveys (and identifying themselves as such), the authors mined data from users of the <a href="http://en.wikipedia.org/wiki/Yahoo!_Toolbar" title="w:Yahoo! Toolbar">Yahoo! Toolbar</a> for Wikipedia URLs containing an <tt>&amp;action=submit</tt> parameter, thereby arriving at a sample of 1900 editors of the English Wikipedia.</p>
<p>Their first main finding is that &#8220;on broad average, Wikipedia editors seem, on the one hand, more sophisticated than usual Web users, reading more news, doing more Web search, and looking up more things in dictionaries and other reference works; on the other hand, they are also deeply immersed in pop culture, spending much online time on music- and movie-related websites.&#8221; However, these &#8220;entertainment lovers &#8230; form only a highly specialized subgroup that contributes many edits&#8221;.</p>
<p>Based on the toolbar data, the paper also tries to answer the question &#8220;Do Wikipedia editors know their domain?&#8221; and related questions, positively: &#8220;across all topical domains Wikipedia editors show significant expertise. &#8230; We also show that more substantial edits tend to come from experts&#8221;, and that logged-in editors show more expertise than IP editors. A final result is that &#8220;About half of the click chains culminating in an edit start with a Web search, with the other half originating on Wikipedia’s main page.&#8221;</p>
<h3 id="Wikipedia_as_a_database_for_structured_biological_data">Wikipedia as a database for structured biological data</h3>
<div style="text-align:center;border:1px solid #ccc;margin:2px;float:right;clear:right;margin:0.5em 0 0.8em 1.4em;">
<div style="padding: 3px !important; border: 1px solid rgb(204, 204, 204); text-align: center; overflow: hidden; font-size: 94%; background-color: white; width:222px;"><a href="http://en.wikipedia.org/wiki/Portal:Gene_Wiki" title="w:Portal:Gene Wiki"><img alt="" src="http://upload.wikimedia.org/wikipedia/commons/thumb/9/95/OxitocinaCPK3D.png/220px-OxitocinaCPK3D.png" width="220" height="182" style="border:1px solid #ccc;" /></a></p>
<div style="border:none;text-align:left;line-height:1.4em;padding:3px !important;font-size:94%;">
<div style="float:right;border:none !important;background:none !important;"><a href="https://commons.wikimedia.org/wiki/File:OxitocinaCPK3D.png" class="internal" title="Enlarge"><img src="http://bits.wikimedia.org/skins-1.18/common/images/magnify-clip.png" width="15" height="11" alt="" /></a></div>
<p>The <a href="http://en.wikipedia.org/wiki/Oxytocin" title="w:Oxytocin">Oxytocin</a> article is part of <a href="http://en.wikipedia.org/wiki/Portal:Gene_Wiki" title="w:Portal:Gene Wiki">Gene Wiki</a>, an open-access collection of Wikipedia articles about human genes.</div>
</div>
</div>
<p>A special issue of <i><a href="http://en.wikipedia.org/wiki/Nucleic_Acids_Research" title="w:Nucleic Acids Research">Nucleic Acids Research</a></i> features 11 articles describing how wikis and collaborative technology can be used to enhance biological databases. A commentary by Robert Finn, Paul Gardner and Alex Bateman<sup id="cite_ref-6" class="reference"><a href="#cite_note-6">[7]</a></sup> discusses in particular how to leverage Wikipedia, its collaborative infrastructure and large editor community to better integrate articles and biological data entries: the authors argue that the project offers an opportunity for crowdsourcing the curation and annotation of biological data, but faces major challenges for expert engagement, i.e. &#8220;how to get scientists <i>en masse</i> to edit articles&#8221; and &#8220;how to allow editors to receive credit for their work on an article&#8221;.</p>
<p>Another article in the same issue<sup id="cite_ref-7" class="reference"><a href="#cite_note-7">[8]</a></sup> presents <a href="http://en.wikipedia.org/wiki/Portal:Gene_Wiki" title="w:Portal:Gene Wiki">the Gene Wiki</a>, an open-access and openly editable collection of Wikipedia articles about human genes. The article describes how structured data available on Gene Wiki articles is kept in sync with the data from primary databases via an automated system and how to automatically compute the quality of articles in the project at word or sentence-level using <a href="http://en.wikipedia.org/wiki/WikiTrust" title="w:WikiTrust">WikiTrust</a>.</p>
<h3 id="Individual_and_social_drivers_of_participation_in_Wikipedia">Individual and social drivers of participation in Wikipedia</h3>
<p>A thesis entitled <i>Individual and social motivations to contribute to Commons-based peer production</i> was submitted by <a href="http://en.wikipedia.org/wiki/University_of_Minnesota" title="w:University of Minnesota">University of Minnesota</a> student Yoshikazu Suzuki for an MA in mass communication. The thesis presents and discusses the results from a small series of interviews as well as a <a href="http://meta.wikimedia.org/wiki/Research:Participatory_Motivation_to_Commons-based_Peer_Production" title="Research:Participatory Motivation to Commons-based Peer Production">survey exploring individual and social motivations of Wikipedia contributors</a>, drawing on <a href="http://en.wikipedia.org/wiki/social_identity_theory" title="w:social identity theory">social identity theory</a>, <a href="http://en.wikipedia.org/wiki/volunteerism" title="w:volunteerism">volunteerism</a> and <a href="http://en.wikipedia.org/wiki/uses_and_gratifications_theory" title="w:uses and gratifications theory">uses and gratifications theory</a>. The survey, run in July 2011 with support from the <a href="http://meta.wikimedia.org/wiki/RCom" title="RCom" class="mw-redirect">Wikimedia Research Committee</a>, collected 208 responses from a random sample of 950 among the top English Wikipedia editors. The results, obtained by applying <a href="http://en.wikipedia.org/wiki/principal_components_analysis" title="w:principal components analysis">principal components analysis</a> to the responses, reveal eight distinct motivational factors: providing information, the seeking of creative stimulation, concern for others’ well-being, the need to be entertained, the avoidance of negative self-affect, cognitive group membership, career benefits, and social desirability. An analysis of the relative strength of each factor indicates that providing information, the seeking of creative stimulation, and concerns for others’ well-being were the three strongest motivational dimensions. Grouping the eight factors into two macro-categories according to self- and other-focused motivations, the other-focused motivations were found to be significantly stronger than the self-focused motivations. The thesis reviews the implications of these results for the design of incentives for participation and editor retention. The full text of the thesis<sup id="cite_ref-8" class="reference"><a href="#cite_note-8">[9]</a></sup> and an <a href="http://www.scribd.com/doc/74939326/">executive summary</a> are available under open access.</p>
<h3 id="Mining_article_revision_histories_for_insights_into_open_collaboration">Mining article revision histories for insights into open collaboration</h3>
<div style="text-align:center;border:1px solid #ccc;margin:2px;float:right;clear:right;margin:0.5em 0 0.8em 1.4em;">
<div style="padding: 3px !important; border: 1px solid rgb(204, 204, 204); text-align: center; overflow: hidden; font-size: 94%; background-color: white; width:222px;"><a href="https://commons.wikimedia.org/wiki/File:History_comparison_example.png" class="image"><img alt="" src="http://upload.wikimedia.org/wikipedia/commons/thumb/3/3d/History_comparison_example.png/220px-History_comparison_example.png" width="220" height="193" style="border:1px solid #ccc;" /></a></p>
<div style="border:none;text-align:left;line-height:1.4em;padding:3px !important;font-size:94%;">
<div style="float:right;border:none !important;background:none !important;"><a href="https://commons.wikimedia.org/wiki/File:History_comparison_example.png" class="internal" title="Enlarge"><img src="http://bits.wikimedia.org/skins-1.18/common/images/magnify-clip.png" width="15" height="11" alt="" /></a></div>
<p>Article revision histories are a rich data source to study patterns of collaboration on Wikipedia.</p></div>
</div>
</div>
<p>A paper in this month&#8217;s edition of <i><a href="http://en.wikipedia.org/wiki/First_Monday_(journal)" title="w:First Monday (journal)">First Monday</a></i>, ambitiously titled &#8220;Understanding collaboration in Wikipedia&#8221;<sup id="cite_ref-9" class="reference"><a href="#cite_note-9">[10]</a></sup>, reports on a statistical analysis of a complete dump of the English Wikipedia (225 million article edits) with regard to several quantities, starting with two that were introduced in a 2004 paper by Andrew Lih &#8220;as a simple measure for the reputation of [an] article within the Wikipedia&#8221;: the total number of edits an article has received (&#8220;rigor&#8221;) and the number of (logged-in and anonymous) users who have edited the article (&#8220;diversity&#8221;). The First Monday paper cites a 2007 study from the same journal, which found that featured articles tend to have more edits and contributors (while controlling for a few other variables)<sup id="cite_ref-10" class="reference"><a href="#cite_note-10">[11]</a></sup> as a justification for using &#8220;rigor&#8221; and &#8220;diversity&#8221; as proxies for article quality, but includes other quantities such as the article size change for an edit. The paper cites earlier work on evaluating Wikipedia article quality (e.g. dismissing the well-known 2005 <i>Nature</i> study based on the mistaken assumption that it had &#8220;only focused on featured articles&#8221;), but does not discuss existing attempts at more sophisticated quantitative quality heuristics.</p>
<p>The <i>First Monday</i> paper highlighted that if consecutive edits by the same user are counted as one, the overall number of article revisions drops by more than 33%, &#8220;revealing that one in three revisions in Wikipedia consist of users responding to their own edits or continuing an ongoing edit begun by themselves&#8221;. &#8220;Article diversity&#8221; ranged up to 12,437 contributors per article, with a median of 12 and an average of 32. One of the main conclusions is that &#8220;rather than reflecting the contributions and expertise of a large group of people, the typical article in Wikipedia reflects the efforts of a relatively small group of users (median of 12) who make a relatively small number of edits (median of 21).&#8221;</p>
<p>Supporting the assumption that most edits do not result in significant changes in content, the study finds that 31% of all revisions cause a size change of fewer than 10 characters, and 51% a change of fewer than 30 characters, with an apparently significant peak at a 4-character difference, presumably related to the insertion or removal of the four brackets (&#8220;[[ ... ]]&#8221;) that generate a wikilink.</p>
<p>The author notes the slight decrease in the overall number of edits since 2008, but tentatively explains it by the increasingly complete coverage of encyclopedic topics, and doesn&#8217;t share the widespread concerns about declining or stagnating editor activity: &#8220;participation in Wikipedia seems to remain as healthy as ever as revisions made per article created each year has annually increased since 2001 without exception&#8221;.</p>
<p>A different paper<sup id="cite_ref-11" class="reference"><a href="#cite_note-11">[12]</a></sup> from last year&#8217;s &#8220;Collaborative Innovation Networks Conference&#8221; similarly promises far-reaching insights from &#8220;Deconstructing Wikipedia&#8221; solely based on revision history statistics without analyzing the actual content changes, using a much smaller sample – 30 featured articles from the English Wikipedia, but also including timestamps. The data did not confirm the hypothesis that &#8220;the editor who initiated an article would have a high level of involvement in the article’s creation&#8221;: for only five of the 30 articles, the initial author was the most frequent contributor.</p>
<p>A second conclusion is that for all of the articles in the sample, &#8220;there is a single Wikipedian whose contributions far exceed all others&#8221;, ranging from 8% to 82% of the articles with an average of 39% (but the analysis does not seem to have sought to quantify the extent to which this exceeds the contributions of the second most frequent contributor). The author indicates that this supports <a href="http://en.wikipedia.org/wiki/Jaron_Lanier" title="w:Jaron Lanier">Jaron Lanier</a>&#8216;s &#8220;oracle illusion&#8221; criticism of a supposed presentation of Wikipedia as a product of &#8220;the crowd&#8221;. Somewhat tautologically, the author observes &#8220;that the control of an individual editor seemed to be reduced as more editors joined the process&#8221;, and points to the need to analyze &#8220;a significantly larger number of articles&#8221; to answer the question whether &#8220;too many cooks spoil the stew&#8221; (apparently unaware of the significant body of earlier literature on this subject, starting with a 2005 paper that presented an answer in its title: &#8220;<a href="http://www.mediachange.ch/publications/27/">Too many cooks don&#8217;t spoil the broth</a>&#8220;, and including the 2007 study which the above reviewed First Monday paper relied on).</p>
<p>A third result of the paper, which likewise might not surprise those already familiar with Wikipedia&#8217;s editing processes, is that &#8220;the creation process is continuous and can go on for a very long time&#8221;, with even articles about historic events from the distant past continuing to receive edits.</p>
<p>The author, an assistant professor in management and marketing at <a href="http://en.wikipedia.org/wiki/Virginia_State_University" title="w:Virginia State University">Virginia State University</a>, concludes the paper by urging his readers to start &#8220;thinking about how the wiki platform, itself, is influencing the creation process&#8221;.</p>
<h3 id="Briefly">Briefly</h3>
<ul>
<li>The Wikimedia Research Committee launched a <a href="http://docs.google.com/spreadsheet/viewform?hl=en_US&amp;formkey=dGNBSGFUcTdJLUxLcGpoWUNoQXM0SGc6MQ">public consultation</a> on the future <b>data/research infrastructure for Wikimedia</b>, in an effort to understand how to best serve the research and developer community with open data from our projects. The consultation will remain open through January 2012 and the full set of responses will be shared under a <a href="http://en.wikipedia.org/wiki/Creative_Commons_license#Public_domain_tools" title="w:Creative Commons license">CC0 license</a>.</li>
<li><b>Semantic enhancements</b>: In &#8220;Enhancing Wikipedia with semantic technologies&#8221;<sup id="cite_ref-12" class="reference"><a href="#cite_note-12">[13]</a></sup>, Lee et al. review existing interfaces for semantic search and present their own platform for enhancements. Based on small-scale user tests, they find that one of their three enhancements – range-based queries – are strongly preferred by users, who would find them desirable not only in Wikipedia but on the wider web. A longer summary is available on <a href="http://acawiki.org/index.php?title=Enhancing_Wikipedia_with_semantic_technologies">AcaWiki</a>.</li>
<li><b>English and Finnish Wikipedias egalitarian, Japanese hierarchical</b>: A paper titled &#8220;Analyzing cultural differences in collaborative innovation networks by analyzing editing behavior in different-language Wikipedias&#8221;<sup id="cite_ref-13" class="reference"><a href="#cite_note-13">[14]</a></sup> (from the <i>2010 Collaborative Innovation Networks Conference</i>, as was the revision statistics paper reviewed above) applied <a href="http://en.wikipedia.org/wiki/social_network_analysis" title="w:social network analysis">social network analysis</a> to collaboration on featured articles on the English, German, Japanese, Korean, and Finnish Wikipedias. It &#8220;found notable differences in the communication behavior among egalitarian cultures such as the Finnish, and quite hierarchical ones such as the Japanese. While the English language Wikipedia shows a distinctive pattern, most likely because it is by far the largest and frequently exploring new concepts copied by others, it seems to follow more the Finnish egalitarian, than the Japanese hierarchical style&#8221;.</li>
<li><a href="http://en.wikipedia.org/wiki/User:Emijrp" title="w:User:Emijrp">User:Emijrp</a> shared a link on <a href="http://lists.wikimedia.org/pipermail/wiki-research-l/2011-December/001753.html">wiki-research-l</a> listing <b><a href="http://pastebin.com/6MzvR6Vi">2,596 scholarly references on wikis</a></b>, obtained by scraping Google Scholar results (on December 22, 2011), as part of a project to build a comprehensive bibliography about wikis – a challenging task that has seen various earlier attempts and was the subject of a workshop at this year&#8217;s <a href="http://en.wikipedia.org/wiki/WikiSym" title="w:WikiSym">WikiSym</a> (see the <a href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2011-10-31/Recent_research" title="w:Wikipedia:Wikipedia Signpost/2011-10-31/Recent research">October</a> and <a href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2011-04-11/Recent_research" title="w:Wikipedia:Wikipedia Signpost/2011-04-11/Recent research">April</a> editions of this research report).</li>
<li><b>Should doctors use and edit Wikipedia?</b>: An editorial in the <i><a href="http://en.wikipedia.org/wiki/Journal_of_the_Royal_Society_of_Medicine" title="w:Journal of the Royal Society of Medicine">Journal of the Royal Society of Medicine</a></i><sup id="cite_ref-14" class="reference"><a href="#cite_note-14">[15]</a></sup> asks whether doctors should reject the use of Wikipedia. The two page article (one-day access: US$30.00) cites some results about the popularity of Wikipedia among medical students, young physicians and the general public, and for some reason highlights the malicious edits of British journalist <a href="http://en.wikipedia.org/wiki/Johann_Hari" title="w:Johann Hari">Johann Hari</a> as example for the downsides of Wikipedia&#8217;s free editability. It contains a review of the literature on the reliability of Wikipedia&#8217;s medical information which is less thorough than that of the <i>Psychological Medicine</i> article reviewed above, and comes to a less approving but still somewhat positive conclusion: &#8220;Although Wikipedia entries are often poorly structured and difficult to understand, they are comparable in accuracy to some online resources, such as health insurance websites.&#8221; In the end, the authors seems to lean towards recommending against ignoring Wikipedia: &#8220;One risk of clinicians disengaging from Wikipedia is that only contributors motivated by personal experience (e.g. patient anecdote) or vested interests (e.g. individual clinicians, institutions or companies promoting their own ideas and products) will remain.&#8221;</li>
</ul>
<h2 id="References">References</h2>
<div class="references-small">
<ol class="references">
<li id="cite_note-0"><a href="#cite_ref-0">↑</a> Reavley, N. J., Mackinnon, A. J., Morgan, A. J., Alvarez-Jimenez, M., Hetrick, S. E., Killackey, E., Nelson, B., et al. (2011). Quality of information sources about mental disorders: a comparison of Wikipedia with centrally controlled web and printed sources. <i>Psychological Medicine</i>, pp. 1-10. <a href="http://dx.doi.org/10.1017/S003329171100287X"><b>DOI</b></a> <a href="http://en.wikipedia.org/wiki/Closed_access" title="Closed access"><img alt="Closed access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/c/c1/Closed_Access_logo_alternative.svg/12px-Closed_Access_logo_alternative.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-1"><a href="#cite_ref-1">↑</a> Schultz, D. S., &amp; Loving, J. L. (2012). Challenges Since Wikipedia: The Availability of Rorschach Information Online and Internet Users’ Reactions to Online Media Coverage of the Rorschach–Wikipedia Debate. <i>Journal of Personality Assessment</i>, 94(1), 73-81. Routledge. <a href="http://dx.doi.org/10.1080/00223891.2011.627963"><b>DOI</b></a> <a href="http://en.wikipedia.org/wiki/Closed_access" title="Closed access"><img alt="Closed access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/c/c1/Closed_Access_logo_alternative.svg/12px-Closed_Access_logo_alternative.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-2"><a href="#cite_ref-2">↑</a> Ruiz, M., Drake, E., Glass, A., Marcotte, D., &amp; van Gorp, W. (2002). Trying to beat the system: Misuse of the Internet to assist in avoiding the detection of psychological symptom dissimulation. <i>Professional Psychology: Research and Practice</i>, 33, 294–299 <a href="http://www.sciencedirect.com/science/article/pii/S0735702802000400"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Closed_access" title="Closed access"><img alt="Closed access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/c/c1/Closed_Access_logo_alternative.svg/12px-Closed_Access_logo_alternative.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-3"><a href="#cite_ref-3">↑</a> Stacey, Jon (2011). <i>Text mining Wikipedia for misspelled words</i>. <a href="http://jonsview.com/text-mining-wikipedia-for-misspelled-words"><b>HTML</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-4"><a href="#cite_ref-4">↑</a> Fogarty, Kevin (December 23, 2011). <a href="http://www.itworld.com/internet/235523/wikipedia-test-showes-americans-ubility-too-spel-detereeorating">Wikipedia test showes Americans&#8217; ubility too spel is detereeorating</a>. ITworld.</li>
<li id="cite_note-5"><a href="#cite_ref-5">↑</a> West, Robert, Ingmar Weber, and Carlos Castillo (2011). <i>Smart but Fun: A Data-Driven Portrait of Wikipedia Editors</i>. <a href="http://ai.stanford.edu/~west1/pubs/wikiedits.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-6"><a href="#cite_ref-6">↑</a> Finn, Robert D, Paul P Gardner, and Alex Bateman (2011). Making your database available through Wikipedia: the pros and cons. <i>Nucleic acids research</i>, 40(1) <a href="http://dx.doi.org/10.1093/nar/gkr1195"><b>DOI</b></a> • <a href="http://nar.oxfordjournals.org/content/40/D1/D9.full"><b>HTML</b></a><a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-7"><a href="#cite_ref-7">↑</a> Good, Benjamin M, Erik L Clarke, Luca de Alfaro, and Andrew I Su (2011). The Gene Wiki in 2011: Community intelligence applied to human gene annotation. <i>Nucleic acids research</i> 40 (1): D1255-1261. <a href="http://dx.doi.org/10.1093/nar/gkr925"><b>DOI</b></a> • <a href="http://nar.oxfordjournals.org/content/40/D1/D1255.full"><b>HTML</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-8"><a href="#cite_ref-8">↑</a> Suzuki, Yoshikazu (2011) <i>Individual and social motivations to contribute to Commons-based peer production</i>, MA thesis, University of Minnesota. <a href="http://conservancy.umn.edu/bitstream/119040/1/Suzuki_Yoshikazu_November2011.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-9"><a href="#cite_ref-9">↑</a> Kimmons, Royce (2011). &#8220;Understanding collaboration in Wikipedia&#8221;. <i>First Monday</i> 16 (12). <a href="http://firstmonday.org/htbin/cgiwrap/bin/ojs/index.php/fm/article/viewArticle/3613/3117"><b>HTML</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-10"><a href="#cite_ref-10">↑</a> Wilkinson, D.M., and B.A. Huberman (2007). &#8220;Assessing the value of cooperation in Wikipedia&#8221;. <i>First Monday</i> 12 (4). <a href="http://firstmonday.org/htbin/cgiwrap/bin/ojs/index.php/fm/article/view/1763/1643"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-11"><a href="#cite_ref-11">↑</a> Feldstein, A. (2011). &#8220;Deconstructing Wikipedia: Collaborative Content Creation in an Open Process Platform&#8221;. <i>Procedia – Social and Behavioral Sciences</i>, 26, 76–84. <a href="http://dx.doi.org/10.1016/j.sbspro.2011.10.564"><b>DOI</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-12"><a href="#cite_ref-12">↑</a> Lian Hoy Lee, Christof Lutteroth, and Gerald Weber (2011). Enhancing Wikipedia with Semantic Technologies. In <i>iUBICOM&#8217;11: The 6th International Workshop on Ubiquitous and Collaborative Computing</i>, 2011. <a href="http://www.bcs.org/upload/pdf/ewic_ubi11_paper3.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-13"><a href="#cite_ref-13">↑</a> Nemoto, Keiichi, and Peter A. Gloor (2011). Analyzing Cultural Differences in Collaborative Innovation Networks by Analyzing Editing Behavior in Different-Language Wikipedias. <i>Procedia &#8211; Social and Behavioral Sciences</i> 26 (January 2011): 180-190. <a href="http://dx.doi.org/10.1016/j.sbspro.2011.10.574"><b>DOI</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-14"><a href="#cite_ref-14">↑</a> Metcalfe, D., &amp; Powell, J. (2011). Should doctors spurn Wikipedia? <i>JRSM</i>, 104 (12), 488-489. <a href="http://dx.doi.org/10.1258/jrsm.2011.110227"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Closed_access" title="Closed access"><img alt="Closed access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/c/c1/Closed_Access_logo_alternative.svg/12px-Closed_Access_logo_alternative.svg.png" width="12" height="19" /></a></li>
</ol>
</div>
<p></p>
<div style="float:left; width: 99%; border-top: 1px solid #CCC; border-bottom: 1px solid #CCC;">
<p style="text-align:center; clear:left; background-color: #EEE; padding: .6em; font-size:140%; font-weight:normal;font-family: Georgia, Times, serif; color:#666"><i>Wikimedia Research Newsletter</i><br />
Vol: 1 • Issue: 6 • December 2011<br />
<span style="font-size:80%;">This newletter is brought to you by the <a href="http://meta.wikimedia.org/wiki/Research:Committee" title="Research:Committee">Wikimedia Research Committee</a> and <a href="http://en.wikipedia.org/wiki/Wikipedia:Signpost" title="w:Wikipedia:Signpost">The Signpost</a><br />
<a href="http://blog.wikimedia.org/c/research-2/wikimedia-research-newsletter/feed/" title="Syndicate the Wikimedia Research Newsletter feed"><img alt="Syndicate the Wikimedia Research Newsletter feed" src="http://upload.wikimedia.org/wikipedia/commons/thumb/4/43/Feed-icon.svg/16px-Feed-icon.svg.png" width="16" height="16" /></a> <a href="http://meta.wikimedia.org/wiki/Research:Newsletter/Archives" title="Research:Newsletter/Archives">[<!--  -->archives]</a> <a href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2011-12-26/Recent_research" title="w:Wikipedia:Wikipedia Signpost/2011-12-26/Recent research">[signpost edition]</a> <a href="http://meta.wikimedia.org/wiki/Research:Newsletter" title="Research:Newsletter">[contribute]</a> <a href="http://meta.wikimedia.org/wiki/Research:Index" title="Research:Index">[research index]</a></span></p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://blog.wikimedia.org/2011/12/28/wikimedia-research-newsletter-december-2011/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Wikimedia Research Newsletter, November 2011</title>
		<link>http://blog.wikimedia.org/2011/11/29/wikimedia-research-newsletter-november-2011/</link>
		<comments>http://blog.wikimedia.org/2011/11/29/wikimedia-research-newsletter-november-2011/#comments</comments>
		<pubDate>Wed, 30 Nov 2011 02:57:34 +0000</pubDate>
		<dc:creator>Tilman</dc:creator>
				<category><![CDATA[Highlights]]></category>
		<category><![CDATA[Wikimedia Research Newsletter]]></category>
		<category><![CDATA[research]]></category>

		<guid isPermaLink="false">http://blog.wikimedia.org/?p=8017</guid>
		<description><![CDATA[Vol: 1 • Issue: 5 • November 2011 [archives] Quantifying quality collaboration patterns, systemic bias, POV pushing, the impact of news events, and editors&#8217; reputation With contributions by: Tbayer, Hfordsa, DarTar and Romanesco Contents 1 Collaboration pattern analysis: Editor experience more important than &#8220;many eyes&#8221; 2 Systemic bias quantified for twenty language Wikipedias 3 Does [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://meta.wikimedia.org/wiki/Research:Newsletter" title="Research:Newsletter"><img alt="WRN header.png" src="http://upload.wikimedia.org/wikipedia/commons/7/7d/WRN_header.png" width="554" height="86" /></a></p>
<p style="clear:left; background-color: #EEE; padding: .3em; font-size:200%; font-weight:normal; font-family: Georgia, Times, serif; color:#666">Vol: 1 • Issue: 5 • November 2011 <span style="font-size:75%; float:right; margin-right:1em; text-transform:uppercase"><a href="http://meta.wikimedia.org/wiki/Research:Newsletter/Archives" title="Research:Newsletter/Archives">[<!--  -->archives]</a> <a href="http://blog.wikimedia.org/c/research-2/wikimedia-research-newsletter/feed/" title="Syndicate the Wikimedia Research Newsletter feed"><img alt="Syndicate the Wikimedia Research Newsletter feed" src="http://upload.wikimedia.org/wikipedia/commons/thumb/4/43/Feed-icon.svg/16px-Feed-icon.svg.png" width="16" height="16" /></a></span></p>
<p style="clear:left; padding: .3em; line-height: 130%; font-size:150%; font-weight:normal; font-style: italic; font-family: Georgia, Times, serif; color:#666">Quantifying quality collaboration patterns, systemic bias, POV pushing, the impact of news events, and editors&#8217; reputation</p>
</div>
<p>
<b>With contributions by:</b> <a href="http://en.wikipedia.org/wiki/User:Tbayer_(WMF)" title="w:User:Tbayer (WMF)">Tbayer</a>, <a href="http://en.wikipedia.org/wiki/User:Hfordsa" title="w:User:Hfordsa">Hfordsa</a>, <a href="http://en.wikipedia.org/wiki/User:DarTar" title="w:User:DarTar">DarTar</a> and <a href="http://en.wikipedia.org/wiki/User:Romanesco" title="w:User:Romanesco">Romanesco</a></p>
<table id="toc" class="toc">
<tr>
<td>
<div id="toctitle">
<h2>Contents</h2>
</div>
<ul>
<li class="toclevel-1 tocsection-1"><a href="https://blog.wikimedia.org/2011/11/29/wikimedia-research-newsletter-november-2011/#Collaboration_pattern_analysis:_Editor_experience_more_important_than_.22many_eyes.22"><span class="tocnumber">1</span> <span class="toctext">Collaboration pattern analysis: Editor experience more important than &#8220;many eyes&#8221;</span></a></li>
<li class="toclevel-1 tocsection-2"><a href="https://blog.wikimedia.org/2011/11/29/wikimedia-research-newsletter-november-2011/#Systemic_bias_quantified_for_twenty_language_Wikipedias"><span class="tocnumber">2</span> <span class="toctext">Systemic bias quantified for twenty language Wikipedias</span></a></li>
<li class="toclevel-1 tocsection-3"><a href="https://blog.wikimedia.org/2011/11/29/wikimedia-research-newsletter-november-2011/#Does_.22In_the_news.22-like_attention_have_a_positive_effect_on_article_quality.3F"><span class="tocnumber">3</span> <span class="toctext">Does &#8220;In the news&#8221;-like attention have a positive effect on article quality?</span></a></li>
<li class="toclevel-1 tocsection-4"><a href="https://blog.wikimedia.org/2011/11/29/wikimedia-research-newsletter-november-2011/#Detecting_POV_pushing_editors"><span class="tocnumber">4</span> <span class="toctext">Detecting POV pushing editors</span></a></li>
<li class="toclevel-1 tocsection-5"><a href="https://blog.wikimedia.org/2011/11/29/wikimedia-research-newsletter-november-2011/#Historian_of_encyclopedias_reviews_Good_Faith_Collaboration"><span class="tocnumber">5</span> <span class="toctext">Historian of encyclopedias reviews Good Faith Collaboration</span></a></li>
<li class="toclevel-1 tocsection-6"><a href="https://blog.wikimedia.org/2011/11/29/wikimedia-research-newsletter-november-2011/#Briefly"><span class="tocnumber">6</span> <span class="toctext">Briefly</span></a></li>
<li class="toclevel-1 tocsection-7"><a href="https://blog.wikimedia.org/2011/11/29/wikimedia-research-newsletter-november-2011/#References"><span class="tocnumber">7</span> <span class="toctext">References</span></a></li>
</ul>
</td>
</tr>
</table>
<h3 id="Collaboration_pattern_analysis:_Editor_experience_more_important_than_.22many_eyes.22">Collaboration pattern analysis: Editor experience more important than &#8220;many eyes&#8221;</h3>
<div style="text-align:center;border:1px solid #ccc;margin:2px;float:right;clear:right;margin:0.5em 0 0.8em 1.4em;">
<div style="padding: 3px !important; border: 1px solid rgb(204, 204, 204); text-align: center; overflow: hidden; font-size: 94%; background-color: white; width:222px;"><a href="http://meta.wikimedia.org/wiki/File:Claw.png" class="image"><img alt="" src="http://upload.wikimedia.org/wikipedia/commons/thumb/4/47/Claw.png/220px-Claw.png" width="220" height="269" style="border:1px solid #ccc;" /></a></p>
<div style="border:none;text-align:left;line-height:1.4em;padding:3px !important;font-size:94%;">
<div style="float:right;border:none !important;background:none !important;"><a href="http://meta.wikimedia.org/wiki/File:Claw.png" class="internal" title="Enlarge"><img src="http://bits.wikimedia.org/skins-1.18/common/images/magnify-clip.png" width="15" height="11" alt="" /></a></div>
<p>One of the motifs indicating article quality: One editor (top) having worked on several related articles (bottom)</p></div>
</div>
</div>
<p>A paper titled &#8220;Characterizing Wikipedia Pages Using Edit Network Motif Profiles&#8221;<sup id="cite_ref-0" class="reference"><a href="#cite_note-0">[1]</a></sup> by three researchers from University College Dublin indicates that the quality of a Wikipedia article can be predicted from characteristics of its &#8220;edit network&#8221; – a graph derived from the collaboration of Wikipedians in that area. <a href="http://en.wikipedia.org/wiki/Network_motif" title="w:Network motif">Network motifs</a> are small graphs which occur particularly frequently as sub-graphs of networks of a certain kind, and can be regarded as its building blocks in some sense. (The concept is popular in bioinformatics, where it is applied to <a href="http://en.wikipedia.org/wiki/gene_regulatory_network" title="w:gene regulatory network">gene regulatory networks</a>.) In this paper, the authors use graphs with at most five nodes consisting of users and articles, which are connected by an edge if the user has edited the article – giving 17 possible &#8220;Wikipedia network motifs&#8221;. (Anonymous users are disregarded.) For a Wikipedia article, the researchers form an &#8220;ego network&#8221; consisting of that article, articles which link to it (and have been edited by at least one of the users who edited the core article), and the users who edited them. For a sample of around 2000 articles from the History and United States categories, the frequencies of the 17 &#8220;Wikipedia network motifs&#8221; in those article&#8217;s &#8220;ego networks&#8221; were calculated.</p>
<p>Using machine learning techniques, the researchers are able to discern with some certainty articles of basic quality (defined as having been <a href="http://en.wikipedia.org/wiki/Wikipedia:WikiProject_Council/Assessment_FAQ" title="w:Wikipedia:WikiProject Council/Assessment FAQ">assessed</a> as Start class by Wikipedians) from those of good quality (defined as Featured or B class), solely based on this set of motif frequencies in the article&#8217;s edit network. Looking at the impact of each of the 17 types separately, they found that &#8220;all network motifs have some potential to discriminate between good and basic Wikipedia articles&#8221; in the sample, but that among the four best predicting motifs, three are &#8220;stars with editors at their centre&#8221;:</p>
<dl>
<dd><i>&#8220;This is interesting because it shows that</i> many eyes <i>is not really the defining characteristic of quality; instead experience is important – the editors should have worked on many other articles.&#8221;</i></dd>
</dl>
<p><span id="more-8017"></span></p>
<p>Another section of the paper constructs <a href="http://en.wikipedia.org/wiki/spatialization" title="w:spatialization">spatializations</a> of the sample (i.e. a two-dimensional mapping where articles with similar motif frequency are close to each other). For the history articles sample, this visualization clearly separated B class and Start class articles, but Featured articles are &#8220;more spread out&#8221;, with two clusters on opposite sides of the diagram. The researchers made the interesting discovery that this seems related to the assessed importance of the articles:</p>
<dl>
<dd><i>&#8220;It transpires that the Featured Articles on the left are inclined to be low or mid importance compared to high or top importance articles on the right. This niche characteristic is emphasized by the fact that these articles are inclined not to have been featured on the Wikipedia main page. We conclude from this that, at least in edit network terms, some low importance Featured Articles look like more ordinary articles. &#8230; It seems that articles on niche topics can reach Featured Article status without a huge amount of collaboration.&#8221;</i></dd>
</dl>
<h3 id="Systemic_bias_quantified_for_twenty_language_Wikipedias">Systemic bias quantified for twenty language Wikipedias</h3>
<p>A paper titled &#8220;Cultural Configuration of Wikipedia: Measuring Autoreferentiality in Different Languages&#8221;<sup id="cite_ref-1" class="reference"><a href="#cite_note-1">[2]</a></sup> by two researchers from the <a href="http://en.wikipedia.org/wiki/Universitat_Polit%C3%A8cnica_de_Catalunya" title="w:Universitat Politècnica de Catalunya">Universitat Politècnica de Catalunya</a>, published in the proceedings of the &#8220;Recent Advances in Natural Language Processing&#8221; conference and apparently based on the first author&#8217;s masters thesis<sup id="cite_ref-2" class="reference"><a href="#cite_note-2">[3]</a></sup> attempts to test the hypothesis that &#8220;contributing for the visibility of the own national or language related content&#8221; is among the motivations to participate in Wikipedia. According to the authors, &#8220;some informal surveys in Catalan WP association ‘Amical Viquipèdia’ showed how the national topics were a focus of interest for writing and conflict&#8221;. They propose the concept of &#8220;autoreferentiality&#8221; &#8220;to describe the interest of a culture on itself, which in WP translates to the interest of editors for their own local content in a WP language edition&#8221;, and set out to measure it by various quantitative features, which are first defined on the article level and then tested on a selection of articles that are assumed to be &#8220;local content&#8221;, using the Java-based <a href="http://collablab.northwestern.edu/wikapidia_api/Wikapidia/Home.html">WikAPIdia</a> tool. (This set is formed by starting with a few keywords clearly pertaining to the local language, and then including articles which share categories – as examples from their own language, the authors list &#8220;“catalunya”, “català”, and also “valencia” or “mallorquí”&#8221; as start words, which &#8220;would retrieve titles in articles and categories like “escriptors de catalunya” or “dret català”, referring to writers and law&#8221;.) Among the tested quantitative features are:</p>
<ul>
<li>&#8220;Isolation&#8221;, based on the number of interwiki links</li>
<li>&#8220;Effort&#8221;, based on the size of the article and the number of internal links it contains</li>
<li>&#8220;Prominence&#8221;, based on the number of incoming wikilinks, the number of categories where the article is a member, and its <a href="http://en.wikipedia.org/wiki/PageRank" title="w:PageRank">PageRank</a></li>
<li>&#8220;Edition&#8221;, a measure of how diverse the authorship of an article is, specified as the smallest number of editors who together contribute 80% of the page&#8217;s edits (assuming to be lower for local content because it is edited by &#8220;highly motivated users&#8221;)</li>
</ul>
<p>The paper applies the eventual formula to Wikipedias in twenty languages – the English language edition is excluded &#8220;due to its size and difficulties in processing in all dimensions&#8221;, and the second and third largest Wikipedias (German and French) are missing as well. In the final &#8220;autoreferentiality index&#8221;, the Icelandic, Japanese and Swahili Wikipedias come out as the most local-focused among these twenty, while, curiously, the Catalan edition which prompted the research question has the lowest autoreferentiality value.</p>
<div style="text-align:center;border:1px solid #ccc;margin:2px;float:right;clear:right;margin:0.5em 0 0.8em 1.4em;">
<div style="padding: 3px !important; border: 1px solid rgb(204, 204, 204); text-align: center; overflow: hidden; font-size: 94%; background-color: white; width:141px;"><a href="http://meta.wikimedia.org/wiki/File:Elizabeth_Taylor_trailer.jpg" class="image"><img alt="" src="http://upload.wikimedia.org/wikipedia/commons/thumb/8/86/Elizabeth_Taylor_trailer.jpg/139px-Elizabeth_Taylor_trailer.jpg" width="139" height="166" style="border:1px solid #ccc;" /></a></p>
<div style="border:none;text-align:left;line-height:1.4em;padding:3px !important;font-size:94%;">
<div style="float:right;border:none !important;background:none !important;"><a href="http://meta.wikimedia.org/wiki/File:Elizabeth_Taylor_trailer.jpg" class="internal" title="Enlarge"><img src="http://bits.wikimedia.org/skins-1.18/common/images/magnify-clip.png" width="15" height="11" alt="" /></a></div>
<p>Caused a &#8220;heavy editorial event&#8221; earlier this year: Elizabeth Taylor</p></div>
</div>
</div>
<h3 id="Does_.22In_the_news.22-like_attention_have_a_positive_effect_on_article_quality.3F">Does &#8220;In the news&#8221;-like attention have a positive effect on article quality?</h3>
<p>A five page paper<sup id="cite_ref-3" class="reference"><a href="#cite_note-3">[4]</a></sup> by a Ph.D. student in Computer Science at the <a href="http://en.wikipedia.org/wiki/University_of_Iowa" title="w:University of Iowa">University of Iowa</a> examines &#8220;The Impact of Heavy Editorial Events on Wikipedia Page Quality&#8221; – for example the flurry of edits to the article <a href="http://en.wikipedia.org/wiki/Elizabeth_Taylor" title="w:Elizabeth Taylor">Elizabeth Taylor</a> after the actor&#8217;s death in March 2011. To measure quality, the approach of an earlier paper<sup id="cite_ref-4" class="reference"><a href="#cite_note-4">[5]</a></sup> is used, which assigns article contributors a reputation value depending on how many of their earlier contributions have been deleted, and by whom, and also takes into account whether the article revision in question was reverted later. The resulting formula was applied to &#8220;high editorial events&#8221; in 100 articles of the English Wikipedia, from the start of Wikipedia in 2001 until the beginning of 2010. As expected, the data supported the hypothesis that &#8220;high editorial events would contribute positively to a page&#8217;s quality&#8221;. The five articles impacted most positively among the studied sample (biased toward the beginning of the alphabet) were <a href="http://en.wikipedia.org/wiki/art" title="w:art">art</a>, <a href="http://en.wikipedia.org/wiki/Allen_Ginsberg" title="w:Allen Ginsberg">Allen Ginsberg</a>, <a href="http://en.wikipedia.org/wiki/anarcho-capitalism" title="w:anarcho-capitalism">anarcho-capitalism</a>, <a href="http://en.wikipedia.org/wiki/Chiropractic" title="w:Chiropractic">chiropractic</a> and <a href="http://en.wikipedia.org/wiki/death" title="w:death">death</a>. The paper also found that a higher increase in the edit rate was associated with a higher quality increase, but does not address the question of whether the relation could be explained by the mere number of edits (i.e. whether the same number of edits over a longer time might have had the same effect).</p>
<h3 id="Detecting_POV_pushing_editors">Detecting POV pushing editors</h3>
<p>A working paper posted this month to <a href="http://en.wikipedia.org/wiki/ArXiv" title="w:ArXiv">ArXiv</a> with the title &#8220;Pushing Your Point of View: Behavioral Measures of Manipulation in Wikipedia&#8221; presents a method to score the neutrality of Wikipedia contributors and to &#8220;detect potential POV pushing behavior&#8221;.<sup id="cite_ref-5" class="reference"><a href="#cite_note-5">[6]</a></sup> The authors propose two metrics to quantify an editor&#8217;s involvement in controversial topics. The first metric (<i>Controversy score</i> or <i>C-score</i>) measures the amount of attention spent by an individual editor on controversial articles, where controversiality is defined on the basis of several quantitative factors previously established in the literature. The second metric (<i>Clustered Controversy score</i> or <i>CC-score</i>) quantifies the focus of an editor&#8217;s attention on controversial articles on the same topic or very similar topics: the purpose of this metric is to tease apart editors involved in genuine controversy resolution (such as administrators who are likely to participate in a broad range of discussions on controversial topics) from &#8220;potentially manipulative users&#8221; who focus their attention on a narrow set of controversial topics. To assess the validity of the above metrics the authors test their discriminatory power at identifying which editors are blocked and which are regular users who were never blocked. The remainder of the paper examines the breakdown of edits by administrators immediately after a successful Request for Adminship. The results, based on qualitative coding by a single reviewer, suggests that some topical areas in the English Wikipedia (such as politics and media) are more likely to be frequently edited by administrators with a high C-score and CC-score than any other topical categories.</p>
<h3 id="Historian_of_encyclopedias_reviews_Good_Faith_Collaboration">Historian of encyclopedias reviews <i>Good Faith Collaboration</i></h3>
<p>The most recent issue of <i><a href="http://en.wikipedia.org/wiki/Annals_of_Science" title="w:Annals of Science">Annals of Science</a></i> (a scholarly journal about the history of science and technology, founded in 1936) contains a four-page review<sup id="cite_ref-6" class="reference"><a href="#cite_note-6">[7]</a></sup> of Joseph Reagle&#8217;s book <i><a href="http://en.wikipedia.org/wiki/Good_Faith_Collaboration" title="w:Good Faith Collaboration">Good Faith Collaboration: The Culture of Wikipedia</a></i> (published in 2010 and recently <a href="http://reagle.org/joseph/2010/gfc/">released</a> on the Web under a CC-BY-NC-SA license). The reviewer <a href="http://www.artsci.uc.edu/collegemain/faculty_staff/profile_details.aspx?ePID=MjcyOTk%3D">Jeff Loveland</a>, who has written extensively about the early history of encyclopedias, criticizes the book for having &#8220;one major weakness, namely in historical contextualization&#8221; (he mentions two 18th-century precedents which should have been given more attention, as they, like Wikipedia, intended to include contributions from the public: <a href="http://en.wikipedia.org/wiki/Vincenzo_Coronelli" title="w:Vincenzo Coronelli">Vincenzo Coronelli</a>&#8216;s <i><a href="http://en.wikipedia.org/wiki/Biblioteca_Universale_Sacro-Profana" title="w:Biblioteca Universale Sacro-Profana">Biblioteca Universale</a></i> and <a href="http://en.wikipedia.org/wiki/Johann_Heinrich_Zedler" title="w:Johann Heinrich Zedler">Zedler</a>&#8216;s <i><a href="http://en.wikipedia.org/wiki/Grosses_vollst%C3%A4ndiges_Universal-Lexicon" title="w:Grosses vollständiges Universal-Lexicon">Universal-Lexicon</a></i>) – and rejects Reagle&#8217;s claim that &#8220;historically, reference works have made few claims about neutrality as a stance of collaboration, or as an end result&#8221;: &#8220;References to such values as impartiality, unbiasedness and objectivity are frequent in the prefaces of encyclopaedias over the last three hundred years&#8221;. On the other hand, the reviewer praises the book for &#8220;com[ing] close to offering&#8221; a comprehensive introduction to Wikipedia, &#8220;touching as it does on nearly all aspects of the encyclopaedia&#8221; and he commends the author&#8217;s writing style as &#8220;informal, energetic and appropriately paced&#8221;. The &#8220;insightful and worthwhile&#8221; ethnography of Wikipedia is highlighted as the second success of the book.</p>
<p>Regarding <a href="http://reagle.org/joseph/2010/gfc/chapter-3.html">chapter 3</a> of the book, which postulates <a href="http://en.wikipedia.org/wiki/WP:NPOV" title="w:WP:NPOV">Neutral Point of View</a> and <a href="http://en.wikipedia.org/wiki/WP:AGF" title="w:WP:AGF">Assume Good Faith</a> as the two principles at &#8220;the heart of Wikipedia collaboration&#8221;, the review recommends &#8220;Anne Goldgar’s study of conduct as a force binding together the early modern <a href="http://en.wikipedia.org/wiki/Republic_of_Letters" title="w:Republic of Letters">Republic of Letters</a> in <i>Impolite Learning</i> (1995) [as] an interesting point of comparison&#8221; regarding &#8220;the historical connection between knowledge and civility&#8221;. Commenting on <a href="http://reagle.org/joseph/2010/gfc/chapter-7.html">chapter 7</a>, which examines criticism of Wikipedia, Loveland observes that &#8220;the portrayal by critics of a possible Wikipedian collective intelligence as anti-individualistic, or anti-rationalistic seems opportunistic and off-the-mark. Meanwhile, Wikipedia now bears the brunt of a refurbished but centuries-old accusation against encyclopaedias, namely that they trivialize and fragment knowledge.&#8221;</p>
<h3 id="Briefly">Briefly</h3>
<ul>
<li><b>Automatically assessing editors&#8217; reputations</b>: Wöhner, Thomas and Köhler present new metrics for automatic reputation assessment of Wikipedia editors. They evaluate seven potential metrics for reputation assessment including editing frequency and contribution to high-quality articles, plus new metrics that they conceived including &#8216;efficiency&#8217; which they define as &#8216;the portion of an author&#8217;s contribution that is persistent and quantifies the acceptance of the author within the Wikipedia community.&#8217; They evaluate the metrics using a database of the Germany Wikipedia from January, 2008 and tested their metrics against Wikipedia&#8217;s internal user classification of blocked users, administrators and anonymous users. They conclude that editing efficiency is most significant for reputation assessment since it was able to distinguish between blocked and regular authors with an accuracy rate of 86%.<sup id="cite_ref-7" class="reference"><a href="#cite_note-7">[8]</a></sup></li>
<li><b>Students reflect on Wikipedia assignments</b>: Chen and Reber present the results of a pilot study where students from a Norwegian and a German university were asked to reflect on their experience writing a Wikipedia article as a course assignment. They provide the results of two independent judges who analyzed the written reflections of students on ten dimensions including: relevance for society, learning outcome and difficulty, among others. The authors conclude that students were highly motivated by the task and &#8216;have learned much about the topic that they wrote about&#8217;.<sup id="cite_ref-8" class="reference"><a href="#cite_note-8">[9]</a></sup></li>
<li><b>Too few newbies?</b>: A paper titled &#8220;Too Few New Wikipedians? Modelling Effort and Participation in Wikipedia&#8221;<sup id="cite_ref-9" class="reference"><a href="#cite_note-9">[10]</a></sup> evaluates &#8220;the efficiency of the Wikipedia projects in different languages in transforming inputs (people using the Internet) into outputs (articles). We find a decreasing return to scale in the biggest projects, but the size or the age of the projects are not the main explanation for the variations in efficiency we see.&#8221;</li>
<li><b>How student editors use sources: synthesis vs plagiarism</b>: Sormunen and Lehtio report the findings of a pilot study on how Finnish secondary-school students use sources when they are required to contribute to Wikipedia as part of their coursework. They interviewed, observed, and analyzed the work of 11 groups of students, and found that: (1) the students relied almost exclusively on Web sources, (2) a sizable fraction (33%) of their work was copied verbatim (or very lightly edited) from their sources, (3) 30% of sources used were not cited at all. The dataset in this study is extremely small and the sample was not designed to be at all representative. Still, the conclusions are disconcerting, especially considering the recent controversy over student plagiarism in a related Wikipedia writing program (<i>Signpost</i> coverage: &#8220;<a href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2011-11-07/Special_report" title="w:Wikipedia:Wikipedia Signpost/2011-11-07/Special report">A post-mortem on the Indian Education Program pilot</a>&#8220;). The interviews with the students could potentially provide insight into why student plagiarism occurs.<sup id="cite_ref-10" class="reference"><a href="#cite_note-10">[11]</a></sup></li>
<li><b>Tracking changes in Wikipedia</b>: A student thesis in Computer Science at the <a href="http://en.wikipedia.org/wiki/University_of_Dresden" title="w:University of Dresden">University of Dresden</a><sup id="cite_ref-11" class="reference"><a href="#cite_note-11">[12]</a></sup> describes the prototype of a software that tracks and categorizes edits on Wikipedia – trying, among other things, to detect articles that are being affected by external events. In a test sample containing articles that had been subject to major news events in recent years, such as <a href="http://en.wikipedia.org/wiki/Fukushima_Daiichi_Nuclear_Power_Plant" title="w:Fukushima Daiichi Nuclear Power Plant">Fukushima Daiichi Nuclear Power Plant</a> or <a href="http://en.wikipedia.org/wiki/Dominique_Strauss-Kahn" title="w:Dominique Strauss-Kahn">Dominique Strauss-Kahn</a>, &#8220;about 74% of the events &#8230; have been detected and about 68% of these detected events (74%) are recognized correctly.&#8221;</li>
<li><b>Gendered language on Wikipedia</b>: Several Wikimedians have <a href="http://www.gossamer-threads.com/lists/wiki/foundation/257912">announced</a> a study titled &#8220;Mind the Gap(s)! Writing Styles of Female Editors on Wikipedia&#8221;,<sup id="cite_ref-12" class="reference"><a href="#cite_note-12">[13]</a></sup> applying algorithms that try to classify a text as &#8220;male&#8221; or &#8220;female&#8221; (based on the frequency of &#8220;male keywords&#8221; and &#8220;female keywords&#8221;) to text contributions by editors who state their gender on their user page (1,119 females and 722 males). Among the conclusions: &#8220;While the data is insufficient to reach the conclusion that Wikipedia attracts females who code their language usage as male in all circumstances on-wiki and off-wiki, we have shown that females use a more male style of writing when writing for Wikipedia.&#8221;</li>
</ul>
<h2 id="References">References</h2>
<div class="references-small">
<ol class="references">
<li id="cite_note-0"><a href="#cite_ref-0">↑</a> Wu, Guangyu, Martin Harrigan, and Pádraig Cunningham (2011). Characterizing Wikipedia pages using edit network motif profiles. In <i>Proceedings of the 3rd international workshop on Search and mining user-generated contents &#8211; SMUC 2011</i>, New York, NY, USA: ACM Press, October 28, 2011. <a href="http://dx.doi.org/10.1145/2065023.2065036"><b>DOI</b></a> • <a href="http://ir.ii.uam.es/smuc2011/res/papers/smuc2011_paper07.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-1"><a href="#cite_ref-1">↑</a> Ribé, Marc Miquel, and Horacio Rodrìguez (2011) Cultural Configuration of Wikipedia: Measuring Autoreferentiality in Different Languages. In <i>Proceedings of Recent Advances in Natural Language Processing</i>, 316-322. Hissar, Bulgaria. <a href="http://aclweb.org/anthology-new/R/R11/R11-1044.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-2"><a href="#cite_ref-2">↑</a> Ribé, Marc Miquel (2011) <i>Cultural conﬁguration of Wikipedia: Measuring autoreferentiality in different languages</i>. Universitat Politècnica de Catalunya. <a href="http://upcommons.upc.edu/pfc/bitstream/2099.1/11700/1/memoria.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-3"><a href="#cite_ref-3">↑</a> Oliver, Corey (2011) <i>The Impact of Heavy Editorial Events on Wikipedia Page Quality</i>. <a href="http://www.coreyoliver.org/research/Oliver2011.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-4"><a href="#cite_ref-4">↑</a> Javanmardi and C. Lopes. Statistical Measure of Quality in Wikipedia. In: 1st Workshop on Social Media Analytics (SOMA ’10), July 2010. <a href="http://snap.stanford.edu/soma2010/papers/soma2010_18.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-5"><a href="#cite_ref-5">↑</a> Das, Sanmay, Allen Lavoie, and Malik Magdon-Ismail (2011). Pushing Your Point of View: Behavioral Measures of Manipulation in Wikipedia. <i>arXiV</i>, November 8, 2011. <a href="http://arxiv.org/abs/1111.2092"><b>PDF</b></a><a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-6"><a href="#cite_ref-6">↑</a> Loveland, Jeff (2011). Review of: Good Faith Collaboration: The Culture of Wikipedia. <i>Annals of Science</i> 68 (4) (October) 555-558. <a href="http://dx.doi.org/10.1080/00033790.2011.564297"><b>DOI</b></a><a href="http://en.wikipedia.org/wiki/Closed_access" title="Closed access"><img alt="Closed access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/c/c1/Closed_Access_logo_alternative.svg/12px-Closed_Access_logo_alternative.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-7"><a href="#cite_ref-7">↑</a> Wöhner, Thomas, Sebastian Köhler, and Ralf Peters (2011). Automatic Reputation Assessment in Wikipedia. In <i>ICIS 2011 Proceedings</i>. <a href="http://aisel.aisnet.org/icis2011/proceedings/onlinecommunity/5"><b>HTML</b></a> <a href="http://en.wikipedia.org/wiki/Closed_access" title="Closed access"><img alt="Closed access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/c/c1/Closed_Access_logo_alternative.svg/12px-Closed_Access_logo_alternative.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-8"><a href="#cite_ref-8">↑</a> Chen, Weiqin, and Rolf Reber (2011). Writing Wikipedia Articles as Course Assignment. In <i>Proceedings of the 19th International Conference on Computers in Education</i>, T. Hirashima et al. (Eds). Chiang Mai, Thailand. <a href="http://122.155.1.128/icce2011/program/proceedings/pdf/C6_S14_141S.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-9"><a href="#cite_ref-9">↑</a> Crowston, Kevin, Nicolas Jullien, and Felipe Ortega (2011) Too Few New Wikipedians? Modelling Effort and Participation in Wikipedia. SSRN eLibrary. <a href="http://ssrn.com/paper=1960696"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-10"><a href="#cite_ref-10">↑</a> Sormunen, Eero, and Leeni Lehtio (2011) <i>Authoring Wikipedia articles as an information literacy assignment – copy-pasting or expressing new understanding in one&#8217;s own words?</i> <a href="https://www12.uta.fi/blogs/know-id/files/2011/10/SormunenLehtio_IR2011.pdf"><b>PDF</b></a><a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-11"><a href="#cite_ref-11">↑</a> Deng, Yihan (2011) <i>Change Tracking in Wikipedia</i>. Master Thesis, <a href="http://www.rn.inf.tu-dresden.de/uploads/Studentische_Arbeiten/Belegarbeit_Deng_Yihan.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-12"><a href="#cite_ref-12">↑</a> <a href="http://en.wikipedia.org/wiki/User:LauraHale" title="w:User:LauraHale">LauraHale</a>, <a href="http://en.wikipedia.org/wiki/User:Hawkeye7" title="w:User:Hawkeye7">Hawkeye7</a>, <a href="http://en.wikipedia.org/wiki/User:Pine" title="w:User:Pine">Pine</a> and others, <a href="http://meta.wikimedia.org/wiki/Mind_the_Gap" title="Mind the Gap"><i>Mind the Gap(s)! Writing Styles of Female Editors on Wikipedia</i></a>.</li>
</ol>
</div>
<p></p>
<div style="float:left; width: 99%; border-top: 1px solid #CCC; border-bottom: 1px solid #CCC;">
<p style="text-align:center; clear:left; background-color: #EEE; padding: .6em; font-size:140%; font-weight:normal;font-family: Georgia, Times, serif; color:#666"><i>Wikimedia Research Newsletter</i><br />
Vol: 1 • Issue: 5 • November 2011<br />
<span style="font-size:80%;">This newletter is brought to you by the <a href="http://meta.wikimedia.org/wiki/Research:Committee" title="Research:Committee">Wikimedia Research Committee</a> and <a href="http://en.wikipedia.org/wiki/Wikipedia:Signpost" title="w:Wikipedia:Signpost">The Signpost</a><br />
<a href="http://blog.wikimedia.org/c/research-2/wikimedia-research-newsletter/feed/" title="Syndicate the Wikimedia Research Newsletter feed"><img alt="Syndicate the Wikimedia Research Newsletter feed" src="http://upload.wikimedia.org/wikipedia/commons/thumb/4/43/Feed-icon.svg/16px-Feed-icon.svg.png" width="16" height="16" /></a> <a href="http://meta.wikimedia.org/wiki/Research:Newsletter/Archives" title="Research:Newsletter/Archives">[<!--  -->archives]</a> <a href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2011-11-28/Recent_research" title="w:Wikipedia:Wikipedia Signpost/2011-11-28/Recent research">[signpost edition]</a> <a href="http://meta.wikimedia.org/wiki/Research:Newsletter" title="Research:Newsletter">[contribute]</a> <a href="http://meta.wikimedia.org/wiki/Research:Index" title="Research:Index">[research index]</a></span></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.wikimedia.org/2011/11/29/wikimedia-research-newsletter-november-2011/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Wikimedia Research Newsletter, October 2011</title>
		<link>http://blog.wikimedia.org/2011/11/02/wikimedia-research-newsletter-october-2011/</link>
		<comments>http://blog.wikimedia.org/2011/11/02/wikimedia-research-newsletter-october-2011/#comments</comments>
		<pubDate>Wed, 02 Nov 2011 15:43:15 +0000</pubDate>
		<dc:creator>Tilman</dc:creator>
				<category><![CDATA[Highlights]]></category>
		<category><![CDATA[Wikimedia Research Newsletter]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[WikiSym]]></category>

		<guid isPermaLink="false">http://blog.wikimedia.org/?p=7279</guid>
		<description><![CDATA[Vol: 1 • Issue: 4 • October 2011 [archives] WikiSym; predicting editor survival; drug information found lacking; RfAs and trust; Wikipedia&#8217;s search engine ranking justified With contributions by: Boghog, Jodi.a.schneider, Drdee, DarTar, Phoebe and Tbayer Contents 1 Wiki research beyond the English Wikipedia at WikiSym 2 Quality of drug information in Wikipedia 3 Predicting editor [...]]]></description>
			<content:encoded><![CDATA[<div style="float:left; width: 99%;">
<p><a href="http://meta.wikimedia.org/wiki/Research:Newsletter" title="Research:Newsletter"><img alt="WRN header.png" src="http://upload.wikimedia.org/wikipedia/commons/7/7d/WRN_header.png" width="554" height="86" /></a></p>
<p style="clear:left; background-color: #EEE; padding: .3em; font-size:200%; font-weight:normal; font-family: Georgia, Times, serif; color:#666">Vol: 1 • Issue: 4 • October 2011 <span style="font-size:75%; float:right; margin-right:1em; text-transform:uppercase"><a href="http://meta.wikimedia.org/wiki/Research:Newsletter/Archives" title="Research:Newsletter/Archives">[<!--  -->archives]</a></span></p>
<p style="clear:left; padding: .3em; line-height: 130%; font-size:150%; font-weight:normal; font-style: italic; font-family: Georgia, Times, serif; color:#666">WikiSym; predicting editor survival; drug information found lacking; RfAs and trust; Wikipedia&#8217;s search engine ranking justified</p>
</div>
<p>
<b>With contributions by:</b> <a href="http://en.wikipedia.org/wiki/User:Boghog" title="en:User:Boghog">Boghog</a>, <a href="http://en.wikipedia.org/wiki/User:Jodi.a.schneider" title="en:User:Jodi.a.schneider">Jodi.a.schneider</a>, <a href="http://en.wikipedia.org/wiki/User:Drdee" title="en:User:Drdee">Drdee</a>, <a href="http://en.wikipedia.org/wiki/User:DarTar" title="en:User:DarTar">DarTar</a>, <a href="http://en.wikipedia.org/wiki/User:Phoebe" title="en:User:Phoebe">Phoebe</a> and <a href="http://en.wikipedia.org/wiki/User:Tbayer_(WMF)" title="en:User:Tbayer (WMF)">Tbayer</a></p>
<table id="toc" class="toc">
<tr>
<td>
<div id="toctitle">
<h2>Contents</h2>
</div>
<ul>
<li class="toclevel-1 tocsection-1"><a href="http://blog.wikimedia.org/2011/11/02/wikimedia-research-newsletter-october-2011/#Wiki_research_beyond_the_English_Wikipedia_at_WikiSym"><span class="tocnumber">1</span> <span class="toctext">Wiki research beyond the English Wikipedia at WikiSym</span></a></li>
<li class="toclevel-1 tocsection-2"><a href="http://blog.wikimedia.org/2011/11/02/wikimedia-research-newsletter-october-2011/#Quality_of_drug_information_in_Wikipedia"><span class="tocnumber">2</span> <span class="toctext">Quality of drug information in Wikipedia</span></a></li>
<li class="toclevel-1 tocsection-3"><a href="http://blog.wikimedia.org/2011/11/02/wikimedia-research-newsletter-october-2011/#Predicting_editor_survival:_The_winners_of_the_Wikipedia_Participation_Challenge"><span class="tocnumber">3</span> <span class="toctext">Predicting editor survival: The winners of the Wikipedia Participation Challenge</span></a></li>
<li class="toclevel-1 tocsection-4"><a href="http://blog.wikimedia.org/2011/11/02/wikimedia-research-newsletter-october-2011/#What_it_takes_to_become_an_admin:_Insights_from_the_Polish_Wikipedia"><span class="tocnumber">4</span> <span class="toctext">What it takes to become an admin: Insights from the Polish Wikipedia</span></a></li>
<li class="toclevel-1 tocsection-5"><a href="http://blog.wikimedia.org/2011/11/02/wikimedia-research-newsletter-october-2011/#High_search_engine_rankings_of_Wikipedia_articles_found_to_be_justified_by_quality"><span class="tocnumber">5</span> <span class="toctext">High search engine rankings of Wikipedia articles found to be justified by quality</span></a></li>
<li class="toclevel-1 tocsection-6"><a href="http://blog.wikimedia.org/2011/11/02/wikimedia-research-newsletter-october-2011/#Attempts_to_predict_the_outcome_of_AfD_discussions_from_an_article.27s_edit_history"><span class="tocnumber">6</span> <span class="toctext">Attempts to predict the outcome of AfD discussions from an article&#8217;s edit history</span></a></li>
<li class="toclevel-1 tocsection-7"><a href="http://blog.wikimedia.org/2011/11/02/wikimedia-research-newsletter-october-2011/#In_brief"><span class="tocnumber">7</span> <span class="toctext">In brief</span></a></li>
<li class="toclevel-1 tocsection-8"><a href="http://blog.wikimedia.org/2011/11/02/wikimedia-research-newsletter-october-2011/#References"><span class="tocnumber">8</span> <span class="toctext">References</span></a></li>
</ul>
</td>
</tr>
</table>
<h3 id="Wiki_research_beyond_the_English_Wikipedia_at_WikiSym">Wiki research beyond the English Wikipedia at WikiSym</h3>
<div style="text-align:center;border:1px solid #ccc;margin:2px;float:right;clear:right;margin:0.5em 0 0.8em 1.4em;">
<div style="padding: 3px !important; border: 1px solid rgb(204, 204, 204); text-align: center; overflow: hidden; font-size: 94%; background-color: white; width:222px;"><a href="http://meta.wikimedia.org/wiki/File:WikiSym_2011_panel.jpg" class="image"><img alt="" src="http://upload.wikimedia.org/wikipedia/commons/thumb/a/ac/WikiSym_2011_panel.jpg/220px-WikiSym_2011_panel.jpg" width="220" height="102" style="border:1px solid #ccc;" /></a></p>
<div style="border:none;text-align:left;line-height:1.4em;padding:3px !important;font-size:94%;">
<div style="float:right;border:none !important;background:none !important;"><a href="http://meta.wikimedia.org/wiki/File:WikiSym_2011_panel.jpg" class="internal" title="Enlarge"><img src="http://bits.wikimedia.org/skins-1.18/common/images/magnify-clip.png" width="15" height="11" alt="" /></a></div>
<p>Panel discussion at WikiSym 2011</p></div>
</div>
</div>
<p><a href="http://en.wikipedia.org/wiki/WikiSym" title="en:WikiSym">WikiSym</a> 2011, the &#8220;7th international symposium on wikis and open collaboration&#8221;, took place from October 3-5 at the Microsoft Research Campus in Silicon Valley (<a href="http://en.wikipedia.org/wiki/Mountain_View" title="en:Mountain View">Mountain View</a>, California). Although the conference&#8217;s scope has broadened to include the study of open online collaborations that are not wiki-based, Wikipedia-related research still took up a large part of the <a href="http://www.wikisym.org/ws2011/program:schedule">schedule</a>. Several of the conference papers have already been reviewed in the <a href="http://meta.wikimedia.org/wiki/Research:Newsletter/2011-09-26" title="Research:Newsletter/2011-09-26">September</a> and <a href="http://meta.wikimedia.org/wiki/Research:Newsletter/2011-08-29" title="Research:Newsletter/2011-08-29">August</a> issues of this research overview, and the rest of the <a href="http://www.wikisym.org/ws2011/program:proceedings">proceedings</a> have since become available online.</p>
<p><span id="more-7279"></span></p>
<p>The workshop &#8220;<b><a href="http://www.wikisym.org/ws2011/workshop:wikilit">WikiLit: Collecting the Wiki and Wikipedia Literature</a></b>&#8220;<sup id="cite_ref-0" class="reference"><a href="#cite_note-0">[1]</a></sup>, led by <a href="http://en.wikipedia.org/wiki/user:phoebe" title="en:user:phoebe">Phoebe Ayers</a> and Reid Priedhorsky, explored the daunting task of collecting the scholarly literature pertaining to Wikipedia and wikis generally. Research about wikis can be difficult to find, since there are papers published in many fields (from sociology to computer science) and in many formats, from published articles to on-wiki community documents. There have been several attempts over the years to collect the wiki and Wikipedia literature, including <a href="http://en.wikipedia.org/wiki/Wikipedia:Academic_studies_of_Wikipedia" title="en:Wikipedia:Academic studies of Wikipedia">on Wikipedia itself</a>, but all such projects have suffered from not keeping up to date with the sheer volume of research that is published every year. While the workshop did not reach consensus on what platform to proceed with to build a sustainable system, there was agreement that this is an important topic for the research and practitioner community, and the group developed a <a href="http://www.wikisym.org/ws2011/workshop:wikilit">list of requirements</a> that such a system should have. The workshop followed and extended discussions on the <a href="https://lists.wikimedia.org/mailman/listinfo/wiki-research-l">wiki-research-l</a> mailing list earlier this year on the topic.</p>
<p>In a panel titled &#8220;<b><a href="http://wikisym.org/ws2011/proceedings:p227-yew">Apples to Oranges?: Comparing across studies of open collaboration/peer production</a></b>&#8220;,<sup id="cite_ref-1" class="reference"><a href="#cite_note-1">[2]</a></sup> six US-based scholars reviewed the state of this field of research. Among the takeaways were a call to study failed collaboration projects more often instead of focusing research on successful &#8220;anomalies&#8221; like Wikipedia, and – especially in the case of Wikipedia – to broaden research to non-English projects.</p>
<p>Another workshop, titled &#8220;<a href="http://wikisym.org/ws2011/proceedings:p231-schulenburg"><b>Lessons from the classroom: successful techniques for teaching wikis using Wikipedia</b></a>&#8220;<sup id="cite_ref-2" class="reference"><a href="#cite_note-2">[3]</a></sup> was a retrospective on the Wikimedia Foundation&#8217;s <a href="http://outreach.wikimedia.org/wiki/Public_Policy_Initiative" title="outreach:Public Policy Initiative">Public Policy Initiative</a>.</p>
<p>Among the conference papers not mentioned before in this newsletter are:</p>
<ul>
<li>&#8220;<b>Mentoring in Wikipedia: a clash of cultures</b>&#8221; <sup id="cite_ref-3" class="reference"><a href="#cite_note-3">[4]</a></sup>, a paper which &#8220;draw[s] insights from the offline mentoring literature to analyze mentoring practices in Wikipedia and how they influence editor behaviors. Our quantitative analysis of <a href="http://en.wikipedia.org/wiki/Wikipedia:Adopt-a-user" title="en:Wikipedia:Adopt-a-user">the Adopt-a-user program</a> shows mixed success of the program&#8221;.</li>
<li>&#8220;<b>Vandalism Detection in Wikipedia: A High-Performing, Feature–Rich Model and its Reduction</b>&#8220;<sup id="cite_ref-4" class="reference"><a href="#cite_note-4">[5]</a></sup> – arguing that on Wikipedia &#8220;human vigilance is not enough to combat vandalism, and tools that detect possible vandalism and poor-quality contributions become a necessity&#8221;, the authors present a vandalism classifier constructed using <a href="http://en.wikipedia.org/wiki/machine_learning" title="en:machine learning">machine learning</a> techniques.</li>
</ul>
<p>Wikipedia-related posters included</p>
<ul>
<li>&#8220;<b>A scourge to the pillar of neutrality: a WikiProject fighting systemic bias</b>&#8220;<sup id="cite_ref-5" class="reference"><a href="#cite_note-5">[6]</a></sup> presenting preliminary findings from an ongoing survey and interviews among members of the <a href="http://en.wikipedia.org/wiki/Wikipedia:WikiProject_Countering_systemic_bias" title="en:Wikipedia:WikiProject Countering systemic bias">WikiProject Countering systemic bias</a>.</li>
<li>Another poster presentation planned to analyze the contributions of the members of this WikiProject to see what kind of systemic bias they might exhibit themselves (&#8220;<b>Places on the map and in the cloud: representations of locality and geography in Wikipedia</b>&#8220;<sup id="cite_ref-6" class="reference"><a href="#cite_note-6">[7]</a></sup>).</li>
<li>&#8220;<b>Participation in Wikipedia&#8217;s article deletion processes</b>&#8220;<sup id="cite_ref-7" class="reference"><a href="#cite_note-7">[8]</a></sup> found that &#8220;the deletion process is heavily frequented by a relatively small number of longstanding users&#8221; and that &#8220;the vast majority of [speedily] deleted articles are not spam, vandalism, or &#8216;patent nonsense&#8217;, but rather articles which could be considered encyclopedic, but do not fit the project&#8217;s standards&#8221;.</li>
<li>&#8220;<b>Exploring underproduction in Wikipedia</b>&#8220;<sup id="cite_ref-8" class="reference"><a href="#cite_note-8">[9]</a></sup> examined &#8220;two key circumstances in which collective production can fail to respond to social need: when goods fail to attain high quality despite (1) high demand or (2) explicit designation by producers as highly important&#8221;.</li>
</ul>
<h3 id="Quality_of_drug_information_in_Wikipedia">Quality of drug information in Wikipedia</h3>
<div style="text-align:center;border:1px solid #ccc;margin:2px;float:right;clear:right;margin:0.5em 0 0.8em 1.4em;">
<div style="padding: 3px !important; border: 1px solid rgb(204, 204, 204); text-align: center; overflow: hidden; font-size: 94%; background-color: white; width:222px;"><a href="http://meta.wikimedia.org/wiki/File:Lovastatin.svg" class="image"><img alt="" src="http://upload.wikimedia.org/wikipedia/commons/thumb/5/50/Lovastatin.svg/220px-Lovastatin.svg.png" width="220" height="188" style="border:1px solid #ccc;" /></a></p>
<div style="border:none;text-align:left;line-height:1.4em;padding:3px !important;font-size:94%;">
<div style="float:right;border:none !important;background:none !important;"><a href="http://meta.wikimedia.org/wiki/File:Lovastatin.svg" class="internal" title="Enlarge"><img src="http://bits.wikimedia.org/skins-1.18/common/images/magnify-clip.png" width="15" height="11" alt="" /></a></div>
<p><a href="http://en.wikipedia.org/wiki/Lovastatin" title="en:Lovastatin">Lovastatin</a>, the first <a href="http://en.wikipedia.org/wiki/statin" title="en:statin">statin</a> to be marketed.</div>
</div>
</div>
<p>A study entitled &#8220;Accuracy and completeness of drug information in Wikipedia: an assessment&#8221;<sup id="cite_ref-9" class="reference"><a href="#cite_note-9">[10]</a></sup> in this month&#8217;s issue of the <i>Journal of the Medical Library Association</i> of five widely prescribed <a href="http://en.wikipedia.org/wiki/statin" title="en:statin">statins</a> found that while these Wikipedia drug articles are generally accurate, they are incomplete and inconsistent. The study&#8217;s authors conclude:</p>
<table align="center" style="border-collapse:collapse; border-style:none; background-color:transparent;" class="cquote">
<tr>
<td width="20" valign="top" style="color:#B2B7F2;font-size:35px;font-family:'Times New Roman',serif;font-weight:bold;text-align:left;padding:10px 10px;">“</td>
<td valign="top" style="padding:4px 10px;">Because the entries on the five most commonly prescribed statins lacked important information, the authors recommend that consumers should seek other sources and not rely solely on Wikipedia.</td>
<td width="20" valign="bottom" style="color:#B2B7F2;font-size:36px;font-family:'Times New Roman',serif;font-weight:bold;text-align:right;padding:10px 10px;">”</td>
</tr>
</table>
<p>The main criticism by the study is that most of the articles lacked sufficient information on adverse effects, contraindications, and drug interactions and this lack of information might harm the consumer. These criticisms echo earlier ones (two similar studies reported in the <i>Signpost</i>: &#8220;<a href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2011-04-11/Recent_research#Pharmacological_study_criticizes_reliability_of_Wikipedia_articles_about_the_top_20_drugs" title="en:Wikipedia:Wikipedia Signpost/2011-04-11/Recent research">Pharmacological study criticizes reliability of Wikipedia articles about the top 20 drugs</a>&#8220;, &#8221; <a href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2009-01-03/Drug_comparison" title="en:Wikipedia:Wikipedia Signpost/2009-01-03/Drug comparison">Wikipedia drug coverage compared to Medscape, found wanting</a>&#8220;). However the authors did note the benefit of Wikipedia hypertext links to additional information that most other web sources on drug information lack and in addition noted that all the Wikipedia articles contained references to peer reviewed journals and other reliable sources. Hence overall, the latest study is somewhat more positive than the earlier two.</p>
<h3 id="Predicting_editor_survival:_The_winners_of_the_Wikipedia_Participation_Challenge">Predicting editor survival: The winners of the Wikipedia Participation Challenge</h3>
<div style="text-align:center;border:1px solid #ccc;margin:2px;float:right;clear:right;margin:0.5em 0 0.8em 1.4em;">
<div style="padding: 3px !important; border: 1px solid rgb(204, 204, 204); text-align: center; overflow: hidden; font-size: 94%; background-color: white; width:152px;"><a href="http://meta.wikimedia.org/wiki/File:Wiki_Participation_Challenge.png" class="image"><img alt="" src="http://upload.wikimedia.org/wikipedia/commons/1/1b/Wiki_Participation_Challenge.png" width="150" height="120" style="border:1px solid #ccc;" /></a></p>
<div style="border:none;text-align:left;line-height:1.4em;padding:3px !important;font-size:94%;">
<div style="float:right;border:none !important;background:none !important;"><a href="http://meta.wikimedia.org/wiki/File:Wiki_Participation_Challenge.png" class="internal" title="Enlarge"><img src="http://bits.wikimedia.org/skins-1.18/common/images/magnify-clip.png" width="15" height="11" alt="" /></a></div>
<p>Banner of the <a href="http://meta.wikimedia.org/wiki/Research:Wiki_Participation_Challenge" title="Research:Wiki Participation Challenge">Wikipedia Participation Challenge</a></div>
</div>
</div>
<p>The Wikimedia Foundation announced the winner of the <a href="http://meta.wikimedia.org/wiki/Research:Wiki_Participation_Challenge" title="Research:Wiki Participation Challenge">Wikipedia Participation Challenge</a>. The data competition, organized in partnership with <a href="http://en.wikipedia.org/wiki/Kaggle" title="en:Kaggle">Kaggle</a> and the <a href="http://www.eecs.wsu.edu/~holder/icdm2011contest/">2011 IEEE International Conference on Data Mining</a>, asked data scientists to use Wikipedia editor data and develop an algorithm to predict the number of future edits, and in particular one that correctly predicts who will stop editing and who will continue to edit (see the <a href="http://blog.wikimedia.org/2011/06/28/data-competition-announcing-the-wikipedia-participation-challenge/">call for submissions</a>). The response was overwhelming, with 96 participating teams, comprising in total 193 people who jointly submitted 1029 entries (listed in the competition&#8217;s <a href="http://www.kaggle.com/c/wikichallenge/Leaderboard">leaderboard</a>).</p>
<p>The brothers Ben and Fridolin Roth (from team prognoZit) developed the winning algorithm. They developed a <a href="http://en.wikipedia.org/wiki/linear_regression" title="en:linear regression">linear regression</a> model using <a href="http://en.wikipedia.org/wiki/Python_(programming_language)" title="en:Python (programming language)">Python</a> and <a href="http://en.wikipedia.org/wiki/GNU_Octave" title="en:GNU Octave">GNU Octave</a>. The algorithm used 13 features (2 based on reverts and 11 based on past editing behavior) to predict future editing activity. Both the <a href="http://dumps.wikimedia.org/other/wikichallenge/">source code</a> and a <a href="http://meta.wikimedia.org/wiki/Research:Wiki_Participation_Challenge_prognoZit" title="Research:Wiki Participation Challenge prognoZit">description of the algorithm</a> are available. Unfortunately, because it relied on patterns in the training dataset that would not be present in the actual one, the model&#8217;s ongoing use is severely restricted.</p>
<p>Second place went to <a href="http://blog.kaggle.com/2011/10/06/like-popping-bubble-wrap/">Keith Herring</a>. Submitting only 3 entries, he developed a highly accurate model, using <a href="http://en.wikipedia.org/wiki/random_forests" title="en:random forests">random forests</a>, and utilizing a total of 206 features. His model shows that a randomly selected Wikipedia editor who has been active in the past year has approximately an 85 percent probability of becoming inactive (no new edits) in the following 5 months. The most informative features captured both the edit timing and volume of an editor&#8217;s activity.</p>
<p>The challenge also announced two Honourable Mentions for participants who only used open source software. The first Honourable Mention went to <a href="http://blog.kaggle.com/2011/10/26/long-live-wikipedia-dell-zhang/">Dell Zang</a> (team zeditor) who used a machine learning technique called <a href="http://en.wikipedia.org/wiki/gradient_boosting" title="en:gradient boosting">gradient boosting</a>. His model mainly uses recent past editor activity. The second Honourable Mention went to Roopesh Ranjan and Kalpit Desai (team Aardvarks). Using <a href="http://en.wikipedia.org/wiki/Python_(programming_language)" title="en:Python (programming language)">Python</a> and <a href="http://en.wikipedia.org/wiki/R_(programming_language)" title="en:R (programming language)">R</a>, they too developed a random forest model. Their model used 113 features, mainly based on the number of reverts and past editor activity (see its <a href="http://meta.wikimedia.org/wiki/Research:Wiki_Participation_Challenge_Aardvarks" title="Research:Wiki Participation Challenge Aardvarks">full description</a>).</p>
<p>All the documentation and source code has been made available on the main entry page for the <a href="http://meta.wikimedia.org/wiki/Research:Wiki_Participation_Challenge" title="Research:Wiki Participation Challenge">WikiChallenge</a>.</p>
<h3 id="What_it_takes_to_become_an_admin:_Insights_from_the_Polish_Wikipedia">What it takes to become an admin: Insights from the Polish Wikipedia</h3>
<div style="text-align:center;border:1px solid #ccc;margin:2px;float:right;clear:right;margin:0.5em 0 0.8em 1.4em;">
<div style="padding: 3px !important; border: 1px solid rgb(204, 204, 204); text-align: center; overflow: hidden; font-size: 94%; background-color: white; width:137px;"><a href="http://meta.wikimedia.org/wiki/File:Wikipedia-logo-v2-pl.png" class="image"><img alt="" src="http://upload.wikimedia.org/wikipedia/commons/a/ad/Wikipedia-logo-v2-pl.png" width="135" height="155" style="border:1px solid #ccc;" /></a></p>
<div style="border:none;text-align:left;line-height:1.4em;padding:3px !important;font-size:94%;">
<div style="float:right;border:none !important;background:none !important;"><a href="http://meta.wikimedia.org/wiki/File:Wikipedia-logo-v2-pl.png" class="internal" title="Enlarge"><img src="http://bits.wikimedia.org/skins-1.18/common/images/magnify-clip.png" width="15" height="11" alt="" /></a></div>
<p>Logo of the Polish Wikipedia</p></div>
</div>
</div>
<p>A team of researchers based at the <a href="http://www.pjwstk.edu.pl/en/">Polish Japanese Institute of Information Technology</a> (PJIIT) published a study presented at <i>SocInfo 2011</i> looking at <a href="http://en.wikipedia.org/wiki/Wikipedia:Requests_for_adminship" title="en:Wikipedia:Requests for adminship">Requests for Adminship (RfA)</a> discussions in the Polish Wikipedia.<sup id="cite_ref-10" class="reference"><a href="#cite_note-10">[11]</a></sup> The paper presents a number of statistics about adminship in the Polish Wikipedia since the RfA procedure was formalized (2005), including the rejection rate of candidates across different rounds, the number of candidates and votes over the years and the distribution of tenure and experience of candidates for adminship. The results indicate that it was far more complicated to obtain admin status in 2010 than it was in previous years, and that tenure required to be a successful RfA candidate has soared dramatically: &#8220;the mean number of days since registration to receiving adminship is nearly five times larger than it was five years before&#8221;.</p>
<p>The remainder of the paper studies RfA discussions by comparing the social network of participants based on their endorsement (vote-for) or rejection (vote-against) of a given candidate with an implicit social network derived from three different types of relations between contributors (trust, criticism and acquaintance). The goal is to measure to what extent these different kinds of relations can predict voting behavior in the context of RfA discussions. The findings suggest that &#8220;trust&#8221; and &#8220;acquaintance&#8221; (measured respectively as the amount of edits by an editor in the vicinity of those by the other editor and as the amount of discussions between two contributors) are significantly higher in votes-for than in votes-against. Conversely, &#8220;criticism&#8221; (measured as the number of edits made by one author and reverted by another editor) is significantly higher in votes-against than in votes-for.</p>
<p>This study complements research on the influence of social ties on adminship discussions <a href="http://meta.wikimedia.org/wiki/Research:Newsletter/2011-09-26#How_social_ties_influence_admin_votes" title="Research:Newsletter/2011-09-26">reviewed in the past edition</a> of the research newsletter.</p>
<h3 id="High_search_engine_rankings_of_Wikipedia_articles_found_to_be_justified_by_quality">High search engine rankings of Wikipedia articles found to be justified by quality</h3>
<p>An article titled &#8220;Ranking of Wikipedia articles in search engines revisited: Fair ranking for reasonable quality?&#8221;, by two professors for information research from the <a href="http://en.wikipedia.org/wiki/Hamburg_University_of_Applied_Sciences" title="en:Hamburg University of Applied Sciences">Hamburg University of Applied Sciences</a> (which appeared earlier this year in the <a href="http://en.wikipedia.org/wiki/Journal_of_the_American_Society_for_Information_Science_and_Technology" title="en:Journal of the American Society for Information Science and Technology">Journal of the American Society for Information Science and Technology</a> and is now available as open access, also in form of a recent <a href="http://en.wikipedia.org/wiki/arxiv" title="en:arxiv">arxiv</a> preprint<sup id="cite_ref-11" class="reference"><a href="#cite_note-11">[12]</a></sup>) addresses &#8220;the fiercely discussed question of whether the ranking of Wikipedia articles in search engines is justified by the quality of the articles&#8221;. The authors recall an earlier paper coauthored by one of them<sup id="cite_ref-12" class="reference"><a href="#cite_note-12">[13]</a></sup> that had found Wikipedia to be &#8220;by far the most popular&#8221; host in search engine results pages (in the US): In &#8220;1000 queries, Yahoo showed the most Wikipedia results within the top 10 lists (446), followed by MSN/Live (387), Google (328), and Ask.com (255)&#8221;. They then set out to investigate &#8220;whether this heavy placement is justified from the user’s perspective&#8221;. First, they re-purposed the results of a 2008 paper of the first author,<sup id="cite_ref-13" class="reference"><a href="#cite_note-13">[14]</a></sup> where students had been asked to judge the relevance of search engine results for 40 queries collected in 2007, restricting them to the search results that consisted of Wikipedia articles – all of them from the German version. They found that &#8220;Wikipedia results are judged much better than the average results at the same ranking position&#8221; by the jurors, and that</p>
<table align="center" style="border-collapse:collapse; border-style:none; background-color:transparent;" class="cquote">
<tr>
<td width="20" valign="top" style="color:#B2B7F2;font-size:35px;font-family:'Times New Roman',serif;font-weight:bold;text-align:left;padding:10px 10px;">“</td>
<td valign="top" style="padding:4px 10px;">The data indicates that contrary to the assumption that Wikipedia articles show up too often in the search engines’ results, the search engines could even think of improving their results through providing more Wikipedia results in the top positions.</td>
<td width="20" valign="bottom" style="color:#B2B7F2;font-size:36px;font-family:'Times New Roman',serif;font-weight:bold;text-align:right;padding:10px 10px;">”</td>
</tr>
</table>
<p>In order to conduct a more thorough investigation (the 2008 assessments having only focused on the criterion of <a href="http://en.wikipedia.org/wiki/Relevance_(information_retrieval)" title="en:Relevance (information retrieval)">relevance</a>), the present paper sets out to develop a set of quality criteria for the evalulation of Wikipedia articles by human jurors. It first gives an overview of existing literature about the <a href="http://en.wikipedia.org/wiki/Reliability_of_Wikipedia" title="en:Reliability of Wikipedia">information quality of Wikipedia</a>, and of encyclopedias in general, identifying four main criteria that several pre-2002 works about the quality of reference works agreed on. Interestingly, &#8220;accuracy&#8221; was not among them, an omission explained by the authors by the difficulty of fact-checking an entire encyclopedia. From this, the authors derive a set of 14 evaluation criteria, incorporating both the general criteria from the literature about reference works and internal Wikipedia criteria such as the status of being a featured/good article, the verifiability of the content and the absence of original research. These were then applied by the jurors (two last year undergraduate students with experience in similar <a href="http://en.wikipedia.org/wiki/Coding_(social_sciences)" title="en:Coding (social sciences)">coding</a> tasks) to 43 German Wikipedia articles that had appeared in the 2007 queries, in their state at that time. While &#8220;the evaluated Wikipedia articles achieve a good score overall&#8221;, there were &#8220;noticeable differences in quality among the examples in the sample&#8221; (the paper contains interesting discussions of several strengths and weaknesses according to the criteria set, e.g. the conjecture that the low score on &#8220;descriptive, inspiring/interesting&#8221; writing could be attributed to &#8220;the German academic style. A random comparison with the English version of individual articles seems to support this interpretation&#8221;).</p>
<p>The authors conclude:</p>
<table align="center" style="border-collapse:collapse; border-style:none; background-color:transparent;" class="cquote">
<tr>
<td width="20" valign="top" style="color:#B2B7F2;font-size:35px;font-family:'Times New Roman',serif;font-weight:bold;text-align:left;padding:10px 10px;">“</td>
<td valign="top" style="padding:4px 10px;">In general, our study could confirm that the ranking of Wikipedia articles in search engines is justified by a satisfactory overall quality of the articles. &#8230; In answer to research question 4b, 4c (&#8216;Is the ranking appropriate? Are good entries ranked high enough?&#8217;), we can say that the rankings in search engines are at least appropriate.</td>
<td width="20" valign="bottom" style="color:#B2B7F2;font-size:36px;font-family:'Times New Roman',serif;font-weight:bold;text-align:right;padding:10px 10px;">”</td>
</tr>
</table>
<p>Both the search engine ranking data and the evaluated Wikipedia article revisions are somewhat dated, referring to January 2007 (the authors themselves note that it &#8220;could well be that in the meantime search engines reacted to that fact [the potential of improving results by ranking Wikipedia higher] and further boosted Wikipedia results&#8221;, and also that regarding the German Wikipedia, the search engine results did not take into account possible effects of the introduction of <a href="http://en.wikipedia.org/wiki/Wikipedia:Flagged_revisions" title="en:Wikipedia:Flagged revisions">stable versions</a> in 2008).</p>
<h3 id="Attempts_to_predict_the_outcome_of_AfD_discussions_from_an_article.27s_edit_history">Attempts to predict the outcome of AfD discussions from an article&#8217;s edit history</h3>
<p>A master&#8217;s thesis defended by Ashish Kumar Ashok, a student in computing at <a href="http://en.wikipedia.org/wiki/Kansas_State_University" title="en:Kansas State University">Kansas State University</a>, describes machine learning methods to determine how the final outcome of an <a href="http://en.wikipedia.org/wiki/Wikipedia:Articles_for_deletion" title="en:Wikipedia:Articles for deletion">Article for Deletion</a> (AfD) discussion is affected by the editing history of the article.<sup id="cite_ref-14" class="reference"><a href="#cite_note-14">[15]</a></sup> The thesis considers features such as the structure of the graph of revisions of an article (based on text changed, added or removed), the number of edits of the article, the number of disjoint edits (according to some contiguity definition), as well as properties of the corresponding AfD, such as the number of <a href="http://en.wikipedia.org/wiki/Wikipedia:Polling_is_not_a_substitute_for_discussion" title="en:Wikipedia:Polling is not a substitute for discussion"><i>!votes</i></a> and the total length of words used by participants in AfD who expressed their preference to <i>keep</i>, <i>merge</i> or <i>delete</i> the article. Different types of classifiers based on the above features are applied to a small sample of 64 AfD discussions from the <a href="http://en.wikipedia.org/wiki/Wikipedia:Articles_for_deletion/Log/2011_August_1" title="en:Wikipedia:Articles for deletion/Log/2011 August 1">1 August 2011 deletion log</a>. The results of the analysis indicate that the performance of the classifiers does not significantly improve by considering any of the above features in addition to the sheer number of <i>!votes</i>, which limits the scope and applicability of the methods explored in this work to predict the outcome of AfD discussions. The author suggests that datasets larger than the sample considered in this study should be obtained in order to assess the validity of these methods.</p>
<h3 id="In_brief">In brief</h3>
<ul>
<li><b>Why did Wikipedia succeed while others failed?</b>: In a presentation on October 11 at the <a href="http://en.wikipedia.org/wiki/Berkman_Center_for_Internet_and_Society" title="en:Berkman Center for Internet and Society">Berkman Center for Internet and Society</a> (&#8220;<a href="http://cyber.law.harvard.edu/events/luncheon/2011/10/makohill">Almost Wikipedia: What Eight Collaborative Encyclopedia Projects Reveal About Mechanisms of Collective Action</a>&#8220;, with video), MIT researcher and Wikimedia Foundation advisory board member <a href="http://en.wikipedia.org/wiki/Benjamin_Mako_Hill" title="en:Benjamin Mako Hill">Benjamin Mako Hill</a> presented preliminary results of his research comparing Wikipedia and seven other <a href="http://en.wikipedia.org/wiki/Internet_encyclopedia_project" title="en:Internet encyclopedia project">Internet encyclopedia projects</a> or proposals that did not take off, based on interviews with the projects&#8217; founders as well as examinations of their archives. The event was <a href="http://www.niemanlab.org/2011/10/the-contribution-conundrum-why-did-wikipedia-succeed-while-other-encyclopedias-failed/">summarised</a> for the <a href="http://en.wikipedia.org/wiki/Nieman_Journalism_Lab" title="en:Nieman Journalism Lab">Nieman Journalism Lab</a> (<a href="http://articles.businessinsider.com/2011-10-13/tech/30274391_1_wikipedia-encyclopedias-yochai-benkler">reprinted</a> in <i><a href="http://en.wikipedia.org/wiki/Business_Insider" title="en:Business Insider">Business Insider</a></i>), and in the <i>Signpost</i>: &#8220;<a href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2011-10-17/In_the_news#The_little_online_encyclopaedia_that_could" title="en:Wikipedia:Wikipedia Signpost/2011-10-17/In the news">The little online encyclopaedia that could</a>&#8220;. Hill later gave a shorter (ca. 12min) talk about the same topic at the &#8220;Digital commons&#8221; forum (see below): <a href="http://epicenter.media.mit.edu/~mako/digcom/hill-digcom-talking_head.ogv">video</a>, <a href="http://epicenter.media.mit.edu/~mako/digcom/hill-digcom-slides.ogv">slides</a>.</li>
<li><b>&#8220;Digital Commons&#8221; conference</b>: On October 29-30, the &#8220;<a href="http://wiki.digital-commons.net/Main_Page">Building Digital Commons</a>&#8221; conference took place in Barcelona, organized by Catalan Wikimedians and Wikimedia Research Committee member Mayo Fuster Morell, and supported by the Wikimedia Foundation. The <a href="http://www.digital-commons.net/program/">program</a> featured several presentations about Wikipedia research; further online documentation is expected to become available later.</li>
<li><b>Placement of categories examined</b>: A paper from two computer science researchers based at the <a href="http://en.wikipedia.org/wiki/Katholieke_Universiteit_Leuven" title="en:Katholieke Universiteit Leuven">Katholieke Universiteit Leuven</a> examines the order in which categories are placed on a Wikipedia article and reports on connections between a category&#8217;s position in this list and &#8220;its persistence within the article, age, popularity, size, and descriptiveness&#8221;.<sup id="cite_ref-15" class="reference"><a href="#cite_note-15">[16]</a></sup> The order in which categories are added is not determined by any explicit rule. However, the research found, older, more persistent and more exclusive categories are consistently placed in lower positions. Categories appearing at lower positions also tend to do so across all the articles they contain and they include articles that are more similar to each other in terms of category overlap.</li>
<li><b>Visualizing semantic data</b>: A team from the <a href="http://en.wikipedia.org/wiki/UCSB" title="en:UCSB">UCSB</a> Department of Computer Science recently presented<sup id="cite_ref-16" class="reference"><a href="#cite_note-16">[17]</a></sup> <a href="http://www.wigipedia-online.com">WiGiPedia</a>, a tool visualizing rich semantic data about WIkipedia articles, designed to &#8220;inform the user of interesting contextual information pertaining to the current article, and to provide a simple way to introduce and/or repair semantic relations between wiki articles&#8221;. The tool builds on structured data represented via templates, categories and infoboxes and queried via <a href="http://en.wikipedia.org/wiki/DBpedia" title="en:DBpedia">DBpedia</a>. By supporting collaborative editing of rich semantic data and one-click semantic updates of Wikipedia articles, the tool aims to bridge the gap between Wikipedia and DBpedia. The source code of the tool doesn&#8217;t appear to be publicly released.</li>
<li><b>Wikipedia literature review</b>: <a href="http://en.wikipedia.org/wiki/User:Owensmartin" title="en:User:Owensmartin">Owen S. Martin</a> posted to <a href="http://en.wikipedia.org/wiki/arXiv" title="en:arXiv">arXiv</a> a <a href="http://arxiv.org/abs/1110.5863">28-page Wikipedia literature review</a> towards his Ph.D. in statistics.<sup id="cite_ref-17" class="reference"><a href="#cite_note-17">[18]</a></sup> About half the paper gives an overview of Wikipedia&#8217;s database structure; the remainder reviews about 30 recent papers from the perspective of assessing their quality, trust, semantic extraction, governance, economic implications and epistemological implications.</li>
<li><b>Vandalism detection contest</b>: An &#8220;Overview of the 2nd International Competition on Wikipedia Vandalism Detection&#8221; has been published.<sup id="cite_ref-18" class="reference"><a href="#cite_note-18">[19]</a></sup></li>
<li><b>Matching Wikipedia articles to Geonames entries</b>: A four-page paper by two researchers from Hokkaido University<sup id="cite_ref-19" class="reference"><a href="#cite_note-19">[20]</a></sup> explored the problem of &#8220;merging Wikipedia&#8217;s Geo-entities and <a href="http://en.wikipedia.org/wiki/GeoNames" title="en:GeoNames">GeoNames</a>&#8221; to form a larger geographical database. This is already being done by the <a href="http://en.wikipedia.org/wiki/YAGO_(ontology)" title="en:YAGO (ontology)">YAGO</a> (Yet Another Great Ontology) database, but the paper uses additional data beyond the article name, such as categories and disambiguation pages on Wikipedia, in order to identify further matching pairs missed in YAGO (and in the process found several errors in GeoNames).</li>
<li><b>Attempt to examine evolution of key activities in Wikipedia</b>: A paper titled &#8220;Governing Complex Social Production in the Internet: The Emergence of a Collective Capability in Wikipedia&#8221;<sup id="cite_ref-20" class="reference"><a href="#cite_note-20">[21]</a></sup> (presented last month at the &#8220;Decade in Internet Time&#8221; symposium at the <a href="http://en.wikipedia.org/wiki/Oxford_Internet_Institute" title="en:Oxford Internet Institute">Oxford Internet Institute</a>) undertakes &#8220;an exploratory theoretical analysis to clarify the structure and mechanisms driving the endogenous change of [Wikipedia]&#8220;, using the framework of capability theory to construct six hypotheses such as &#8220;the membership in group(s) of contributors that take up governance tasks varies less than in those revolving on content production&#8221;. These are then tested empirically by applying a clustering algorithm to monthly snapshots of the English Wikipedia (until 2009) &#8220;to identify distinct groupings of contributors at each month&#8221;. However, the clustering algorithm leaves out a group of users &#8220;that covers all the observed domains of activity&#8221; and &#8220;despite its relatively small share of overall contributor population &#8230; provides the majority of the work&#8221;, which leads the authors to dub it &#8220;the core editors of Wikipedia&#8221;.</li>
</ul>
<h2 id="References">References</h2>
<div class="references-small">
<ol class="references">
<li id="cite_note-0"><a href="#cite_ref-0">↑</a> Ayers, Phoebe, and Reid Priedhorsky (2011). WikiLit: Collecting the wiki and Wikipedia literature. In: <i>Proceedings of the 7th International Symposium on Wikis and Open Collaboration</i> &#8211; WikiSym &#8217;11, 229. New York, New York, USA: ACM Press, 2011. <a href="http://dx.doi.org/10.1145/2038558.2038612"><b>DOI</b></a> • <a href="http://wikisym.org/ws2011/_media/proceedings:p229-ayers.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-1"><a href="#cite_ref-1">↑</a> Antin, Judd, Ed H. Chi, James Howison, Sharoda Paul, Aaron Shaw, and Jude Yew (2011). Apples to oranges? Comparing across studies of open collaboration/peer production. In: <i>Proceedings of the 7th International Symposium on Wikis and Open Collaboration</i> &#8211; WikiSym &#8217;11, 227. New York, New York, USA: ACM Press, 2011. <a href="http://dx.doi.org/10.1145/2038558.2038610"><b>DOI</b></a> • <a href="http://wikisym.org/ws2011/_media/proceedings:p227-yew.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-2"><a href="#cite_ref-2">↑</a> Schulenburg, Frank, LiAnna Davis, and Max Klein (2011) Lessons from the classroom.In: <i>Proceedings of the 7th International Symposium on Wikis and Open Collaboration</i> &#8211; WikiSym &#8217;11, 231. New York, New York, USA: ACM Press, 2011. <a href="http://dx.doi.org/10.1145/2038558.2038613"><b>DOI</b></a> • <a href="http://www.wikisym.org/ws2011/_media/proceedings:p231-schulenburg.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-3"><a href="#cite_ref-3">↑</a> Musicant, David R., Yuqing Ren, James A. Johnson, and John Riedl (2011). Mentoring in Wikipedia: a clash of cultures. In <i>Proceedings of the 7th International Symposium on Wikis and Open Collaboration</i> &#8211; WikiSym &#8217;11, 173. New York, New York, USA: ACM Press, 2011. <a href="http://dx.doi.org/10.1145/2038558.2038586"><b>DOI</b></a> • <a href="http://www.wikisym.org/ws2011/_media/proceedings:p173-musicant.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-4"><a href="#cite_ref-4">↑</a> Javanmardi, Sara, David W. McDonald, and Cristina V. Lopes (2011). Vandalism detection in Wikipedia. In: <i>Proceedings of the 7th International Symposium on Wikis and Open Collaboration</i> &#8211; WikiSym &#8217;11, 82. New York, New York, USA: ACM Press, 2011. <a href="http://dx.doi.org/10.1145/2038558.2038573"><b>DOI</b></a> • <a href="http://www.wikisym.org/ws2011/_media/proceedings:p82-javanmardi.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-5"><a href="#cite_ref-5">↑</a> Livingstone, Randall M (2011). A scourge to the pillar of neutrality. In: <i>Proceedings of the 7th International Symposium on Wikis and Open Collaboration</i> &#8211; WikiSym &#8217;11, 209. New York, New York, USA: ACM Press, 2011. <a href="http://dx.doi.org/10.1145/2038558.2038597"><b>DOI</b></a> • <a href="http://www.wikisym.org/ws2011/_media/proceedings:p209-livingstone.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-6"><a href="#cite_ref-6">↑</a> Livingstone, Randall M. (2011) Places on the map and in the cloud: representations of locality and geography in Wikipedia. In: <i>Proceedings of the 7th International Symposium on Wikis and Open Collaboration</i> &#8211; WikiSym &#8217;11, 211. New York, New York, USA: ACM Press, 2011. <a href="http://dx.doi.org/10.1145/2038558.2038598"><b>DOI</b></a> <a href="http://www.wikisym.org/ws2011/_media/proceedings:p211-livingstone.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-7"><a href="#cite_ref-7">↑</a> Geiger, R. Stuart, and Heather Ford (2011). Participation in Wikipedia&#8217;s article deletion processes. In: <i>Proceedings of the 7th International Symposium on Wikis and Open Collaboration</i> &#8211; WikiSym &#8217;11, 201. New York, New York, USA: ACM Press, 2011. <a href="http://dx.doi.org/10.1145/2038558.2038593"><b>DOI</b></a> • <a href="http://www.wikisym.org/ws2011/_media/proceedings:p201-geiger.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-8"><a href="#cite_ref-8">↑</a> Gorbatai, Andreea D. (2011) Exploring underproduction in Wikipedia. In: <i>Proceedings of the 7th International Symposium on Wikis and Open Collaboration</i> &#8211; WikiSym &#8217;11, 205. New York, New York, USA: ACM Press, 2011. <a href="http://dx.doi.org/10.1145/2038558.2038595"><b>DOI</b></a> • <a href="http://www.wikisym.org/ws2011/_media/proceedings:p205-gorbatai.pdf"><b>PDF</b></a></li>
<li id="cite_note-9"><a href="#cite_ref-9">↑</a> Kupferberg, Natalie, and Bridget McCrate Protus (2011) Accuracy and completeness of drug information in Wikipedia: an assessment. <i>Journal of the Medical Library Association</i> 99(4): 310-3. <a href="http://dx.doi.org/10.3163/1536-5050.99.4.010"><b>DOI</b></a> • <a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3193353/"><b>HTML</b></a> <a href="http://en.wikipedia.org/wiki/Closed_access" title="Closed access"><img alt="Closed access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/c/c1/Closed_Access_logo_alternative.svg/12px-Closed_Access_logo_alternative.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-10"><a href="#cite_ref-10">↑</a> Turek, Piotr, Justyna Spychała, Adam Wierzbicki, and Piotr Gackowski (2011) Social Mechanism of Granting Trust Basing on Polish Wikipedia Requests for Adminship. In: <i>Social Informatics 2011</i>. Lecture Notes in Computer Science, 6984:212-225. <a href="http://dx.doi.org/10.1007/978-3-642-24704-0_25"><b>DOI</b></a> <a href="http://en.wikipedia.org/wiki/Closed_access" title="Closed access"><img alt="Closed access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/c/c1/Closed_Access_logo_alternative.svg/12px-Closed_Access_logo_alternative.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-11"><a href="#cite_ref-11">↑</a> Lewandowski, Dirk, and Ulrike Spree (2011) Ranking of Wikipedia articles in search engines revisited: Fair ranking for reasonable quality? <i>Journal of the American Society for Information Science</i> 62(1)): 117-132. <a href="http://dx.doi.org/10.1002/asi.21423"><b>DOI</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a> • <a href="http://en.wikipedia.org/wiki/arxiv.org" title="en:arxiv.org">arxiv.org</a> <a href="http://arxiv.org/abs/1109.0916"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-12"><a href="#cite_ref-12">↑</a> Höchstötter, Nadine, and Dirk Lewandowski (2009). What users see – Structures in search engine results pages. <i>Information Sciences</i> 179 (12): 1796-1812 <a href="http://dx.doi.org/10.1016/j.ins.2009.01.028"><b>DOI</b></a> • <a href="http://hdl.handle.net/10760/16081"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-13"><a href="#cite_ref-13">↑</a> Lewandowski, Dirk (2008). The retrieval effectiveness of Web search engines: Considering results descriptions. <i>Journal of Documentation</i> 64(6), 915-937 <a href="http://hdl.handle.net/10760/11258"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-14"><a href="#cite_ref-14">↑</a> Ashok, Ashish Kumar (2011). <i>Predictive data mining in a collaborative editing system: the Wikipedia articles for deletion process</i>. <a href="http://hdl.handle.net/2097/12026"><b>HTML</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-15"><a href="#cite_ref-15">↑</a> Gyllstrom, Karl, and Marie-Francine Moens (2011) Examining the &#8220;Leftness&#8221; Property of Wikipedia Categories. In: CIKM &#8217;11. <a href="https://lirias.kuleuven.be/bitstream/123456789/318601/1/GyllstromCIKM2011.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-16"><a href="#cite_ref-16">↑</a> Bostandjiev, Svetlin, John O&#8217;Donovan, Brynjar Gretarsson, Christopher Hall, and Tobias Hollerer (2011) WiGiPedia: Visual Editing of Semantic Data in Wikipedia. In: <i>Workshop on Visual Interfaces to the Social and Semantic Web</i> (VISSW2011), <a href="http://bostandjiev.com/Content/publications/bostandjiev.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-17"><a href="#cite_ref-17">↑</a> Martin, Owen S (2011) A Wikipedia Literature Review. <i>ArXiV</i>, October 17, 2011. <a href="http://arxiv.org/pdf/1110.5863v1"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-18"><a href="#cite_ref-18">↑</a> Potthast, Martin, and Teresa Holfeld (2011) Overview of the 2nd International Competition on Wikipedia Vandalism Detection. In: PAN 2011. <a href="http://www.uni-weimar.de/medien/webis/publications/papers/stein_2011u.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-19"><a href="#cite_ref-19">↑</a> Yiqi Liu, and Masaharu Yoshioka (2011) Construction of large geographical database by merging Wikipedia&#8217;s Geo-entities and GeoNames. <a href="http://sigswo.org/papers/SIG-SWO-A1102/SIG-SWO-A1102-03.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-20"><a href="#cite_ref-20">↑</a> Aaltonen, Aleksi, and Giovan Francesco Lanzara (2011) Governing Complex Social Production in the Internet: The Emergence of a Collective Capability in Wikipedia. In <i>Decade in Internet Time symposium</i>. <a href="http://ssrn.com/paper=1926138"><b>HTML</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
</ol>
</div>
<p></p>
<div style="float:left; width: 99%; border-top: 1px solid #CCC; border-bottom: 1px solid #CCC;">
<p style="text-align:center; clear:left; background-color: #EEE; padding: .6em; font-size:140%; font-weight:normal;font-family: Georgia, Times, serif; color:#666"><i>Wikimedia Research Newsletter</i><br />
Vol: 1 • Issue: 4 • October 2011<br />
<span style="font-size:80%;">This newletter is brought to you by the <a href="http://meta.wikimedia.org/wiki/Research:Committee" title="Research:Committee">Wikimedia Research Committee</a> and <a href="http://en.wikipedia.org/wiki/Wikipedia:Signpost" title="w:Wikipedia:Signpost">The Signpost</a><br />
<a href="http://meta.wikimedia.org/wiki/Research:Newsletter/Archives" title="Research:Newsletter/Archives">[<!--  -->archives]</a> <a href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2011-10-31/Recent_research" title="w:Wikipedia:Wikipedia Signpost/2011-10-31/Recent research">[signpost edition]</a> <a href="http://meta.wikimedia.org/wiki/Research:Newsletter" title="Research:Newsletter">[contribute]</a> <a href="http://meta.wikimedia.org/wiki/Research:Index" title="Research:Index">[research index]</a></span></p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://blog.wikimedia.org/2011/11/02/wikimedia-research-newsletter-october-2011/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
<enclosure url="http://epicenter.media.mit.edu/~mako/digcom/hill-digcom-talking_head.ogv" length="17761668" type="video/ogg" />
<enclosure url="http://epicenter.media.mit.edu/~mako/digcom/hill-digcom-slides.ogv" length="23265037" type="video/ogg" />
		</item>
		<item>
		<title>Wikimedia Research Newsletter, September 2011</title>
		<link>http://blog.wikimedia.org/2011/09/28/wikimedia-research-newsletter-september-2011/</link>
		<comments>http://blog.wikimedia.org/2011/09/28/wikimedia-research-newsletter-september-2011/#comments</comments>
		<pubDate>Thu, 29 Sep 2011 02:49:40 +0000</pubDate>
		<dc:creator>Tilman</dc:creator>
				<category><![CDATA[Highlights]]></category>
		<category><![CDATA[Wikimedia Research Newsletter]]></category>
		<category><![CDATA[WikiSym]]></category>

		<guid isPermaLink="false">http://blog.wikimedia.org/?p=6600</guid>
		<description><![CDATA[Vol: 1 • Issue: 3 • September 2011 [archives] Top female Wikipedians, reverted newbies, link spam, social influence on admin votes, Wikipedians&#8217; weekends, WikiSym previews With contributions by: Tbayer, Daniel Mietchen, DarTar and Jodi.a.schneider Contents 1 What the most active female editors contribute 2 Effects of reverts on wiki work 3 Further Wikipedia coverage at [...]]]></description>
			<content:encoded><![CDATA[<div style="float:left; width: 99%;">
<p><a href="http://meta.wikimedia.org/wiki/Research:Newsletter" title="Research:Newsletter"><img alt="WRN header.png" src="http://upload.wikimedia.org/wikipedia/commons/7/7d/WRN_header.png" width="554" height="86" /></a></p>
<p style="clear:left; background-color: #EEE; padding: .3em; font-size:200%; font-weight:normal; font-family: Georgia, Times, serif; color:#666">Vol: 1 • Issue: 3 • September 2011 <span style="font-size:75%; float:right; margin-right:1em; text-transform:uppercase"><a href="http://meta.wikimedia.org/wiki/Research:Newsletter/Archives" title="Research:Newsletter/Archives">[<!--  -->archives]</a></span></p>
<p style="clear:left; padding: .3em; line-height: 130%; font-size:150%; font-weight:normal; font-style: italic; font-family: Georgia, Times, serif; color:#666">Top female Wikipedians, reverted newbies, link spam, social influence on admin votes, Wikipedians&#8217; weekends, WikiSym previews</p>
</div>
<p>
<b>With contributions by:</b> <a href="http://meta.wikimedia.org/wiki/User:Tbayer_(WMF)" title="User:Tbayer (WMF)">Tbayer</a>, <a href="http://meta.wikimedia.org/wiki/User:Mietchen" title="User:Mietchen">Daniel Mietchen</a>, <a href="http://meta.wikimedia.org/wiki/User:DarTar" title="User:DarTar">DarTar</a> and <a href="http://meta.wikimedia.org/wiki/User:Jodi.a.schneider" title="User:Jodi.a.schneider">Jodi.a.schneider</a></p>
<table id="toc" class="toc">
<tr>
<td>
<div id="toctitle">
<h2>Contents</h2>
</div>
<ul>
<li><a href="http://blog.wikimedia.org/2011/09/28/wikimedia-research-newsletter-september-2011/#What_the_most_active_female_editors_contribute"><span>1</span> <span>What the most active female editors contribute</span></a></li>
<li class="toclevel-1 tocsection-2"><a href="http://blog.wikimedia.org/2011/09/28/wikimedia-research-newsletter-september-2011/#Effects_of_reverts_on_wiki_work"><span>2</span> <span>Effects of reverts on wiki work</span></a></li>
<li class="toclevel-1 tocsection-3"><a href="http://blog.wikimedia.org/2011/09/28/wikimedia-research-newsletter-september-2011/#Further_Wikipedia_coverage_at_WikiSym_2011:_Social_dynamics_and_global_reach"><span>3</span> <span>Further Wikipedia coverage at WikiSym 2011: Social dynamics and global reach</span></a></li>
<li class="toclevel-1 tocsection-4"><a href="http://blog.wikimedia.org/2011/09/28/wikimedia-research-newsletter-september-2011/#Link_spam_research_with_controversial_genesis_but_useful_results"><span>4</span> <span>Link spam research with controversial genesis but useful results</span></a></li>
<li class="toclevel-1 tocsection-5"><a href="http://blog.wikimedia.org/2011/09/28/wikimedia-research-newsletter-september-2011/#How_social_ties_influence_admin_votes"><span>5</span> <span>How social ties influence admin votes</span></a></li>
<li class="toclevel-1 tocsection-6"><a href="http://blog.wikimedia.org/2011/09/28/wikimedia-research-newsletter-september-2011/#Wikipedians.27_weekends_in_international_comparison"><span>6</span> <span>Wikipedians&#8217; weekends in international comparison</span></a></li>
<li class="toclevel-1 tocsection-7"><a href="http://blog.wikimedia.org/2011/09/28/wikimedia-research-newsletter-september-2011/#In_brief"><span>7</span> <span>In brief</span></a></li>
<li class="toclevel-1 tocsection-8"><a href="http://blog.wikimedia.org/2011/09/28/wikimedia-research-newsletter-september-2011/#References"><span>8</span> <span>References</span></a></li>
</ul>
</td>
</tr>
</table>
<p><span id="more-6600"></span></p>
<h3 id="What_the_most_active_female_editors_contribute">What the most active female editors contribute</h3>
<p>A paper addressing gender imbalance in Wikipedia (&#8220;Gender differences in Wikipedia editing&#8221;) by Judd Antin and collaborators won the &#8220;Best Short Paper&#8221; award at <a href="http://en.wikipedia.org/wiki/WikiSym" title="w:WikiSym">WikiSym</a>.<sup id="cite_ref-0" class="reference"><a href="#cite_note-0">[1]</a></sup> This follows the <a rel="nofollow" class="external text" href="http://www.wikisym.org/2011/09/21/best-paper-winners-for-wikisym-2011/">awarding</a> of &#8220;best full paper&#8221; to another study on the gender gap<sup id="cite_ref-1" class="reference"><a href="#cite_note-1">[2]</a></sup> already covered in <a href="http://meta.wikimedia.org/wiki/Research:Newsletter/2011-08-29#Extensive_analysis_of_gender_gap_in_Wikipedia_to_be_presented_at_WikiSym_2011" title="Research:Newsletter/2011-08-29">previous editions of the research newsletter</a>. The study by Antin and collaborators sampled 256,190 users who created a new account on the English Wikipedia between September 2010 and February 2011 and qualitatively coded their contribution by category of wiki work. The results suggest that, whereas in the lower three quartiles by activity level men and women make roughly the same contributions in each category of wiki work, in the top quartile editors behave in a significantly different way. The researchers found that among the top 25% of Wikipedians by activity level:</p>
<ul>
<li>only 27% of all revisions are made by women;</li>
<li>women tend to make larger revisions than men;</li>
<li>top female editors make significantly larger revisions than men in at least two categories: &#8220;adding new content&#8221; and &#8220;rephrasing existing text&#8221;</li>
</ul>
<h3 id="Effects_of_reverts_on_wiki_work">Effects of reverts on wiki work</h3>
<div style="text-align:center;border:1px solid #ccc;margin:2px;float:right;clear:right;margin:0.5em 0 0.8em 1.4em;">
<div style="padding: 3px !important; border: 1px solid rgb(204, 204, 204); text-align: center; overflow: hidden; font-size: 94%; background-color: white; width:272px;"><a href="http://meta.wikimedia.org/wiki/File:Revert_effect_on_boldness.png" class="image"><img alt="" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/73/Revert_effect_on_boldness.png/270px-Revert_effect_on_boldness.png" width="270" height="127" style="border:1px solid #ccc;" /></a></p>
<div style="border:none;text-align:left;line-height:1.4em;padding:3px !important;font-size:94%;">
<div style="float:right;border:none !important;background:none !important;"><a href="http://meta.wikimedia.org/wiki/File:Revert_effect_on_boldness.png" class="internal" title="Enlarge"><img src="http://bits.wikimedia.org/skins-1.18/common/images/magnify-clip.png" width="15" height="11" alt="" /></a></div>
<p>Two plots show the changes in an editor&#8217;s boldness after being reverted.</p></div>
</div>
</div>
<p>Another WikiSym 2011 paper by <a href="http://en.wikipedia.org/wiki/GroupLens" title="w:GroupLens">GroupLens</a> researchers, including Summer of Research fellow <a href="http://meta.wikimedia.org/wiki/User:EpochFail" title="m:User:EpochFail">Aaron Halfaker</a> (&#8220;Don’t bite the newbies: how reverts affect the quantity and quality of Wikipedia work&#8221;), reports on the effects of reverts on the quality and quantity of Wikipedia editors, with a specific focus on newbies.<sup id="cite_ref-2" class="reference"><a href="#cite_note-2">[3]</a></sup> The study uses a number of key metrics to assess the quality of editor contributions (using <i>reverts per revision</i> and <i>Persistent Word Revisions</i> or <i>PWR</i>, to measure the survival across revisions of words added by an editor, other than stop-word) and changes in editor activity (using a <i>controlled activity delta</i> that calculates an editor&#8217;s variation of activity across weeks with respect to the week preceding the revert, normalized by the editor&#8217;s daily rate of activity). The results point at the same time at the important role of reverts as a learning and quality improvement process but also at their negative effects on new contributors. Below are highlights from this study:</p>
<ul>
<li>Compared with their activity prior to a revert, in the first week after a revert reverted editors decrease their activity by 0.1 standard deviations compared with an increase in a control group of non-reverted editors of about 0.3 standard deviations.</li>
<li>It matters who performs a revert: editors reverted by a registered editor do not recover to the average level of activity for at least one month, whereas editors reverted by anonymous users recover much faster.</li>
<li>Reverts affect the quality of one&#8217;s work: reverted editors are less likely to be reverted in the future (particularly in the week after the revert), whereas the probability of being reverted in the control group keeps growing every week. Reverted editors are also less likely to make important changes to an article after being reverted, compared with the control group. However, the productivity of reverted editors in the following weeks increases more rapidly than non-reverted editors.</li>
<li>Reverts affect newbies more negatively. Experienced editors are less affected by reverts on their average activity while newbies are significantly less likely to continue editing after a revert than experienced editors.</li>
</ul>
<p>These results are consistent with the findings by Summer of Research fellows on the effects of <a href="http://meta.wikimedia.org/wiki/Research:Wikimedia_Summer_of_Research_2011/Summary_of_Findings#Community_interaction_with_new_Wikipedians" title="m:Research:Wikimedia Summer of Research 2011/Summary of Findings">community interactions with new Wikipedians</a>.</p>
<h3 id="Further_Wikipedia_coverage_at_WikiSym_2011:_Social_dynamics_and_global_reach">Further Wikipedia coverage at WikiSym 2011: Social dynamics and global reach</h3>
<div style="text-align:center;border:1px solid #ccc;margin:2px;float:right;clear:right;margin:0.5em 0 0.8em 1.4em;">
<div style="padding: 3px !important; border: 1px solid rgb(204, 204, 204); text-align: center; overflow: hidden; font-size: 94%; background-color: white; width:262px;"><a href="http://meta.wikimedia.org/wiki/File:WikiTrip_egyptian_revolution_screenshot.png" class="image"><img alt="" src="http://upload.wikimedia.org/wikipedia/commons/thumb/1/19/WikiTrip_egyptian_revolution_screenshot.png/260px-WikiTrip_egyptian_revolution_screenshot.png" width="260" height="172" style="border:1px solid #ccc;" /></a></p>
<div style="border:none;text-align:left;line-height:1.4em;padding:3px !important;font-size:94%;">
<div style="float:right;border:none !important;background:none !important;"><a href="http://meta.wikimedia.org/wiki/File:WikiTrip_egyptian_revolution_screenshot.png" class="internal" title="Enlarge"><img src="http://bits.wikimedia.org/skins-1.18/common/images/magnify-clip.png" width="15" height="11" alt="" /></a></div>
<p>Geographic location of edits for English Wikipedia article <a href="http://en.wikipedia.org/wiki/2011_Egyptian_revolution" title="w:2011 Egyptian revolution">2011 Egyptian revolution</a></div>
</div>
</div>
<ul>
<li>The two papers on gender gap mentioned above will be presented in a session titled <a rel="nofollow" class="external text" href="http://www.wikisym.org/2011/09/14/session-preview-understanding-wikipedia/">Understanding Wikipedia</a>, along with other original works some of which were already <a href="http://meta.wikimedia.org/wiki/Research:Newsletter/2011-08-29#Deleted_revisions_in_the_English_Wikipedia" title="Research:Newsletter/2011-08-29">reviewed</a> in the research newsletter, such as a study by researchers from the <a href="http://en.wikipedia.org/wiki/University_of_Pennsylvania" title="w:University of Pennsylvania">University of Pennsylvania</a> examining revision deletion in the English Wikipedia (see also the summary posted on <a rel="nofollow" class="external text" href="http://acawiki.org/What_Wikipedia_deletes:_Characterizing_dangerous_collaborative_content">AcaWiki</a>).<sup id="cite_ref-3" class="reference"><a href="#cite_note-3">[4]</a></sup></li>
<li>A second research session will be devoted to <a rel="nofollow" class="external text" href="http://www.wikisym.org/2011/09/20/session-preview-wikipedia-as-a-global-phenomenon/">Wikipedia as a global phenomenon</a>. It will feature two papers focusing on Wikipedia&#8217;s coverage of rapidly changing events such as the <a href="http://en.wikipedia.org/wiki/2011_Egyptian_revolution" title="w:2011 Egyptian revolution">2011 Egyptian revolution</a> and the <a href="http://en.wikipedia.org/wiki/2011_T%C5%8Dhoku_earthquake_and_tsunami" title="w:2011 Tōhoku earthquake and tsunami">2011 Tōhoku earthquake and tsunami</a>. The analysis of Wikipedia coverage of the Egyptian revolution, by a team of Italian researchers based in <a href="http://en.wikipedia.org/wiki/Trento" title="w:Trento">Trento</a> (from the same lab that released the <a rel="nofollow" class="external text" href="http://sonetlab.fbk.eu/wikitrip/">WikiTrip visualization</a>, previously <a href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2011-08-15/Women_and_Wikipedia#Brief_news" title="w:Wikipedia:Wikipedia Signpost/2011-08-15/Women and Wikipedia">covered in the Signpost</a>), is available as a preprint.<sup id="cite_ref-4" class="reference"><a href="#cite_note-4">[5]</a></sup></li>
</ul>
<h3 id="Link_spam_research_with_controversial_genesis_but_useful_results">Link spam research with controversial genesis but useful results</h3>
<p>The &#8220;<a rel="nofollow" class="external text" href="http://www.wikisym.org/2011/09/19/session-preview-wiki-tools-and-interfaces/">Wiki tools and interfaces</a>&#8221; session at WikiSym will see the presentation of a paper titled &#8220;<a rel="nofollow" class="external text" href="http://www.cis.upenn.edu/~westand/docs/wikisym_11_spam_final.pdf">Autonomous link spam detection in purely collaborative environments</a>&#8220;. According to the five authors from the <a href="http://en.wikipedia.org/wiki/University_of_Pennsylvania" title="w:University of Pennsylvania">University of Pennsylvania</a>, link spam is currently &#8220;an annoying, but non-pervasive issue&#8221;, but could become a grave threat to Wikipedia if new spam techniques that were explored by some of them in another paper (see below) become more widespread.</p>
<p>Using the <a href="http://en.wikipedia.org/wiki/Wikipedia:STiki" title="w:Wikipedia:STiki">STiki</a> software by one of the authors, which is already widely used as an anti-vandalism tool on the English Wikipedia, the researchers collected mainspace edits adding external links and extracted a corpus of 5,962 link additions classified as either ham or spam, using criteria such as whether the edit had been rolled back (to determine spam), or whether it had been added by a user with rollback rights (to determine ham). From this, the researchers derived numerous features that indicate link spamming behavior, in three areas: On-wiki evidence (including very simple metrics such as the URL&#8217;s length – spam links tend to be shorter – or that older and more popular articles are more likely to be targeted), properties of the <a href="http://en.wikipedia.org/wiki/landing_page" title="w:landing page">landing page</a> that the link points to (these were found to be less useful), and classification from third-party sites, including Alexa and Google Safe Browsing. The backlinks data provided by Alexa proved to be most useful for the classifier that the authors went on to construct, and tested in a live implementation in the STiki tool. They conclude that &#8220;it is clear this work will benefit the Wikipedia community&#8221;.</p>
<p>In another paper, presented earlier this month at CEAS ‘11, five authors from the same university including two of the same researchers examine the possibility of &#8220;<a rel="nofollow" class="external text" href="http://www.cis.upenn.edu/~westand/docs/ceas_11_wiki_spam_final.pdf">Link spamming Wikipedia for profit</a>&#8220;. They picture spam detection on Wikipedia as a pipelined process, with the <a href="http://meta.wikimedia.org/wiki/MediaWiki:Spam-blacklist" title="MediaWiki:Spam-blacklist">MediaWiki spam blacklist</a> as the first stage (currently containing around 17000 regular expressions), recent changes patrollers (often aided by software tools) as the next – often reacting within seconds after an edit, watchlisters as the third (within minutes to days), and finally review by normal readers as the last stage. Based on a spam/ham corpus constructed as in the other paper, this paper contains some further analysis of the characteristics of link spam destinations and spamming accounts, and of the exposure spammed links receive before they are removed (determined by both the link&#8217;s lifespan and the popularity of the spammed page). The most sensitive part of the paper then leverages these results to &#8220;describe a novel and efficient spam model we estimate can significantly outperform status quo techniques&#8221;, e.g. by rapidly adding links to exploit the time lag of Wikipedia&#8217;s spam removal process, or targetting popular pages. In a nod to <a href="http://en.wikipedia.org/wiki/WP:BEANS" title="w:WP:BEANS">WP:BEANS</a>, the researchers admit that &#8220;there is the possibility that we have introduced previously unknown vectors&#8221;, but the &#8220;Ethical Considerations&#8221; section emphasizes that:</p>
<dl>
<dd>&#8220;It is in no way this research’s intention to facilitate damage to Wikipedia or any wiki host. The vulnerabilities discussed in this section have been disclosed to Wikipedia’s parent organization, the Wikimedia Foundation (WMF). Further, the WMF was notified regarding the publication schedule of this document and offered technical assistance.&#8221;</dd>
</dl>
<p>The authors also point to the implementation of the spam mitigation tool described in the WikiSym article.</p>
<p>However, the paper fails to mention that last year, one of its authors conducted actual, extensive tests of spamming techniques on the English Wikipedia that are very similar to those outlined in the paper. The spam attacks gained the attention of several IT security news websites, and even involved setting up a fake webshop to measure how many Wikipedia readers would have carried out an actual purchase of the penis enlargement pills advertised in the links. The case led to the researcher&#8217;s temporary ban as a Wikipedia user, later lifted by the arbitration committee, and informed the research guidelines drafted later that year by the Wikimedia Foundation&#8217;s <a href="http://meta.wikimedia.org/wiki/Research_Committee" title="Research Committee" class="mw-redirect">Research Committee</a>. See <i>Signpost</i> coverage: &#8220;<a href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2010-08-16/Spam_attacks" title="w:Wikipedia:Wikipedia Signpost/2010-08-16/Spam attacks">Large scale vandalism revealed to be &#8216;study&#8217; by university researcher</a>&#8221; (includes a background interview with the researcher).</p>
<h3 id="How_social_ties_influence_admin_votes">How social ties influence admin votes</h3>
<p>A paper by three researchers from the <a href="http://en.wikipedia.org/wiki/University_of_the_Philippines_Diliman" title="w:University of the Philippines Diliman">University of the Philippines Diliman</a><sup id="cite_ref-5" class="reference"><a href="#cite_note-5">[6]</a></sup>, presented at the International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2011) two months ago, examined statistical relations between the voting behavior in <a href="http://en.wikipedia.org/wiki/WP:RfA" title="w:WP:RfA">requests for adminships (RfAs)</a> and the on-wiki social contacts of participants. The paper includes a brief review of existing literature (in particular two papers which already studied the relation with existing social networks<sup id="cite_ref-6" class="reference"><a href="#cite_note-6">[7]</a></sup><sup id="cite_ref-7" class="reference"><a href="#cite_note-7">[8]</a></sup>). Drawing from a January 2008 dump of the English Wikipedia, they analyzed 2,587 elections conducted between 2004 and 2008 (48% of them successful, with 7,231 users voting or running in at least one RfA, and 80% of the final non-neutral votes being supportive), and &#8220;1,097,223 instances of communication between 265,155 distinct pairs of users&#8221; who had run or voted in an RfA – from user talk page messages, an undirected <a href="http://en.wikipedia.org/wiki/social_graph" title="w:social graph">social graph</a> was generated. Their results concern three areas:</p>
<ul>
<li>&#8220;Factors that motivate participation&#8221;: As a first result, the researchers found that the number of a user&#8217;s contacts who already voted in an RfA, and (more strongly) whether the user had been in contact with the candidate, &#8220;contribute positively to the probability of a user’s participation in an election. This may be due to the fact that voters are inclined to support candidates with whom they are acquainted with.&#8221;</li>
<li>As &#8220;Factors that influence voting&#8221; (i.e. the support/oppose decision) the authors considered the numbers of &#8220;support&#8221; and &#8220;oppose&#8221; votes that a user&#8217;s contacts have already cast when the user votes, and whether the user had been in contact with the candidate before. All yielded regression coefficients with the expected sign (acquaintance with the candidate weighing positively), and the authors conclude that &#8220;we can already explain voting behavior by just examining the immediate neighborhood of a voter&#8221;, but note that &#8220;it is interesting to note that the presence of contacts who have voted negatively weighs more heavily compared with those who voted positively.&#8221;</li>
<li>Finally, the paper examined &#8220;Influential voters in the social network&#8221;, by calculating various well-known <a href="http://en.wikipedia.org/wiki/social_network" title="w:social network">social network</a> metrics for both the &#8220;support&#8221; and &#8220;oppose&#8221; camps in each election (&#8220;<a href="http://en.wikipedia.org/wiki/Degree_(graph_theory)" title="w:Degree (graph theory)">degree</a>, <a href="http://en.wikipedia.org/wiki/closeness_centrality" title="w:closeness centrality">closeness centrality</a>, <a href="http://en.wikipedia.org/wiki/betweenness_centrality" title="w:betweenness centrality">betweenness centrality</a>, <a href="http://en.wikipedia.org/wiki/HITS_algorithm" title="w:HITS algorithm">authority, hub</a>, <a href="http://en.wikipedia.org/wiki/PageRank" title="w:PageRank">PageRank</a>, <a href="http://en.wikipedia.org/wiki/clustering_coefficient" title="w:clustering coefficient">clustering coefficient</a>, and <a href="http://en.wikipedia.org/wiki/eigenvector_centrality" title="w:eigenvector centrality">eigenvector centrality</a>&#8220;, averaged over all voters in each camp, and combined into a weighted difference). Closeness, PageRank, and eigenvector centrality were found to have the largest regression coefficients in predicting the outcome of an RfA, suggesting to the authors &#8220;that decisions of influential nodes can affect the outcome of the RfA process. Although it was not studied in this paper, a possible explanation for this result is that influential users may sway other users to vote the same way and this aggregate behavior may have an impact on the result of the election&#8221;.</li>
</ul>
<h3 id="Wikipedians.27_weekends_in_international_comparison">Wikipedians&#8217; weekends in international comparison</h3>
<p>A paper titled &#8220;Temporal characterization of the requests to Wikipedia&#8221; examined how search requests, read accesses and edits on Wikipedia change over time, and relate to those at the entirety of Wikimedia sites (based on squid logs for the whole year of 2009, provided by the Wikimedia Foundation). Among findings are differences between language versions of Wikipedia, such as that the &#8220;the number of edits tends to raise in weekends&#8221; for the French, Japanese, Dutch and Polish Wikipedia, but not for other languages. Another paper, titled &#8220;Circadian patterns of Wikipedia editorial activity: A demographic analysis&#8221;<sup id="cite_ref-8" class="reference"><a href="#cite_note-8">[9]</a></sup>, similarly analyzed &#8220;34 Wikipedias in different languages [trying] to characterize and find the universalities and differences in temporal activity patterns of editors&#8221;, with the underlying data provided by the German Wikimedia chapter from the <a href="http://en.wikipedia.org/wiki/WP:TOOLSERVER" title="w:WP:TOOLSERVER">toolserver</a>. They found that &#8220;in contrast to diurnal [daily] pattern, which is universal to a great extent, weekly activity patterns of WPs show remarkable differences. We could, however, identify two main categories, namely &#8216;weekends&#8217; and &#8216;working days&#8217; active WPs.&#8221;<sup id="cite_ref-9" class="reference"><a href="#cite_note-9">[10]</a></sup></p>
<h3 id="In_brief">In brief</h3>
<ul>
<li><b>Gender bias in Wikipedia and Britannica</b>: An article by Joseph Reagle and Lauren Rhue titled &#8220;Gender bias in Wikipedia and Britannica&#8221; examines gender bias in biographical coverage, comparing the English Wikipedia and the <a href="http://en.wikipedia.org/wiki/Encyclopedia_Britannica" title="w:Encyclopedia Britannica">Encyclopedia Britannica</a>.<sup id="cite_ref-10" class="reference"><a href="#cite_note-10">[11]</a></sup> The study suggests that &#8220;Wikipedia provides better coverage and longer articles, and that it typically has more articles on women than Britannica in absolute terms, but we also find that Wikipedia articles on women are more likely to be missing than are articles on men relative to Britannica&#8221;. See the <a rel="nofollow" class="external text" href="http://reagle.org/joseph/blog/social/wikipedia/gender-bias-in-wp-eb">accompanying blog post</a> with the full datasets used in this study.</li>
<li><b>Wikipedia as a potlatch</b>: Spanish researcher Felipe Ortega compares Wikipedia to the <a href="http://en.wikipedia.org/wiki/potlatch" title="w:potlatch">potlatch</a>, a traditional gift-giving ceremony whose participants gain status based on the generosity of their gifting, in <a rel="nofollow" class="external text" href="http://blog.felipeortega.net/2011/09/20/the-digital-potlatch/">this blog post</a> summarizing his new book with Joaquín Rodríguez ( “El Potlatch Digital: Wikipedia y el Triunfo del Procomún y el Conocimiento Compartido” ["The Digital Potlatch: Wikipedia and the Triumph of Commons and Shared Knowledge"], published in Spanish by <a href="http://es.wikipedia.org/wiki/Ediciones_C%C3%A1tedra" title="es:Ediciones Cátedra">Ediciones Cátedra</a>.<sup id="cite_ref-11" class="reference"><a href="#cite_note-11">[12]</a></sup> Drawing from new qualitative research (interviews with editors of the Spanish Wikipedia) as well as existing quantitative research, the book concludes that recognizing the gifts Wikipedians make, through meritocracy and explicit acknowledgement, helps motivate participation.</li>
<li>
<div style="text-align:center;border:1px solid #ccc;margin:2px;float:right;clear:right;margin:0.5em 0 0.8em 1.4em;">
<div style="padding: 3px !important; border: 1px solid rgb(204, 204, 204); text-align: center; overflow: hidden; font-size: 94%; background-color: white; width:252px;"><a href="http://meta.wikimedia.org/wiki/File:PSBOCT.png" class="image"><img alt="" src="http://upload.wikimedia.org/wikipedia/commons/thumb/9/95/PSBOCT.png/250px-PSBOCT.png" width="250" height="207" style="border:1px solid #ccc;" /></a></p>
<div style="border:none;text-align:left;line-height:1.4em;padding:3px !important;font-size:94%;">
<div style="float:right;border:none !important;background:none !important;"><a href="http://meta.wikimedia.org/wiki/File:PSBOCT.png" class="internal" title="Enlarge"><img src="http://bits.wikimedia.org/skins-1.18/common/images/magnify-clip.png" width="15" height="11" alt="" /></a></div>
<p><a href="http://en.wikipedia.org/wiki/Bowel_obstruction" title="w:Bowel obstruction">Bowel obstruction</a> was one of the articles edited by the University of Texas students during the course.</div>
</div>
</div>
<p><b>How medical students edit Wikipedia</b>: A paper published last month by the Kansas Journal of Medicine asked &#8220;Are students able and willing to edit Wikipedia to learn components of evidence-based practice?&#8221; <sup id="cite_ref-12" class="reference"><a href="#cite_note-12">[13]</a></sup> In 2007 and 2008, two groups of senior medical students at the University of Texas Health Science Center at San Antonio participated in an exercise where they were asked &#8220;to place succinct summaries of [medical] studies in Wikipedia&#8221; (after a four hour introductory course on wikis). In a survey, 91% of them said that the project should be offered again in the next year, and 71% planned to edit Wikipedia again. (The authors caution that this group was self-selected.) The articles were examined two months after their edits, and 46% of the students had their contribution improved in some way, while &#8220;the pages edited by 62% of students had additional edits in response to incidental vandalism to the pages, but in no instance was the vandalism done to an edit by a student&#8221;.</li>
<li><b>Ethnography of wikiculture set free</b>: Joseph Reagle&#8217;s 2010 book on the cultural dynamics of Wikipedia, <i><a href="http://en.wikipedia.org/wiki/Good_Faith_Collaboration" title="w:Good Faith Collaboration">Good Faith Collaboration</a></i>, is now <a rel="nofollow" class="external text" href="http://reagle.org/joseph/2010/gfc/">freely available</a> to read online, having been released under an accommodating Creative Commons licence (CC BY-NC-SA 3.0).<sup id="cite_ref-13" class="reference"><a href="#cite_note-13">[14]</a></sup></li>
<li><b>Provenance for Wikipedia articles</b>: A (closed access) doctoral dissertation defended at the University of Arizona presents a &#8220;domain ontology of provenance for Wikipedia based on the W7 model&#8221;, building on the notion of the <a href="http://en.wikipedia.org/wiki/five_Ws" title="w:five Ws">five Ws</a>. The author applies this ontology to extract provenance for Wikipedia articles and to assess their quality, thereby identifying &#8220;several collaboration patterns that are preferable or detrimental for data quality&#8221;.<sup id="cite_ref-14" class="reference"><a href="#cite_note-14">[15]</a></sup></li>
<li><b>Geographies of the World&#8217;s Knowledge</b>: as already mentioned in <a href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2011-09-19/In_the_news" title="w:Wikipedia:Wikipedia Signpost/2011-09-19/In the news">last week&#8217;s Signpost</a>, the <a rel="nofollow" class="external text" href="http://www.floatingsheep.org/2011/09/two-of-floatingsheep-collective-have.html">floatingsheep collective</a>, in collaboration with the <a href="http://en.wikipedia.org/wiki/Oxford_Internet_Institute" title="w:Oxford Internet Institute">Oxford Internet Institute</a>, released a report titled &#8220;Geographies of the World&#8217;s Knowledge&#8221; visualizing the temporal and geographical distribution of Wikipedia articles.<sup id="cite_ref-15" class="reference"><a href="#cite_note-15">[16]</a></sup> Drawing from roughly 1.5 million articles in a 2010 database download, the report revealed among other findings that more articles had been written about Antarctica (7,800) than any South American or African nation, that the country with the most internet users (China) accounted for barely 1% of articles, that its biographical articles overwhelmingly geolocate to Western Europe and, from the 18th century on, North America, and that vastly more biographies per year were written for the 20th and particularly the 21st century compared to preceding time periods. The report is released under a Creative Commons BY-NC-ND license.</li>
<li><b>Wikipedia found to have grown until 2007</b>: A paper by a sociology researcher from the <a href="http://en.wikipedia.org/wiki/University_of_York" title="w:University of York">University of York</a>, titled &#8220;Measuring the Development of Wikipedia&#8221;<sup id="cite_ref-16" class="reference"><a href="#cite_note-16">[17]</a></sup>, explores the development of the number of edits and the number of participants on the English Wikipedia from 2002 to 2007 (curiously asserting that &#8220;there is only 6 years data&#8221;). As first result, the research &#8220;reveals that the number of edits and the total number of participants both increased in Wikipedia from 2002 to 2007&#8243;. The paper&#8217;s most tangible contribution appears to consist of histograms plotting the number of users with a particular edit count in each of the years 2002 to 2007, which the author finds &#8220;are similar with the <a href="http://en.wikipedia.org/wiki/Pareto_distribution" title="w:Pareto distribution">Pareto distribution</a> in the shape, [and therefore] we assume that the participation situation in Wikipedia is one type of the Pareto Distribution&#8221;. A large part of the four page paper (available for $26) is devoted to general explanations of this distribution. It also mentions the need to use a statistical method such as <a href="http://en.wikipedia.org/wiki/maximum_likelihood_estimation" title="w:maximum likelihood estimation">maximum likelihood estimation</a> to confirm the optical impression that the histograms follow the Pareto distribtion, but it remains unclear if the author actually carried this out. Also, despite emphasizing several times the importance of determining the changes in the <i>k</i> parameter over the years (a measure of the inequality associated with the postulated Pareto distribution) – calling it &#8220;vital to model the participation situation in Wikipedia&#8221; -, the actual values are never given. The abstract promises &#8220;an equation to predict future development trend of Wikipedia&#8221;, but it remains unclear to this reviewer which equation this refers to.</li>
</ul>
<h2 id="References">References</h2>
<div class="references-small">
<ol class="references">
<li id="cite_note-0"><a href="#cite_ref-0">↑</a> Antin, Judd, Raymond Yee, Coye Cheshire, and Oded Nov (2011). Gender Differences in Wikipedia Editing. <i>WikiSym 2011: Proceedings of the 7th International Symposium on Wikis</i>, 2011. <a rel="nofollow" class="external text" href="http://people.ischool.berkeley.edu/~coye/Pubs/Articles/GenderWikiSym2011.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-1"><a href="#cite_ref-1">↑</a> S.T.K. Lam, A. Uduwage, Z. Dong, S. Sen, D.R. Musicant, L. Terveen, and J. Riedl (2011). WP:Clubhouse? An Exploration of Wikipedia&#8217;s Gender Imbalance. In <i>WikiSym 2011: Proceedings of the 7th International Symposium on Wikis</i>, 2011. <a rel="nofollow" class="external text" href="http://grouplens.org/system/files/wp-gender-wikisym2011.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-2"><a href="#cite_ref-2">↑</a> Halfaker, Aaron, Aniket Kittur, and John Riedl (2011). Don&#8217;t Bite the Newbies: How Reverts Affect the Quantity and Quality of Wikipedia Work. <i>WikiSym &#8217;11: Proceedings of the 7th International Symposium on Wikis</i>. <a rel="nofollow" class="external text" href="http://www.grouplens.org/system/files/halfaker11bite.personal.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-3"><a href="#cite_ref-3">↑</a> A.G. West and I. Lee (2011). What Wikipedia Deletes: Characterizing Dangerous Collaborative Content. In <i>WikiSym 2011: Proceedings of the 7th International Symposium on Wikis</i>. <a rel="nofollow" class="external text" href="http://www.cis.upenn.edu/~westand/docs/wikisym_11_revdel_final.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-4"><a href="#cite_ref-4">↑</a> Ferron, Michela, and Paolo Massa (2011). Collective memory building in Wikipedia: The case of North African uprisings. <i>WikiSym 2011: Proceedings of the 7th International Symposium on Wikis</i>. <a rel="nofollow" class="external text" href="http://www.gnuband.org/papers/collective_memory_building_in_wikipedia_the_case_of_north_african_uprisings/"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-5"><a href="#cite_ref-5">↑</a> Cabunducan, Gerard, Ralph Castillo, and John Boaz Lee (2011). Voting behavior analysis in the election of Wikipedia admins. In: <i>2011 International Conference on Advances in Social Networks Analysis and Mining</i>, 545–547. IEEE <a rel="nofollow" class="external text" href="http://dx.doi.org/10.1109/ASONAM.2011.42"><b>DOI</b></a> <a href="http://en.wikipedia.org/wiki/Closed_access" title="Closed access"><img alt="Closed access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/c/c1/Closed_Access_logo_alternative.svg/12px-Closed_Access_logo_alternative.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-6"><a href="#cite_ref-6">↑</a> J. Leskovec, D. Huttenlocher, J. Kleinberg (2010) Predicting positive and negative links in online social networks. <i>ACM WWW International conference on World Wide Web</i> (WWW &#8217;10), 2010. <a rel="nofollow" class="external text" href="http://videolectures.net/www2010_leskovec_ppn/">video</a> <a rel="nofollow" class="external text" href="http://cs.stanford.edu/people/jure/pubs/signs-www10.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-7"><a href="#cite_ref-7">↑</a> J. Leskovec, D. Huttenlocher, J. Kleinberg (2010) Governance in Social Media: A case study of the Wikipedia promotion process. In: <i>AAAI International Conference on Weblogs and Social Media</i> (ICWSM &#8217;10). <a rel="nofollow" class="external text" href="http://videolectures.net/icwsm2010_leskovec_gsm/">video</a> <a rel="nofollow" class="external text" href="http://cs.stanford.edu/people/jure/pubs/voting-icwsm10.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-8"><a href="#cite_ref-8">↑</a> Yasseri, Taha, Sumi, Róbert, Kerétsz, János (2011). Circadian patterns of Wikipedia editorial activity: A demographic analysis, <i>ArXiV</i> (September 8, 2011). <a rel="nofollow" class="external text" href="http://arxiv.org/abs/1109.1746"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-9"><a href="#cite_ref-9">↑</a> Reinoso, Antonio J., Jesus M. Gonzalez-Barahona, Rocio Muñoz-Mansilla, and Israel Herraiz (2011). Temporal characterization of the requests to Wikipedia. In <i>Proceedings of the 5th International Workshop on New Challenges in Distributed Information Filtering and Retrieval</i> (DART 2011). ETSI Caminos, Canales y Puertos (UPM), September 13, 2011. <a rel="nofollow" class="external text" href="http://oa.upm.es/8836/1/temporalCharac.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-10"><a href="#cite_ref-10">↑</a> Reagle, Joseph, and Lauren Rhue (2011). Gender Bias in Wikipedia and Britannica. <i>International Journal of Communication</i> 5 (2011): 1138–1158. <a rel="nofollow" class="external text" href="http://ijoc.org/ojs/index.php/ijoc/article/view/777"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-11"><a href="#cite_ref-11">↑</a> José Felipe Ortega and Joaquín Rodríguez López (2011). <i>El potlatch digital. Wikipedia y el triunfo del procomún y el conocimiento compartido</i>, Catedra, September 2011. <a rel="nofollow" class="external text" href="http://www.catedra.com/cgigeneral/newFichaProducto.pl?obrcod=2885118"><b>HTML</b></a> <a href="http://en.wikipedia.org/wiki/Closed_access" title="Closed access"><img alt="Closed access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/c/c1/Closed_Access_logo_alternative.svg/12px-Closed_Access_logo_alternative.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-12"><a href="#cite_ref-12">↑</a> Badgett, Robert G, and Mary Moore (2011). Are students able and willing to edit Wikipedia to learn components of evidence-based practice? <i>Kansas Journal of Medicine</i> 4(3), August 30, 2011. <a rel="nofollow" class="external text" href="http://hdl.handle.net/2271/976"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-13"><a href="#cite_ref-13">↑</a> Reagle, Joseph M. (2010). <i>Good Faith Collaboration: The Culture of Wikipedia</i>. The MIT Press, 2010. <a rel="nofollow" class="external text" href="http://reagle.org/joseph/2010/gfc/"><b>HTML</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-14"><a href="#cite_ref-14">↑</a> Liu, J. (2011). <i>W7 model of provenance and its use in the context of Wikipedia</i>. PhD dissertation, The University of Arizona, 2011. <a rel="nofollow" class="external text" href="http://gradworks.umi.com/34/60/3460890.html"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Closed_access" title="Closed access"><img alt="Closed access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/c/c1/Closed_Access_logo_alternative.svg/12px-Closed_Access_logo_alternative.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-15"><a href="#cite_ref-15">↑</a> Graham, M., Hale, S. A. and Stephens, M. (2011) <i>Geographies of the World’s Knowledge</i>. Ed. Flick, C. M., London, Convoco! Edition. <a rel="nofollow" class="external text" href="http://www.oii.ox.ac.uk/publications/convoco_geographies_en.pdf"><b>PDF</b></a> <a href="http://en.wikipedia.org/wiki/Open_access" title="Open access"><img alt="Open access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/7/77/Open_Access_logo_PLoS_transparent.svg/12px-Open_Access_logo_PLoS_transparent.svg.png" width="12" height="19" /></a></li>
<li id="cite_note-16"><a href="#cite_ref-16">↑</a> He, Zeyi (2011). Measuring the Development of Wikipedia. In <i>2011 International Conference on Internet Technology and Applications</i>, IEEE <a rel="nofollow" class="external text" href="http://dx.doi.org/10.1109/ITAP.2011.6006393"><b>DOI</b></a> <a href="http://en.wikipedia.org/wiki/Closed_access" title="Closed access"><img alt="Closed access" src="http://upload.wikimedia.org/wikipedia/commons/thumb/c/c1/Closed_Access_logo_alternative.svg/12px-Closed_Access_logo_alternative.svg.png" width="12" height="19" /></a></li>
</ol>
</div>
<p></p>
<div style="float:left; width: 99%; border-top: 1px solid #CCC; border-bottom: 1px solid #CCC;">
<p style="text-align:center; clear:left; background-color: #EEE; padding: .6em; font-size:140%; font-weight:normal;font-family: Georgia, Times, serif; color:#666"><i>Wikimedia Research Newsletter</i><br />
Vol: 1 • Issue: 3 • September 2011<br />
<span style="font-size:80%;">This newletter is brought to you by the <a href="http://meta.wikimedia.org/wiki/Research:Committee" title="Research:Committee">Wikimedia Research Committee</a> and <a href="http://en.wikipedia.org/wiki/Wikipedia:Signpost" title="w:Wikipedia:Signpost">The Signpost</a><br />
<a href="http://meta.wikimedia.org/wiki/Research:Newsletter/Archives" title="Research:Newsletter/Archives">[<!--  -->archives]</a> <a href="http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost/2011-09-26/Recent_research" title="w:Wikipedia:Wikipedia Signpost/2011-09-26/Recent research">[signpost edition]</a> <a href="http://meta.wikimedia.org/wiki/Research:Newsletter" title="Research:Newsletter">[contribute]</a> <a href="http://meta.wikimedia.org/wiki/Research:Index" title="Research:Index">[research index]</a></span></p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://blog.wikimedia.org/2011/09/28/wikimedia-research-newsletter-september-2011/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using memcached
Database Caching 1/27 queries in 0.030 seconds using memcached
Object Caching 322/371 objects using memcached

Served from: blog.wikimedia.org @ 2012-02-23 13:15:32 -->
