Wikipedia:Wikipedia Signpost/Single/2021-06-27

The Signpost
Single-page Edition
WP:POST/1
27 June 2021

 


2021-06-27

Elections, Wikimania, masking and more

Board of Trustees elections

2021 WMF Board elections timeline

Fourteen candidates have thrown their hats into the ring for the four vacant seats on the Wikimedia Foundation Board of Trustees. The winning candidates will serve three-year terms. As of press time, you have one more day to submit your candidacy.

You may also submit questions for the candidates, or endorse or comment on others' questions through June 29. So far 40 questions have been submitted.

The diverse group of candidates includes four women, one current board member, and three Africans. About a dozen nationalities are represented.

The elections were cancelled last year while plans to redesign the Board were in progress. Voting will be held August 4–17 with eligible voters casting single transferable votes.

Virtual Wikimania August 13–17

Wikimania, the annual conference hosted by the Wikimedia Foundation, will be a virtual event this year for the first time in its 15-year history.

Submissions for presentations are now closed, with almost 300 submissions received. Registration will open soon.

Masking IPs proposal

The WMF is starting to make its proposal to mask IP addresses more concrete. WMF Legal has stated that this type of proposal must be implemented for legal reasons, though the WMF is still gathering evidence and opinions on how best to implement it. Be sure to see the talk page for the most recent community feedback.

Another very long, and somewhat disorganized, discussion at Wikipedia:Village pump (WMF)/Archive 3#IP Masking Update seems to have significant support for simply banning IP editing if the IP masking is put into effect.

Board chair steps down, becomes WMF consultant

María Sefidari, Raystorm, has stepped down as chair of the WMF Board of Trustees. It was then announced on a Wikimedia mailing list that she would become a paid consultant. The reaction on the mailing list, starting with Chris Keating, was "flabbergasted". Michael Maggs, Philip Kopetzky and others joined in saying that it looked like a conflict of interest and a general governance mistake that the WMF would never allow WMF affiliates to get away with.

There is now also a discussion at The Village Pump (WMF), under the heading Self-Dealing on the WMF Board.

Brief notes




Reader comments

2021-06-27

Boris and Joe, reliability, love, and money

Boris gives Joe an outstanding present

A diplomatic exchange of gifts involving a photo on Commons (right) was thoroughly dissected by The Washington Post, The Independent, The Times and many other newspapers.

US President Joe Biden gave UK Prime Minister Boris Johnson a $6,000 bicycle. Boris gave Joe a photograph of a mural in Edinburgh of Frederick Douglass taken from Commons. An earlier version of the story in The Times suggested that Boris' present was not really adequate. Sorry, I didn't archive the earlier story in time, but in any case The Signpost disagrees. It's wonderful that Johnson recognized the importance of US–UK shared history—including Douglass's life and general race relations in both countries. It's even better that Johnson gave a freely licensed photo available on Commons. Why waste the taxpayers' money when such a thoughtful gift is libre as well as gratis.

The photographer Melissa Highton gives due credit to the muralist Ross Blair (AKA TrenchOne). Highton is a dual US–UK national with roots in Maryland and Scotland. She says that the "incident gained a wee bit of media coverage" and that it "just goes to show that serendipitous things happen when you share openly."

More prime ministers and presidents should give such gifts. We welcome readers' suggestions in the Comments section below on what Commons photos their political leaders should give to other countries' leaders. Be sure to include the filename, e.g. for Biden to France's Emmanuel Macron, include [[:File:New Colossus manuscript Lazarus.jpg]], with a colon in front of "File".

The most reliable source on the internet?

PC Magazine interviews Amy Bruckman, Professor at Georgia Tech, and a keynote speaker at the upcoming IntelliSys 2021 Conference, who also edits Wikipedia occasionally. She states:

The content of a popular Wikipedia page is actually the most reliable form of information ever created. Think about it—a peer-reviewed journal article is reviewed by three experts ... and then is set in stone.... A popular Wikipedia page might be reviewed by thousands of people. If something changes, it is updated.... On the other hand, a less popular Wikipedia page might not be reliable at all.

Her book Should You Believe Wikipedia? will be published in 2022 by Cambridge University Press.

Canada's Global News offers a slightly different perspective but concludes with similar advice.

Arabic Wikipedia grows in size and scope

Reuters reports that Arabic Wikipedia now has more than 1 million articles. The story, written by Mahmoud Mourad, notes that in 2020 the number of registered users to the site expanded by 44%, and is now at 136,000. A brief interview with a Wikipedian, Anass Sedrati, contains this quote which gets to the heart of Wikipedia: "We can consider this as an attempt to provide knowledge to those who have not had the same opportunities as us." In another example of the Wikipedia ethos (and some commentary on our notability policies!), a second Wikipedian shares: "...We must have an article on everything that exists and on everything related to our life." Happy editing to all those over at Arabic Wikipedia!

Wikipedia, love, and marriage

They met about 2010 in a "chat room" on Wikipedia, according to The New York Times among a community of "passionate moderators, writers, and editors". In 2013 they met in person at a Wiki-wedding in New Jersey. Wikimania 2014 in London was next. The relationship heated up a bit in 2016 and again in 2019. This Pi day, March 14, the two were married celebrating with lemon meringue and apple pies. In their spare time they've managed a total of over 30,000 edits. The Signpost sends our warmest congratulations to IShadowed and Shirik.

Apparent plagiarism leads Elsevier to retract periodic table book

Chemistry World reports that academic publisher Elsevier has retracted the book The Periodic Table: Nature’s Building Blocks: An Introduction to the Naturally Occurring Elements, Their Origins and Their Uses based on accusations that the book plagiarized Wikipedia. An occasional Wikipedia editor Thomas Rauchfuss, an inorganic chemistry professor at the University of Illinois Urbana-Champaign was notified by another Wikipedian of the apparent plagiarism, and checked it out in some detail. After contacting the book's three co-authors without receiving a satisfactory reply, Rauchfuss notified Elsevier who pulled the book.

Canadian students write articles about Indigenous people and social justice

Canadian editorial outlet Troy Media publishes the story "Students bring Indigenous perspectives to Wikipedia", which was originally published in the University of Alberta's magazine Folio. The story reports on Dr. Nykkie Lugosi-Schimpf and one of her students, Kris Cromwell, and how they used Wikipedia in Lugosi-Schimpf's course Colonialism and the Criminal Justice System.

In the course, students created Wikipedia articles about Indigenous people and the Canadian justice system. Lugosi-Schimpf was inspired to create the course after reading a story about Erin O'Neil, the Wikipedian in Residence for University of Alberta, and O'Neil's discussion about how Wikipedia can be used for social justice projects. Lugosi-Schimpf then contacted Wiki Education to set up the course. Previously the course had students work directly with organizations and communities to fulfill the service learning portion of the course. With the COVID-19 pandemic, such in-person work was not possible and Wikipedia presented an alternative. According to Lugosi-Schimpf, it also presented a great opportunity for students as well. "Indigenous peoples have been spoken for and talked about, and I think that for students to be able to speak for themselves, about themselves, rather than have someone else tell their stories is really important." Students peer-reviewed each other's work before the final articles were posted to Wikipedia, which gave them scholarly editing skills as well as a connection to each other during a difficult time socially due to the pandemic.

Kris Cromwell relates that she was excited by the opportunity presented by the course to impact how Wikipedia depicts racialized groups. "I'm a Black woman in the Faculty of Native Studies, and I thought it was important to have something from the perspective of a racialized person with education and training in critical Indigenous studies." She created Indigenous peoples and the Canadian criminal justice system and has continued to edit. She says "while not everyone can be a journalist, everyone can be a Wikipedia editor."

An Edit-a-thon for Eternity

The Australian Christian newspaper Eternity, working with the WikiProject 1000 Women in Religion, held an edit-a-thon to increase the coverage of Australian religious women.

Two more Indigenous Taiwanese language Wikipedias

On April 15, 2021, the Atayal and Seediq language Wikipedias were launched, as reported in The News Lens and Language Magazine. Both languages are indigenous to Taiwan, and are the second and third indigenous Taiwanese language Wikipedias to be created. The launch of these Wikipedias, along with the Sakizaya Wikipedia back in 2019 (see prior Signpost coverage), are part of a language revitalization project by the Taiwanese government. Atayal is spoken by only 90,000 people, and Seediq by only 10,000, and fluent speakers of both respective languages are far fewer. In 2018, UNESCO categorized Atayal as "vulnerable" to extinction and Seediq as "severely endangered". These languages belong to the Formosan family of Austronesian languages. Most speakers are elderly.

The lab leak hypothesis on Wikipedia

Once again CNET reports on the coverage of the COVID-19 pandemic on Wikipedia. This time, the focus is on the origins of the virus: "Inside Wikipedia's endless war over the coronavirus lab leak theory." The article, written by Jackson Ryan, is clear in its descriptions of Wikipedia processes, and features quotes from involved Wikipedians. The piece was well-received on a relevant talk page, with one Wikipedian writing: "it is rare to read an article about Wikipedia that gets things broadly right".

In brief

Videos and podcasts

  • Jimbo on section 230: (11 minutes) Jimmy Wales is interviewed on Yahoo! about possible changes to the U.S. law that shields the WMF and other platforms from liability caused by their contributors. "We can't make Facebook legally liable for everything that somebody’s crazy uncle types on the internet.” Jimmy then moves on to "your sweet grandmother" and "angry boys".
  • "Just the facts, ma'am": (8 minutes) Katherine Maher on April 13, her last day as CEO and Executive Director of the Wikimedia Foundation, was interviewed by Trevor Noah. Over nearly eight minutes, they discussed the growth of Wikipedia, neutrality and verifiability, and fund raising. The main surprise was that she repeated a quote associated with the TV show Dragnet.
  • Katherine Maher also spoke to the Atlantic Council on June 22, 2021.
  • The Nature of Wikipedia: (29 minutes) on podcaster The World According to Wikipedia starts with a discussion on cabals, and moves on to Wiki Loves Earth the international photo competition. Anastasiia Petrova, the international coordinator is featured.



Do you want to contribute to "In the media" by writing a story or even just an "in brief" item? Edit next month's edition in the Newsroom or leave a tip on the suggestions page.



Reader comments

2021-06-27

Croatian Wikipedia: capture and release

A decade-old case of "project capture" of the Croatian Wikipedia (Hr.WP) by nationalist administrators may have been resolved with the help of a report published this month on Meta by the Wikimedia Foundation titled "The Case of the Croatian Wikipedia". The report was authored anonymously, presumably to avoid harassment, and is an independent view of an expert on the subject matter.

The admins, led by Kubura, inserted disinformation and used sockpuppeting and other abusive tactics, according to a separate RfC which globally banned him last November. Blablubbs, who participated in the RfC, said that Kubura had an "army of socks". Blablubbs decided to help at the RfC "partially because of the whitewashing ... and partially because of draconian crackdowns on dissent inside the project".

The admins were linked by the report to Croatian nationalist positions by their downplaying the UN war crime convictions of Croatians who fought in the 1991–1999 Yugoslav Wars, their use of biased unreliable sources and by their support of the World War II era Nazi-puppet state, the Independent State of Croatia (NDH), as well as the military group, the Ustaše which the report calls "terrorist".

The report echos earlier accounts of administrator abuse including a 2019 article in The Signpost, "The curious case of the Croatian Wikipedia", Croatian and international news stories going back to 2010, and complaints by Wikipedia editors starting about 2007. The report concludes that "Hr.WP had been dominated by ideologically driven users who are misaligned with Wikipedia’s five pillars, confirming concerns about the project’s integrity from the global community."

Articles are being re-written and disaffected editors are rejoining the project. The report notes this progress but warns that the transformation is not complete and that the banned admins may use new accounts to try to recapture the project.

The report also observes that this case highlights a "significant weakness in the global Wikimedia community and – by extension – Wikimedia Foundation platform governance."

The report

The WMF began its planning for the report in November 2020 as the RfC on banning Kubura was in progress, but the author's investigation started in February 2021. He is an external expert on the subject matter and provides three recommendations to the WMF. Jan Eissfeldt, Global Head of Trust and Safety at the WMF told The Signpost that the author is a native speaker of Serbo-Croatian with "decades of relevant international experience analysing ... patterns of organised disinformation." The report states "opinions expressed in this report are those of the author and do not necessarily reflect the official policy or position of the Wikimedia Foundation."

The Croatian, Serbian, and Bosnian Wikipedias are unusual in that they all separated starting in 2003 from the Serbo-Croatian Wikipedia, which continued to exist. All these languages are mutually intelligible variants of Serbo-Croatian, which is termed "pluricentric".

The report states that

this structure enabled local language communities to sort by points of view on each project, often falling along political party lines in the respective regions. The report asserts, furthermore, it deprived the newly-created communities of editorial diversity that normally guides and underpins the traditionally successful process of editorial consensus in other pluricentric language projects.
— Report, p.2

The limited number and diversity of editors on the new Croatian Wikipedia allowed it to become politicized, and allowed Kubura, his sockpuppets, and followers to capture the administrative structure of the project.

Evidence of this capture is shown in the report's section on "Key findings and case studies" (pp. 15–35) which makes up one-third of the report. It includes subsections on

  • "Measuring disinformation," which uses a sample of articles on 32 people convicted of war crimes committed during the Yugoslav wars and compares how these crimes were covered in 8 Wikipedia language versions. Though the sample size is low, Hr.WP clearly covered Croatian war criminals differently than the other language versions.
  • "Illustrative examples," or case studies from the above subsection
  • "Propaganda," examining a dozen specific articles
  • "Questionable sources," which examines eleven sources used in the project and the frequency of their use, classifying them as self-published sites, those without any declared editorial standards, openly extremist sites, etc.

This section is the core of the analysis and may set the standard if any future reports of this type are needed.

Based on these findings the report makes three recommendations to the WMF and the Serbo-Croatian communities:

  1. that the Croatian community seek to "continue re-establishing a robust local governance system, requesting oversight and support from the rest of the Wikimedia movement as needed;"
  2. that they seek to unify the selection of admins and functionaries with other Serbo-Croatian communities; and
  3. that they explore a full reunification into the original Serbo-Croatian language project.

Adding some urgency to these recommendations, the report warns that, as currently constituted, Hr.WP is at risk of being recaptured by nationalist editors and admins.

An additional observation – strengthen global governance

Following the recommendations, the report's author makes a statement that goes beyond the Serbo-Croatian community and the Hr.WP disinformation problem.

The evident failure of the Meta RfC system to resolve the structural misalignment of Croatian language Wikipedia and the lack of an adequate alternative pathway to resolution, points at the significant weakness in the global Wikimedia community and – by extension – Wikimedia Foundation platform governance. This is a problem for public and regulatory confidence in the self-governance model provided for within the framework of the Foundation’s policies ... While devising possible pathways to address this identified bigger challenge is beyond the scope of this disinformation assessment, it strikes the author of this evaluation as increasingly important to resolve in the light of heightened regulatory scrutiny of user-generated platform models, including Wikipedia ...
— Report, p.14

The Signpost asked Jan Eissfeldt of the WMF for his reaction to the report's observation. He recognizes that disinformation is a growing problem, and emphasized that the WMF would work with the communities as openly as possible. For cases where safety is a potential problem, they might need to work through stewards or other trusted users. "We are investing in our movement's capacity to identify and respond to all kinds of influence operations, including those led by government actors, to ensure the accuracy of the information shared on Wikimedia projects. An example of this was the taskforce we put together ahead of the U.S. presidential election."

While not promising to start any new program to systematically evaluate disinformation problems, he said "the Foundation aims to conduct project evaluations, in collaboration with volunteers in the Wikimedia movement, to explore potential issues in projects openly and transparently."

How well did the WMF respond in the case of the Croatian Wikipedia? He says that the WMF "did not adequately understand some of the unique risks now identified in the report," in particular the risks of having separate Wikipedias for parts of pluricentric language communities.



Reader comments

2021-06-27

Feminist critique of Wikipedia's epistemology, Black Americans vastly underrepresented among editors, Wiki Workshop report

A monthly overview of recent academic research about Wikipedia and other Wikimedia projects, also published as the Wikimedia Research Newsletter.

Reviewed by Markworthen

This paper[1] by Menking and Rosenberg, published in the journal Science, Technology, & Human Values, is a recondite article. Its depth is both a strength (diligent study of the article will likely enhance Wikipedians' understanding of potential problems, such as our assumptions about what constitutes a reliable source or our epistemological presumptions), and a weakness (most Wikipedians will not read it because it is so dense).

I tried (several times) but cannot improve on the authors' summary. Here, then, is an excerpt from the article abstract:

A repository for established facts, Wikipedia is also a social space in which the facts themselves are decided. As a community, Wikipedia is guided by the five pillars—principles that inform and undergird the prevailing epistemic and social norms and practices for Wikipedia participation and contributions. We contend these pillars lend structural support to and help entrench Wikipedia’s gender gap as well as its lack of diversity in both participation and content. In upholding these pillars, Wikipedians may unknowingly undermine otherwise reasonable calls for inclusivity, subsequently reproducing systemic biases. We propose an alternative set of pillars developed through the lens of feminist epistemology ... Our aim is not only to reduce bias, but also to make Wikipedia a more robust, reliable, and transparent site for knowledge production.

Context

Background reading that will enhance understanding of this Menking & Rosenberg (2021) article:

Talk page discussions about the article

The article has generated some engaging discussions on Wikipedia talk pages, for example:


African Americans are vastly underrepresented among US Wikimedians, but contribute motivated by "black altruism"

Reviewed by Tilman Bayer

Last month, the Wikimedia Foundation published the results of its annual "Community Insights", a global survey of 2,500 Wikimedians (including active editors and program leaders) conducted in September/October 2020.[2]

For the first time, the survey asked about race and ethnicity, confined to two countries where such categories are widely used and accepted: the US (195 responses) and the UK (67). Among US contributors, the findings shows striking gaps among Black/African American editors (0.5% compared to 13% among the general population) and American Indian/Alaska Native editors (0.1% vs 0.9%). Hispanic/Latino/a/x editors show a lesser but still large gap (5.2% vs. 18%). White/Caucasian (89% vs. 72%) and especially Asian Americans (8.8% vs. 5.7% among the general population) are over-represented among contributors in the US.

Survey findings about the race of contributors in the US, compared to the overall population (categories are overlapping and thus sum to more than 100%)

In the UK, the survey similarly found "significant underrepresentation" of Black or Black British editors (0.0% vs. 3.0% in the general population), whereas the percentage of white editors was close to the general population.

A racial or ethnic gap among Wikipedians in the US has long been anecdotally observed or conjectured (see e.g. this 2010 thread which also contained some informed speculation about possible reasons), but this marks the first time that it is backed by empirical survey data, related to the fact that the Wikimedia Foundation's annual surveys are global in nature and there are no internationally accepted definitions of race and ethnicity (or worse, survey questions of this nature would be considered offensive in many countries) [3][4].

Illustration from the 2018 "Pipeline of Online Participation Inequalities" paper

Correspondingly, there has been few research about possible reasons for such gaps. An exception is the 2018 paper "The Pipeline of Online Participation Inequalities: The Case of Wikipedia Editing"[supp 1], which we previously reviewed here with a more general focus, but it also contains some insights about reasons why African Americans contribute at a lower rate.

While this study did not find a significant racial disparity among the earlier parts of the pipeline (measuring whether survey respondents had heard of Wikipedia or had visited Wikipedia), when it comes to "know[ing] that Wikipedia can be edited [...] age, gender, and several racial/ethnic identity categories (Black, Hispanic, Other) emerge as salient explanatory factors where they did not before. Income no longer explains the outcome. Education level associates strongly with knowing Wikipedia can be edited." However, racial and ethnic background factors "do not associate with who contributes content" (i.e. the last part of the pipeline). This points to raising awareness of Wikipedia's editability as a potential strategy for reducing these gaps, although this would not address "the importance of education and Internet skills" gaps for closing knowledge gaps that the authors highlight in their overall conclusions.

Conversely, a 2020 paper answered the question "What drives Black contributions to Wikipedia?"[3][4] with the following conclusions, based on a survey of 318 Black Wikipedia editors in the US:

First, black altruism [measured via the survey question "Writing/editing Wikipedia is a way for me to help the Black community," "Sharing my knowledge through Wikipedia improves content on the Black community," and "It is important that Blacks write/edit Wikipedia"] indirectly influences Wikipedians’ content contribution through their perception of information quality regarding Black content in the Wikipedia universe. In other words, the higher their altruistic tendencies, the stronger their perceptions of information quality. [...] We argue that black altruism mediated through perceptions of information quality [...] helps extend black digital culture in the online encyclopedia. Therefore, we suggest one of the gratifications of Black Wikipedia contribution is likely the production and dissemination of Black cultural information content. [...]

Secondly, our findings suggest that Black Wikipedians’ perceived social presence, is also a significant driver of content contribution. That is to say, Wikipedia contribution uplifts self-esteem and feelings of self-enhancement among Black authors and editors. [...]

Lastly our findings, demonstrated that entertainment is not a factor that significantly motivates Black Wikipedians’ content contribution in the current study. We suggest this may be because Wikipedia content contribution, like long-form blogging, requires “more sustained time, editing, maintenance” (Steele, 2018 p. 116), intellectual endeavor, and a higher degree of technological proficiency than other types of social media activities. Nevertheless, our results are a sharp contrast to previous studies which suggest that amusement is an influential factor in Wikipedia contributions [...]"

Survey respondents were recruited in 2017 via Qualtrics "based on predefined characteristics such as individuals who identified as Black/African American, resided in the United States, and had made at least one edit/contribution to Wikipedia's English edition over the last three years". Interestingly, the resulting sample of 318 Black Wikipedia contributors was much larger than that of the WMF Community Insights survey, which (barring some extreme downward adjustments during the weighting process) appears to have consisted of a single Black respondent in the sample, considering the stated percentage of 0.5% among 195 US-based respondents.


Wikiworkshop 2021

Report by Tilman Bayer

The annual WikiWorkshop, part of The Web Conference, took place as an online event on April 14, 2021, featuring the papers listed below. The organizers reported that 78% of attendees were non-native English speakers, 66% first-time attended Wiki Workshop for the first time, 53% were academic researchers and 34% students.

"References in Wikipedia: The Editors' Perspective"

From the abstract:[5]

"we explore the creation and collection of references for new Wikipedia articles from an editor’s perspective. We map out the workflow of editors when creating a new article, emphasising on how they select references."

"Do I Trust this Stranger? Generalized Trust and the Governance of Online Communities"

From the abstract:[6]

" we hypothesize that administrators’ community governance policy might be influenced by general trust attitudes acquired mostly out of the Wikipedia context. We use a decontextualized online experiment to elicit levels of trust in strangers in a sample of 58 English Wikipedia administrators. We show that low-trusting admins exercise their policing rights significantly more (e.g., block about 81% more users than high trusting types on average). We conclude that efficiency gains might be reaped from the further development of tools aimed at inferring users’ intentions from digital trace data."

"Negative Knowledge for Open-world Wikidata"

From the abstract:[7]

"Like most major KBs [knowledge bases, Wikidata is] incomplete and therefore operates under the open-world assumption (OWA) – statements not contained in Wikidata should be assumed to have an unknown truth. The OWA ignores however, that a significant part of interesting knowledge is negative, which cannot be readily expressed in this data model. In this paper, we review the challenges arising from the OWA, as well as some specific attempts Wikidata has made to overcome them. We review a statistical inference method for negative statements, called peer-based inference, and present Wikinegata, a platform that implements this inference over Wikidata. ... Wikinegata is available at https://d5demos.mpi-inf.mpg.de/negation ."

"A Brief Analysis of Bengali Wikipedia's Journey to 100,000 Articles"

From the abstract:[8]

"This paper analyzes the various associating factors throughout this journey including the number of active editors, number of content pages, pageview, etc., along with the connection to outreach activities with these parameters."

"WikiShark: An Online Tool for Analyzing Wikipedia Traffic and Trends"

From the abstract:[9]

"This paper introduces WikiShark (www.wikishark.com) – an online tool that allows researchers to analyze Wikipedia traffic and trends quickly and effectively, by (1) instantly querying pageview traffic data; (2) comparing traffic across articles; (3) surfacing and analyzing trending topics; and (4) easily leveraging findings for use in their own research."

"Tracing the Factoids: the Anatomy of Information Re-organization in Wikipedia Articles"

From the abstract:[10]

"... we investigate the impact of gradual edits on the re-positioning and organization of the factual information in Wikipedia articles [...] we show that in a Wikipedia article, the crowd is capable of placing the factual information to its correct position, eventually reducing the knowledge gaps. We also show that the majority of information re-arrangement occurs in the initial stages of the article development and gradually decreases in the later stages."

"Wikidata Logical Rules and Where to Find Them"

From the paper[11] (an extended abstract):

"We are interested in soft (approximate) constraints expressed as dependencies (or logical rules), such as the constraint that “a person cannot be born after one of her children”. Such rules have proven to be useful for error detection [4], adding missing facts [3], executing queries faster, and reasoning [1]. Not only these rules are not stated in Wikidata, but, to the best of our understanding, a way to express them as constraints is still to be defined in the repository ... there are very few rules that are exact, i.e., true for each and every case. As an example, consider a rule stating that “a country has always one capital”. This is true for most countries, but there are 15 countries that have two or more capitals. Therefore, the rule has a very high confidence, but it is not exact ... The goal of our work is to create a large collection of rules for Wikidata with their confidence measure. In this abstract, we report on two directions we have been exploring to obtain such rules, our results, and how we believe the Wikimedia community could benefit from this effort."

"Simple Wikidata Analysis for Tracking and Improving Biographies in Catalan Wikipedia"

From the abstract:[12]

"we highlight the possibilities of taking advantage of structured data from Wikidata for evaluating new biographical articles, so facilitating users to get engaged into diversity challenges or track potential vandalism and errors"

Related code: https://github.com/toniher/wikidata-pylisting

"Structural Analysis of Wikigraph to Investigate Quality Grades of Wikipedia Articles"

From the abstract:[13]

"we present a novel approach based on the structural analysis of Wikigraph to automate the estimation of the quality of Wikipedia articles. We examine the network built using the complete set of English Wikipedia articles and identify the variation of network signatures of the articles with respect to their quality. Our study shows that these signatures are useful for estimating the quality grades of un-assessed articles with an accuracy surpassing the existing approaches in this direction."

"Towards Open-domain Vision and Language Understanding with Wikimedia"

From the abstract:[14]

"This work [i.e. research proposal] describes a project towards achieving the next generation of models, that can deal with open-domain media, and learn visio-linguistic representations that reflect data’s context, by jointly reasoning over media, a domain knowledge-graph and temporal context. This ambition will be leveraged by a Wikimedia data framework, comprised by comprehensive and high-quality data, covering a wide range of social, cultural, political and other type of events"

"Language-agnostic Topic Classification for Wikipedia"

From the abstract:[15]

"we propose a language-agnostic approach based on the links in an article for classifying articles into a taxonomy of topics that can be easily applied to (almost) any language and article on Wikipedia. We show that it matches the performance of a language-dependent approach while being simpler and having much greater coverage."

See also: online demo, data dumps, model details

"Fast Linking of Mathematical Wikidata Entities in Wikipedia Articles Using Annotation Recommendation"

From the abstract:[16]

"We evaluate the quality and time-savings of AI-generated formula and identifier annotation recommendations on a test selection of Wikipedia articles from the physics domain. Moreover, we evaluate the community acceptance of Wikipedia formula entity links and Wikidata item creation and population to ground the formula semantics. Our evaluation shows that the AI guidance was able to significantly speed up the annotation process by a factor of 1.4 for formulae and 2.4 for identifiers. Our contributions were accepted in 88% of the edited Wikipedia articles and 67% of the Wikidata items. The >>AnnoMathTeX<< annotation recommender system is hosted by Wikimedia at annomathtex.wmflabs.org. In the future, our data refinement pipeline will be integrated seamlessly into the Wikimedia user interfaces."

"ShExStatements: Simplifying Shape Expressions for Wikidata"

From the abstract:[17]

"Wikidata recently supported entity schemas based on shape expressions (ShEx). They play an important role in the validation of items belonging to a multitude of domains on Wikidata. [...] In this article, ShExStatements is presented with the goal of simplifying writing the shape expressions for Wikidata."

"Inferring Sociodemographic Attributes of Wikipedia Editors: State-of-the-art and Implications for Editor Privacy"

From the abstract:[18]

"we investigate the state-of-the-art of machine learning models to infer sociodemographic attributes of Wikipedia editors based on their public profile pages and corresponding implications for editor privacy. [...] In comparative evaluations of different machine learning models, we show that the highest prediction accuracy can be obtained for the attribute gender, with precision values of 82% to 91% for women and men respectively, as well as an averaged F1-score of 0.78. For other attributes like age group, education, and religion, the utilized classifiers exhibit F1-scores in the range of 0.32 to 0.74, depending on the model class."

"The Language of Liberty: A preliminary study"

From the paper:[19]

"We managed to align more than 37,000 articles across Wikipedia and Conservapedia; of these, about 28,000 pages share an identical title, while the remaining ones are aligned based on redirect pages. In total, the whole corpus contains 106 million tokens and 558,000 unique words. [...] We can notice marked differences in word usage in the two resources: Wikipedia authors tend to use more objective/neutral words (affordable care, american politician), in addition to many non-political terms. In Conservapedia [there] prevail derogatory terms such as rino, which stands for “Republican In Name Only", and Democrat Party, but also topics of high concern to the conservative community such as the homosexual agenda, communist manifesto, and fetal tissue."

"Information flow on COVID-19 over Wikipedia: A case study of 11 languages"

From the abstract:[20]

"we study the content editor and viewer patterns on the COVID-19 related documents on Wikipedia using a near-complete dataset gathered of 11 languages over 238 days in 2020. Based on the analysis of the daily access and edit logs on the identified Wikipedia pages, we discuss how the regional and cultural closeness factors affect information demand and supply."

"Towards Ongoing Detection of Linguistic Bias on Wikipedia"

From the abstract:[21]

"As part of our research vision to develop resilient bias detection models that can self-adapt over time, we present in this paper our initial investigation of the potential of a cross-domain transfer learning approach to improve Wikipedia bias detection. The ultimate goal is to future-proof Wikipedia in the face of dynamic, evolving kinds of linguistic bias and adversarial manipulations intended to evade NPOV issues."

Analysis of two million AfD (Article for Deletions) discussions

From the abstract:[22]

"[Wikipedia article deletion] decisions (which are known as “Article for Deletion”, or AfD) are taken by groups of editors in a deliberative fashion, and are known for displaying a number of common biases associated to group decision making. Here, we present an analysis of 1,967,768 AfD discussions between 2005 and 2018. We perform a signed network analysis to capture the dynamics of agreement and disagreement among editors. We measure the preference of each editor for voting toward either inclusion or deletion. We further describe the evolution of individual editors and their voting preferences over time, finding four major opinion groups. Finally, we develop a predictive model of discussion outcomes based on latent factors."

Among the findings are that "Editors who joined before 2007 tend to overwhelmingly belong to the more central parts of the network" and that "user preferences [for keep or delete] are relatively stable over time for ... more central editors. However, despite the overall stability of trajectories, we also observe a substantial narrowing of opinions in the early period of an AfD reviewer tenure. ... Strong deletionists exhibit the least amount of change, suggesting the possibility of lower susceptibility, or higher resistance, to opinion change in this group." Overall though, the authors conclude that "differences between inclusionists and deletionists are more nuanced than previously thought."

From the abstract:[23]

" we use general and health-specific features from Wikipedia articles to propose health-specific metrics. We evaluate these metrics using a set of Wikipedia articles previously assessed by WikiProject Medicine. We conclude that it is possible to combine generic and specific metrics to determine health-related content’s information quality. These metrics are computed automatically and can be used by curators to identify quality issues."

"Wikipedia Editor Drop-Off: A Framework to Characterize Editors' Inactivity"

A figure from the paper, showing "The different states of drop-off related to activity and their possible transitions"
From the abstract:[24]

"... we present an approach to characterize Wikipedia’s editor drop-off as the transitional states from activity to inactivity. Our approach is based on the data that can be collected or inferred about editors’ activity within the project, namely their contributions to encyclopedic articles, discussions with other editors, and overall participation. Along with the characterization, we want to advance three main hypotheses, derived from the state of the art in the literature and the documentation produced by the community, to understand which interaction patterns may anticipate editors leaving Wikipedia: 1) abrupt interactions or conflict with other editors, 2) excess in the number and spread of interactions, and 3) a lack of interactions with editors with similar characteristics."

The paper is part of an ongoing research project funded by a €83,400 project grant from the Wikimedia Foundation. Some related code can be found at https://github.com/WikiCommunityHealth/ .

Wikimedia Foundation Research Award of the Year

Besides presentations about the papers listed above, the Wiki Workshop event also saw the announcement of the first "Wikimedia Foundation Research Award of the Year" ("WMF-RAY", cf. call for nominations), with the following two awardees:

"Content Growth and Attention Contagion in Information Networks: Addressing Information Poverty on Wikipedia"[25] (also presented at last year's Wikiworkshop), a paper which according to the laudators

"demonstrates causal evidence of the relationship between increases in content quality in English Wikipedia articles and subsequent increases in attention. The researchers conduct a natural experiment using edits done on English Wikipedia via the Wiki Education Foundation program. The paper shows that English Wikipedia articles that were improved by students in the program gained more viewers than a group of otherwise similar articles. It also found that this effect spills over into a range of articles linked to from the improved articles."

"Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages"[26] and Masakhane (which describes itself as "A grassroots NLP community for Africa, by Africans"). The paper

describes a novel approach for participatory research around machine translation for African languages. The authors show how this approach can overcome the challenges these languages face to join the Web and some of the technologies other languages benefit from today."

While the research does not seem to have concerned Wikipedia directly, the laudators find it an "inspiring example of work towards Knowledge Equity, one of the two main pillars of the 2030 Wikimedia Movement Strategy" and expect the project's success

"will directly support a range of Wikimedia Foundation and Wikimedia Movement goals including the newly-announced Abstract Wikipedia which will rely heavily on machine translation too."

Consistent with its title, the paper features an impressive list of no less than 48 authors (with the cited eprint having been submitted to arXiv by Julia Kreutzer of Google Research).

Briefly

References

  1. ^ Menking, Amanda; Rosenberg, Jon (2021-05-01). "WP:NOT, WP:NPOV, and Other Stories Wikipedia Tells Us: A Feminist Critique of Wikipedia's Epistemology". Science, Technology, & Human Values. 46 (3): 455–479. doi:10.1177/0162243920924783. ISSN 0162-2439. Closed access icon
  2. ^ Rebecca Maung (Wikimedia Foundation): 2021 Community Insights Report, Meta-wiki, May 2021
  3. ^ Stewart, Brenton; Ju, Boryung (2019-02-01). "What drives Black contributions to Wikipedia?". Proceedings of the Association for Information Science and Technology. doi:10.1002/pra2.2018.14505501168.Closed access icon
  4. ^ Stewart, Brenton; Ju, Boryung (May 2020). "On Black Wikipedians: Motivations behind content contribution". Information Processing and Management: an International Journal. 57 (3). doi:10.1016/j.ipm.2019.102134. Closed access icon
  5. ^ Kaffee, Lucie-Aimée; Elsahar, Hady (2021-04-19). "References in Wikipedia: The Editors' Perspective" (PDF). Companion Proceedings of the Web Conference 2021. WWW '21. New York, NY, USA: Association for Computing Machinery. pp. 535–538. doi:10.1145/3442442.3452337. ISBN 9781450383134.
  6. ^ Hergueux, Jérôme; Algan, Yann; Benkler, Yochai; Fuster-Morell, Mayo (2021-04-19). "Do I Trust this Stranger? Generalized Trust and the Governance of Online Communities" (PDF). Companion Proceedings of the Web Conference 2021. WWW '21. New York, NY, USA: Association for Computing Machinery. pp. 539–543. doi:10.1145/3442442.3452338. ISBN 9781450383134.
  7. ^ Arnaout, Hiba; Razniewski, Simon; Weikum, Gerhard; Pan, Jeff Z. (2021-04-19). "Negative Knowledge for Open-world Wikidata" (PDF). Companion Proceedings of the Web Conference 2021. WWW '21. New York, NY, USA: Association for Computing Machinery. pp. 544–551. doi:10.1145/3442442.3452339. ISBN 9781450383134.
  8. ^ Dastider, Ankan Ghosh (2021-04-19). "A Brief Analysis of Bengali Wikipedia's Journey to 100,000 Articles" (PDF). Companion Proceedings of the Web Conference 2021. WWW '21. New York, NY, USA: Association for Computing Machinery. pp. 552–557. doi:10.1145/3442442.3452340. ISBN 9781450383134.
  9. ^ Vardi, Elad; Muchnik, Lev; Conway, Alex; Breakstone, Micha (2021-04-19). "WikiShark: An Online Tool for Analyzing Wikipedia Traffic and Trends" (PDF). Companion Proceedings of the Web Conference 2021. WWW '21. New York, NY, USA: Association for Computing Machinery. pp. 558–571. doi:10.1145/3442442.3452341. ISBN 9781450383134.
  10. ^ Verma, Amit Arjun; Dubey, Neeru; Iyengar, S.R.S.; Setia, Simran (2021-04-19). "Tracing the Factoids: the Anatomy of Information Re-organization in Wikipedia Articles" (PDF). Companion Proceedings of the Web Conference 2021. WWW '21. New York, NY, USA: Association for Computing Machinery. pp. 572–579. doi:10.1145/3442442.3452342. ISBN 9781450383134.
  11. ^ Ahmadi, Naser; Papotti, Paolo (2021-04-19). "Wikidata Logical Rules and Where to Find Them" (PDF). Companion Proceedings of the Web Conference 2021. WWW '21. New York, NY, USA: Association for Computing Machinery. pp. 580–581. doi:10.1145/3442442.3452343. ISBN 9781450383134.
  12. ^ Hermoso Pulido, Toni (2021-04-19). "Simple Wikidata Analysis for Tracking and Improving Biographies in Catalan Wikipedia" (PDF). Companion Proceedings of the Web Conference 2021. WWW '21. New York, NY, USA: Association for Computing Machinery. pp. 582–583. doi:10.1145/3442442.3452344. ISBN 9781450383134.
  13. ^ Chhabra, Anamika; Srivastava, Shubham; S. Iyengar, S. R.; Saini, Poonam (2021-04-19). "Structural Analysis of Wikigraph to Investigate Quality Grades of Wikipedia Articles" (PDF). Companion Proceedings of the Web Conference 2021. WWW '21. New York, NY, USA: Association for Computing Machinery. pp. 584–590. doi:10.1145/3442442.3452345. ISBN 9781450383134.
  14. ^ Semedo, David (2021-04-19). "Towards Open-domain Vision and Language Understanding with Wikimedia" (PDF). Companion Proceedings of the Web Conference 2021. WWW '21. New York, NY, USA: Association for Computing Machinery. pp. 591–593. doi:10.1145/3442442.3452346. ISBN 9781450383134.
  15. ^ Johnson, Isaac; Gerlach, Martin; Sáez-Trumper, Diego (2021-04-19). "Language-agnostic Topic Classification for Wikipedia" (PDF). Companion Proceedings of the Web Conference 2021. WWW '21. New York, NY, USA: Association for Computing Machinery. pp. 594–601. doi:10.1145/3442442.3452347. ISBN 9781450383134.
  16. ^ Scharpf, Philipp; Schubotz, Moritz; Gipp, Bela (2021-04-19). "Fast Linking of Mathematical Wikidata Entities in Wikipedia Articles Using Annotation Recommendation" (PDF). Companion Proceedings of the Web Conference 2021. WWW '21. New York, NY, USA: Association for Computing Machinery. pp. 602–609. doi:10.1145/3442442.3452348. ISBN 9781450383134.
  17. ^ Samuel, John (2021-04-19). "ShExStatements: Simplifying Shape Expressions for Wikidata" (PDF). Companion Proceedings of the Web Conference 2021. WWW '21. New York, NY, USA: Association for Computing Machinery. pp. 610–615. doi:10.1145/3442442.3452349. ISBN 9781450383134.
  18. ^ Brückner, Sebastian; Lemmerich, Florian; Strohmaier, Markus (2021-04-19). "Inferring Sociodemographic Attributes of Wikipedia Editors: State-of-the-art and Implications for Editor Privacy" (PDF). Companion Proceedings of the Web Conference 2021. WWW '21. New York, NY, USA: Association for Computing Machinery. pp. 616–622. doi:10.1145/3442442.3452350. ISBN 9781450383134.
  19. ^ Araque, Oscar; Gatti, Lorenzo; Kalimeri, Kyriaki (2021-04-19). "The Language of Liberty: A preliminary study" (PDF). Companion Proceedings of the Web Conference 2021. WWW '21. New York, NY, USA: Association for Computing Machinery. pp. 623–626. doi:10.1145/3442442.3452351. ISBN 9781450383134.
  20. ^ Jung, Changwook; Hong, Inho; Sáez-Trumper, Diego; Lee, Damin; Myung, Jaehyeon; Kim, Danu; Yun, Jinhyuk; Jung, Woo-Sung; Cha, Meeyoung (2021-04-19). "Information flow on COVID-19 over Wikipedia: A case study of 11 languages" (PDF). Companion Proceedings of the Web Conference 2021. WWW '21. New York, NY, USA: Association for Computing Machinery. pp. 627–628. doi:10.1145/3442442.3452352. ISBN 9781450383134.
  21. ^ Madanagopal, Karthic; Caverlee, James (2021-04-19). "Towards Ongoing Detection of Linguistic Bias on Wikipedia" (PDF). Companion Proceedings of the Web Conference 2021. WWW '21. New York, NY, USA: Association for Computing Machinery. pp. 629–631. doi:10.1145/3442442.3452353. ISBN 9781450383134.
  22. ^ Tasnim Huq, Khandaker; Ciampaglia, Giovanni Luca (2021-04-19). "Characterizing Opinion Dynamics and Group Decision Making in Wikipedia Content Discussions" (PDF). Companion Proceedings of the Web Conference 2021. WWW '21. New York, NY, USA: Association for Computing Machinery. pp. 632–639. doi:10.1145/3442442.3452354. ISBN 9781450383134.
  23. ^ Couto, Luís; Lopes, Carla Teixeira (2021-04-19). "Assessing the quality of health-related Wikipedia articles with generic and specific metrics" (PDF). Companion Proceedings of the Web Conference 2021. WWW '21. New York, NY, USA: Association for Computing Machinery. pp. 640–647. doi:10.1145/3442442.3452355. ISBN 9781450383134.
  24. ^ Marc Miquel-Ribé, Cristian Consonni and David Laniado: Wikipedia Editor Drop-Off: A Framework to Characterize Editors' Inactivity wikiworkshop.org, 2021
  25. ^ Zhu, Kai; Walker, Dylan; Muchnik, Lev (2018-06-05). "Content Growth and Attention Contagion in Information Networks: Addressing Information Poverty on Wikipedia". SSRN 3191128.
  26. ^ Nekoto, Wilhelmina; Marivate, Vukosi; Matsila, Tshinondiwa; Fasubaa, Timi; Kolawole, Tajudeen; Fagbohungbe, Taiwo; Akinola, Solomon Oluwole; Muhammad, Shamsuddeen Hassan; Kabongo, Salomon; Osei, Salomey; Freshia, Sackey; Niyongabo, Rubungo Andre; Macharm, Ricky; Ogayo, Perez; Ahia, Orevaoghene; Meressa, Musie; Adeyemi, Mofe; Mokgesi-Selinga, Masabata; Okegbemi, Lawrence; Martinus, Laura Jane; Tajudeen, Kolawole; Degila, Kevin; Ogueji, Kelechi; Siminyu, Kathleen; Kreutzer, Julia; Webster, Jason; Ali, Jamiil Toure; Abbott, Jade; Orife, Iroro; Ezeani, Ignatius; Dangana, Idris Abdulkabir; Kamper, Herman; Elsahar, Hady; Duru, Goodness; Kioko, Ghollah; Murhabazi, Espoir; van Biljon, Elan; Whitenack, Daniel; Onyefuluchi, Christopher; Emezue, Chris; Dossou, Bonaventure; Sibanda, Blessing; Bassey, Blessing Itoro; Olabiyi, Ayodele; Ramkilowan, Arshath; Öktem, Alp; Akinfaderin, Adewale; Bashir, Abdallah (2020-11-06). "Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages". arXiv:2010.02353 [cs].
Supplementary references and notes:
  1. ^ Shaw, Aaron; Hargittai, Eszter (2018-02-01). "The Pipeline of Online Participation Inequalities: The Case of Wikipedia Editing". Journal of Communication. 68 (1): 143–168. doi:10.1093/joc/jqx003. ISSN 0021-9916. Closed access icon (but still available via archive.org)




Reader comments

2021-06-27

So no one told you life was gonna be this way

This traffic report is adapted from the Top 25 Report, prepared with commentary by Igordebraga, Kingsif, SSSB, Benmite and Mcrsftdog.

The Signpost took a break last month, but given the weekly report on Wikipedia's most viewed didn't stop, we have two months to cover.

And the Oscar goes to... (April 25 – May 1)

Most Popular Wikipedia Articles of the Week (April 25 to May 1, 2021)
Rank Article Class Views Image Notes/about
1 93rd Academy Awards 1,571,338 The goddamned pandemic derailed the film industry, yet AMPAS still decided to award the best productions of 2020 (and early 2021). One of the major jibes before the ceremony got started was its choice of main location, a train station (pictured), and it only got worse from there. The Oscars, adamantly refusing to use Zoom (or similar) after the less than successful early awards ceremonies, was in-person only, though they added hubs in London and Paris so nominees that didn't want to (or, couldn't) go to Los Angeles could attend. It certainly felt more like the Golden Globes in a normal year with its audience at tables and such. Except they weren't drunk, nor particularly excited. The screen ban also meant there were no clips played before categories, and both the hosts and the In Memoriam seemed to be rushing to get home. So, between the average moviegoer having not seen the nominees and and a fairly uninteresting ceremony, it translated into the lowest ratings ever.
2 Mortal Kombat (2021 film) 1,563,890 24 years after the disastrous Mortal Kombat: Annihilation, the ultraviolent fighting game series returned to theaters (and HBO Max) giving everything fans hoped for, namely lots of people punching each other, good special effects, mostly accurate portrayals of their beloved characters, and dialogue featuring things such as "Fatality!", "Flawless Victory!" and "GET OVER HERE!" Hence Mortal Kombat already recouped its $55 million budget even with not as many screens available.
3 Nomadland (film) 1,449,019 One of the weirdest things in #1 was that the top category, Best Picture, was the third-to-last instead of the big prize at the end.[a] Its winner was the adaptation of a non-fiction book about a woman who decides to spend her days living in a van down by the river vandwelling. Two of the producers winning this prize also took home Best Director (#10) and Actress (#8).
4 Rohit Sardana 1,331,967 From Hollywood to India, where a local TV anchor died at just 41 from a heart attack.
5 2021 NFL Draft 1,309,263 American football got its latest college athletes, with the Jacksonville Jaguars using their #1 pick on Trevor Lawrence.
6 Shadow and Bone 1,267,141 Netflix has released a new series adapting a fantasy novel by Leigh Bardugo (pictured), following Alina Starkov (played by Jessie Mei Li) as she discovers she has the power to set her country free from the fold, a perpetually dark, barren strip of land cutting the country in two.
7 Shadow and Bone (TV series) 1,134,244
8 Frances McDormand 983,409 Best Actress at #1 came to this talented thespian for #3, a near-record third time (only behind Katherine Hepburn's four; counting the Supporting category, she now has as many Oscars as Meryl Streep!), following a pregnant policewoman from the middle of nowhere (where the director was husband Joel Coen) and an angry mother exploiting outdoor advertisement. McDormand also got a second statuette as the film's co-producer, and celebrated that win by howling! Here she's pictured with a different award she won thrice.
9 Deaths in 2021 960,124 Given the In Memoriam at #1 (speedily) used this song:
Until the dolphin flies and parrots live at sea (Always)
Until we dream of life and life becomes a dream
10 Chloé Zhao 925,572 For only the second time, Best Director went to a woman, the one responsible for #3. And Ms. Zhao's next movie will certainly show if she can do action as well as fellow winner Kathryn Bigelow, namely Marvel's Eternals.
  1. ^ Everyone would agree that the producers of the ceremony chose to close with Best Actor presuming Chadwick Boseman would win, ending the ceremony homaging the late actor. So what an anti-climax when the Oscar instead went to Anthony Hopkins, who wasn't even in attendance...

Our D-I-V-O-R-C-E becomes final today (May 2–8)

Most Popular Wikipedia Articles of the Week (May 2 to 8, 2021)
Rank Article Class Views Image Notes/about
1 Bill Gates 1,371,353 The billionaire couple announced their divorce this week, after 27 years of marriage and 34 years as a couple. The divorce appears to be amicable and they have said that they will continue to work together on their foundation.
2 Melinda French Gates 1,027,961
3 Deaths in 2021 926,892
Lay me down
Let the only sound
Be the overflow...
4 Invincible (TV series) 744,955 The final episode of the first series aired on April 29. It has proved so popular that a further two series of this family-unfriendly animated superhero show were confirmed on April 30.
5 Resident Evil Village 730,516 Capcom's seminal survival horror series returned for its eight mainline title (in fact, the cover art shows there's a "VIII" hidden in "Village"), where the unlucky bastard who sought his wife in the seventh game now tries to rescue his daughter from inhuman creatures. And given the setting is Transylvania, of course there's vampires, whose leader caused quite the impact upon her reveal in a demo.
6 Cinco de Mayo 703,801 The annual celebration held on May 5, celebrating Mexico's victory over France at the Battle of Puebla in 1862.
7 Elon Musk 695,271 #1 used to be the richest man in the world, now it's this guy, who hosted Saturday Night Live. Not the first entrepreneur to have done so, but most people would rather forget the previous one.
8 2021 West Bengal Legislative Assembly election 691,857 The election in this east Indian state was held April 27–29, with the results being announced this week. All India Trinamool Congress (AITC) increased their majority. The election was marred by violence, with opposition parties claiming the violence was conducted by the AITC against them.
9 Dogecoin 679,762 Showing how fickle the cryptocurrency market is, #7 made jokes about the memetic dog-themed one he champions on SNL (on his opening monologue, Musk brought his mom on stage, who said she expected her Mother's Day gift to not be Dogecoin; and on Weekend Update, Musk appeared as a cryptocurrency expert who was questioned thrice by host Michael Che about Dogecoin) and its value fell 37%! Perhaps to compensate, right after the week covered by this report, Musk's SpaceX announced a Dogecoin-funded mission.
10 Mamata Banerjee 673,257 Despite losing her seat in the West Bengal Legislative Assembly, her party won the most seats in #8 and she remains the Chief Minister of West Bengal.

Go to war again, blood is freedom's stain (May 9–15)

Most Popular Wikipedia Articles of the Week (May 9 to 15, 2021)
Rank Article Class Views Image Notes/about
1 Elon Musk 2,023,222 Billionaire Memeguy hosted Saturday Night Live on May 8. His fans thought his was the best episode ever; his detractors thought it was as unfunny as anything modern SNL puts out. Musk also managed to crash two cryptocurrencies in one week—doge- crashing after a (presumably failed) plug on SNL, and bit- crashing after Tesla announced it would no longer be accepted as payment.
2 State of Palestine 1,543,993
In this week, protests over the eviction of Palestinians from Sheikh Jarrah, East Jerusalem escalated into countless missile strikes on the Gaza Strip.
3 Israeli–Palestinian conflict 1,473,599
4 Israel 1,445,058
5 Hamas 1,206,073
6 Iron Dome 1,043,060
7 Gaza Strip 1,003,236
8 Jupiter's Legacy (TV series) 983,196
What if a superhero was morally grey? What if they, get this, killed people? That'd be crazy. Jupiter's Legacy, the first piece of media to ever explore this concept, premiered on Netflix on May 7.
9 Deaths in 2021 937,721
Well, I'll die as I stand here today
Knowing that deep in my heart
They'll fall to ruin one day
10 Radhe (2021 film) 924,247
India has sadly been going through a huge spike in cases of the pandemic, so the latest Bollywood blockbuster had to be released digitally (though foreign markets got it in theaters).

Sing, sing a song, sing out loud (May 16–22)

Most Popular Wikipedia Articles of the Week (May 16 to 22, 2021)
Rank Article Class Views Image Notes/about
1 Eurovision Song Contest 2021 1,595,667 The annual (if one forgets 2020, as many would like to do; at most there was that Will Ferrell comedy, that probably brought in bigger non-European interest this year) competition took place in Rotterdam this year, in a venue that housed a COVID field hospital. As ever, Eurovision was pretty crazy, with highlights including vaporwave backgrounds, some of the most poorly executed green screening, and some guy who wouldn't put a shirt on. The winner: Italy, who sent the alt rock band Måneskin (including topless dude). The loser: the UK, who managed to get nil points. Twice.
2 Halston 1,523,384 Ewan McGregor plays Halston (1932–1990), a mononymous fashion designer and eponymous subject of a Netflix miniseries that is proof that Ryan Murphy can produce shows about literally anything (as long as they include fashion).
3 State of Palestine 1,433,840 A second week of the conflict caused the State of Palestine to be third on our list, along with several other related articles.
4 Army of the Dead 1,206,852 Before his foray into comic book adaptations, Zack Snyder started his film career remaking Dawn of the Dead. So here's him back into zombies (in fact, it's apparently based on an idea he had while working on the remake), namely a Netflix original where Dave Bautista leads a team trying to steal a Las Vegas casino's vault before the city is nuked to contain its undead infestation.
5 Israel 1,120,843 A truce closed off the most recent flare-up of the decades-long conflict after 15 days.
6 Israeli–Palestinian conflict 1,026,543
7 Hamas 992,010
8 Deaths in 2021 886,710 Breathing, is the hardest thing to do
With all I've said
And all that's dead for you
9 The Woman in the Window (2021 film) 866,684 This schlocky modern retelling of Hitchcock's Rear Window, starring Amy Adams, made its way to Netflix on May 14.
10 Mare of Easttown 824,532 HBO continues to air this show about Kate Winslet as detective Mare, who returns to Easttown Township to investigate a murder.

I'll be there for you, 'cause you're there for me too (May 23–29)

Most Popular Wikipedia Articles of the Week (May 23 to 29, 2021)
Rank Article Class Views Image Notes/about
1 Army of the Dead 1,584,244 Zack Snyder's newest film, released to Netflix on May 21, stars Dave Bautista, Ella Purnell, and a digitally-superimposed Tig Notaro. The plot's about a crew breaking into a zombie-infested Las Vegas to retrieve millions of dollars before the place is nuked, but it's also about a father reconnecting with his estranged daughter—read into that what you will.
2 Eternals (film) 1,029,757 The first trailer for this MCU film—from Nomadland director Chloé Zhao—was released on Monday. If you're wondering how an Oscar winning director makes a Marvel movie, just take it from producer Kevin Feige:

We cut a little sample reel together, I remember, to show [Disney higher-ups]. And it was so beautiful, and I had to keep saying, “This is right out of a camera; there’s no VFX work to this at all!” Because it was a beautiful sunset, with perfect waves and mist coming up from the shore on this giant cliffside — really, really impressive stuff. (Variety)

3 Phil Mickelson 989,739 Mickelson won the 2021 PGA Championship on May 23; at the age of 50, he's the oldest person to ever win a men's major golf championship
4 Matthew Perry 989,188 In the years between Friends and The Reunion, Chandler starred in Studio 60 on the Sunset Strip, Mr. Sunshine, Go On, The Odd Couple, 17 Again and Fallout: New Vegas. He also tried to fend off the addiction to prescription drugs that hit him while doing the show.
5 Eurovision Song Contest 2021 945,167 After the pandemic ensured the only Eurovision Song Contest in 2020 was the one with Will Ferrell, the music extravaganza featuring the best and most bombastic of Europe (and Australia) made a glorious return.
6 Deaths in 2021 894,293 I won't stay quiet, I won't stay quiet
Cause staying silent's the same as dying
7 Mare of Easttown 819,701 John Oliver joked that this series about a murder investigation is the only thing as white and depressing as the current backdrop of his show. And now there's only episode left for it.
8 Måneskin 818,137 This Italian rock band won the Eurovision Song Contest with their song "Zitti e buoni."
9 Friends: The Reunion 736,524 This special, starring the cast of the really big 90s–00s sitcom, premiered on HBO Max on Thursday. The guestlist included people who had appeared on the original series, as well as Malala, BTS, Justin Bieber, and Lady Gaga.
10 Elliot Page 716,745 Page, star of Juno and The Umbrella Academy, came out as a trans man at the end of last year. On Monday, he posted his first post-top surgery shirtless photo.

No one could ever know me, no one could ever see me (May 30 – June 5)

Most Popular Wikipedia Articles of the Week (May 30 to June 5, 2021)
Rank Article Class Views Image Notes/about
1 Mare of Easttown 1,232,548 Writing/reading this Report, a pattern can be noted that articles on TV shows enter right as the season starts, and either remain steady or lose visits with more episodes, with the only thing that raises views being the season finale. This HBO drama starring Kate Winslet as a policewoman with a fractured home life investigating a murder managed to invert it, as Mare of Easttown only earned a slot by the third episode and has only grown since then, with the miniseries closer managing to the top the list – which is also a reflection of its actual ratings, with the seventh having nearly a million more viewers compared to the first!
2 Tulsa race massacre 1,178,663
On May 31 and June 1, 1921, the Black population of Tulsa, Oklahoma was devastated by attacks from a White mob. Hundreds were killed, thousands were left homeless, and the prosperous Greenwood District—known as "Black Wall Street"—was destroyed. The massacre was left out of American popular history until relatively recently; the revival probably owes a lot to HBO, as both the 2019 Watchmen miniseries and last year's Lovecraft Country both featured the events. 2020 also had the unfortunate coincidence of last year's George Floyd protests being at their height on the anniversary of the massacre.
3 Deaths in 2021 951,295 To the well-organized mind, death is but the next great adventure.
4 A Quiet Place Part II 874,861 Shh! The John Krasinski-directed horror flick A Quiet Place, about a family (led by himself and real life wife Emily Blunt) who must live their lives in silence to avoid killer aliens with amazing hearing, earned itself critical and audience acclaim as well as beaucoup bucks. It got itself a sequel, but does anyone else kind of hate it when they just slap "Part 2" in front of things? It could've been A Quieter Place, or A Quiet Place: Hold Your Breath, or A Quiet Place: Please Would You Kindly Shut The Heck Up. In any case, Jim Halpert seems to have avoided the sophomore slump with this one if online ratings are to be believed.
5 Cruella (film) 856,457 The newest Joker movie, starring Emma Stone as a campy performance artist/fashion artist/mastermind criminal that will one day skin Dalmatians, was released to theatres and Disney+ on May 28.
6 The Conjuring: The Devil Made Me Do It 830,910 This horror movie based on the trial of Arne Cheyenne Johnson was released to theatres and HBO Max on Friday.
7 Matthew Perry 742,353 Many viewers of the Friends reunion were worried about Perry after a withdrawn performance. During the show he admitted to anxiety surrounding how many laughs he would get. He has since announced that he and his fiancé had split after 4 years.
8 Naomi Osaka 709,447 Osaka was fined $15,000 for not giving a press conference after her first game in the French Open; the next day she withdrew, citing her mental health.
9 Frank Kameny 708,859 In 1957, Kameny was fired from the Army Map Service after his superiors learned of his homosexuality. He unsuccessfully fought this in the courts, and afterwards became a gay rights activist. The late Kameny, and pride month itself, was commemorated with a Google Doodle on Wednesday.
10 The Family Man (Indian TV series) 682,356 Prime Video has released this thriller starring Manoj Bajpayee as a middle-class man secretly working as an intelligence officer for India's counter-terrorist task force.

Someone to face the day with, make it through all the rest with (June 6–12)

Most Popular Wikipedia Articles of the Week (June 6 to 12, 2021)
Rank Article Class Views Image Notes/about
1 UEFA Euro 2020 1,725,754 After one year of delay, the 60th anniversary of Europe's football tournament between nations started on June 11. As of this report's cutoff, 4 games have been played. The only surprising result from these 4 is Finland beating Denmark (pictured is Joel Pohjanpalo, who scored the winning goal) However, given the circumstances (see below) the result is less surprising.
2 Christian Eriksen 1,553,804 The Danish international footballer suffered a suspected cardiac arrest in the tail end of the first half of a match against Finland, their opening match as part of #1. The match was immediately suspended while his team mates formed a protective buble around Eriksen and medics performed CPR. After approximately 15 minutes Erikson was transferred to hospital, where he is said to be stable, conscious and awaiting tests. The match resumed around 2 hours, Denmark's play was noticeably subdued, as were Finland's winning celebrations.
3 The Conjuring: The Devil Made Me Do It 1,079,590 The third sequel to 2013's The Conjuring and the whopping eighth installment in The Conjuring Universe is based on Gerard Brittle's book The Devil in Connecticut, written about the demonic trial of Arne Cheyenne Johnson (#9).
4 Ed and Lorraine Warren 995,545 Okay, so, if you ask me, the escapades recounted by these two ghostbusters (and the protagonists of the Conjuring films, in which they're portrayed by Patrick Wilson and Vera Farmiga, respectively) have about as much truth to them as when someone drops acid and then tells you they had a conversation with Julius Caesar. But you can't say the tall tales these two have Conjured up about supernatural spirits over the years haven't made for some perfect movie material, considering just how many films the studios have been able to milk from them, including the most recent Conjuring installment (#3).
5 Logan Paul 968,238 The slightly less controversial (though not that much less controversial) half of social media's Paul brothers has lived many lives both on and off of the internet, first as one of Vine's biggest stars, then as a YouTuber, an actor, a singer, a screenwriter, and a podcaster. Now, like his brother, he's made his triumphant return to boxing two years after his first professional match, going up against none other than Floyd Mayweather Jr. (#9). I didn't watch the fight, though according to the site you're reading this on, Mayweather may have gone a little easy on him.
6 Loki (TV series) 967,253 The MCU machine just keeps on pumping. In the the first episode of this semi-retconning sci-fi crime thriller, which was released on Disney+ this week, Tom Hiddleston reprises his role as the titular God of Mischief, introduced in 2011's Thor. He gets captured by the Time Variance Authority, an organization that monitors the timelines of the Multiverse, who deem one of the versions of Loki a threat to the "Sacred Timeline", in which no multiversal war breaks out, and Loki agrees to help stop the alternate, fugitive version of himself from messing up the timeline. Confused? I certainly am!
7 The Family Man (Indian TV series) 960,776 The second season of this Indian thriller, starring Manoj Bajpayee (pictured) as a middle-class man working for the National Intelligence Agency who must balance secretly trying to protect the country from terrorists with his tumultuous family life, was released last week on Amazon Prime Video. It continues to yield positive reviews and buzz.
8 Floyd Mayweather Jr. 929,958 A boxer who had a shining professional career, being champion of four weight classes and retiring undefeated after 50 fights. Mayweather returned for an exhibition fight with #5, where him holding back led to showers of boos from the audience.
9 Trial of Arne Cheyenne Johnson 895,314 Saying "a demon made me kill my landlord" is like a way more extreme version of "my dog ate my homework", but in 1981, that was the excuse given by Arne Cheyenne Johnson and his defense lawyers. You must acquit, I guess.

As told in #3, in a story that seems to have been based more on slightly convenient timing and coincidence than truth, the body of an 11-year-old boy named David Glatzel was supposedly being inhabited by a demon, so his family called in ghost hunters Ed and Lorraine Warren (#4), who then got the Church to perform an official exorcism on Glatzel. Months later, Johnson killed his landlord, apparently because the demon had relocated from Glatzel's body to Johnson's. Not only is this hilarious because it means the exorcism was a failure, but it also means that the demon waited months to actually do anything demonic.

10 Sweet Tooth (TV series) 810,631 Based on the comic book series of the same name by Jeff Lemire, this Robert Downey Jr.-produced, James Brolin-narrated Netflix fantasy series released last week takes place in a world where animal-human hybrids are being hunted down, and follows the adventures of a boy born part deer and part human named Gus as he travels across America with his human companion, Tommy, in search of Gus's mother.

Even at my worst I'm best with you, yeah! (June 13–19)

Most Popular Wikipedia Articles of the Week (June 13 to 19, 2021)
Rank Article Class Views Image Notes/about
1 Juneteenth 2,615,582 Saturday was Juneteenth, which commemorates the abolition of slavery in Texas following the Civil War. While most states have some recognition of the holiday, it took until this Thursday for President Biden to make it a federal holiday—the timing probably has a lot to do with the civil rights protest movement from last year.
2 UEFA Euro 2020 2,548,439 As of this report's cut-off, every team has played two of their three group games. Italy, Belgium and the Netherlands are already through, but will still try to win their group, for a theoretically easier round of 16 match (although the Netherlands have already won theirs) North Macedonia are already knocked out and now play for pride. However, for the other 20 teams still need to fight to stay in, with only 13 of those 20 teams making it through...
3 Cristiano Ronaldo 950,014 Ronaldo is currently playing for Portugal in the UEFA Euros. He's a pretty big deal, enough to be blamed for a stock value drop of The Coca-Cola Company after moving a bottle of Coke off the table to encourage viewers to drink water. (the true story is much more complicated)
4 Loki (TV series) 881,400 Thought WandaVision was a one time deal in "Disney+ Marvel shows that seem designed to leave the viewer confused every week"? Nope, Loki is here with its timeline variances to keep that void filled. There was even the reveal of the antagonist, known as "The Variant", an alternate Loki who not only is intent on breaking the flow of time, but is a woman!
5 Milkha Singh 872,744 Nicknamed "The Flying Sikh", a sprinter who was the last Indian man to get close to an Olympic medal in track and field with a 1956 fourth place (a woman got to the same position in 1984; the country has not had much Olympic success in running, jumping and throwing), and died of COVID-19 complications at the age of 91.
6 Christian Eriksen 866,489 Following the Danish international's cardiac arrest during his match against Finland national football team (June 12), he has been fitted with an implantable cardioverter defibrillator. Meanwhile, during Denmark's match against Belgium was paused in the 10th minute for a tribute (Eriksen wears the #10 shirt), with a banner reading "All of Denmark is with you, Christian".
7 Critical race theory 858,371 School boards around the United States are filled with talk about "critical race theory," a specific academic field that, to some critical eyes, encompasses everything from The 1619 Project to white genocide. In Washoe County, Nevada, parents even proposed equipping teachers with body cams to prevent the dastardly CRT. Liberals warn that bans on CRT—so far passed in Idaho and Florida—will have a chilling effect on discussions of racism, while conservatives boast that they would have a chilling effect on discussions of racism.
8 Deaths in 2021 830,867 When I die and they lay me to rest
Gonna go to the place that's the best
When I lay me down to die
Goin' up to the Spirit in the Sky
9 Novak Djokovic 814,589 Djkovic won the Men's Singles tournament at the French Open on June 13, taking the champion title from Stefanos Tsitsipas.
10 UEFA European Championship 691,184 Ever since 1960, the European national football teams have a tournament among them every four years. Although the goddamned pandemic ensured the latest edition (#2) had an extra year of wait.

Exclusions

  • These lists exclude the Wikipedia main page, non-article pages (such as redlinks), and anomalous entries (such as DDoS attacks or likely automated views). Since mobile view data became available to the Report in October 2014, we exclude articles that have almost no mobile views (5–6% or less) or almost all mobile views (94–95% or more) because they are very likely to be automated views based on our experience and research of the issue. Please feel free to discuss any removal on the Top 25 Report talk page if you wish.



Reader comments

2021-06-27

Searching for Wikipedia


Magnifying glass with focus on paper, Niabot, CC BY-SA 3.0
This article was originally published in the Wikimedia Techblog on June 7, 2021 CC BY-SA 4.0
How people use Search to access Wikipedia is a common question by researchers. Until now, however, there has been little data available about this relationship. To help address these questions, the Wikimedia Foundation is releasing a new, faceted dataset on search engine traffic to Wikipedia so you can ask questions like "What is the most common search engine in my country?" or "Which search engine is most-used by Android users?"

It's no secret that search engines ferry a great deal of traffic to Wikipedia. With every major change in how a search engine presents its results,[a] questions arise about how the change might affect Wikipedia traffic. Historically, there has been scant data about how search engine traffic varied by platform and region.

We are taking a small step towards shedding greater light on the relationship between Search and Wikipedia by releasing a new, daily dataset of Wikipedia pageviews referred directly from search engines split by Wikipedia language, search engine, operating system, and web browser.

What might you find combing through the data? Well, first, you'll discover there's a lot of data! In any given month, about eight billion pageviews to Wikipedia come directly from clicks on search engines. On any given day, this dataset showcases pageviews that come from about 220 different countries, 100 different languages of Wikipedia,[b] 50 browser families, 14 operating systems, and 20 search engines.[c]

The vast majority of those clicks—over 90%—come from Google Search (table; see Figure 1). The next closest competitor is Yahoo Search at 2% of views followed by Microsoft Bing, DuckDuckGo, and Yandex Search. While Google's search traffic is globally quite dominant, many of the smaller search engines see their share of search coming primarily from a single country—e.g., 70% of Yahoo!'s search comes from Japan; 90% of Yandex's search comes from Russia; almost 100% of Naver's search comes from South Korea (nested table).

The increasing dominance of mobile devices can be seen in this dataset as well but with slightly more variation between countries than between search engines. Android and iOS typically trade between the top two spots with Windows generally in a strong third place (heatmap). Browsers have similar dynamics but replace Android with Chrome Mobile, iOS with Safari, and add a few more desktop versions into the mix (heatmap).

Image credit: Wikipedia search referrals dashboard, Isaac Johnson, CC BY-SA 4.0
Figure 1. Global search traffic to Wikipedia in April 2021. The blue line at the top is Google, at approximately 250 million pageviews referred per day, and all the other search engines are at the bottom of the chart, at fewer than 6 million pageviews referred per day. (Link to data)

Visualizing the data

The multi-faceted nature of this new dataset also presented some new display challenges. Most datasets we release consist of a target metric—e.g., pageviews—and are composed of a single facet—e.g., language edition—or sometimes hierarchical facets—e.g., you can split by project family like Wikipedia or individual languages of Wikipedia. This dataset has five, non-hierarchical facets, all with many categories, as highlighted in the previous section.

Maybe you're interested in which search engine is dominant in a particular market? Or how Android users compare to iOS users? Or the distribution of language editions in a given country? Or, or, or…? This makes our standard public dashboards — Wikistats, Dashiki, Discovery — a poor fit for someone who might want to slice or aggregate the data as they primarily support a single dominant facet.

Luckily, Wikimedia has some experience with an open-source dashboarding platform called Turnilo that is a perfect fit. Turnilo allows for us to create quick, dynamic filters and aggregations, supports a variety of displays—e.g., tables, line graphs, or heatmaps—and makes it easy to share specific views of the data via URLs. We currently use Turnilo to showcase a number of private datasets, so we had some experience working with it but had never provided a publicly-viewable version. In just a few hours, we built a public Turnilo instance on our Cloud VPS infrastructure (code). We worked with the Turnilo team to improve support for flat files (as opposed to their more popular, but more complex Druid back-end). And now we have a strong use-case for expanding our public dataset dashboarding options (Phab)!

Go check it out at: https://wiki-search-referrals.wmcloud.org/ And if all the options are a bit overwhelming, here's a good place to start: search referrals from the previous month split by country and search engine (link).

See also

Footnotes

  1. ^ See, for example Google Panda, Google Penguin, Google Pigeon Signpost editors
  2. ^ Astute Wikipedians might notice that there are 300 language editions, not 100. The discrepancy arises from masking that we do for any pageview counts below 500 for privacy reasons — i.e. many other language editions (and countries and OSes and browsers) receive search traffic, but they would be represented as “other” in this dataset if they did not meet that threshold. See https://phabricator.wikimedia.org/T270140 for more details.
  3. ^ You can see more information on the search engines we track in this dataset here (https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake/Traffic/referrer_daily#Search_Engines). If you notice any major search engines missing, let us know!




Reader comments

2021-06-27

Wikipedia's best articles on the world's strangest things

Wikipedia is a haven for specialized knowledge, unevenly applied. While articles on topics as important as "The Sun in culture" sit little-noticed and little-loved, we have poured time and attention into producing extremely high-quality articles on some more obscure subjects. Here are a few of Wikipedia's featured articles on the bizarre, outlandish, or simply fascinating things the world produces from time to time.

The Brown Dog affair

With a title that seems lifted from a Tintin book, the Brown Dog affair actually involves a public hue and cry in Edwardian Britain. At about 5,600 words, the full story is well worth your time to read, but in short, to quote from the lead: "it involved the infiltration by Swedish feminists of University of London medical lectures; pitched battles between medical students and the police; police protection for the statue of a dog; a libel trial at the Royal Courts of Justice; and the establishment of a Royal Commission." Nominated for FA status by the great and much missed SarahSV, this article is a reminder that the past is a massive place.

Why don't animals have wheels?

Rotating locomotion in living systems answers a question that occurs to every five-year-old shortly after they learn about cars. Why haven't animals given up their ungainly legs, paws, and wings in exchange for the ability to roll? The article, nominated for FA status by swpb, is a perfect example of how to turn inquiry into knowledge.

Cats, the coronation cut

Continuing on our mammalian theme, we come to the Chief Mouser to the Cabinet Office. This article describes a cat with a lordly salary of £100 a year, whose role is to keep 10 Downing Street, a notoriously drafty and unwelcoming home, free of vermin. It's hard to say which part of the article better exemplifies Wikipedia's appreciation of the catholic nature of knowledge – the section on partisanship with relation to the cat, or the navigation box at the bottom containing article links to all the Chief Mousers of years past. Nominated for Featured List status by Matthew.

And now for something completely different...

Anyone who has suffered through a YouTube video beset with ads will understand the frustration of American football fans who tuned in to watch the Heidi Game. Played in 1968, when TV channels were few and football was just beginning its rise to dominant popularity in the United States, the regular-season game filled up its entire 3-hour timeslot. A few minutes before the Oakland Raiders began an epic comeback, NBC decided to switch to regular programming – specifically, Heidi, a made-for-TV movie not much remembered today. Interestingly, the film's score was composed by John Williams, one year after his first Academy Award nomination. Nominated for Featured Article status by The Writer 2.0.

Honorary mention

Finally, mention must be made of Extremely Online, recently brought to Good Article status by JPxG. An enjoyable read, the article does fail to answer one key question – how much time do you have to spend on Wikipedia before you become Extremely Online? Maybe it's best if we leave that one to the philosophers.



Reader comments

2021-06-27

WikiProject on open proxies interview

Hello to all! The best way to get something done at The Signpost is the same as everywhere else on Wikipedia: just do it. So, I'll take my chances with some interviews this year and see how it goes.

I enjoy Wikiproject interviews because they show the atmosphere that keeps me on this website. Editors interested in a broad or niche area contribute together, with a WikiProject often forming the nexus of this collaboration. These editors aren't always the most vocal, and they aren't always the ones running RfCs or in Arbcom, and might not be running for administratorship. Instead they are often part of the thousands of us editing content in the background. These interviews can give fascinating insights into different topic areas and the editors' motivations in those areas.

The atypical title first drew me to WikiProject on open proxies. My interest only grew when I saw its unique structure as a noticeboard. So – what is an open proxy? And why have a WikiProject about one? These and many more questions are hopefully answered below:

What is Wikipedia:WikiProject on open proxies?

  • MarioGom: A project to enforce Wikipedia:Open proxies. That is, blocking open proxies on Wikipedia.
  • zzuuzz: I'd describe it as part of a loose collaboration of users who aim to identify or confirm open proxies, and usually block them from editing. Requests are made by a wide range of editors, admins, and checkusers who deal with vandalism, spam, and sockpuppetry problems. At the core of the project is the task of confirming whether an IP address is an open proxy or not, so it also seeks to help unblock IP addresses which have been blocked as open proxies but are no longer open proxies. It can additionally provide other technical and diagnostic information, for example opinions about IP block lengths or collateral damage. Also, being an open proxy is not always a straightforward issue – a response might sometimes identify a public wifi, or a private server, or a commercial filtering service, which can be complicated areas in terms of both policy and potential abuse. Sometimes this information may be simply relayed to the person making the request so they or anyone else can make up their own mind.
  • GN: Depends who you ask. Mario and zzuuzz hit most of what I'd say – we're a loose group of people who work together to help enforce our policy on open proxies and to provide technical knowledge about open proxies to other editors.
  • SQL: The replies above cover it very well.
  • Blablubbs: I concur with the above.

It's interesting that you are a combined noticeboard and WikiProject. What do you see as the benefits (or downsides) or formatting yourselves as a a WikiProject?

  • MarioGom: I am relatively new to the WikiProject, and I am unaware of the original rationale. I think working as a noticeboard is well aligned with the work we do (similar to WikiProject Spam). A WikiProject, which is somewhat informal, allows for some flexibility that is probably needed in an area where a small number of users is involved.
  • zzuuzz: WP:OP was initially part of a now defunct cross-wiki project. While other 'chapters' have gradually become deprecated, the enwiki chapter has carried on in roughly the same format over the years. A rename has occasionally been mooted, but usually lacking any great alternatives or enthusiasm. I think the project label is not really a bad fit, as there is a membership of sorts – the list of users who can verify open proxies. Apart from this list, membership is highly informal. The noticeboard format is suitable for the task in hand, which is to process reports. On the downside, I would say a singular focus on reports may not be best suited for strategic or in-depth discussions.
  • GN: It's been like this since before I joined, and for the primary task (handling reports) it works well enough that I haven't really seen a need to reconsider the setup. On the rare occasion that I've been around and we've needed to have bigger discussions, the talk page has worked well enough.
  • SQL: It's been that way for as long as I can remember. At one point I was looking at rolling it into IPCheck, and using the resulting data to help train a Machine Learning algorithm.
  • Blablubbs: Our primary function arguably is as a noticeboard, but I don't think the name really matters. The current format gets the job done well.

What motivated you to become a member of WikiProject on open proxies?

  • MarioGom: I am involved in tracking and reporting undisclosed paid editing and other types of abuse. Through this work, especially on sockpuppet investigations, I learnt that a very high proportion of abusive users (spammers, harassers, vandals) routinely use VPN [Virtual private network] or proxy services. This facilitates block evasion and obstructs the work of CheckUsers. I noticed that VPN services often got their servers blocked individually. I thought this was somewhat pointless because if a single VPN server is blocked, the user can jump in a couple of clicks to any other servers offered by that same VPN service. I started participating on WPOP to report full VPN and proxy services at once. If I see abuse from an IP that belongs to a VPN service, instead of reporting that single IP, I report every IP for that service, effectively blocking the full service.
  • zzuuzz: Open proxies have always been responsible for a disproportionate amount of spam, vandalism, and other disruption, so I generally support blocking a lot of them. At the time I joined in 2007 there was a very messy approach to blocking open proxies, with large numbers of suspected proxies usually being blocked indefinitely and remaining blocked, sometimes on the weakest of evidence. I joined the project as I figured I could use my technical skills and experience to help bring some sense to the proxy-blocking process.
  • GN: I joined because WPOP looked fairly understaffed and I already had a good amount of the requisite technical knowledge. I also have come across several open proxies during my work as an SPI clerk.
  • SQL: I joined to help cut down on UPE [undisclosed paid editing].
  • Blablubbs: I don't recall what exactly triggered my interest in proxies – I had my fair share of encounters with them, both through UPE-related tasks and other anti-abuse work, and at some point I became interested in verifying them myself, started asking around and learning by doing, and it all sort of went from there.

What do you see as some of the biggest achievements of WikiProject on open proxies, and are there any contributions you are particularly proud of?

  • MarioGom: Making abuse a little bit harder. I am particularly fond of some tools I created to automatically list all unblocked IPs from VPN services, which are now my primary tool to create WPOP reports. This has led to some VPN services that were often used for abuse being fully blocked. That being said, WPOP and its participants are not the only ones tackling this problem. There are several bots for automated blocking, such as User:ST47ProxyBot on English Wikipedia. There is also a bot used to list IP ranges of certain big hosting providers that are often used by VPNs and proxies (see User:AntiCompositeBot/ASNBlock), as well as very dedicated stewards who do their part on meta (User:Jon Kolbert).
  • zzuuzz: It's difficult to separate the achievements of the WikiProject from other achievements made by the blocking policy, the open proxies policy, and the community in general. However, in line with the advances made elsewhere I would say the projects greatest success has been supporting the transition to dealing with open proxies fairly, proportionately, and accurately.
  • GN: In my opinion, our biggest achievement is just cutting down on abuse – there are several long-term abuse cases who are known to use proxies, and blocking those makes their lives harder.
  • Blablubbs: It's hard to quantify the impact we have in exact terms – I don't know how many logged-in users are on any given VPN range or proxy IP we find, and what they're actually doing on those proxies. What I do know is that while we'll never be able to block every proxy in the world, our work here at the very least forces people to spend a lot more time looking for unblocked ones, and that makes it worthwhile. In recent months, we've been able to knock out a very large number of VPN providers and webhost ranges (mostly thanks to MarioGom's ceaseless efforts) and discovered and integrated new tools and discovery methods (shoutout to the fine folks at spur!). I'm proud of that progress.

What does an average day at WikiProject on open proxies look like? What roles and tasks do you associate with your WikiProject, and how do you go about fulfilling them?

  • MarioGom: Overall, a report of comes in, a verified user confirms the report and adds additional information and an admin blocks IPs or ranges if appropriate. I am primarily involved in reports. I spend about half of my time in the project coding tools to find blockable IPs, and the other half on manual review of these results and preparing reports in a way that (hopefully) saves some time from verified users and admins.
  • zzuuzz: On a typical day, reports will come in to the noticeboard listing IP addresses used by vandals, spammers, block-evaders, and other undesirables. There may be observations from various people about the evidence and other connected data. Usually a verified user will then confirm whether the IPs are open proxies or not, along with any appropriate blocks, and a passing admin will take any appropriate action.
  • GN: My main role has just been working off the backlog. Someone reports an IP, I use various tools to determine whether that IP is an open proxy (or something else blockable, like a VPS), and then figure out what kind of action to take (hardblocks vs softblock, for example, or whether I need to block an IP range). Even if a proxy checker has signed off on the block, I always do spot checks before hitting the block button, just to be certain.
  • SQL: Reports come in, a verified user looks it over, and makes a recommendation, and an admin acts on it. Often times, I'll add the appropriate filters to ASNBlock.
  • Blablubbs: I'm usually alerted to new reports by a bot ping on IRC and tackle them once I find time – in the majority of actionable cases, the reports will be for single VPN IPs, since ST47ProxyBot does an excellent job at finding and blocking open proxies. After confirming any given individual IP, I'll go for deep dives into ranges and providers using SQL's excellent isprangefinder tool, branch out from there, and then recommend administrative action. Aside from this, I do behind the scenes work, running checks on IPs that people send to me privately and often bouncing thoughts back and forth with MarioGom; together, we also maintain a list of common VPN fingerprints that enables us to attribute IPs to specific proxy providers.

What are WikiProject on open proxies's most pressing needs, and how can a new contributor help?

  • MarioGom: We could use more help from: (1) Users with advanced networking skills willing to help in VPN and proxy discovery and verification. Discovery is about finding the IPs in the first place, and verification is about technical methods to make sure that a reported IP is a proxy. (2) Admins that can help processing reports. Or stewards who can bridge these efforts to meta. (3) The WMF could help with further automation and data. We could also use better coordination with Meta and other Wikimedia projects.
  • zzuuzz: The project can always use people with the technical skills and experience to identify and confirm open proxies.
  • GN: More folks with technical expertise, more people with new ideas (we have a status quo, and it works, but it's entirely possible that we're stuck doing things a particular inefficient way and we don't see a simpler way to do things), and I wouldn't mind if the WMF actually tried to help instead of making our job harder.
  • SQL: More technically inclined people. Maybe a better format than the current noticeboard format.
  • Blablubbs: I concur with what others have said – more technical expertise, admin help and WMF assistance would definitely be an asset. I'll also add that this project depends on people making reports; while spotting potential proxy use can be hard for people who aren't technically inclined, there are some telltale signs (an IP editor hopping from country to country, weird WHOIS outputs that point to something not being a normal residential ISP etc.), and people reporting such IPs are a huge asset to the project. I'd rather decline more checks than miss more proxies. It's easier than it looks, and I'm happy to provide advice to anyone who is interested in contributing in one way or another.

Anything else you'd like to add?

  • MarioGom: Thanks to all editors, admins and stewards who contribute to this area.
  • zzuuzz: In the intro you asked what an open proxy is, and it's a good question. Put simply, an open proxy is an IP address that anyone can use to edit Wikipedia. They can be used for many things in fact, but the key purpose of using an open proxy is to obscure the real origins of an action. This might be in order to improve privacy for the end user, or to evade sanctions. I suppose many users these days might better recognise the term virtual private network (VPN service), some of which operate on the same principles and fall under the same Wikipedia policy. These networks can be used to evade geolocation restrictions or other censorship, sometimes legally and/or ethically, and sometimes not. Wikipedia has a long standing policy of blocking lots of open proxies from editing. There is no explicit prohibition against open proxies, and flags such as block exemption can enable editing from blocked addresses, however we do block an enormous number of them because they are responsible for an enormous amount of abuse.
  • GN: My thanks to everyone who contributes to WPOP, but I specifically want to call out User:MarioGom and User:Blablubbs – the two of them have discovered/written several tools over the past year or so that make proxy-checking a whole lot easier.
  • SQL: Seconding what GN said above. In specific, Blablubbs and MarioGom have been a huge help lately.
  • Blablubbs: Thanks for all the shoutouts, and thanks to anyone who helps out with reports, checks, and food for thought. I'll specifically call out Malcolmxl5, who has been a huge help in knocking out the administrative backlog lately.

Why are we asking volunteers for technical expertise to a strongly identified problem, when there is a foundation dedicated to spending 100 million dollars on this sort of thing? – question from Bri.

  • MarioGom: I'm not sure this should be material for the Signpost article about WP:WPOP (maybe it will be appropriate for the comments once published?), but the upcoming IP masking will render WP:WPOP obsolete. That is ok, but that also means that the WMF will need to assume tasks and responsibilities that are currently done by the community. My opinion is that independently of IP masking, the WMF should have a dedicated anti-abuse team.
  • zzuuzz: I think it's fair to say there's still some confusion about both the proposed extent of IP masking, and the nature and quality of proposed proxy checks. It's also important to note that most proxy blocking is already done outside of this WikiProject. No automated check claims, or should claim, to be 100% effective. One proxy checking tool which is used regularly provides about 15 different pieces of information, at least some of which are usually contradictory, to help the checker make up their own mind. Even the Torblock extension, which has been used in production for over a decade and uses a list of IP addresses supplied directly from the Tor project, can't pick up all Tor proxies. Where this project has most value is in these edge cases. I've said many times that human skills and judgment are essential to proper proxy identification. Would the community want someone in employ of the WMF to decide whether an IP should be blocked or not? I'm not sure that would go down well.
  • MarioGom: I think the WMF can help us significantly from a technical perspective. For example, a small amount of ASNs account for high number of VPN servers. These are mostly blocked in enwiki, but global blocks are not so comprehensive. There is also the problem of residential proxies, where long blocks are useless (and even harmful), but it is theoretically possible to build an autoblock system that minimizes collateral damage, and that system would require access to a good deal of backend data. The WMF can also provide better and more accurate data and tools to admins and CheckUsers.
  • zzuuzz: No doubt there's more that could be done, though I don't think the vision of a comprehensive ban across all projects is realistic. In the meantime, and in any case, this project could still use volunteers with technical skills and experience.

Thanks to everyone who answered (or asked) these questions, and also to you, the reader. Please contribute some ideas for WikiProjects to review or any feedback in the comments below, or on my talk page.




Reader comments

2021-06-27

Is WMF fundraising abusive?

This column, the Forum, is intended for the frank discussion and sharing of opinions between Wikipedians, so please read the discussion below in that light.
Andreas Kolbe is a former co-editor-in-chief of The Signpost. He recently published an article in The Daily Dot, titled "Wikipedia is swimming in money—why is it begging people to donate?". Smallbones is the current editor-in-chief of The Signpost. The opinions expressed are those of the authors and do not necessarily reflect those of other Signpost contributors or those of the Wikimedia Foundation.

Smallbones: Andreas, you recently wrote a piece in the Daily Dot, criticizing the Wikimedia Foundation's fundraising. Your piece was reported on and followed up by several other news sources, among them The Next Web, The Telegraph and Clarín, Argentina's biggest newspaper. The overall criticism seems to be that:

  • The WMF's budget has grown exponentially – much too fast,
  • The WMF is pressuring donors with appeals that suggest that Wikipedia is in dire need of money and may be forced to close down,
  • It is wasting donors' money on projects that the community neither wants or needs, and
  • The WMF and the community of editors are at loggerheads on this and many other issues

Does this represent your view of the problem?

Andreas Kolbe: Hi Smallbones, thanks for inviting me! I don't think the WMF's budget increases are necessarily a bad thing. For example, I asked many questions on Meta in the run-up to this piece. I received answers because there were WMF staff members there watching the pages. As anyone who's been around a while knows, that's not how it would have been a few years ago. There are many things the roughly 500 Foundation staff and contractors are doing that are worthwhile, and if the money is there to pay them, then good.

What I do feel strongly about is that the revenue should be gathered honestly. That brings me to your second point. The WMF banners shown annually around the world always evoke a sense that Wikipedia is facing an imminent threat – that its independence or even continued presence online is imperiled by a lack of funds. Which at this point is frankly ludicrous. Just look at the graph!

As for wasting money on projects the community neither wants or needs, I'd have to ask you what you mean by community – the volunteer community, or the public in general?

Smallbones: Defining community might mean all the potential readers, or potential donors – that is the entire population of the world, or it might mean a few thousand people who make more than 100 edits per month on the English Wikipedia. Let’s talk about the overall group of English-language editors on Wikipedia since the major complaints I’ve seen are in the English-language press.

People in the U.S. may have a different experience with fundraising campaigns than people in other places. Being asked to donate to charities is an everyday occurrence here. For example, today my National Public Radio station announced that an unnamed donor will give the station $100,000 if 3,500 people call the station and make a pledge of any size. But you need to do it in the next three days or else the $100,000 will disappear!

Later this summer, I expect to see volunteer firefighters, wearing some of their firefighting gear, standing in the middle of the road near the traffic light in front of their station. They slow down traffic and extend their tall firefighting boots to collect money from drivers. Yes, I’d consider this pushy if I thought about it, but like many Americans, I’ve gotten used to it.

The Canadian and British experiences with charity fundraising are fairly close to the American experience, though their public broadcasting and firefighting are likely paid for directly by the government. I’ve been told that charity solicitors on London streets are called “chuggers”, short for “charity muggers”. Canadians and Brits likely have seen television adverts with disturbing pictures of animals or children in pain, soliciting money for the animal welfare or children's welfare societies.

I don’t object to the goals of these organizations, but I really don’t like these adverts. Other countries would fund these needs through the government. Still other countries might fund them through religious organizations, large donations from philanthropists, or, perhaps in poorer countries, not at all. Perhaps a narrowly tailored campaign should be designed by the WMF for each country.

The first example of a WMF fundraising pitch in your Daily Dot article reads: "This Thursday Wikipedia really needs you. This is the 10th appeal we’ve shown you. 98% of our readers don’t give; they look the other way … We ask you, humbly, don’t scroll away."

While I’m surprised that the "this is the 10th appeal" sentence would help raise funds, I don’t see anything objectionable in it. The WMF does need donors every year, not just every 2-3 years when they’d be starting to run out of money. The "98%" sentence just gives readers the assurance that they wouldn’t be alone in not donating. The last sentence just asks readers to think one more time about donating.

Where is the objectionable part? Where are the disappearing funds? The firefighter’s boot in the driver’s face? The chuggers? The pictures of children and animals in pain? I guess I just don’t get it. Could you give another example of an objectionable on-Wiki pitch and explain why you think so?

Andreas: Other phrases used this fiscal year have included "Help us keep Wikipedia online and growing," "We need you to make a donation to protect Wikipedia's independence," "We humbly ask you to defend Wikipedia's independence," "This Thursday we need you to make a donation so that we can continue to protect Wikipedia's independence." Here are some examples:

"This Wednesday we humbly ask you to defend Wikipedia's independence"


The way they sound almost tearful sometimes, which is so at odds with the financial realities, causes me and countless others much the same distaste you describe feeling above.

Smallbones: Let’s focus on your last example – they’re all very similar to each other and to the example I discussed above. Three additional points.

  • The WMF is asking for about US $2.00
  • They mentioned “independence” and “if Wikipedia became commercial.” They are not saying that Wikipedia will become dependent on commercial interests this year or next but just suggesting the possibility at some future date.
  • They mention “Wednesday” and “help keep Wikipedia online and growing”. I don’t know why it’s important to mention “Wednesday”, but clearly nobody is suggesting that Wikipedia will go offline on Thursday.

Andreas: Surely it's designed to enhance the sense of urgency, that Wikipedia "really needs" money, today. But let me come back to your four bullet points above.

You asked whether I thought the WMF "is wasting donors' money on projects that the community neither wants or needs". My concern is rather that the global community of readers and donors doesn't have a good grasp of what the WMF is doing, which appears to be at least in part by design. Maybe they'd love what the WMF is doing, but if they don't know about it, how can they have an opinion?

Smallbones: You’re dead-wrong on this one. The WMF is radically transparent compared to other nonprofits. Just go to the bottom of the main page. There’s a dozen Wikimedia projects listed and linked. Dozens of different language versions. Wikipedia:About, Contact Wikipedia. You can also go to the Wikimedia Foundation website or read the articles on Wikipedia and Wikimedia Foundation. The biggest problem is likely that there is too much information, or that it’s not organized as well as it could be.

But you are likely thinking of the WMF’s future plans. Any big changes will likely be discussed with a dozen communities and listed on Meta for general discussion. The general reader who clicks on a fundraising banner probably doesn’t want to read all of that, but they can. They also likely have a very firm grasp of what Wikipedia is – it’s an online encyclopedia.

Can you think of any nonprofit – or any organization that is more transparent?

Andreas: The fundraising banners always just ask for money for Wikipedia, which is not even the WMF's main expenditure item – it reportedly accounts for about 30% of spending.

The financial statements show that over the past decade,

  • WMF net assets have increased more than 10-fold, even as
  • expenses have increased more than 10-fold,
  • the WMF has increased staff numbers almost 10-fold, while
  • cost of "salaries and wages" has increased more than 15-fold – which tells us that
  • WMF salaries have risen at rates well above inflation – doubled actually, in the case of the ED/CEO.
  • In addition to the more than 10-fold increase in net assets, the WMF now also has $100 million with Tides and is in the process of launching a for-profit arm.

Yet throughout this period of growth, the fundraising banners have made people believe that the WMF "often struggles to have enough money to keep Wikipedia up and running," as Trevor Noah put it in his recent interview of Katherine Maher. Viewers were left with that notion – the phrase remained uncorrected.

Smallbones: You are confusing exponential growth with other issues. There’s no question that the WMF’s fundraising has been successful. That doesn’t mean that they are lying to potential donors. That doesn’t mean that they are wasting money.

As far as Maher’s interview with Trevor Noah, I can’t answer for either of them but will note that you’re now focusing on what wasn’t said in a TV interview, rather than what was said in a fundraising appeal. Noah asked a long question (almost a full minute of an 8 minute interview) about the importance of WMF being a nonprofit (starting at 3:50), Maher’s answer covered the main question in detail, but not the part that you focused on.

Andreas: Andrew Lih said five years ago in the Washington Post, "People will come up to me during fundraising season and ask if Wikipedia's in trouble. I have to reassure them that not only is Wikipedia not in trouble, but that it's making more money than ever before and is at no risk of going away."

The WMF is $200 million richer today. When people learn just how wealthy the WMF is, you see complaints like this one at Hacker News:

  • "It doesn't matter if they have big plans, the way they ask for money doesn't say that – it says they can't survive without my 10 bucks, which is disgusting. I'd rather be shown ads than this."

Smallbones: Surely you could find a better source than Hacker News. When the pseudonymous poster says the WMF told him that they couldn’t survive “without my 10 bucks”, you didn’t take that literally did you?

What Andrew Lih said over 5 years ago is likely more interesting. I’ll ping him to see what he thinks now.

Andreas: I could have quoted many more statements like that from social media. Here is a Twitter thread by Hector Martin, the Asahi Linux founder, that got nearly 1,000 retweets. He refers to the banners as "downright deceptive". The Next Web spoke of Wikipedia's "impassioned pitch for cash" that "gives the impression that the site is struggling to stay afloat". People out there agreed with me.

Moreover, many readers assume the WMF plays an active role in directing, moderating or fact-checking Wikipedia content. This is partly because the WMF has a media presence and takes credit for the project, whereas the volunteer community has no public voice, no organisation speaking for it, no PR agent. The WMF should fill this role, but the volunteers don't feel represented by it.

Smallbones: The WMF does a pretty good job mentioning Wikipedia volunteers at almost every chance. Perhaps they could do more. I also think that many reporters don’t know how to deal with the multi-headed hydra that is Wikipedia, and just credit the WMF for convenience. So how should the WMF represent the diverse views of the community? – that’s a hard one to answer.

Andreas: Half the board should be elected by volunteers who do not represent organisations financially dependent on the WMF – who have no financial conflict of interest. This brings me to your last bullet point about the WMF and the community. I was struck by the magnitude of the opposition when the WMF tried to rebrand itself as the Wikipedia Foundation last year – voices against the renaming were running at well over 90 per cent on Meta (540 opposed vs. 46 in favour). Volunteers spoke of attempted "identity theft". An open letter telling the WMF to stop bears the signatures of 73 Wikimedia affiliates and 998 volunteers.

I believe what drives this unease is in part the way the WMF has been absorbed by the US political and financial establishment.

A Counterpunch article I read the other day mentions that traditionally an estimated 80% of top foreign policy positions in the US government are held by members of the Council on Foreign Relations (CFR). It's a US think tank with around 5,000 members. They include ex-WMF CEO Katherine Maher and the WMF's most influential consultants, mentioned in its annual Form 990 disclosures as having received six-figure sums for their services. These consultants all have close ties to the Clintons:

  • Whitney Williams, whose williamsworks firm was in charge of the process that set the WMF strategic direction for 2030, is a CFR member. She was formerly Hillary Clinton's Trip Director.
  • Minassian Media, the WMF's PR consultancy for the past few years, was founded by Craig Minassian, ex-press secretary to Bill Clinton and today chief communications officer of the Clinton Foundation. Jove Oliver, a Minassian partner dealing with the WMF account, is a member of the CFR.
  • Trilogy Interactive's Josh Ross was Hillary Clinton's senior digital advisor …

Moreover, the various foundations that have traditionally given the WMF large sums of money – such as the Ford Foundation, Stanton Foundation, Alfred P. Sloan Foundation, Knight Foundation – are all headed by CFR members as well (or a member's spouse, in the case of the Stanton Foundation).

Wikipedians don't belong to this small elite. Wikipedia itself is an anti-elitist project. In most countries (with exceptions like Kazakhstan) volunteers celebrate their project's independence from government, even to the point of contrariness (recall the Pierre-sur-Haute incident). Think tanks aren't welcomed as sources – "Avoid all think tanks like the plague … The influence of think-tanks is pernicious …" was one veteran admin's verdict at WP:RSN.

Smallbones: The politics of Wikipedians and of the folks who run the WMF is always a good topic for speculation and even for conspiracy theories. About all I can say is that, after a period of strong libertarian influence, the general community seems to have become more liberal than the “average American”. Since this is an international project that shouldn’t be unexpected. But I hadn’t heard the one about the Clintons taking over the WMF before.

The Council on Foreign Relations is a highly respected organization. Nobody should be playing a game of "guilt by association" using their name.

Andreas: I think media organisations – including Wikimedia – should be at arm's length from government. As for the Clintons, these approaches were made quite openly as long as a decade ago. Hillary Clinton sent Richard Boly to Wikimania 2012. Boly was then in charge of "an ambitious State Department initiative that uses social media and online platforms to change the way employees communicate and reach outside their boundaries to advance U.S. foreign policy interests. He told Wikipedians we are "kindred spirits" and read out a letter of greeting from Clinton.

There is a growing culture gap. You now hear complaints even from ex-staffers that WMF leadership culture has become indistinguishable from general US corporate culture.

Now, your firemen example. So you come to a traffic light. Firefighters are preventing you from moving off, impressing on you that they are struggling to have enough money to keep the fire engines up and running, need your money today to continue to save lives. Alarmed, you give them $20 of your hard-earned cash. But then you find out that the fire service's purely administrative headcount has increased 10-fold, most of the money is spent on other things than fire engines, its net assets have grown by $100 million over the past five years, and it's also acquired a $100-million endowment in record time, partly funded by your donations. Moreover, its CEO now earns twice as much as ten years ago, and ten times as much as you.

I think anyone who's not a complete doormat would at this point feel slightly had.

And then you learn that the firemen even go to some of the city's poorest quarters, begging for money, making people fear they may have to do without a fire service if they don't give money today – even though the service has already surpassed its own revenue year goals by nearly $50 million.

Smallbones: Can I say that your firefighter example is a bit alarmist? You are asking me to assume all your conclusions into a hypothetical example. Or are you just trying to hose me?

Andreas: In April/May, the WMF was already around $50 million ahead of its revenue year goals. Yet it started fundraising in South American countries badly hit by the pandemic, using banners that implied its independence was under threat. I didn't think that was right.

Smallbones: There’s a very good argument for fundraising around the world. If fundraising is limited to just North America and Western Europe, the WMF will be perceived as just being an organization for the developed world. When hard financial choices have to be made, the developed world would be very influential. The WMF wants to be a world wide organization.

Andreas: Yes, indeed it does! All I have to say in conclusion is that I'd ask Wikipedians to read the Daily Dot article, and share it online if they agree.

If the banners continue to imply there's a financial emergency, or that money should be given to the WMF to "show the volunteers that their work matters", I'd suggest volunteers organise a social media campaign with the tagline "Not in our name" in time for the next fundraiser.

And volunteers should press – in the media, not just on Wikipedia – for non-Wikimedia-affiliated volunteers to be given 50% of the seats on the WMF board. That would be a partnership of equals, and it's the only way things will change.

Smallbones: To conclude, Andreas, you haven’t shown any examples of abusive fundraising adverts, certainly nothing comparable to what I see every day from other nonprofits. I’ll only ask our readers to check out the specific banners that the critics so vociferously complain about.

Sure, the WMF might better tailor their adverts to individual countries, or remove the word “Wednesday” from the banners. But that’s not what you are complaining about, is it? You’ve vacillated on whether WMF spending is a problem. I'm sure you understand that international fundraising makes sense for an international organization.

I still can't understand why you want to delay fundraising until the WMF is close to being broke. Most people will try to ensure that they have enough money before they need it.

That leaves the fact that the relationship between the WMF and editors in some project communities is often contentious. You’re promoting your own views on Wiki-politics above. Fine, but don’t confuse your politics with the proper way to raise funds. Your politics may be more controversial than you think.

The difficulties between the WMF and the editor communities can only be solved by all parties recognizing that they need each other. These difficulties won’t be solved by cutting fundraising.



Reader comments

2021-06-27

Reliability of WikiLeaks discussed

Over the past few months, the use of documents hosted on WikiLeaks as a source for facts has been a topic of discussion across several different noticeboards.

Editors discuss reliability of WikiLeaks

Several discussions began surrounding the reliability of WikiLeaks (see entry in perennial sources list), following a dispute regarding the use of the source in a stub-class biography article of Laotian politician Sisay Leudetmounsone. The biography article's creator, Ruling party, had inserted a WikiLeaks document purporting to be a diplomatic cable of the United States into the article while building the page. After JzG objected to the use of the source, a discussion began at the article's talk page regarding whether the use of the source was consistent with Wikipedia policy.

Discussions on the article's talk page soon spilled over to the biography of living persons (BLP) noticeboard, where editors debated in a now-archived discussion whether or not a specific leaked U.S. diplomatic cable available on WikiLeaks, which lists the biography's subject as a part of the 8th Central Committee of the Lao People's Revolutionary Party, is a reliable source. Editors discussed whether or not the source being available on WikiLeaks satisfies the policy requirement that reliable sources be published, whether WikiLeaks has a reputation for fact-checking and accuracy, and if the extent to which previous discussions that had taken place at the reliable sources noticeboard should play a role in evaluating the particular use of the source. Editors also discussed if other sources whose reliability is not contested existed for the information.

Further discussion occurred in a now-archived thread and a request for comments (RfC) on the reliable sources noticeboard regarding the general reliability of WikiLeaks in providing authentic primary source documents. Initial comments surrounded the use of WikiLeaks as a source for the biography, and the discussion initially focused on whether the cable in question had been reported on by reliable sources or subject to external verification. Many editors noted some skepticism of WikiLeaks' fact-checking and document verification process, though a clear consensus was not immediately reached.

The RfC proposed by Szmenderowiecki raised the question of WikiLeaks' reliability in general, asking editors to consider the extent to which editors believe that WikiLeaks is "per se reliable for the publication of genuine government documents" and whether or not there would be special considerations for cases in which "a reasonable editor may conclude that the coverage from RS [reliable sources] is likely to be minimal or absent on the subject". At the time of writing, the RfC has been archived without a closing summary statement.



Reader comments

2021-06-27

SarahSV

Graffiti art which has appeared on SarahSV's user pages for many years
This article was written by twenty-four Wikipedians.

The Wikipedia community was deeply saddened in early May to hear of the death of SlimVirgin, also known as SarahSV or Sarah, who began editing in November 2004 and became an integral, respected, and influential member of the community. She was foundational in developing many of Wikipedia's core policies, including biographies of living persons, no original research, and verifiability, and drew on her wide range of interests to write some of Wikipedia's finest content.

A prolific contributor, her edits, numbering more than 177,000, give insight into the project's often difficult evolution since its early days. In March 2005 she was nominated for adminship and passed with a majority of 77 to 1. SarahSV's intelligence, direct manner, broad range of policy knowledge, and lucid writing were highly regarded, and for many years her talkpage was the second-most-watchlisted, after Jimbo Wales'. Several editors, on learning of her passing, described her on her talk page memorial as "irreplaceable" to Wikipedia. Her final edit was on April 18, 2021.

Policy development

SarahSV was instrumental in establishing verifiability, not truth.[1] She helped to develop a number of policies that are at the core of the project. After the 2005 Seigenthaler biography incident she played a key role in formulating an early draft of the biographies of living persons policy and in negotiating community acceptance. She continued to watch over and hone the policy, and made nearly half of all edits to the page.

In 2006, SarahSV proposed that the verifiability and no original research policies be consolidated as an attribution policy.[2] Although that merger did not gain consensus after a vote of 424 supports to 354 opposes, her leadership led to significant improvement of both policies, which have served as templates to similar policies on more than 70 Wikipedias. In recent years she was active in simplifying, clarifying, and structurally improving the conflict of interest guideline and related pages.

SarahSV embraced controversial subjects. In her first week on Wikipedia she created the article Death of Jeremiah Duggan, related to the LaRouche movement, which led to her first arbitration case. Both on- and off-wiki she was subjected to extensive harassment, stalking, and misogyny, particularly in the early years. Her experiences helped to inform the community's policies on personal attacks, harassment, and the use of external attack sites. Over time, many of these principles have been incorporated into both Wikimedia Foundation and broader Wikimedia community policies; the current draft of the Wikimedia community universal code of conduct reflects many principles she championed.

Creation of the Gender Gap Task Force

Image from the Gender Gap Task Force

Since her first years at Wikipedia, SarahSV actively sought to highlight and counter systemic bias and to reduce the gender gap. In May 2013 she formed the Gender Gap Task Force[3] to gather information on the gender gap among both editors and content, and to reduce these by encouraging new editors and content development. The Task Force has grown to more than 170 editors who work closely with other WikiProjects in promoting content drives.

SarahSV participated on the Gender Gap Mailing List, promoted editing salons focused on relevant subjects, wrote and improved articles about women, and authored the essay Writing about women. Of particular note, she brought Female genital mutilation to featured article status in 2014. The article was described by the late Brianboulton as covering "a difficult and challenging topic", and "an important contribution to Wikipedia". J Milburn agreed, congratulating her "fortitude in tackling such a difficult and yet important subject". On February 6, 2015, on the International Day of Zero Tolerance for Female Genital Mutilation, it appeared on Wikipedia's homepage as Today's featured article. The article was later recognized by the community with a Million Award.

Featured and other article work

SarahSV was a meticulous researcher, an expert at evaluating sources, and a clear and concise writer. She had eclectic interests and worked on a broad range of topics—including justice, race relations, philosophy, literature, human and animal rights, the Holocaust, religion and faith, current events, and conspiracy theories. She was greatly respected in the FAC community, where she made important contributions to setting and improving standards for featured articles, not least through the quality of her nominations. She was an eloquent, facts-based, and demanding FAC reviewer, and many nominators found her insistence on the correct use of sources intimidating. A consensus is that her guidance was always constructive: on reviewing one biography, she advised editors to "bring out some of the colour, things that make him three-dimensional for the reader ... [to] bring him to life".

She brought Bernard Williams to featured-article status in December 2004, followed in 2006 by three further featured articles: Joel Brand, Rudolph Vrba, and Elie Wiesel's book Night. She nominated Brown Dog Affair in 2007 as a collaboration with other editors. In 2009 she successfully nominated Abu Nidal, Stanley Green, and Marshalsea, followed a year later by the Killing of Muhammad al-Durrah and Death of Ian Tomlinson. In 2014 she was part of a collaboration that brought the notoriously difficult and complex poet Ezra Pound to FA status.

SarahSV's editing reveals a focus of great breadth and depth. She uploaded almost 3000 files, created 4174 pages, and edited more than 22,000 articles; twelve of which she edited more than 1000 times. When working to achieve good article or featured status, she often expanded related articles, such as the Holocaust, Rudolf Vrba, and Christian Science. Other articles on typically difficult subjects that can in large part be credited to her include veganism, Bad Pharma, the Rotherham child sexual exploitation scandal, the disappearance of Madeleine McCann, and Chelsea Manning. She was continuing her work on the Holocaust and the Auschwitz concentration camp until her last edits in late April 2021.

An honest woman here lies at rest,
As e’er God with Her image blest:
The friend of all, the friend of truth;
The friend of age, and guide of youth:
Few hearts like hers, with virtue warm’d,
Few heads with knowledge so inform’d:
If there’s another world, she lives in bliss;
If there is none, she made the best of this.

— adapted from Robert Burns, Epitaph on My Own Friend


Citations



Reader comments

If articles have been updated, you may need to refresh the single-page edition.