Since a lot of researchers show up at these events, and many do not know the huge array of research tools we have. We'll introduce a set of datasources and explain what types of questions each datasource is best suited for. We'll also cover a host of tools and libraries designed to make analysis easier. Finally, we'll demonstrate an example project that makes use of multiple datasources. After the demonstration, participants will be encouraged to explore and start their own work. Organizers will be available to answer questions.
Tools
- Quarry
- pywikibot
- MediaWiki Utilities
- WikiBrainAPI
- ORES
Datasources
- LabsDB
- XML dumps
- MediaWiki API
- Curated datasets
- Scholarly citations
- Clickstream
- Article Feedback
- Teahouse corpus
- Reverts
Note that this workshop would serve a dual purpose. (1) to support a the work of Wikipedian Researchers (the type of researchers who come to Wikimania) (2) to identify pain points in using datasets and tools and prioritize solving them.