0% found this document useful (0 votes)

10 views16 pages

Syllabus

The course 'Data Wise: Data Science in Society' focuses on understanding data as evidence, its implications, and its role in shaping science, policy, and politics. Over six weeks, students will explore concepts such as data circulation, bias, and responsible usage, culminating in debates to apply their knowledge. The course aims to equip students with critical data skills necessary for evaluating data sets and understanding their societal impact.

Uploaded by

pcturnes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views16 pages

Syllabus

Uploaded by

pcturnes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Center for Information Technology

Faculty of Behavioural and Social Sciences

Campus Fryslân

Data Wise:
Data Science in Society
Data as evidence

Course code: SOMIN04

Academic Year 2023-2024
Semester I a
Credits: 2.5 ECTS
Course coordinator: Dr. Raul R. Cordero
Content
0
Content 1
Introduction 2
Learning goals 2
Course structure 2
Practical Information 3
Assessment 3
Attendance & Absence 4

Academic Integrity 4

Proper Referencing 4

Contact information 4

Weekly schedule 5
Week 1 Introduction to Data as Evidence: Characteristics of Data and Big Data Myths 5

Week 2 Data, Evidence and Knowledge 6

Week 3 Data circulation 7
Week 4 Data bias and using data responsibly 8
Week 5 Debating Data as Evidence- Part 1 9
Week 6 Debating Data as Evidence- Part 2 10
Appendices 11
Appendix 1. Assignments and Assessment 11

Course guide Data Wise

Introduction
In today’s digital society, a critical understanding of data is essential for all graduates and
professionals. Knowledge about data is often split into areas of expertise, so that processes that span
algorithms, servers, users and institutions are rarely discussed coherently and accessibly. The topic of
‘data as evidence’ cuts across social, technical, and methodological areas and enables an integrated
approach to data.

This course introduces a set of concepts to assess how data shapes science, policy, and politics,
including how data is turned into metrics that are used to make decisions. Using a series of concrete
examples from areas such as sports, health and climate change, it connects data as a highly
technological practice to broad social questions of evidence.

In this course, we will focus specifically on the recent changes in how we use data as evidence. We
will analyze the recent growth of types and amount of data (datafication) and the different ways data
can be used as evidence. By the end of the course, you will master the following concepts: causation,
correlation, description, features, probability, sampling, model, population. You will also learn to
analyze data cycles and to map out knowledge production systems. A number of current issues will
also be examined from the perspective of data: fake news, bubbles, algorithmic discrimination, and the
prominence of data in the anti-pandemic measures. Through the combination of conceptual tools and
assignment, you will be well equipped to assess data sets and to address what you can and cannot do
with it.

In particular, the course will address the conceptual and analytic tools you need to develop good
critical data skills.

The elective course provides a theoretical basis for your Collaborative Data Projects and connect to
other courses through the centrality of data in all the questions addressed.

Learning goals
Upon the successful completion of this course, students will be able to:
1) Understand data production and data journeys and their implications
2) Understand technical, functional and epistemic dimensions of data and their interactions
3) Evaluate data as evidence in relation to knowledge needs
Course structure
The course runs for 6 weeks. There is one class of two hours every week.

The course accounts for 2.5, distributed across the following elements:
12 hrs lectures and in-class sessions
18 hrs reading and preparation for the lectures
18 hrs assignment 1
18 hrs assignment 2
4 hrs preparation of debate

Course guide Data Wise

Practical Information
Readings and Resources
Please see weekly schedule for precise reading assignments (for example, chapters from books)

Brightspace

We use the virtual learning environment “Brightspace” as the main platform for communication and
Blackboard Collaborate for the online classes. Here, you’ll find recommended literature, information
on assignments and your grades. Announcements regarding schedule -or content changes will also
be published in Brightspace.

All essential information about the course can be found in this syllabus. However, as we reserve the
right to change the syllabus, please keep track of Brightspace for the most up-to-date information.

Assessment
In this course, two assignments and one in-class activity will be assessed, as well as participation. All
aim to provide you with the opportunity to practice critical analysis skills and to receive feedback on
these. Such skills are essential to data work: they will enable you to perform really valuable work.
Many people can answer data questions, much fewer are able to make a strong case why a particular
data set is suitable to answer a particular question, or to improve the formulation of questions in
order to make sure you are asking the important questions, the right questions.

In all cases, the data and tasks may seem simple: this is deceptive! The assignment is not meant to
test your data crunching skills but to provide an opportunity to slow down and consider critically, in
detail, the assumptions, erasures, biases and constraints that arise in each step taken. Carefully
analyzing and reflecting on these aspects will enable you to develop basic skills that you will then be
able to apply to more complex situations.

The assessment will be done on the basis of two assignments each worth 30% and preparation and
performance in a debate 20% and a participation grade worth 20%. See appendix.

Detailed assignment and assessment descriptions can be found in Appendix 1.

Deadlines for the assignments are

Report Assignment #1: 15 October (23.59)
Report Assignment #2: 31 October (23.59)

Course guide Data Wise

The debates are scheduled during the sessions of 17 and 24 October. You must attend these
sessions in order to do fulfil the requirements for this assessment.

Attendance & Absence

If you are unable to attend a class, please inform your lecturer per email and add the educational
secretary in cc. All contact details are provided below.

Academic Integrity
Cheating and plagiarism are academic offences, with severe consequences. They are acts or
omissions by students to partly or wholly hinder accurate assessment. As per the Teaching and
Examination Regulations, cases of cheating and plagiarism are reported to Exam Board and the
Board will decided upon the consequences.

Proper Referencing
Use an accepted academic referencing system such as author-date, Vancouver-numeric, etc. Different
fields use different referencing systems because different systems are more suitable for the specific
kind of scientific communication (articles, book) and the particular type of sources in a field (artworks,
legal texts, books, articles, patents, images).

If you directly quote a source (for example, take over a piece of text or passage in an article), you
must indicate this with quotation marks at the beginning and end of the quotation, and refer to the
source, including page number. If you paraphrase a source (write up what it says in your own words),
you must also indicate the source in the text with a reference. If you put forth particular facts or
statements about situations or phenomena that are not ‘common knowledge’, you must also provide
a reference.

It is NEVER suitable to simply add a list of sources at the end of a text. Proper referencing has two
components:
(1) You must indicate IN THE TEXT whenever a source is used
2) You must provide a list of these sources

If you do not do this, you make it impossible for the reader to evaluate the originality and strength of
your work. If you do not properly reference your sources, this amounts to plagiarism and will be
treated as such. This means your assignment will be referred to the Exam Board. Penalties for
plagiarism are serious and can vary from (but are not limited to) an automatic fail for the assignment,
exclusion from the course, or the impossibility of graduating with distinction.

Here is a brief introduction: https://www.youtube.com/watch?v=AMWPTztsk2A. There are also many

tutorials, wizards and software packages that can support proper referencing. If in doubt, consult your
instructor.
Contact information
The course coordinator of Data as Evidence is Dr Raul R. Cordero – [email protected]
General questions or suggestions with regard to the course can be addressed to the educational
secretariat. Email: [email protected] / phone number: 050-3634766.

Course guide Data Wise

Weekly schedule
Week 1 Introduction to Data as Evidence: Characteristics of Data and Big Data
Myths

Objectives
Students will be able to
• Understand recent dynamics in modes of data production, (including technological,
epistemic, economic, cultural and institutional dimensions)
• Recognize different discourses on data (big data hype, etc.)

Class Thu 19/09 13:00-15:00

Preparation
Readings must be done BEFORE class. That way, we can have more in-depth discussions during our
sessions. There is a bit more required reading this week! Be sure to take enough time to do all of it
and start the course off in a position of strength.

Required reading
Chapter 3, Characteristics of Data, in Beaulieu, A., & Leonelli, S. (2021). A Critical Introduction to
Data and Society. SAGE Publications Ltd.
Boyd, D., & Crawford, K. (2011). Six Provocations for Big Data (SSRN Scholarly Paper No. ID
1926431). Retrieved from Social Science Research Network website:
https://papers.ssrn.com/abstract=1926431
Alaimo, Cristina, and Jannis Kallinikos. 2017. ‘Computing the Everyday: Social Media as Data
Platforms’. The Information Society 33 (4): 175–91.
https://doi.org/10.1080/01972243.2017.1318327.

Additional reading (highly recommended, but not compulsory)

Beer, D. (2015). Productive measures: Culture and measurement in the context of everyday
neoliberalism. Big Data & Society, 2(1), 205395171557895.
https://doi.org/10.1177/2053951715578951

In this first class, we will first briefly revisit the concept of datafication and the intensification of data in
many spheres of life from Introduction to Data. We will quickly move on to address data creation and
the characteristics of data. Finally, we will review a number of big data myths and critically assess
them. Assignment 1 will be explained.

Course guide Data Wise

Week 2 Data, Evidence and Knowledge

Objectives
Students will be able to:
• Analyze the context of data, platforms, creation and how it shapes meaning of data
• Apply concepts of traces versus observations and measurements & samples, populations,
bias to data they encounter
• Distinguish between Relational and representational views of data
• Be able to assess the limitations of data sets, including the issue of dark data and
phenomena such as Simpson’s paradox

Class Thu 26/09 13:00-15:00

Preparation
Required reading
There are three required readings for this week If you are quite new to data science (and actually, this
is a good refresher/overview for everyone) you might want to start by reading this document:
Data Science: A guide for society
https://www.askforevidence.org/articles/data-science-a-guide-for-society

Chapter 4, Data, Evidence and Knowledge in Beaulieu, A., & Leonelli, S. (2021). A Critical
Introduction to Data and Society. SAGE Publications Ltd.
Rieder, Gernot, and Judith Simon. 2016. ‘Datatrust: Or, the Political Quest for Numerical Evidence
and the Epistemologies of Big Data’. Big Data & Society 3 (1): 2053951716649398.
https://doi.org/10.1177/2053951716649398.

Additional reading (highly recommended, but not compulsory)

Chapter 2 of Schutt, R., & O’Neil, C. (2013). Doing Data Science. O’Reilly & Associates.

Content
We will explore what it means to use data as evidence. What are our criteria for evidence, and how
does data fulfill it? Are there other types of evidence? What are the differences between observation
and interpretation in knowledge production? We will further build on the insights about data creation
and provide concepts to name and navigate some of the ways in which data gets used and (features
of data sets) and valued as evidence (objectivity, quantification).

Also in this session, we will address key concept in using data for knowing and discuss how data
creation relates to what you want to do and can do with data—and vice versa. We will look at a
number of cases where we ‘know’ something on the basis of data and explore where trust in this
knowledge comes from. We will work towards a layered understanding of sampling and populations,
of the relation between technology and knowledge, of correlation and causation, prediction and
pattern detection, and of how black boxes play a role in data ecosystems (as basis for assignment 2).

Course guide Data Wise

Week 3 Data circulation

Objectives
Students will be able to
• Understand how data circulates and is put to work
• Apply concepts to describe and assess how data shapes science, policy, and politics
(causation, correlation, prediction, description, features, probability, sampling, model,
population, black boxes)
• Distinguish the role of models, conventions, infrastructure, visualization and curation in
turning data into evidence

Class I Thu 3/10 13:00-15:00

In our third week, we will focus on the importance of circulation for the meaning and value of data. A
number of concepts will be discussed that help us understand this circulation. The topic of data
visualization will also be an important topic, as a way of linking the issues of evidence and knowledge
we discussed in week 2 and the dynamics of circulation of data.

Preparation

Required reading
Chapter 5, Data Circulation in Beaulieu, A., & Leonelli, S. (2021). A Critical Introduction to Data and
Society. SAGE Publications Ltd.

Optional reading
Hand, David. 2019. ‘What Is the Purpose of Statistical Modelling?’ Harvard Data Science Review 1 (1).
https://doi.org/10.1162/99608f92.4a85af74.

Course guide Data Wise

Week 4 Data bias and using data responsibly

Objectives
Students will be able to:
1) Critically assess the way data shapes science, policy, and politics, including how data are turned
into metrics that are used for decisions

2) Discuss data justice and fairness in the face of data inequalities

3) Assess data sets for what you can and cannot do with them, how they might lead to outcomes that
can be negative (fake news, algorithmic discrimination) or positive (removing barriers, greater
efficiency and fairness)
4) Recognize how guidelines, codes of conducts and regulations apply to different data processes

Class Thu 10/10 13:00-15:00

Preparation

Required reading
Zook, M., Barocas, S., Boyd, D., Crawford, K., Keller, E., Gangadharan, S. P., … Pasquale, F. (2017).
Ten simple rules for responsible big data research. PLOS Computational Biology, 13(3),
e1005399. https://doi.org/10.1371/journal.pcbi.1005399

Course guide Data Wise

Week 5 Debating Data as Evidence- Part 1

Objectives
Students will be able to
1) Apply the models and concepts learned during the course to argue specific positions with
regards to data issues
2) Demonstrate their grasp of the complex dynamics that shape how we value data and the
processes that rely on data
3) Debate an assigned position, listen to dissenting arguments, and counter the points brought
forward in the course of the debate.

Class 17/10 13:00-15:00 PHYSICALLY CO-PRESENT SESSION

Preparation

Required reading
Artificial Intelligence and Gender Bias. (n.d.). Retrieved 24 August 2019, from Catalyst website:
https://www.catalyst.org/research/trend-brief-gender-bias-in-ai/
Hao, K. (n.d.). This is how AI bias really happens—And why it’s so hard to fix. Retrieved 24 June 2019,
from MIT Technology Review website: https://www.technologyreview.com/s/612876/this-is-how-ai-
bias-really-happensand-why-its-so-hard-to-fix/

The Correspondent, 2020. The original Big Tech is working closer than ever with governments to
combat coronavirus – with no scrutiny
https://thecorrespondent.com/621/the-original-big-tech-is-working-closer-than-ever-with-
governments-to-combat-coronavirus-with-no-scrutiny/81373317498-76dea099

The Correspondent, 2020. Coronavirus apps show governments can no longer do without Apple or
Google https://thecorrespondent.com/546/coronavirus-apps-show-governments-can-no-longer-do-
without-apple-or-google/19583175612-bc49b1e0

O’Neil, C. (2016). Weapons of Math Destruction: How Big Data Increases Inequality and Threatens
Democracy. Crown/Archetype. Chapter 1

Additional reading (optional)

Neff, G., Tanweer, A., Fiore-Gartland, B., & Osburn, L. (2017). Critique and Contribute: A Practice-
Based Framework for Improving Critical Data Studies and Data Science. Big Data, 5(2), 85–97.
https://doi.org/10.1089/big.2016.0050

In this class, we will hold a debate on topics 1 and 2. You must attend class and take part in the
debate to be assessed on this part of the course.

Course guide Data Wise

Week 6 Debating Data as Evidence- Part 2

Class Thu 24/10 13:00-15:00 PHYSICALLY CO-PRESENT SESSION

Preparation

Required reading (same as previous week)

Artificial Intelligence and Gender Bias. (n.d.). Retrieved 24 August 2019, from Catalyst website:
https://www.catalyst.org/research/trend-brief-gender-bias-in-ai/
Hao, K. (n.d.). This is how AI bias really happens—And why it’s so hard to fix. Retrieved 24 June 2019,
from MIT Technology Review website: https://www.technologyreview.com/s/612876/this-is-
how-ai-bias-really-happensand-why-its-so-hard-to-fix/
O’Neil, C. (2016). Weapons of Math Destruction: How Big Data Increases Inequality and Threatens
Democracy. Crown/Archetype. Chapter 1

Additional reading (optional)

Neff, G., Tanweer, A., Fiore-Gartland, B., & Osburn, L. (2017). Critique and Contribute: A Practice-
Based Framework for Improving Critical Data Studies and Data Science. Big Data, 5(2), 85–
97. https://doi.org/10.1089/big.2016.0050

In this class, we will hold a debate on topics 3 and 4. You must attend class and take part in the
debate to be assessed on this part of the course.

Course guide Data Wise

Appendices
Appendix 1. Assignments and Assessment

Assignment 1 Carbon Footprint Calculators as Black Boxes (30%)

Report Assignment #1: deadline 15 October (23.59)

This assessment will help consolidate these two learning outcomes:

1) Understanding technical, functional and epistemic dimensions of data and their interactions
2) Evaluate data as evidence in relation to knowledge needs

For this assignment, you will investigate two elements that go into using data as evidence and assess
which conclusions you can draw. In this report, use concepts from lectures and readings from the first
four weeks of the course.

The case at the heart of this assignment is the calculation of the carbon footprint of your latest
holiday. In this assignment, you will have to
(1) Generate data on the relevant aspects of your latest holiday
(2) Use at least three different carbon calculators to analyse this data
(3) Put forth a conclusion on your carbon footprint based this data and the carbon calculation

Please follow these steps for your investigation:

1. Write a description of your holiday.
2. Identify which elements have a significant carbon impact.
3. Calculate the carbon impact of your holidays, using 3 different carbon calculators. Consider
differences in the input requested and in the output.
4. For each of the calculators, investigate how the calculations are made using information from
the calculators’ website or other sources (like reviews of carbon calculators). Comparing the
outcomes is a useful way to investigate how the calculator is working.
5. Consider the outcomes of the three calculators and evaluate the results.
6. Select the results that you think are most valuable in answering the question ‘what is the
carbon footprint of my last holiday’. Articulate why you chose one over the other.

In your report include

• A brief description of your holiday
• A definition of what you consider a significant carbon impact and why
• A description of the three calculators, of the processes and outcomes (how calculations are
made, how they differ, what you found out about how they work)
• A discussion of how you evaluated each of the calculators and their outcomes
• A discussion of the insights you find most valuable in answering the question and an explicit
account of why you trust the results.
The report should be about 1500 words long. Please feel free to use annotated screenshots or other
supporting material.

Course guide Data Wise

Element in your assignment # points IMPORTANT
Formal elements: word count, clarity, 5 Please be sure to properly reference your
styles, referencing sources (see Course Guide and slides from
lecture)
Definition of evaluation criteria 5
(carbon footprint)
Description of the three calculators, 5 Be specific in how they differ and how these
the processes and outcomes differences matter.
An account of your evaluation of the 10 Provide clear explanations of how you
calculators evaluate the calculators; use your critical
thinking skills from assignment 1.
An account of what you find to be the 10 Show that you can think critically about black
most valuable insights in evaluating boxes.
(according to the criteria you defined!)
Provide an explanation of why you 10 Show you can explain your basis for trust in
trust (some of) the insights (or not) outcomes of data processes.

Course guide Data Wise

Assignment 2. Strengths, Limitations and Shaping of Data: Understanding data in context (30%)

Report Assignment #2: 31 October (23.59)

This assignment will help consolidate these two learning outcomes

1) Understanding data creation and data journeys
2) Understanding technical, functional and epistemic dimensions of data and their interactions

This assignment is the opportunity to develop your skills in assessing data sets for what you can and
cannot do with them, how they might lead to outcomes that can be negative (fake news, algorithmic
discrimination) or positive (removing barriers, greater efficiency and fairness) and what makes them
suitable to answer particular questions and be analyzed in particular ways.

There are many aspects of data that shape how it can and cannot be used. Your task in this
assignment is to discover the strengths and limitations of a data set. This assignment will enable you
to4.6 Conclusion:
develop critical data Data
analysisscience in a relational
skills regarding the productionperspective
of data and to understand how
decisions about technical and conceptual aspects shape the data and what can be done with it.
We can now compare the more abstract cycle of knowledge production to the model of
To do this assignment:
data journeys in data science that we discussed earlier in Chapter 3 (see Figure 4.4).
-Get to know your data set
-Use the cycle of research from Beaulieu and Leonelli, chapter 4

Figure
Figure 4.4 cycle
1 The The steps of data
of research journeys
according toin
thedata science
relational (O’Neil
view and Schutt, 2013)
on data.
superimposed on the model of the research process. Adapted from Leonelli (2019 [EJPS]).

This combination of the two models enables us to see how data analysis functions as a
type of knowledge production. It also helps illustrate the kinds of work needed13 in
different steps of the process of knowledge production. As we turn to the steps involved
in data journeys, we will see how these different steps are structured by infrastructures
and
Course conventions
guide Data Wise (among other elements). These make it possible to put data to work.
We will also explore the knowledge and skills needed to ensure the integration of these
activities, so that the cycle can be completed.
-Reflect on each of these steps, to understand how each step implies interaction between the analysis
and the data, between the researcher and her or his object of study.
-report on the steps above, using the graphic of the cycle to structure your analysis. For each step,
make explicit the assumptions that go into shaping the data and what you can do with it. If your
project is not yet so advanced that you have done all the steps, that is not a problem. Be thorough
and detailed in what you have done with the data so far, and what you have been able to find out
about the data creation (if you didn’t collect/create it yourself)

Examples of some of the assumptions to interrogate along the way:

• Populations (n=all)
• Sampling suitability
• Actors (Who is represented by the data, who will be using the data)
• Place, geography (Where does the data come from? Where has it travelled to?)
• Practices (What kinds of analyses will be done? How?)
• Infrastructure (Which kinds of tools of platforms were used? How did this shape the data
collection, the formats and the data?)
• Conventions used
• Awareness of data collection (Is data generated implicitly? Explicitly?)
• Material aspects (computing power available, speed, etc)
• Analysis (Which types of analysis can be done with this data? Which types cannot? How could
you check this?)
• And so on… All of the above are discussed in the first 4 weeks of the course.

Be sure to name and make explicit how each of these assumptions shapes how you interpret,
value and use data as evidence.

The report should be about 2000 words long. Please feel free to use annotated screenshots or other
supporting material.

Your assignment will be graded as follows:

Element in your assignment # points IMPORTANT

Formal elements: word count, clarity, 5 Please be sure to properly reference your
styles, referencing sources (see Course Guide and slides from
lecture)
Use of literature from the course (at 10 You must be specific in how you refer to the
least 6 different sources) course material. For example, “Boyd and
Crawford (2011) also discuss algorithms” is
NOT specific enough. Demonstrate that you
have read, understand and can apply the
concepts in the readings.
Clear application of analytic categories 10 Choose the most relevant steps for your data
and steps of scheme (addressing and select the best questions to address.
issues in at least 2 different steps)

Course guide Data Wise

Concrete explanation of technical, 10 Provide clear explanations of the different
functional and epistemic dimensions dimensions and think about how they relate
of data and their interactions (what the interactions between them are)
Analysis of how different assumptions 10 Show that you can think critically about how
and decisions shape how the data can data is fit for different purposes.
(or cannot) be used
Debate

This in-class activity will be assessed, for 20% of your grade. Both preparation and participation will
be graded. More details will be given in class.

Participation

Your grade will be based on your preparation for the classes (done the readings, etc), your
participation in the in-class activities and discussions, and of course your presence across the
sessions. This is worth 20% of your grade.