DSM020 Module Specification
DSM020 Module Specification
Module Specification
Key Information
Module title Data Programming in Python (Core Module)
Level 7 Credit value 15
Notional study hours
Member Institution Goldsmiths 150
and duration of course
Module lead author/ Sean McGrath
Subject matter expert
Matthew Yee-King
Module co-author
This module aims to provide you with the programming skills you will need to carry out programming
tasks you will encounter in the other modules in this programme. You will learn about general
programming techniques such as variables, functions and control flow. You will learn how to work with
different types of data structures such as arrays and dictionaries. You will develop data processing
pipelines, which allow you to convert raw data into data that you can analyse. You will apply
mathematical and statistical procedures to data. You will learn how to plot graphs of various types.
You will also familiarise yourself with an industry standard data science programming environment
which you can use throughout the programme.
Topics covered in this module:
The following is an indicative list of topics:
1. Data structures
2. Reading and writing data on the filesystem
3. Retrieving data from the web
4. Retrieving data from databases using query languages
5. Cleaning and restructuring data, part 1
6. Cleaning and restructuring data, part 2
7. Data plotting
8. Version control systems
9. Unit tests
10. Data processing pipelines
There will be regular quizzes to help you develop your learning and gauge your progress.
Assessment Activities
The table below lists the assessment activity types you might encounter taking the module. It also
states if that type of assessment can be automatically graded. For example, multiple choice quizzes
can be automatically graded, and so can some programming assignments. It also states if that type of
assessment will be found in the summative coursework. More details about the summative
assessments are provided below.
Assessment Can it be CW1 CW2
activity type automaticall
y graded
with
feedback in
some cases?
Writing task X X
Programmin X X X
g task
The table below shows what assessment elements will be used to assess if you achieved a particular
module learning output.
Learning outcome/ Assessment activity type Quiz Writing Programming Peer review
task task task
1. Write data analysis programs that use X
variables, functions and control flow.
Pass Mark
To pass this module you must get a mark of at least 50% in each element of assessment, subject to
the application of rules for compensation. Please see the programme regulations for more information.
Assessment Elements
The deadlines provided are indicative and subject to change. You should always refer to the
VLE for the most up to date information.
Additional Information
It is important that you take the time to refer to the overview of assessment rules on our website as
well as the detailed ‘Rules for taking written examinations’, ‘Plagiarism rules for coursework’ and the
‘Assessment Offences’ sections in the General Regulations. Further information on academic integrity
and good practice can be found in the Student Guide, and in the Study Skills module of the Orientation
Session in the Virtual Learning Environment.
Learning resources
Learning materials
Learning activities and formative assessments for this module are designed and developed by the
Lead Author/Module Leader
and incorporate the following:
• Short videos
• Learning activities
• Formative assessments
• Learning content.
The learning content is drawn from an essential reading or readings (see below). The materials are
available in a variety of e-formats to enable access on a variety of devices. The content is divided into
topics.
E-resources
All students will be given access to the University of London’s e-resources through the VLE and the
University of London Online Library. You will be directed to particular journals within each topic of
study and given other online resources to enhance your information literacy skills as part of the
induction.
Core texts
Python for Data Analysis, 2e (2017): Data Wrangling with Pandas, Numpy, and Ipython