0% found this document useful (0 votes)
22 views5 pages

DSM020 Module Specification

Uploaded by

Christopher Neo
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
22 views5 pages

DSM020 Module Specification

Uploaded by

Christopher Neo
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

MSc Data Science

Module Specification
Key Information
Module title Data Programming in Python (Core Module)
Level 7 Credit value 15
Notional study hours
Member Institution Goldsmiths 150
and duration of course
Module lead author/ Sean McGrath
Subject matter expert
Matthew Yee-King
Module co-author

Rationale for the module


This module will provide you with the programming skills necessary to carry out the programming
tasks in the rest of the programme.

Aims of the module

This module aims to provide you with the programming skills you will need to carry out programming
tasks you will encounter in the other modules in this programme. You will learn about general
programming techniques such as variables, functions and control flow. You will learn how to work with
different types of data structures such as arrays and dictionaries. You will develop data processing
pipelines, which allow you to convert raw data into data that you can analyse. You will apply
mathematical and statistical procedures to data. You will learn how to plot graphs of various types.
You will also familiarise yourself with an industry standard data science programming environment
which you can use throughout the programme.
Topics covered in this module:
The following is an indicative list of topics:

1. Data structures
2. Reading and writing data on the filesystem
3. Retrieving data from the web
4. Retrieving data from databases using query languages
5. Cleaning and restructuring data, part 1
6. Cleaning and restructuring data, part 2
7. Data plotting
8. Version control systems
9. Unit tests
10. Data processing pipelines

Learning outcomes for the module

By the end of this module, you will be able to:


1. Critically evaluate data sets and write programs that use appropriate programming techniques
to clean and analyse the data.
2. Critically evaluate data sets and write programs that use appropriate data structures for storing
the data.
3. Critically use database querying languages to interact with data from real world databases.

Learning and teaching strategy and methods


This module has 150 notional study hours. The module is organised into 10 topics, with between 1-2
hours of learning content and 10-12 hours of study required per topic. The remaining study time is
intended for coursework and exam preparation. The topics will contain the following types of learning
content:
The static information pertaining to a topic will be presented using a range of static learning content
types. Some static learning content types are detailed in the table below:
Static content Description
type
Video lesson A video involving a presenter explaining some aspect
of a topic.
Slides Slides presented during a video can be provided in
an electronic form.
Reading Text based material, for example, instructions about
a programming exercise, or material from a text book.
Computer Where appropriate, program code can be provided,
program code for example, example code developed during a video
lesson.
Media resources Media files that are relevant to the course material.
Data Data files that are relevant to the course material.
Web links Where appropriate, links can be provided to
externally hosted content, for example, programming
language documentation.
In addition to these static content types, there will be a range of interactive content types. Some of
these might form part of an assessment. Assessments are discussed further in the Assessment
Strategy section. The interactive content types that can appear in a module are listed in the table
below.
Interactive Description
content type
Discussion Students are given a prompt for a discussion. The
prompt discussion then takes placed in an online discussion
forum.
Quiz question Quiz questions typically involve a question followed by
a set of multiple-choice answers. They are typically
automatically marked.
Programming A programming task involves the provision of a
task specification for a computer program or a part of a
program that needs to be written. The program code
can then be uploaded to the virtual learning
environment. It might be automatically marked or it
might be marked by a tutor.
Writing task A writing task involves the provision of a specification
for a piece of writing that a student should complete.
The writing would typically be uploaded to the virtual
learning environment to be marked by a tutor.
Interactive An interactive program or simulation is a computer
program or program or simulation that the student can run and
simulation interact with. Sometimes, they might be required to
answer questions about it, or to complete a task within
it.
Peer review In a peer review activity, students are required to
activity submit a piece of work, which might be program code,
written work, images, videos etc. They then review the
work of other students.

Assessment strategy, assessment methods


Summative and Formative Assessments
The module will contain a range of summative and formative assessments. Summative assessments
are assessments which contribute directly towards your final grade. Formative assessments do not
count directly towards your final grade. Instead, they provide you with opportunities for low stakes
practice, and will often provide some sort of feedback about your progress. For example, a practice
quiz might provide you with feedback about why a particular answer was wrong.

There will be regular quizzes to help you develop your learning and gauge your progress.

Assessment Activities
The table below lists the assessment activity types you might encounter taking the module. It also
states if that type of assessment can be automatically graded. For example, multiple choice quizzes
can be automatically graded, and so can some programming assignments. It also states if that type of
assessment will be found in the summative coursework. More details about the summative
assessments are provided below.
Assessment Can it be CW1 CW2
activity type automaticall
y graded
with
feedback in
some cases?
Writing task X X
Programmin X X X
g task

The table below shows what assessment elements will be used to assess if you achieved a particular
module learning output.
Learning outcome/ Assessment activity type Quiz Writing Programming Peer review
task task task
1. Write data analysis programs that use X
variables, functions and control flow.

2. Identify and justify the use of appropriate X X


data structures for the storage of different types of
data.

3. Load data from the filesystem and web- X


based resources and prepare it for analysis by
cleaning and structuring it.

4. Use database querying languages to filter X X


and import data from databases.

5. Formulate and implement algorithmic X X X X


solutions to data analytic problems, including data
cleaning.

Pass Mark

To pass this module you must get a mark of at least 50% in each element of assessment, subject to
the application of rules for compensation. Please see the programme regulations for more information.

Assessment Elements

Coursework (100% weighting)


There are two items of coursework for this module. Each contributes to the final assessment mark for
this module as follows:
Coursework 1 30% (deadline – week 12 – see VLE for date)
Coursework 2 70% (deadline – week 21 – see VLE for date)

The deadlines provided are indicative and subject to change. You should always refer to the
VLE for the most up to date information.
Additional Information
It is important that you take the time to refer to the overview of assessment rules on our website as
well as the detailed ‘Rules for taking written examinations’, ‘Plagiarism rules for coursework’ and the
‘Assessment Offences’ sections in the General Regulations. Further information on academic integrity
and good practice can be found in the Student Guide, and in the Study Skills module of the Orientation
Session in the Virtual Learning Environment.
Learning resources
Learning materials

Learning activities and formative assessments for this module are designed and developed by the
Lead Author/Module Leader
and incorporate the following:
• Short videos
• Learning activities
• Formative assessments
• Learning content.
The learning content is drawn from an essential reading or readings (see below). The materials are
available in a variety of e-formats to enable access on a variety of devices. The content is divided into
topics.

E-resources

All students will be given access to the University of London’s e-resources through the VLE and the
University of London Online Library. You will be directed to particular journals within each topic of
study and given other online resources to enhance your information literacy skills as part of the
induction.

Core texts

Python for Data Analysis, 2e (2017): Data Wrangling with Pandas, Numpy, and Ipython

You might also like