Task 2P-1

This document outlines Task 2P for SIG731, focusing on working with unidimensional data using numpy vectors. Students are required to complete various tasks related to analyzing BTC-to-USD data, including data retrieval, statistical calculations, and visualizations, all while adhering to specific guidelines and submission requirements. Failure to meet deadlines or submission criteria may result in failing the unit, and all work must be original and independently completed.

Uploaded by

smartidixit9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views4 pages

Task 2P-1

Uploaded by

smartidixit9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

SIG731 : Task 2P

Working with numpy Vectors (Unidimensional Data)

1 Introduction
This task is related to Module 2 (Sections 2.1-2.4;see the Learning Resources on the unit site or,even
better, Chapters 2–3 of Minimalist Data Wrangling with Python).
This task is due on Week 3 (19th Jan, Sunday). Start tackling it as early as possible. If we find your first
solution incomplete or otherwise incorrect, you will still be able to amend it based on the generous
feedback we will give you (allow 3–5 working days). In case of any problems/questions, do not hesitate
to attend our on-campus/online classes or use the Discussion Board on the unit site.
Submitting after the aforementioned due date might incur a late penalty. The cut-off date is Week 4
(Friday). There will be no extensions (this is a Week 2 task, after all) and no solutions will be accepted
thereafter. At that time, if your submission is not 100% complete, it will be marked as FAIL, without the
possibility of correcting and resubmitting. This task is part of the hurdle requirements in this unit. Not
submitting the correct version on time results in failing the unit.
All submissions will be checked for plagiarism. You are expected to work independently on your task
solutions. Never share/show parts of solutions with/to anyone.

2 Questions
Create a single Jupyter/IPython notebook (see the Artefacts section below for all the requirements – read
the whole task specification first!), where you perform what follows.
The use of pandas is forbidden. You can use scipy, though.
Do not use for loops or list comprehensions – this is an exercise on numpy.
Q1. Download the daily close BTC-to-USD data, from 2023-01-01 up to 2023-12-31, available at https:
//finance.yahoo.com/quote/BTC-USD (the Historical Data tab).
Q2. Use numpy.genfromtxt or numpy.loadtxt to read the above BTC-to-USD data as a numpy
vector named rates.
Option: You can use a spreadsheet application such as LibreOffice Calc or MS Excel to manually
remove everything except the numeric values in the Close column. The column labels should also
be manually deleted. Export these observations to a CSV file (which should only contain numbers,
one per line). You can also use features in numpy.genfromtxt or numpy.loadtxt to remove
them.
Q3. For the fourth quarter of the year only (Q4 2023; days 274–365 inclusive),determine and display(in
a readable manner) the following aggregates:
• arithmetic mean,
• minimum,
• the first quartile,
• median,
• the third quartile,
• maximum,
• standard deviation,
• interquartile range.
Reference result from Q3 2024 (yours can be prettier):
## arithmetic mean: 28091.33

## minimum: 25162.65

## Q1: 26225.56

## median: 28871.82

## Q3: 29767.07

## maximum: 31476.05

## standard deviation: 1827.04

## IQR: 3541.51

Q4. Call matplotlib.pyplot.plot(days, rates, <...further_arguments...>) to

draw the Q4 2023 data (with 274 denoting 1 October), using red solid line segments. Call
matplotlib.pyplot.title to add the plot title. Discuss what you see.

Q5. Determine the day numbers (with 274 denoting 1 October) with the lowest and highest observed
prices in Q4 2023. Below is an example of the lowest and highest price days in Q3 2023.
## Lowest price was on day 254 (25162.65).

## Highest price was on day 194 (31476.05).

All packages must be imported and data must be loaded at the beginning of the file (only once!).

Q6. Using matplotlib.pyplot.boxplot, draw a horizontal box-and-whisker plot for the Q4 2023
daily price increases/decreases as obtained by a call to numpy.diff.

Using an additional call to matplotlib.pyplot.plot, mark the arithmetic mean on the box
plot with a green “x”.
In your own words, explain what we can read from the plot. Below is a reference plot from Q3 2024.
Distribution of BTC-to-USD daily price increases in Q3 2023
2000 1500 1000 500 0 500 1000 1500

Q7. Count (programmatically,using the vectorised relational operators from numpy) how many outliers
the boxplot contains (for the definition of an outlier, consult Section 2.3 of our learning materials
on the unit site or Section 5.1 in the Book). In your own words, explain what such outliers might
mean in the current context.
## There are 16 outliers.

3 Artefacts
The solution to the task must be included in a single Jupyter/IPython notebook (an .ipynb file) running
against a Python 3 kernel. The use of G**gle Colab is discouraged. Nothing beats a locally-installed
version where you have full control over the environment. Do not become dependent on third-party
middlemen/distributors. Choose freedom instead.
Make sure that your notebook has a readable structure;in particular,that it is divided into sections.Use
rich Markdown formatting (text in dedicated Markdown chunks – not just Python comments).
Do not include the questions/tasks from the task specification. Your notebook should read nicely and
smoothly – like a report from data analysis that you designed yourself. Make the flow read natural (e.g.,
First, let us load the data on… Then, let us determine… etc.). Imagine it is a piece of work that you
would like to show to your manager or clients — you certainly want to make a good impression. Check
your spelling and grammar. Also, use formal language.
At the start of the notebook, you need to provide: the title of the report (e.g., Task42:How Much I Love
This Unit), your name, student number and email address.
Then, add 1–2 introductory paragraphs (an introduction/abstract – what the task is about).
Before each nontrivial code chunk, briefly explain what its purpose is. After each code chunk,
summarise and discuss the obtained results (in a few sentences).
Conclude the report with 1–2 paragraphs (summary/discussion/possible extensions of the analysis etc.).
Limitations of the ipynb-to-pdfrenderer:
Ensure that your report as seen in Olympus is aesthetic. The ipynb-to-pdf renderer is imperfect. We work
with what we have. Here are the most common Markdown-related errors.

• Do not include any externally loaded images (via the ![label](href) Markdown command),
for they lead to upload errors.
• Do not input HTML code in Markdown.
• Make sure you leave one blank line before and after each paragraph and bullet list. Do not use
backslashes at the end of the line.
• Currently, also LaTeX formulae and Markdown tables are not recognised. However, they do not

lead to any errors.

Checklist:
1. Header, introduction, conclusion (Markdown chunks).
2. Text divided into sections, all major code chunks commented and discussed in your own words
(Markdown chunks).
3. Every subtask addressed/solved. In particular, all reference results that are part of the task
specification have been reproduced (plots, computed aggregates, etc.).
4. The report is readable and neat. In particular:
• all code lines are visible in their entirety (they are not too long),
• code chunks use consecutive numbering (select Kernel - Restart and Run All from the Jupyter
menu),
• rich Markdown formatting is used (# Section Title, * bullet list, 1. enumerated
list, | table |, *italic*, etc.),
• the printing of unnecessary/intermediate objects is minimised (focus on reporting the results
specifically requested in the task specification).
Submissions which do not fully (100%)conform to the task specification on the cut-off date will be
marked as FAIL.
Good luck!

4 Intended Learning Outcomes

ULO Is
Related?
ULO1 (Data Processing/Wrangling) YES
ULO2 (Data Discovery/Extraction) YES
ULO3 (Requirement Analysis/Data Sources) YES
ULO4 (Exploratory Data Analysis) YES
ULO5 (Data Privacy and Ethics) YES

Introduction To Numpy Exercise
No ratings yet
Introduction To Numpy Exercise
24 pages
DSBDA Manual
No ratings yet
DSBDA Manual
76 pages
Minimalist Datawrangling Withpython Marek Gagolewski PDF Download
No ratings yet
Minimalist Datawrangling Withpython Marek Gagolewski PDF Download
79 pages
Unit 3
No ratings yet
Unit 3
102 pages
Fds Merged
No ratings yet
Fds Merged
102 pages
Data Science Workshop - Day 1
No ratings yet
Data Science Workshop - Day 1
80 pages
ML Lab Manual
No ratings yet
ML Lab Manual
59 pages
ML Lab Manual
No ratings yet
ML Lab Manual
90 pages
Data Science Roadmap
No ratings yet
Data Science Roadmap
41 pages
Microsoft Ai Automate
No ratings yet
Microsoft Ai Automate
259 pages
Numpy - Ipynb - Colaboratory
No ratings yet
Numpy - Ipynb - Colaboratory
32 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
42 pages
Dsbda Lab Manual Merged
No ratings yet
Dsbda Lab Manual Merged
117 pages
Python Task Descriptions
No ratings yet
Python Task Descriptions
10 pages
Week 3 v1.1 (Hidden) Supervised Learning (Regression)
No ratings yet
Week 3 v1.1 (Hidden) Supervised Learning (Regression)
52 pages
Index: SR. NO. Practical Name Date of Perform NO. Sign
No ratings yet
Index: SR. NO. Practical Name Date of Perform NO. Sign
28 pages
The Tip of The Iceberg: 1 Before You Start
No ratings yet
The Tip of The Iceberg: 1 Before You Start
18 pages
03-Jupyter Markdown Python
No ratings yet
03-Jupyter Markdown Python
28 pages
Session 2 Assessment - Google Forms
No ratings yet
Session 2 Assessment - Google Forms
11 pages
Lecture 1 Pyhton Programming DOST 1
No ratings yet
Lecture 1 Pyhton Programming DOST 1
67 pages
SCREENING - DEVELOPMENTAL DELAY & AUTISM - CCBS DP Infant-Toddler Checklist
100% (2)
SCREENING - DEVELOPMENTAL DELAY & AUTISM - CCBS DP Infant-Toddler Checklist
9 pages
Data Preprocessing Python Tome I
No ratings yet
Data Preprocessing Python Tome I
10 pages
Task 3P-1
No ratings yet
Task 3P-1
6 pages
Lab 3 & 4
No ratings yet
Lab 3 & 4
10 pages
Packages in Python
No ratings yet
Packages in Python
17 pages
Data Visualization - Lab - Manual - 2024
No ratings yet
Data Visualization - Lab - Manual - 2024
13 pages
Week1 - Introduction To Machine Learning and Toolkit
No ratings yet
Week1 - Introduction To Machine Learning and Toolkit
102 pages
Task 4P-1
No ratings yet
Task 4P-1
5 pages
Python Workshop March 2018
No ratings yet
Python Workshop March 2018
31 pages
Singh Project1 Report
No ratings yet
Singh Project1 Report
12 pages
4BUIS014W Business Computing-Portfolio
No ratings yet
4BUIS014W Business Computing-Portfolio
7 pages
Assigniment 2 Machine Learning
No ratings yet
Assigniment 2 Machine Learning
7 pages
Python Course Outline
No ratings yet
Python Course Outline
24 pages
Python Libraries Explained
No ratings yet
Python Libraries Explained
10 pages
CSIT 6000Q Assignment 3
No ratings yet
CSIT 6000Q Assignment 3
5 pages
Assignment 1 - Linear Regression (PyTorch - Zero To GANs)
No ratings yet
Assignment 1 - Linear Regression (PyTorch - Zero To GANs)
4 pages
Assignment 01
No ratings yet
Assignment 01
7 pages
Lannet
No ratings yet
Lannet
3 pages
Computational
No ratings yet
Computational
7 pages
Datascience
No ratings yet
Datascience
8 pages
CS502M Project Spec
No ratings yet
CS502M Project Spec
8 pages
Syllabus AIML
No ratings yet
Syllabus AIML
14 pages
GSX-R750L9: Parts Catalogue
No ratings yet
GSX-R750L9: Parts Catalogue
110 pages
DMlab 2021
No ratings yet
DMlab 2021
4 pages
02 Working With Data
No ratings yet
02 Working With Data
3 pages
AIC3 - Python For Data Analysis - Scheda
No ratings yet
AIC3 - Python For Data Analysis - Scheda
4 pages
Assignment 01
No ratings yet
Assignment 01
3 pages
IDS Syllabus
No ratings yet
IDS Syllabus
5 pages
World GK MCQs For PPSC Set II PDF
No ratings yet
World GK MCQs For PPSC Set II PDF
12 pages
Decap776 P 1
No ratings yet
Decap776 P 1
6 pages
ML Task
No ratings yet
ML Task
4 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
Assignment 3-PDS Python-24S3
No ratings yet
Assignment 3-PDS Python-24S3
5 pages
Assignment 1 - Linear Regression (PyTorch - Zero To GANs) PDF
No ratings yet
Assignment 1 - Linear Regression (PyTorch - Zero To GANs) PDF
4 pages
ML Lab 04 Manual - Pandas and MatplotLib
No ratings yet
ML Lab 04 Manual - Pandas and MatplotLib
7 pages
Project2 - 158755. 4.21
No ratings yet
Project2 - 158755. 4.21
3 pages
Tasks B.2 - Data Processing 1
No ratings yet
Tasks B.2 - Data Processing 1
1 page
Final Coursework - 24.2 Ad Cert Python
No ratings yet
Final Coursework - 24.2 Ad Cert Python
2 pages
Data Science Course Outline CES LUMS
No ratings yet
Data Science Course Outline CES LUMS
4 pages
ARM CPU Architecture
No ratings yet
ARM CPU Architecture
30 pages
Machine Learning and Pattern Recognition Programming
No ratings yet
Machine Learning and Pattern Recognition Programming
4 pages
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
DS-DS Lab-1
No ratings yet
DS-DS Lab-1
4 pages
Assignment 2
No ratings yet
Assignment 2
5 pages
Answers To Questions On The Bible Asked by Christians
No ratings yet
Answers To Questions On The Bible Asked by Christians
23 pages
10 1108 - 978 1 78769 537 520191001
No ratings yet
10 1108 - 978 1 78769 537 520191001
29 pages
Converting Common Units of Mass Measure KG and Grams
No ratings yet
Converting Common Units of Mass Measure KG and Grams
7 pages
Multidimensional Locus of Control Attitude Scale Levenson Miller 1976 JPSP
No ratings yet
Multidimensional Locus of Control Attitude Scale Levenson Miller 1976 JPSP
10 pages
A HPLC Method For The Determination of Bisoprolol in Tablets and Its Application To A Bioequivalence Study
No ratings yet
A HPLC Method For The Determination of Bisoprolol in Tablets and Its Application To A Bioequivalence Study
5 pages
Andhra Pradesh SSC Results, AP SSC 2015 Results, SSC Marks, 10th Class
No ratings yet
Andhra Pradesh SSC Results, AP SSC 2015 Results, SSC Marks, 10th Class
1 page
Solid Dosage Forms: Tablets: Abhay ML Verma (Pharmaceutics)
No ratings yet
Solid Dosage Forms: Tablets: Abhay ML Verma (Pharmaceutics)
5 pages
Aggregate Functions in DBM
No ratings yet
Aggregate Functions in DBM
13 pages
This Study Resource Was: Nursing in The Philippines (10th
No ratings yet
This Study Resource Was: Nursing in The Philippines (10th
8 pages
Ss Activity 10
No ratings yet
Ss Activity 10
3 pages
Script Foundation Day
No ratings yet
Script Foundation Day
2 pages
Cs Tcom Sincgars RT 1523 VHF Radio Datasheet
No ratings yet
Cs Tcom Sincgars RT 1523 VHF Radio Datasheet
2 pages
Chapter 2 3 Tomp
No ratings yet
Chapter 2 3 Tomp
3 pages
Topic Call To Be Different P
No ratings yet
Topic Call To Be Different P
3 pages
DLL - Tle 6 - Q4 - W8
No ratings yet
DLL - Tle 6 - Q4 - W8
4 pages
C.14 Queens Park Urban Conservation Area
No ratings yet
C.14 Queens Park Urban Conservation Area
13 pages
MODULE 4 Implementating The Curriculum
No ratings yet
MODULE 4 Implementating The Curriculum
7 pages
Introduction To Real-Time Control Solution
No ratings yet
Introduction To Real-Time Control Solution
15 pages
Internship Report - Shubham
No ratings yet
Internship Report - Shubham
60 pages
Suprativ Datta Atlassian Resume
No ratings yet
Suprativ Datta Atlassian Resume
1 page
The Magic of Childhood A Journey Through Simple Pleasures
No ratings yet
The Magic of Childhood A Journey Through Simple Pleasures
10 pages
Phenotypic Variability and Divergence in Lentil
No ratings yet
Phenotypic Variability and Divergence in Lentil
19 pages
Longyear Bits Hardness Rating & Comparison Chart
No ratings yet
Longyear Bits Hardness Rating & Comparison Chart
1 page
M4TXX-BR12SH: TIMEKEEPER SNAPHAT (Battery & Crystal)
No ratings yet
M4TXX-BR12SH: TIMEKEEPER SNAPHAT (Battery & Crystal)
7 pages
Critical Essay
No ratings yet
Critical Essay
8 pages
Hope 1 (Reveiwer)
No ratings yet
Hope 1 (Reveiwer)
2 pages

Task 2P-1

Uploaded by

Task 2P-1

Uploaded by

SIG731 : Task 2P

Working with numpy Vectors (Unidimensional Data)

## standard deviation: 1827.04

Q4. Call matplotlib.pyplot.plot(days, rates, <...further_arguments...>) to

## Highest price was on day 194 (31476.05).

lead to any errors.

4 Intended Learning Outcomes

You might also like