0% found this document useful (0 votes)
16 views7 pages

Exercises 4

The document outlines exercises for studying and interpreting various datasets. It provides instructions for 4 core exercises analyzing bakery sales, US honey production, unemployment rates and YouTube channels. It also includes 3 optional advanced time series exercises and an extra exercise analyzing employee satisfaction survey data.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
16 views7 pages

Exercises 4

The document outlines exercises for studying and interpreting various datasets. It provides instructions for 4 core exercises analyzing bakery sales, US honey production, unemployment rates and YouTube channels. It also includes 3 optional advanced time series exercises and an extra exercise analyzing employee satisfaction survey data.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

Exercise set 4 – studying and interpreting datasets

Place all your exercises in your Git-project including the data files!

For these exercises, download the zip-file containing all Exercise Set 4 –files
in Moodle.

Note: Remember that not all datasets provide clear patterns, phenomena or
correlations. Sometimes data is just too varied to find any clear indications!

1. Dataset: Bakery sales

Needed files in this exercise:


- bakery.csv
- Exercise4_1_bakery.ipynb

Study the Jupyter notebook and follow the instructions. All answers
need to be provided into the markdown cell under the questions.

You are allowed to code more plots and pandas sections if you want,
these usually grant extra points if they provide more insight into the
data.
2. Dataset: US Honey production

Needed files in this exercise:


- US_honey_dataset.csv
- Exercise4_2_honey.ipynb

Study the Jupyter notebook and follow the instructions. All answers need
to be provided into the markdown cell under the questions.

You are allowed to code more plots and pandas sections if you want, these
usually grant extra points if they provide more insight into the data.
3. Time series datasets

a. Time series dataset 1: Unemployment rate in USA

Needed files in this exercise:


- USUnemployement.csv
- Exercise4_3a_unemployment.ipynb

Study the Jupyter notebook and follow the instructions. All


answers need to be provided into the markdown cell under the
questions.

You are allowed to code more plots and pandas sections if you
want, these usually grant extra points if they provide more insight
into the data.
b. Time series dataset 2: Storms and hurricanes

Needed files in this exercise:


- storms.csv
- Exercise4_3b_storms.ipynb

Study the Jupyter notebook and follow the instructions. All


answers need to be provided into the markdown cell under the
questions.

You are allowed to code more plots and pandas sections if you
want, these usually grant extra points if they provide more insight
into the data.
c. Time series dataset 3: Wind power generation and coal
consumption

Needed files in this exercise:


- coalconsumption.csv
- nuclear.csv
- Exercise4_3c_energy.ipynb

Study the Jupyter notebook and follow the instructions. All


answers need to be provided into the markdown cell under the
questions.

You are allowed to code more plots and pandas sections if you
want, these usually grant extra points if they provide more insight
into the data.
4. Dataset: Popular YouTube -channels

Needed files in this exercise:


- youtube.csv
- Exercise4_4_youtube.ipynb

Study the Jupyter notebook and follow the instructions. All answers
need to be provided into the markdown cell under the questions.

You are allowed to code more plots and pandas sections if you want,
these usually grant extra points if they provide more insight into the
data.
Advanced extra exercises!

Note: These datasets can be a bit messy compared to the previous exercises.

1. Dataset: Employee satisfaction

Needed files in this exercise:


- HR_Employee_Data.xslx
- Exercise4_advanced_1_employeesatisfaction.ipynb

Study the Jupyter notebook and follow the instructions. All answers
need to be provided into the markdown cell under the questions.

You are allowed to code more plots and pandas sections if you want,
these usually grant extra points if they provide more insight into the
data.

You might also like