Lab - Basic Data Analysis
Lab - Basic Data Analysis
Objectives
Use very simple methods to describe existing data, fill-in missing data values and to make simple predictions.
Part 1: Learn how to Use Data as Information
Part 2: Plot data and predict values
Background/Scenario
Data is meaningless in and of itself. Information is meaningful and useful. Data only becomes information
when it used in context to answer specific questions. In this lab, you will use graphs of existing data to create
missing values and to predict values based on trends.
Required Resources
PC or mobile device with Internet access
Browser capable of playing a video from the Internet.
Audio capability to listen to video narration.
Data analysis can occur in many different ways. The ultimate goal is to discover something in the data that
gives insight into what has happened or to predict what may happen in the future. Descriptive statistics
summarizes what happened and provides the data in a numeric or graphical way. Predictive analytics
answers the question of what may happen in the future based on past data.
© 2022 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public. Page 1 of 6
Lab – Explore Sources of Open Data
Predicting with Linear Models video provided by Khan Academy. Pause the video and complete the
activities along with the video instructor. Use the plot you created.
Watch the entire video. You will only work with the first dataset that the instructor discusses in this lab. In the
video, the instructor demonstrates how to use data points as information to create new estimated data points.
interpolation
____________________________________________________________________________
It refers to determining what occurred between two data points.
extrapolation
____________________________________________________________________________
It indicates that the last data points were and that trend would look like, and that trend would
continue, and that something may happen if that trend continued.
____________________________________________________________________________
What are two interesting observations that the instructor in the video makes regarding the trends in the
median age of marriage and the ages of the males and females who marry?
it is gotten smaller and smaller over time.
____________________________________________________________________________________
____________________________________________________________________________________
Woman Men
Missing Year
hours Hours
1970
1980
1990
c. Extrapolate values for the year 2020 by creating a line that best summarizes the values for the previous
five periods.
© 2022 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public. Page 2 of 6
Lab – Explore Sources of Open Data
____________________________________________________________________________________
d. Another kind of information that can be derived from this data is about the gap between the number of
hours of housework for men versus the number of hours of housework for women. This will display
another trend regarding the equality between men and woman over this period. Complete the table below
by filling it in with the amount of time that women do house work subtracted from the amount of time that
men do housework.
Men Women
Woman housework
Date housework hours - Men
hours/week
hours/week hours
In the IoT, Big Data comes from many sources. Sometimes values are missing because a sensor temporarily
lost connectivity or data points were lost in transmission. Interpolation can serve as one strategy for replacing
missing data. Extrapolation is used to predict values for events that have not yet occurred. Because the IoT
yields so much data, predictive analytic models can be built that reliably see into the future by extrapolating
trends from historical data.
© 2022 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public. Page 3 of 6
Worksheet 1
25
20
Median Age
15
10
0
1890 1910 1930 1950 1970 1990 2010
Year
© 2022 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public. Page 4 of 6
Lab – Explore Sources of Open Data
Worksheet 2
35
30
25
20
15
10
0
1960 1970 1980 1990 2000 2010 2020
© 2022 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public. Page 5 of 6
Lab – Explore Sources of Open Data
Worksheet 3
25
27.5
20 17.6
10.5
8.7
15 8.6
9.9
7.4
10
8
0
1960 1970 1980 1990 2000 2010 2020
© 2022 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public. Page 6 of 6