0% found this document useful (0 votes)
26 views15 pages

Data 23 24 Resources

The document consists of various activity guides for data analysis, including creating bar charts, histograms, and scatter plots using different datasets. It emphasizes filtering and cleaning data, exploring relationships between variables, and visualizing findings to derive insights. Additionally, it includes a project component where students select a dataset, create visualizations, and analyze potential biases in their interpretations.

Uploaded by

henryszuster
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
26 views15 pages

Data 23 24 Resources

The document consists of various activity guides for data analysis, including creating bar charts, histograms, and scatter plots using different datasets. It emphasizes filtering and cleaning data, exploring relationships between variables, and visualizing findings to derive insights. Additionally, it includes a project component where students select a dataset, create visualizations, and analyze potential biases in their interpretations.

Uploaded by

henryszuster
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 15

Data ('23-'24)

Exploring One Column


Resources
Unit 5 Lesson 2
Name(s)_______________________________________________ Period ______ Date ___________________

Activity Guide - Exploring One Column

Bar Charts

Do This
● Make a bar chart for each column in the "Dogs" dataset and categorize it as either "useful" or "not useful" by
placing that column name in the table below.

Useful Not Useful

Chart
Paste one of the charts that you believe is "useful" in the space below.

Questions

1. What is one piece of information someone could learn from the chart you pasted?

2. Do you notice any patterns in which charts are or are not useful?

Computer Science Principles 1


Unit 5 Lesson 2

Histograms

Do This:
● Make a histogram for the "Max Weight" column of the "Dogs" data set and choose the "Bucket Size" that you
think is most helpful for interpreting the data.

Chart

Questions

3. What bucket size did you choose? Why do you think this is the most helpful bucket size for this chart?

4. What is the most common range of maximum weights for dog breeds?

5. What is the least common range of maximum weights for dog breeds?

Do This:
● Make a histogram for one other column in the Dogs dataset and choose the "Bucket Size" that you think is
most helpful for interpreting the data.

Chart

Questions

1. What bucket size did you choose and why?

2. What is one question the chart you created could help answer?

Computer Science Principles 2


Data ('23-'24)
Filtering and Cleaning Data
Resources
Unit 5 Lesson 3
Name(s)_______________________________________________ Period ______ Date ___________________

Activity Guide - Filtering Data

Data Set: Female State Legislators

Do This:
● Filter the Female State Legislators dataset for Year by the year you were born.
● Create a histogram for Percentage of Females in Legislature. Choose a bucket size of 5.
● Copy/paste the chart below.

Chart 1

● Filter the Female State Legislators dataset for Year by the year 2019.
● Copy/paste the chart below.
● Answer the questions.

Chart 2

Questions

1. In the year you were born, how many states had between 25 and 30 percent of their legislatures made up of
female legislators?

2. In 2019, how many states had between 25 and 30 percent of their legislatures made up of female legislators?

3. What does this data show?

4. Why might that be the case?

Computer Science Principles 1


Unit 5 Lesson 3

Data Set: US Women Running for Elected Office in 2020

Do This:
● Filter the US Women Running for Elected Office in 2020 dataset to answer the following questions. The chart
type is up to you. Paste the chart below each answer.

Question: Which states had at least one woman run for Governor?

Answer:

Chart:

Question: Which states had more than 10 women win their primary race?

Answer:

Chart:

Computer Science Principles 2


Data ('23-'24)
Exploring Two Columns
Resources
Unit 5 Lesson 4
Name(s)_______________________________________________ Period ______ Date ___________________

Activity Guide - Exploring Two Columns

Cross Tab

1. Crosstab - Use the “Words” dataset


Use: Length & Part of Speech
a. Chart - Copy and paste the chart you created in the space below

b. Which part of speech shows up most often in this data set?

c. Which part of speech seems to be longest, on average?

d. Which part of speech seems to be shortest, on average?

e. About how long is the "typical" noun? How can you tell?

2. Crosstab - Use the “Favorite Classes” dataset


Use: Grade & Favorite Class
a. Chart - Copy and paste the chart you created in the space below

b. Which class do Seniors like the most?

c. Which grade likes History the most?

d. What is one other interesting pattern you can see in this diagram?

Computer Science Principles 1


Unit 5 Lesson 4

Scatter

1. Scatter - Use the “US States” dataset


Use: Median Household Income & Percent Adult College Graduates
a. Chart - Copy and paste the chart you created in the space below

e. What is the range of incomes on this chart?

f. What is the range of percentages of adult college graduates?

g. Do you see a relationship between income and percent of adult college graduates? What does the
relationship look like?

3. Scatter - Use the “US States” dataset


Use: You pick the columns to compare
In the States table find a combination of columns in a Scatter Plot that you think show an interesting pattern

a. Chart - Copy and paste the chart you created in the space below

b. What is the pattern that you observe in this data?

c. Describe what this pattern might mean or indicate.

d. Usually we can't be sure whether the pattern we observe in a scatter chart (a correlation) means that
one of the columns is causing the other. Give one way you might do more research to try to investigate
the cause of the pattern you found.

Computer Science Principles 2


Data ('23-'24)
Big, Open, and Crowdsourced Data
Resources
Unit 5 Lesson 5
Name(s)_______________________________________________ Period ______ Date ___________________

Activity Guide - Big, Open, and Crowdsourced Data


Background
Today you will research one of three topics at the intersection of data analysis, computing, and society. These topics all
use the data analysis process in new and interesting ways to address problems.

Topic Links Key Vocabulary to Look


For

Topic 1 ● AI and Parallel Systems Scalability


Big Data https://www.youtube.com/watch?v=1XGo8K1boH4 Parallel systems
● Big Data and Medicine Code.org
https://www.youtube.com/watch?v=bMrDHtGHFR4

Topic 2 ● How Pokemon Inspired A Citizen Science Project... Citizen science


Crowdsourced Data https://www.npr.org/2018/04/20/597972310 Crowdsource
● What is Citizen Science? https://youtu.be/81hhecI0p5k

Topic 3 ● What is Open Data? Open data


Open Data https://www.youtube.com/watch?v=qSD9ob8rGcs Open access
● The Case for Open Data
https://www.youtube.com/watch?v=iOrPK7p2AwI

What topic did you choose? ________________________________________________

As you watch the videos keep notes on examples of how they complete each step of the data analysis process.

Collect or Choose Data Clean and/or Filter Data Visualize and Find New Information
Patterns

What is the core idea of your topic? What is it about?

Give two examples of the problems / questions your topic is being used to solve / answer.

Computer Science Principles 1


Data ('23-'24)
Project - Tell a Data Story
Resources
Unit 5 Lesson 8
Name(s)_______________________________________________ Period ______ Date ___________________

Project Guide - Tell a Data Story


Background
Computing tools allow people from all different fields to learn from data. By transforming and visualizing data we can
find patterns that help us learn about our world.

For this project you will

● Select a dataset from the Data Library


● Use the Data Visualizer to make a visualization that helps us learn something interesting about the data
● Write a response that explains:
○ what your visualizations show
○ new insights or decisions that can be made based on this visualization
○ possible problems with your analysis including potential bias in the data or your interpretation

Tell a Data Story

Data Analysis Written Response


Process

Select a dataset from the Data Library. Read the metadata to understand what information is
available in the table

Dataset Name:

Short Description:

Did you filter or clean the data? Why or why not?

Create a visualization. Copy/paste the chart below.

Computer Science Principles 1


Unit 5 Lesson 8

What information does your visualization show:

What new insights or decisions can be made based on this visualization:

What are the possible problems with your analysis including potential bias in the dataset
or your interpretation:

Computer Science Principles 2


Unit 5 Lesson 8

Rubric

Extensive Convincing Limited Evidence No Evidence


Category
Evidence Evidence

Collect or Choose Data Dataset is correctly Dataset is correctly Dataset is correctly Dataset is not
identified and identified and identified and identified or
description is description is mostly description is description is
complete. complete. somewhat missing.
complete.

Clean/Filter Data Explanation for Explanation for Explanation for Explanation for
cleaning and/or cleaning and/or cleaning and/or cleaning and/or
filtering is complete filtering is complete filtering is somewhat filtering is
and reasonable. and mostly complete or incomplete.
reasonable. somewhat
reasonable.

Visualize and Find Visualization is Visualization is Visualization is Visualization is


Patterns readable and mostly readable and somewhat readable unreadable or
includes a title, and includes a title, and and/or is missing a missing.
citation. citation. title or citation. .

New Information: The visualization is The visualization is The visualization is The visualization is
Interpreting the thoroughly mostly explained. somewhat not explained.
visualization explained. explained.

New Information: Insights or decisions Insights or decisions Insights or decisions Insights or decisions
Insights or Decisions are reasonable and are mostly are mostly are missing.
effectively linked to reasonable and reasonable and
information effectively linked to somewhat
displayed in the information effectively linked to
visualization displayed in the information
visualization displayed in the
visualization

New Information: Possible problems Possible problems Possible problems Possible problems
Bias with analysis or with analysis or with analysis or with analysis or
potential bias are potential bias are potential bias are potential bias are
reasonable and mostly reasonable somewhat not addressed.
thoughtfully and thoughtfully reasonable and
addressed. addressed. addressed.

Computer Science Principles 3

You might also like