0% found this document useful (0 votes)
2 views

Python

The document is a workbook focused on identifying different types of data and understanding problem statements within datasets. It includes exercises based on a solar system dataset and the Titanic dataset, guiding users to analyze and answer questions related to data types, features, and objectives. Prerequisite videos are recommended before attempting the exercises.

Uploaded by

vedhashreer3
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views

Python

The document is a workbook focused on identifying different types of data and understanding problem statements within datasets. It includes exercises based on a solar system dataset and the Titanic dataset, guiding users to analyze and answer questions related to data types, features, and objectives. Prerequisite videos are recommended before attempting the exercises.

Uploaded by

vedhashreer3
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 9

Types of Data

Data Types & Features


Workbook
Index
1 Objectives ........................................................................................... 3
2 Overview ............................................................................................. 4
3 Prerequisite ......................................................................................... 5
4 Exercise .............................................................................................. 6
4.1 Problem 1 ...................................................................................................... 6
4.1.1 Scenario ............................................................................................. 6
4.2 Problem 2 ...................................................................................................... 7
4.2.1 Real Data Set ..................................................................................... 7
1 Objectives
After completing the assignment, you should be able to:
– Identify different types of data in a dataset
– Identify the objective of a problem statement in a dataset
2 Overview
This document has two exercises that has scenario-based questions on types of data.
Attempt both the exercises and answer the questions that follow. After completing the
same, work out the quiz-based exercises based on the topics.

Note: The links provided in the document navigate to the respective web sources and
open in the same window. The users can use the back arrow in the browser, which
navigates back to the beginning of the document.
3 Prerequisite
Please complete the following videos before going through the workbook.

1. Types of Data
2. Objective of Data Analysis and Feature Selection
4 Exercise
4.1 Problem 1

Answer the questions given below on the following dataset by applying your
knowledge on types of data.

4.1.1 Scenario
Given below is a solar system dataset, which gives information about the various
planets present in the solar system.

4.1.1.1 Study the table given below and answer the questions that follow:

Planet Distance from the Has global Mass No.of


name sun (106 km) magnetic field? (1024 kg) moons

Mercury 57.9 Yes 0.330 0

Venus 108.2 No 4.87 0

Earth 149.6 Yes 5.97 1

227.9
Mars No 0.642 2

778.6
Jupiter Yes 1898 79

1433.5
Saturn Yes 568 82

2872.5
Uranus Yes 86.8 27

4495.1
Neptune Yes 102 14

Table 1: Solar system data

1 Name the columns which have numeric data type from the given table?
2 Which column is continuous in nature?
3 Which of the given columns is discrete in nature?
4 Which column has a categorical data type?
5 How many categories do you see in the categorical column of the table? Is
the categorical feature ordinal or nominal in nature?

4.2 Problem 2
Answer the questions to understand data analysis and identify the objective of problem
statement from the following dataset.

4.2.1 Real Data Set


Read the instructions given below and answer the questions that follow.
On April 15, 1912, on her first voyage, the majestic ship named ‘Titanic’ sank after
hitting an iceberg, resulting in the death of thousands of people on-board.
While the factors for survival were possibly based on luck, yet there were some people
who had a greater survival rate as compared to others, such as upper class people,
women and children.
This dataset can be used to find out what determined the survival of a passenger. It
contains information about the passenger and whether the person survived or not?
Have a look at the first 10 rows of the dataset. (Optional: If you wish to see the
complete dataset, click here.)

Figure 1: Real dataset


4.2.1.1 Answer the questions that follow by referring various column descriptions:

Variable Definition Key

survival Whether a person survived? 0 = No, 1 = Yes


pclass Ticket class 1 = 1st, 2 = 2nd, 3 = 3rd
sex Sex
Age Age in years
sibsp # of siblings / spouses aboard the Titanic
parch # of parents / children aboard the Titanic
ticket Ticket number
fare Passenger fare
cabin Cabin number
embarked Port of Embarkation C = Cherbourg, Q = Queenstown, S =
Southampton
Table 2: Titanic dataset

The problem is to determine whether the survival rate depends on passenger


attributes such as age, sex or ticket class. After analysing the data by a team of data
scientists, it was found that the survival rate indeed depends on the passenger
attributes such as pclass, sex and age. Based on this result, answer the following
questions:

• What is the dependent feature in the dataset?


• What is the data type of the dependent feature?
• Which of the columns is ordinal in nature?
• Which of the columns is nominal in nature?
• If you are given data about a new passenger (not present in the dataset), can
you predict whether the person survived or not based on the relationship
between passenger attributes and survival? What would be the objective of the
problem in this case? Choose the correct option.
o Determining a continuous value
o Determining a categorical value

You have completed Data Types & Features workbook in Types of Data.
Copyright © 2021 Accenture
All rights reserved.
Accenture and its logo are trademarks of Accenture.

You might also like