0% found this document useful (0 votes)
372 views4 pages

Titanic Data Analysis-Report

The document analyzes the Titanic dataset containing information on 891 passengers from the Titanic shipwreck. It aims to understand passenger demographics and identify factors that affected survival rates by exploring age, gender, class, fare and other attributes. Specifically, it will analyze trends like class and gender differences in survival, average fares and ages, and embarkation ports of wealthier passengers. The objective is to gain insights through data wrangling, exploration and effective communication of conclusions.

Uploaded by

Ghamdan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
372 views4 pages

Titanic Data Analysis-Report

The document analyzes the Titanic dataset containing information on 891 passengers from the Titanic shipwreck. It aims to understand passenger demographics and identify factors that affected survival rates by exploring age, gender, class, fare and other attributes. Specifically, it will analyze trends like class and gender differences in survival, average fares and ages, and embarkation ports of wealthier passengers. The objective is to gain insights through data wrangling, exploration and effective communication of conclusions.

Uploaded by

Ghamdan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

Titanic Data Analysis

Introduction

In the early morning hours of 15 April 1912, the Titanic ship drowns
by bumping with a Iceberg. There were an estimated 2,224
passengers and crew aboard the ship, and there were lifeboats for
only 1178 people. Therefore more than 1,500 died.

We have data of 891 people of titanic who drowned or survived


during mishap. So when there is name of some ‘Data’ there is a lot
interesting for ‘Data Scientists’. I have explored dataset and found a
lot interesting facts about Titanic.
Objective
The aim of this project is to analyze the Kaggle Titanic dataset, which
includes the following steps:

1. Defining the list of questions to be answered with data


analysis.
2. Data wrangling (including its cleansing and adding additional
fields) to allow for more convenient consumption of the data
and making analysis possible.
3. Data exploration and effective communication of the
exploratory steps.
4. Summarizing conclusions.

Defining the list of questions to be answered with


data analysis
There are many ways to explore the data and dozens of potential
questions to ask. After some brainstorming, I have come up with the
following:

1. What was the distribution of age across the passengers? Was it


different between men and women?
2. What was the distribution of ticket fares? What were the
average and the most expensive tickets? How much would this
average and the most expensive tickets worth now?
3. How many passengers embarked in different ports? Can we
detect the port where the richest passengers embarked? How
much would the median price for this subset of passengers
worth now?
4. How many passengers by gender were there on the ship? What
was the survival rate for men, women and children? Was sex
correlated to the survival rate?
5. What was the distribution of passengers among different
classes? What was the survival rate among different ticket
classes? Was ticket class correlated to the survival rate?
Dataset

First we have a dataset in which data of 891 people. Out data set
have 12 columns representing features. I will show you first ten rows
of dataset for just overview that how our data look like.

This is our dataset which have following features:

1. PassengerId: Id of every passenger.

2. Survived: This feature have value 0 and 1. 0 for not


survived and 1 for survived.

3. Pclass: There are 3 classes of passengers. Class1, Class2


and Class3.

4. Name: Name of passenger.
5. Sex: Gender of passenger.

6. Age: Age of passenger.

7. SibSp: Indication that passenger have siblings and


spouse.

8. Parch: Whether a passenger is alone or have family.

9. Ticket: Ticket no of passenger.

10. Fare: Indicating the fare.

11. Cabin: The cabin of passenger.

12. Embarked: The embarked category.

13. Initial: Initial name of passenger.

You might also like