0% found this document useful (0 votes)

430 views15 pages

Business Analytics Presentation: Titanic Survival Analysis and Prediction

The document is a presentation on analyzing the Titanic dataset using various machine learning algorithms like decision trees and KNN. It first describes exploratory data analysis including loading data, checking for missing values, data visualization and feature selection. Then it discusses building a decision tree model with train-test split and calculating accuracy. It also covers implementing K-fold cross validation with decision trees. Finally, it explains the working of KNN algorithm and shows the steps to find optimal K and predict survival values on test data.

Uploaded by

Rumani Chakraborty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

430 views15 pages

Business Analytics Presentation: Titanic Survival Analysis and Prediction

Uploaded by

Rumani Chakraborty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Business Analytics

Presentation
Titanic Survival Analysis and
Prediction

PREPARED BY - DIVYANSH SINGH - 20BM63030

PRANAV KUMAR - 20BM63064
RUMANI CHAKRABORTY - 20BM63076
SHUMI MITRA - 20BM63086
TALATI SAURABH RASHMIKANT - 20BM63096

TEAM NUMBER - 04
TEAM NAME - MAVERICKS
1
Exploratory Data
Analysis
An approach to analyzing data sets to summarize
their main characteristics, often with visual
methods.

2
Process Adopted
1. Loading &
getting detailed 2. Checking for
3. Visualization
statistics of the missing data
dataset

5. Critically
4. Filling the 6. Appending the
analyzing the
missing data modified fields
essential data

7. Generating the
final table

3
Initial table after loading the data set

Upon checking for missing data

4
• Since there is quite a lot of data in the dataset to be gone
through, visualizing will be a better tool for analysis
• 20% of data in Age column contains null, while too many values in
Cabin column are null
• Visualising and filling necessary data in the Age column
• Dropping the Cabin Column

5
Visualization
6
Final Table Obtained Upon Correlating

7
Decision Tree
[ with train-test break up]
Decision tree is the most powerful and popular tool for
classification and prediction.

8
Understanding Decisi
on Tree
1. LOADING THE LIBRARIES

2. FEATURE & TARGET SELECTION

3. SPLITTING THE DATASET INTO TRAINING SET AND TEST SET

4. CREATING & TRAINING DECISION TREE CLASSIFIER OBJECT

Flowchart like tree structure, where
5. PREDICTING THE RESPONSE FOR TEST DATASET 1. Each internal node denotes a
feature/ attribute
6. PRINTING THE ACCURACY
2. Each branch represents the decision rule
7. VISUALIZING THE DECISION TREE 3. Each leaf node (terminal node) represents
the outcome.

9
K Fold
Cross Partition • Partition the dataset into k equal sized partitions

Validation Select • Select 1 partition as the validation data

Use • Use the remaining k-1 as training data

Train • Train the model and determine the accuracy

Repeat • Repeat the process k times, selecting a different

partition each time

Average • Average the accuracy results

10
Result
Accuracy: 0.7574626865671642
11
KNN Implementation
KNN is a non-parametric and lazy learning algorithm

12
Understanding KNN
• K is the number of nearest neighbors. The number of neighbors is
the core deciding factor
• KNN has the following basic steps:
 Calculate distance
 Find closest neighbors
 Vote for labels

1 2 3 4 5 6
Finding accuracy for Plotting accuracy loading & displaying Copying the test dataset Predicting survival values Displaying the final
selected number of corresponding to value of the test dataset & analysing it to get the submission
neighbours K in KNN relevant columns

13
Results Obtained

14
THANK
YOU !!!

Nav Technical Interview Questions
20% (5)
Nav Technical Interview Questions
45 pages
Jorghi Inzaghi Tanson
100% (1)
Jorghi Inzaghi Tanson
12 pages
Chapter 9. Database Design
100% (1)
Chapter 9. Database Design
52 pages
Database Foundatins Exam
No ratings yet
Database Foundatins Exam
14 pages
Ivy Tech Community College: DBMS110 - M04 Lab Assignment (1 Question 40 Points Total)
No ratings yet
Ivy Tech Community College: DBMS110 - M04 Lab Assignment (1 Question 40 Points Total)
2 pages
Seminar PPT
80% (5)
Seminar PPT
18 pages
LFCE Exam v3.18 Revised Jan 2019
100% (1)
LFCE Exam v3.18 Revised Jan 2019
269 pages
Final Exam 23okt2020
No ratings yet
Final Exam 23okt2020
16 pages
Answer Key For Ilearning Oracle Academia
0% (4)
Answer Key For Ilearning Oracle Academia
4 pages
Midterm Oracle
No ratings yet
Midterm Oracle
12 pages
Review Your Answers
No ratings yet
Review Your Answers
53 pages
Dfo Section 2 Quiz
No ratings yet
Dfo Section 2 Quiz
22 pages
An Oracle Project ON: Airport Management System
No ratings yet
An Oracle Project ON: Airport Management System
12 pages
Dokumen - Tips Quiz-S8
No ratings yet
Dokumen - Tips Quiz-S8
12 pages
Mid Term Sem I 2016
No ratings yet
Mid Term Sem I 2016
13 pages
Dfo Section 1
No ratings yet
Dfo Section 1
3 pages
Final Exam Semester 1
100% (2)
Final Exam Semester 1
14 pages
Assesment Solution-Mscise Adbms
No ratings yet
Assesment Solution-Mscise Adbms
2 pages
Final Exam Hadi Nossair Data Base Seccion 4
No ratings yet
Final Exam Hadi Nossair Data Base Seccion 4
15 pages
Section 4 Dan 5
No ratings yet
Section 4 Dan 5
4 pages
Section
No ratings yet
Section
4 pages
Soal SBD
No ratings yet
Soal SBD
197 pages
Section 2 Quiz
No ratings yet
Section 2 Quiz
2 pages
Answers PDF
No ratings yet
Answers PDF
9 pages
Database Design 1-4: Major Transformations in Computing
No ratings yet
Database Design 1-4: Major Transformations in Computing
4 pages
Feature Extraction For Classifying Students Based On Their Academic Performance
No ratings yet
Feature Extraction For Classifying Students Based On Their Academic Performance
5 pages
Mid Term Exam Semester 2 Part 2
No ratings yet
Mid Term Exam Semester 2 Part 2
15 pages
Section 6 Quiz: 1st Normal Form. 2nd Normal Form. 3rd Normal Form. ( ) None of The Above, The Entity Is Fully Normalised
No ratings yet
Section 6 Quiz: 1st Normal Form. 2nd Normal Form. 3rd Normal Form. ( ) None of The Above, The Entity Is Fully Normalised
5 pages
Database Programming Section 4 Quiz
No ratings yet
Database Programming Section 4 Quiz
13 pages
Final Exam Oracle
No ratings yet
Final Exam Oracle
10 pages
Section 3 Quiz, Database Design ORACLE
No ratings yet
Section 3 Quiz, Database Design ORACLE
8 pages
MID Test Semester 1 Oracle Database Design Ujian Ke-2
No ratings yet
MID Test Semester 1 Oracle Database Design Ujian Ke-2
11 pages
Section 5
100% (1)
Section 5
17 pages
Relational Databases ASSIGNMENT - 2: Name: Kapila Ravichandran Student ID: 8918716 Section: 2
No ratings yet
Relational Databases ASSIGNMENT - 2: Name: Kapila Ravichandran Student ID: 8918716 Section: 2
3 pages
Database Programming Section 13 Quiz
No ratings yet
Database Programming Section 13 Quiz
13 pages
Peer-To-Peer File Sharing
No ratings yet
Peer-To-Peer File Sharing
6 pages
Final Exam Semester 1
No ratings yet
Final Exam Semester 1
21 pages
Oracle Semest 2 Part 1
No ratings yet
Oracle Semest 2 Part 1
20 pages
Heart Rate Animation With C
No ratings yet
Heart Rate Animation With C
7 pages
Oracle Final Exam Part I
No ratings yet
Oracle Final Exam Part I
22 pages
Quiz Session 6 Oracle
100% (1)
Quiz Session 6 Oracle
5 pages
Section 2 Quiz Database Design Oracle
No ratings yet
Section 2 Quiz Database Design Oracle
347 pages
Database Testbank
No ratings yet
Database Testbank
13 pages
Benny Hernanda Putra - If B 2020
No ratings yet
Benny Hernanda Putra - If B 2020
7 pages
Javadatabasse
No ratings yet
Javadatabasse
26 pages
Section 8 Quiz DD
No ratings yet
Section 8 Quiz DD
49 pages
Algebra Questions and Answers
0% (2)
Algebra Questions and Answers
6 pages
Database Programming Section 16 Quiz
No ratings yet
Database Programming Section 16 Quiz
11 pages
Final Exam Oracle
100% (1)
Final Exam Oracle
232 pages
Section 1: Test: Mid Term Exam Semester 1
No ratings yet
Section 1: Test: Mid Term Exam Semester 1
13 pages
Section 6 Quiz 1 l1 l4
No ratings yet
Section 6 Quiz 1 l1 l4
4 pages
Final
No ratings yet
Final
27 pages
Soal Oracle 1
No ratings yet
Soal Oracle 1
2 pages
Data Science Assignment 2
No ratings yet
Data Science Assignment 2
14 pages
2 - Preprocessing
No ratings yet
2 - Preprocessing
74 pages
Business Data Analytics Part 4
No ratings yet
Business Data Analytics Part 4
52 pages
PredictingTitanicSurvivorsusing by Applying Exploratory Data Anyltics and ML
No ratings yet
PredictingTitanicSurvivorsusing by Applying Exploratory Data Anyltics and ML
7 pages
Employee Performance Analysis
No ratings yet
Employee Performance Analysis
3 pages
Machine Learning
100% (2)
Machine Learning
30 pages
DA Caravan 6672064
No ratings yet
DA Caravan 6672064
26 pages
Decision Support
No ratings yet
Decision Support
21 pages
Gomez Jorge Project
No ratings yet
Gomez Jorge Project
9 pages
Final Research Paper
No ratings yet
Final Research Paper
3 pages
Genral Journal-Vogue Company
No ratings yet
Genral Journal-Vogue Company
7 pages
FAR Assignment 2
No ratings yet
FAR Assignment 2
7 pages
FAR Assignment 4
No ratings yet
FAR Assignment 4
10 pages
FAR Assignment 3 - Calculations
No ratings yet
FAR Assignment 3 - Calculations
5 pages
Assignment 3 - 17-09-2020
No ratings yet
Assignment 3 - 17-09-2020
144 pages
Beta Book Project: Estimate Firm and Industry Beta: BM6xxx2: Corporate Finance A. Chandra, 2021
No ratings yet
Beta Book Project: Estimate Firm and Industry Beta: BM6xxx2: Corporate Finance A. Chandra, 2021
1 page
Bharata's Rasa Sutra and The Theory of Rasa Dhvani: December 2016
100% (1)
Bharata's Rasa Sutra and The Theory of Rasa Dhvani: December 2016
11 pages
MIS Case Study - Digitalization at Siemens
0% (1)
MIS Case Study - Digitalization at Siemens
6 pages
MP 10 Block 04
No ratings yet
MP 10 Block 04
42 pages
Note 1873631 - Best Practices To Set SGA and PGA
No ratings yet
Note 1873631 - Best Practices To Set SGA and PGA
2 pages
App01 PDF
No ratings yet
App01 PDF
6 pages
Siemens Microwave Network
No ratings yet
Siemens Microwave Network
7 pages
New Text Document
No ratings yet
New Text Document
11 pages
Testing Tools and Measurements: (Any 4 Appropriate Limitations of Manual Testing - 4marks 1 Mark Each)
No ratings yet
Testing Tools and Measurements: (Any 4 Appropriate Limitations of Manual Testing - 4marks 1 Mark Each)
9 pages
Ad Hoc Testing
No ratings yet
Ad Hoc Testing
7 pages
Comp 2911 Cheat Sheet
No ratings yet
Comp 2911 Cheat Sheet
5 pages
Transport Management System
No ratings yet
Transport Management System
5 pages
BS-120&130&180&190&200&220 - Database Failure and Solutions - EN
No ratings yet
BS-120&130&180&190&200&220 - Database Failure and Solutions - EN
2 pages
Mayuri Sonawane: Objective
No ratings yet
Mayuri Sonawane: Objective
3 pages
Visvesvaraya Technological University: Hotel Management System
No ratings yet
Visvesvaraya Technological University: Hotel Management System
29 pages
Components of A Looker Purchase File en 16
No ratings yet
Components of A Looker Purchase File en 16
2 pages
Java J2EE Performance Tuning
No ratings yet
Java J2EE Performance Tuning
7 pages
Unit-3 Software: Need of Computer Software
No ratings yet
Unit-3 Software: Need of Computer Software
10 pages
User Access Permissions
No ratings yet
User Access Permissions
8 pages
MDVM Datasheet Aug 2023
No ratings yet
MDVM Datasheet Aug 2023
3 pages
How To Backup and Restore The Virtual I/O Server
No ratings yet
How To Backup and Restore The Virtual I/O Server
9 pages
Fall 2020 - Cloud Computing and Big Data - HW 1
No ratings yet
Fall 2020 - Cloud Computing and Big Data - HW 1
7 pages
SAP NetWeaver Developer Studio 7.30 Installation Guide
No ratings yet
SAP NetWeaver Developer Studio 7.30 Installation Guide
11 pages
Doubly Linked List in Python: Objective
No ratings yet
Doubly Linked List in Python: Objective
2 pages
IMS IA Exam Paper - JULY 2023 - Sample
100% (1)
IMS IA Exam Paper - JULY 2023 - Sample
7 pages
Notes Unit 5 OS
No ratings yet
Notes Unit 5 OS
16 pages
Digital Forensics - Getting Started With File Systems
No ratings yet
Digital Forensics - Getting Started With File Systems
38 pages
Mca16.4.2 Advanced Java & Web Technologies
No ratings yet
Mca16.4.2 Advanced Java & Web Technologies
13 pages
11 4 3 2+Lab+-+Disk+CLI+Commands
No ratings yet
11 4 3 2+Lab+-+Disk+CLI+Commands
8 pages
How AI Accelerates ML Development
No ratings yet
How AI Accelerates ML Development
6 pages
CS 360-Software Engineering-Hamid Abdul Basit
No ratings yet
CS 360-Software Engineering-Hamid Abdul Basit
4 pages

Business Analytics Presentation: Titanic Survival Analysis and Prediction

Uploaded by

Business Analytics Presentation: Titanic Survival Analysis and Prediction

Uploaded by

Business Analytics

PREPARED BY - DIVYANSH SINGH - 20BM63030

Upon checking for missing data

2. FEATURE & TARGET SELECTION

4. CREATING & TRAINING DECISION TREE CLASSIFIER OBJECT

Validation Select • Select 1 partition as the validation data

Use • Use the remaining k-1 as training data

Train • Train the model and determine the accuracy

Repeat • Repeat the process k times, selecting a different

Average • Average the accuracy results

You might also like