Business Analytics Presentation: Titanic Survival Analysis and Prediction
Business Analytics Presentation: Titanic Survival Analysis and Prediction
Presentation
Titanic Survival Analysis and
Prediction
TEAM NUMBER - 04
TEAM NAME - MAVERICKS
1
Exploratory Data
Analysis
An approach to analyzing data sets to summarize
their main characteristics, often with visual
methods.
2
Process Adopted
1. Loading &
getting detailed 2. Checking for
3. Visualization
statistics of the missing data
dataset
5. Critically
4. Filling the 6. Appending the
analyzing the
missing data modified fields
essential data
7. Generating the
final table
3
Initial table after loading the data set
5
Visualization
6
Final Table Obtained Upon Correlating
7
Decision Tree
[ with train-test break up]
Decision tree is the most powerful and popular tool for
classification and prediction.
8
Understanding Decisi
on Tree
1. LOADING THE LIBRARIES
3. SPLITTING THE DATASET INTO TRAINING SET AND TEST SET
9
K Fold
Cross Partition • Partition the dataset into k equal sized partitions
12
Understanding KNN
• K is the number of nearest neighbors. The number of neighbors is
the core deciding factor
• KNN has the following basic steps:
Calculate distance
Find closest neighbors
Vote for labels
1 2 3 4 5 6
Finding accuracy for Plotting accuracy loading & displaying Copying the test dataset Predicting survival values Displaying the final
selected number of corresponding to value of the test dataset & analysing it to get the submission
neighbours K in KNN relevant columns
13
Results Obtained
14
THANK
YOU !!!
15