Project Report
Project Report
objective
2.
4.Model Building
6.Data Visualizations
7.Deployment Frame
Exploratory Data Analysis (EDA) and
Feature Engineering
Data set details
4)The data set is having both the combination of categorical values and numeric values.So, we need
to convert categorical values to numeric
1)The above plot is the box plot for the numeric features in the data set. The main advantage of boxplot
is it shows outliers. An outlier is a data point that differs significantly from other observations.
Data visualization :
1)The above plot represents the scatter plot for numeric features in the data set. A pairplot plot a
pairwise relationships in a dataset. A “pairplot” is also known as a scatterplot, in which one variable in
the same data row is matched with another variable's value,.
Data set details
1)The above plot represents correlation plot for numeric features in the data set. A correlation matrix is
a table showing correlation coefficients between variables. Each cell in the table shows
the correlation between two variables. A correlation matrix is used to summarize data, as an input into
a more advanced analysis, and as a diagnostic for advanced analyses.
We have few features which are categorical so we have to convert them to numeric.
job','marital','education','default','housing','loan','contact','month','poutcome’ these are in categorical values
we need to convert some variable to dummy variable. Below are the count plots for categorical features
which we can get insights from it.
Data visualization :
Model Building
Model Building
1)It is simple to understand as it follows the same process which a human follow while
making any decision in real-life.
2)It can be very useful for solving decision-related problems.
3)It helps to think about all the possible outcomes for a problem.
4)There is less requirement of data cleaning compared to other algorithms.