Trainity Project-6
Trainity Project-6
Bank Loan
Case Study
Tushar Aseri | [email protected]
Descriptio
Contents Data Analytics Task
Insights
Summary
Description
As I am a Data Analyst at finance company, I need to draw meaningful insights from the dataset for
identifying factors behind loan default
Task Use Exploratory Data Analysis (EDA) to analyze patterns in the data and
ensure that capable applicants are not rejected.
Data Analysis
Identifying Missing Dat
Identifying Outlier
Analyze Data Imbalanc
Performing Variate Analysi
Identify Top Correlations
Data Analytics
Use various functions in excel to find blank cells in
Problem 1 Identifying Missing Data
data and impute them accordingly
Steps
name
Code-
Data Analytics
Detect and identify various outliers in the dataset
Problem 2 Identifying Outliers
using statistical functions
Steps
First I used TRANSPOSE to list all column name
Used QUARTILE.INC for finding quartile each for 25
and 75 and calculated IQ
Then I calculated lower & upper bound
Then I used Conditional formatting to identify
outliers and highlight them
Codes-
Data Analytics
Detect and calculate data imbalance ratio of
Problem 3 Analyzing Data Imbalance
loan application dataset
Codes-
Steps
First I used TRANSPOSE to list all column
name Total count
Then I used the formula for calculating
count of all zeros and ones for each of
the colum Class of 0
Then I calculated the ratio of 0 and
Finally used conditional formatting
Class of 1
Data Analytics
Analyze different variates using multiple
Problem 4 Perform Variate Analysis
excel functions
Codes-
Steps
First I used TRANSPOSE to list all column
name
Then by using each formula I calculated
values for each colum
Compared with the target variable and
created chart for it
Data Analytics
Identify the top correlations for each segment
Problem 5 Identifying Correlations
using excel functions
Steps
First I used TRANSPOSE to list all column name
Then I used filtered out 2 segments of target column i.e 1 and
Then I used CORREL function to find relation between each Codes- Ranking
variabl
Finally used RANK function to assign them values and you can see
the heatmap in the excel sheet that is attached Correlation
Insights