Gupta 2020
Gupta 2020
9th International Conference on System Modeling & Advancement in Research Trends, 4th–5th, December, 2020
Faculty of Engineering & Computing Sciences, Teerthanker Mahaveer University, Moradabad, India
Anshika Gupta1, Vinay Pant2, Sudhanshu Kumar3 and Pravesh Kumar Bansal4
1,2,3
Faculty of Engineering and Computing Sciences, iNurture, TMU, Moradabad
4
Department of Computer Science, Government Engineering College, Bharatpur
E-mail: [email protected], [email protected],
3
[email protected], [email protected]
Abstract—With the advancement in technology, there This system allows jumping on particular applications that
are so many enhancements in the banking sector also. The deserve to be approved on a priority basis.
number of applications is increasing every day for loan There are some features for the prediction like-
approval. There are some bank policies that they have to ‘Gender’, ‘Married’, ‘Dependents’, ‘Education’, ‘Self_
consider while selecting an applicant for loan approval. Based
Employed’, ‘ApplicantIncome’, ‘CoapplicantIncome’,
on some parameters, the bank has to decide which one is best
for approval. It is tough and risky to check out manually every ‘LoanAmount’, ‘Loan_Amount_Term’, ‘Credit_History’,
person and then recommended for loan approval. In this ‘Property_Area’, ‘Loan_Status’.
work, we use a machine learning technique that will predict II. Literature Survey
the person who is reliable for a loan, based on the previous
record of the person whom the loan amount is accredited A prediction is a statement about what someone thinks
before. This work’s primary objective is to predict whether will happen in the future. People make predictions all the
the loan approval to a specific individual is safe or not. time. Some are very serious and are based on scientific
Keyword: Loan Dataset, Logistic Regression, Random calculations, but many are just guesses. Prediction helps us
Forest, Django. in many things to guess what will happen after some time or
I. Introduction after a year or after ten years.
As the data are increasing daily due to digitization in Predictive analytics is a branch of advanced analytics
the banking sector, people want to apply for loans through that uses many techniques from data mining, statistics,
the internet. Artificial intelligence (AI), as a typical method modeling, machine learning, and artificial intelligence
for information investigation, has gotten more consideration to analyze current data to make predictions. “Adyan Nur
increasingly. Individuals of various businesses are utilizing Alfiyatin, Hilman Taufiq [2] and their friends work on the
AI calculations to take care of the issues dependent on house price prediction. They use regression analysis and
their industry information. Banks are facing a significant Particle Swarm Optimization (PSO) to predict house price”.
problem in the approval of the loan. Daily there are so many One other similar work on the Mohamed El Mohadab,
applications that are challenging to manage by the bank Belaid Bouikhalene [3] and Said Safi to predict the rank for
employees, and also the chances of some mistakes are high. scientific research paper using supervised learning. Kumar
Most banks earn profit from the loan, but it is risky to choose Arun, Garg Ishan and Kaur Sanmeet [1] work on bank loan
deserving customers from the number of applications. One prediction on how to bank approve a loan. They proposed
mistake can make a massive loss to a bank. a model with the help of SVM and Neural networks like
Loan distribution is the primary business of almost machine learning algorithms.
every bank. This project aims to provide a loan [1, 8] to a This literature review helps us carry out our work and
deserving applicant out of all applicants. An efficient and propose a reliable bank loan prediction model.
non-biased system that reduces the bank’s time employs
checking every applicant on a priority basis. The bank III. Proposed Methodology
authorities complete all other customer’s other formalities The process to predict the bank loan of the applicants is
on time, which positively impacts the customers. The best as shown in figure 1. There is a different phase in each step,
part is that it is efficient for both banks and applicants. which is described here.
Authorized licensed use limited to: California State University Fresno. Downloaded on June 21,2021 at 00:54:54 UTC from IEEE Xplore. Restrictions apply.
9th International Conference on System Modeling & Advancement in Research Trends, 4th–5th December, 2020
Faculty of Engineering & Computing Sciences, Teerthanker Mahaveer University, Moradabad, India
Authorized licensed use limited to: California State University Fresno. Downloaded on June 21,2021 at 00:54:54 UTC from IEEE Xplore. Restrictions apply.
Bank Loan Prediction System using Machine Learning
Here the task of the machine is to find the hidden pattern Heat map is a data visualization technique that shows
from the data by using cluster analysis. The dataset is the magnitude of a phenomenon as color in two dimensions
labelled so that here we used supervised learning approach. as shown in figure 4. Color intensity shows the relationship
In our work, we used a supervised learning approach. between each other. The color variation may be by hue or
intensity, giving an obvious visual to the reader about how
B. Algorithms used for Prediction
the phenomenon is clustered or varies over space. From this
1) Logistic Regression heat map, it is negative relation in Loan amount term with
It is a classification set of rules used to assign application come attribute.
observations to a discrete set of instructions. Logistic V. Experiment and Result Analysis
regression is also a predictive analysis, like other regression
In this section, we use a machine learning algorithm on
analyses methods. Logistic regression is basically used for
a loan prediction dataset and deploy the result using HTML,
define the relationship between dependent binary variable
CSS, Django at the local server. Figure 5 shows the loan
and nominal or other independent variable. Now a day’s
prediction system of the applicants based on the value enter
logistic regression is used in many research areas like
by the bank employee.
medical science, machine learning and social science. It
also used by many e-commerce applications to predict the
mind set of customer to buy the product.
2) Random Forest
Random Forest is a robust system learning algorithm
that is used for a ramification of responsibilities along
with classification and regression. Random forests method
overcome the over fitting issue of decision trees during
training. It is an ensemble method made up of a large
number of small decision trees [5,7] called estimators where
each tree produces the prediction. The random forest model
combines the predictions of the estimators to produce a
more accurate prediction.
C. Correlation between Parameters
Fig. 5: Final Layout
The first attribute is to select the gender of the applicant
either male or female. The second is the marital status then
dependents attributes mean that the applicant is dependent
financially on someone or not. Other attributes are the
education of the customer, employment status, applicant
income, loan amount term, credit history, residential area,
etc. of the applicant. Finally, it shows the status of the loan
i.e. it is safe or risky as shown in fig 6.
Authorized licensed use limited to: California State University Fresno. Downloaded on June 21,2021 at 00:54:54 UTC from IEEE Xplore. Restrictions apply.
9th International Conference on System Modeling & Advancement in Research Trends, 4th–5th December, 2020
Faculty of Engineering & Computing Sciences, Teerthanker Mahaveer University, Moradabad, India
Authorized licensed use limited to: California State University Fresno. Downloaded on June 21,2021 at 00:54:54 UTC from IEEE Xplore. Restrictions apply.