Industrial Report On Banking System
Industrial Report On Banking System
Submitted By:
SUBMITTED TO:
(Signature of student)
Bilal Shahid
1650810043
Date: 17-10-2019
ACKNOWLEDGEMENT
The satisfaction that accompanies on the successful completion of this project would
be incomplete without the mention of the people who made it possible, without whose
constant guidance and encouragement would have made our efforts go in vain. We
consider ourself privileged to express gratitude and respect towards all those who
guided us through the completion of this project.
We convey thanks to our project guide Mr. Abhijeet Mishra for providing
encouragement, constant support and guidance which was of a great help to complete
this project successfully.
ETL LABS
ETL is a Professional and educational institute for Programming & Basic concept in
computer educations. The institute provides you with focus on the different
programming languages. It provides a wide range of different language courses as a
result of the continuous effort to meet the needs of a good programmer as well as all
types of competitive computer exams & to improve your personality on the whole.
These courses are designed in a manner that systematically equips them with the
knowledge and a technique required for any of the exams and helps them in progressing
in their careers.
Directors of ETL Labs Private Limited are Amit Singh and Ankit Kumar.
ETL provide all types of computer training such as 2 months summer training/
internship for B.Tech students, BCA, MSc (IT), MCA final year projects are done in
our center. We teach Java, .NET, PHP, Web Designing, VB.NET, ASP.NET, C, C++,
Website Creation, Live Projects.
ETL Labs Private Limited is an Indian Non-Government Company. It's a private
company and is classified as 'company limited by shares.
TABLE OF CONTENTS
S.NO CONTENT
1. INTRODUCTION
2. OBJECTIVE
3. BACKGROUND
4. BENEFITS
5. KEYWORDS
7. DATA DESCRIPTION
8. PACKAGES
9. LOGISTIC REGRESSION
11. SNAPSOTS
12. CONCLUSION
13. REFERENCES
Introduction
This project is being created as a result of the data sets of banking
department for the profit maximization of Banking System.
In this project our main motive to maximize the profit and reduce the loss
of bank, We try to make module with the help of machine learning we
make a decision framework which help to maximize the profit of banking
system.
On the basis of previous year of dataset we create our module with the
help of Machine learning which give a decision framework to understand
the applicant who try to apply the loan ,we approve them or not we can
set a criteria where the applicant is good credit risk or bad credit risk,
which help in no loss in while repaying the loan by applicant.
OBJECTIVE
• The main objective of this project “Minimization of risk and
maximization of profit on behalf of the bank.”, on the basis of
previous data set of banking system.
• Objective of this project increasing per unit revenue, decreasing
unit cost or mix of both.
BENEFITS
This report provides information for people who are interested in the banking industry
as well as in the credit risk topic. It indicates different approaches used to tackle the
credit risk issue. The case bank is of Middle East, thus businessmen from foreign
countries can gain information of how a typical credit risk management system works
through this report. For the case bank, the report’s outcome supports the credit
institution in improving its internal credit risk control. The comparison between the case
bank’s currently in used methods and the knowledge base can generate additional
models that the bank can apply. From the report’s results, we will give
recommendations to develop the case bank’s credit risk management further. Regarding
the field of specialization and career, we expect to gain an insight of how banks protect
themselves from high-risk loans. In addition, during the making of this report, we can
practice and improve the skills in various areas such as communication, data analysis,
decision making, which will support us in our future profession.
KEYWORDS
• Machine learning
• Profit maximization
• Banking System
• Minimize losses Maximize profit
• Lower Risk
• Data Processing
• Module Based on Machine Learning
• Libraries (Pandas, Numpy, Matplotlib, Scikit learn, Seaborn, Plotly,
Logistic Regression, Decision tree)
• Data analytics
In this project we use Machine Learning through Python Language which will run
on jyputer notebook.
Python:
Python is an interpreted, high-level, general-purpose programming
language. Created by Guido van Rossum and first released in 1991, Python's design
philosophy emphasizes code readability with its notable use of significant whitespace.
Python Features:
Python provides lots of features that are listed below.
Machine Learning:
Machine learning is a type of artificial intelligence (AI) that
provides computers with the ability to learn without being explicitly programmed.
Machine learning focuses on the development of Computer Programs that can
change when exposed to new data.
DATA DESCRIPTION
The dataset contains 1000 entries where each entry represents a person who takes
credit by a bank. Each person is classified as a good or a bad credit risk according to
the set of attributes.
There are two types of variables used in this report :-
1.Numeric variables
2.Categorical variables
Packages
Pandas:
Pandas is the most popular python library that is used for data analysis. It
provides highly optimized performance with back-end source code is purely written in
C or Python.
Numpy:
NumPy is not another programming language but a Python extension
module. It provides fast and efficient operations on arrays of homogeneous data.
NumPy extends python into a high-level language for manipulating numerical data,
similiar to MATLAB.
Matplotlib:
Matplotlib tries to make easy things easy and hard things possible. You can
generate plots, histograms, power spectra, bar charts, errorcharts, scatterplots, etc.
Seaborn:
Seaborn is a Python data visualization library based on matplotlib. It
provides a high-level interface for drawing attractive and informative statistical
graphics.
Scikit learn:
A python library which contains very good and efficient implementations
of various machine learning algorithms.
It is Simple and efficient tools for data mining and data analysis, Accessible to
everybody, and reusable in various contexts.
Plotly:
Plotly Also a plotting library which creates interactive plots.
Logistic Regression
Logistic Regression is a Machine Learning classification algorithm that is used to
predict the probability of a categorical dependent variable. In logistic regression, the
dependent variable is a binary variable that contains data coded as 1 (yes, success,
etc.) or 0 (no, failure, etc.).
Advantages:
• is easily interpretable
Disadvantages:
• while working with Logistic regression you are not able to handle a large number
of categorical features/variables.
• it is vulnerable to overfitting
• it cant solve the non-linear problem with the logistic regression model that is why
it requires a transformation of non-linear features
• Logistic regression will not perform well with independent(X) variables that are
not correlated to the target(Y) variable.
Decision Tree
Decision tree learning is a method commonly used in Machine learning. It is also a
method for supervised learning. The goal is to create a model that predicts the value of
a target variable based on several input variables.
Disadvantage:
Snapsots
So on 30% split, we maximized our profit upto 3 times.
Conclusion
The main purpose of this project to maximize the profit of banking system and
remove the deficiency what we face in previous few year. On the basis of previous
year data we can mine the data according to our need and find the result on the basis
of different situation machine learning library and tools helps for data mining and we
plot the data as graph and with the help of graph we easily classify according to our
need.
The basic specialization on this project it help to understand the data and deficiency
what we face with previous customer we can easilyunderstand what is better option
for maximize the profit.
Under this era,banks should simultaneously enrich their statistical techniques in
order to accommodate the increase availability of data, and to exploit all possible
dimensions of information collected.
“It’s a statistical technique to improve the banking system on the basic of previous
data and maximize the profit”
REFRENCES
• https://www.kaggle.com/startupsci/titanic-data-science-solutions For model
analysis and visualization Data analytics and Machine learning tools.
• https://towardsdatascience.com/building-a-logistic-regression-inpython-
step-by-step-becd4d56c9c8
• https://www.geeksforgeeks.org/introduction-machine-learning-
usingpython/#targetText=Introduction%20To%20Machine%20Learning%2
0using%20Python,when%20exposed%20to%20new%20data.
• https://www.datacamp.com/community/tutorials/decision-treeclassification-
python#targetText=Decision%20Tree%20Algorithm,known%20as%2
0the%20root%20node.