0% found this document useful (0 votes)

74 views11 pages

Bankruptcy Prediction Using Machine Learning: Nanxi Wang

Uploaded by

max lax

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

74 views11 pages

Bankruptcy Prediction Using Machine Learning: Nanxi Wang

Uploaded by

max lax

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Journal of Mathematical Finance, 2017, 7, 908-918

http://www.scirp.org/journal/jmf
ISSN Online: 2162-2442
ISSN Print: 2162-2434

Bankruptcy Prediction Using Machine Learning

Nanxi Wang

Shanghai Starriver Bilingual School, Shanghai, China

How to cite this paper: Wang, N.X. (2017) Abstract

Bankruptcy Prediction Using Machine
Learning. Journal of Mathematical Finance, With improved machine learning models, studies on bankruptcy prediction
7, 908-918. show improved accuracy. This paper proposes three relatively newly-developed
https://doi.org/10.4236/jmf.2017.74049
methods for predicting bankruptcy based on real-life data. The result shows
Received: August 30, 2017
among the methods (support vector machine, neural network with dropout,
Accepted: November 14, 2017 autoencoder), neural network with added layers with dropout has the highest
Published: November 17, 2017 accuracy. And a comparison with the former methods (logistic regression,
genetic algorithm, inductive learning) shows higher accuracy.
Copyright © 2017 by author and
Scientific Research Publishing Inc.
This work is licensed under the Creative Keywords
Commons Attribution International
Support Vector Machine, Autoencoder, Neural Network, Bankruptcy,
License (CC BY 4.0).
http://creativecommons.org/licenses/by/4.0/ Machine Learning
Open Access

1. Introduction
Machine learning is a subfield of computer science. It allows computers to build
analytical models of data and find hidden insights automatically, without being
unequivocally coded. It has been applied to a variety of aspects in modern socie-
ty, ranging from DNA sequences classification, credit card fraud detection, robot
locomotion, to natural language processing. It can be used to solve many types
of tasks such as classification. Bankruptcy prediction is a typical example of clas-
sification problems.
Machine learning was born from pattern recognition. Earlier works of the
same topic (machine learning in bankruptcy) use models including logistic re-
gression, genetic algorithm, and inductive learning.
Logistic regression is a statistical method allowing researchers to build predic-
tive function based on a sample. This model is best used for understanding how
several independent variables influence a single outcome variable [1]. Though
useful in some ways, logistic regression is also limited.
Genetic algorithm is based on natural selection and evolution. It can be used

DOI: 10.4236/jmf.2017.74049 Nov. 17, 2017 908 Journal of Mathematical Finance

N. X. Wang

to extract rules in propositional and first-order logic, and to choose the appro-
priate sets of if-then rules for complicated classification problems [2].
Inductive learning’s main category is decision tree algorithm. It identifies
training data or earlier knowledge patterns and then extracts generalized rules
which are then used in problem solving [2].
To see if the accuracy of bankruptcy prediction can be further improved, we
propose three latest models—support vector machine (SVM), neural network,
and autoencoder.
Support vector machine is a supervised learning method which is especially
effective in cases of high dimensions, and is memory efficient because it uses a
subset of training points in the decision function. Also, it specifies kernel func-
tions according to the decision function [3]. Its nice math property guarantees a
simple convex optimization problem to converge to a single global problem.
Neural networks, unlike conventional computers, are expressive models that
learn by examples. They contain multiple hidden layers, thus are capable of
learning very complicated relationships between inputs and outputs. And they
operate significantly faster than conventional techniques. However, due to li-
mited training data, overfitting will affect the ultimate accuracy. To prevent this,
a technique called dropout—temporarily and randomly removes units (hidden
and visible)—to the neural network [4].
Autoencoder, also known as Diabolo network, is an unsupervised learning al-
gorithm that sets the target values to be equal to the inputs. By doing this, it
suppresses the computation of representing a few functions, which improves
accuracy. Also, the amount of training data required to learn these functions is
reduced [5].
This paper is structured as follows. Section 2 describes the motivation for this
idea. Section 3 describes relevant previous work. Section 4 formally describes the
three models. In Section 5 we present our experimental results where we do a
parallel comparison within the three models we choose and a longitudinal com-
parison with the three older models. Section 6 is the conclusion. Section 7 is the
reference.

2. Motivation
The three models we choose (SVM, neural network, autoencoder) are relatively
newly-developed but have already been applied to many fields.
SVM has been used successfully in many real-world problems such as text ca-
tegorization, object tracking, and bioinformatics (Protein classification, Cancer
classification). Text categorization is especially helpful in daily life—web
searching and email filtering provide huge convenience and work efficiency.
Neural networks learn by examples instead of algorithms, thus, they have been
widely applied to problems where it is hard or impossible to apply algorithmic
methods [6]. For instance, finger print recognition is an exciting application.
People can now use their unique fingerprints as keys to unlock their phones and
payment accounts, free from the troubling, long passwords.

DOI: 10.4236/jmf.2017.74049 909 Journal of Mathematical Finance

N. X. Wang

Autoencoders are especially successful in solving difficult tasks like natural

language processing (NLP). They have been used to solve the previous seemingly
intractable problems in NLP, including word embeddings, machine translation,
document clustering, sentiment analysis, and paraphrase detection.
However, the usage of the three models in economics or finance is compara-
tively hard to find. So, we aim to find out if they still work well in economical
field by running them with real-life data in a predicting bankruptcy task.
Another motivation is finding out if the accuracy of this particular problem
(bankruptcy prediction) can be improved after reading previous works—The
discovery of experts’ decision rules from qualitative bankruptcy data using ge-
netic algorithms [2], and Predicting Bankruptcy with Robust Logistic Regression
[1]—which uses older models. Thus, a comparison of the models and results is
included in this paper.

3. Related Work
Machine learning enables computers to find insights from data automatically.
The idea of using machine learning to predict bankruptcy has previously been
used in the context of Predicting Bankruptcy with Robust Logistic Regression by
Richard P. Hauser and David Booth [1]. This paper uses robust logistic regres-
sion which finds the maximum trimmed correlation between the samples re-
mained after removing the overly large samples and the estimated model using
logistic regression [1]. This model has its limitation. The value of this technique
relies heavily on researchers’ abilities to include the correct independent va-
riables. In other words, if researchers fail to identify all the relevant independent
variables, logistic regression will have little predictive value [7]. Its overall accu-
racy is 75.69% in the training set and 69.44% in testing set.
Another work, the discovery of experts’ decision rules from qualitative bank-
ruptcy data using genetic algorithms, in 2003 by Myoung-Jong Kim and Ingoo
Han uses the same dataset as we do. They apply older models—inductive learn-
ing algorithms (decision tree), genetic algorithms, and neural networks without
dropout. Since the length of genomes in GA is fixed, a given problem cannot
easily be encoded. And GA gives no guarantee of finding the global maxima. The
problem of inductive learning is with the one-step-ahead node splitting without
backtracking, which may generate a suboptimal tree. Also, decision trees can be
unstable because small variations in the data might result in a completely differ-
ent tree being generated [3]. And the absence of dropout in the neural network
model increases the possibility of overfitting which affects accuracy. The overall
accuracies are 89.7%, 94.0%, and 90.3% respectively.
The models we choose either contain a newly developed technique, like dro-
pout, or completely new models that have hardly been utilized in bankruptcy
prediction.

4. Model Description
This section describes the proposed three models.

DOI: 10.4236/jmf.2017.74049 910 Journal of Mathematical Finance

N. X. Wang

4.1. Support Vector Machine

Specifically, we use support vector classify (SVC), a subcategory of SVM, in this
task. It constructs a hyper-plane, as shown in Figure 1, in a high dimensional
space which is used for classification. Generally, a good separation represented
by the solid line in Figure 1 means the distance(the space between the dotted
lines) to the nearest training data points (the red and blue dots) of any class
(represented by the color red and blue) is the largest. This is also known as func-
tional margin [3].
With training vectors in two classes and a vector,
xi ∈ 
= , i 1, , n, y ∈ {1, −1}
p n

respectively, SVM aims at solving the problem:

n
1
min ω Tω + C ∑ ζ i
ω ,b ,ζ 2
i =1

subject to
( )
yi ω Tφ ( xi ) + b ≥ 1 − ζ i

Its dual is
1
min α T Qα − eTα
α 2

subject to
y Tα = 0, 0 ≤ α i ≤ C , i= 1, , n
where e is a common vector, C > 0 is upper bound, Q is n by n positive semi-
definite matrix, Qij ≡ yi y j k ( xi ⋅ x j ) , and K ( xi , x j ) = φ ( xi ) φ ( x j ) is the kernel.
T

Figure 1. SVM model [3].

DOI: 10.4236/jmf.2017.74049 911 Journal of Mathematical Finance

N. X. Wang

Here the function implicitly maps the training vectors into a higher dimensional
space.
The decision function is:
 n 
sgn  ∑ yiα i K ( xi , x ) + ρ  [3]
 i=1 

4.2. Neural Network with Dropout

Neural networks’ inputs are modelled as layers of neurons. Its structure is shown
in the following figure.

As shown in Figure 1, the formal neuron uses n inputs x1 , x2 , , xn to clas-

sify the signals coming from dendrites, and are then synoptically weighted cor-
respondingly with w1 , w2 , , wn that measure their permeabilities. Then, the
excitation level of the neuron is calculated as the weighted sum of input values:
n
ξ = ∑ wi xi
i =1

f in Figure 2 represents activation function.

When the value of excitation level x reaches the threshold h, the output y
(state) of the neuron is induced. This simulates the electric impulse generated by
axon [8].
Dropout is a technique that further improves neural network’s accuracy. In
Figure 3, let L be the number of hidden layers, l ∈ {1, , L} the hidden layers of
the neural network, z ( l ) and y ( l ) the vectors of inputs and outputs of layer
l , respectively. W ( l ) and b ( l ) are the weights and biases at layer l . For
l ∈ {0, , L − 1} and any hidden unit i, the network then can be described as:
z ( ) w(
l +1)
y l + b(
l +1 l +1) iii
= ,

y(
l +1)
= f z( ( l +1)
), ii

Figure 2. Neural network model.

DOI: 10.4236/jmf.2017.74049 912 Journal of Mathematical Finance

N. X. Wang

Figure 3. Artificial neural network.

where f is any activation function.

With dropout, the feed-forward operation becomes:
r(l)-Bernoulli(p), j
y( ) = r ( ) y( ) ,
l l l

z ( ) w(
l +1)
y l + b(
l +1 l +1) iii
= , [4].

4.3. Autoencoder
Consider an n/p/n autoencoder.
In Figure 4, let F and G denote sets, n and p be positive integers where 0 < p <
n, and B be a class of functions from Fn to Gp.
Define X = { x1 , , xm } as a set of training vectors in Fn. When there are ex-
ternal targets, let Y = { y1 , , ym } denote the corresponding set of target vectors
in Fn. And ∆ is a distortion function (e.g. Lp norm, Hamming distance) defined
over Fn.
For any A ∈ A and B ∈ B, the input vector x ∈ Fn becomes output vector A ◦
B(x) ∈ Fn through the autoencoder. The goal is to find A ∈ A and B ∈ B that
minimize the overall distortion function:
B ) min E ( =
min E ( A,= xt ) min ∆A  B ( xt ) , xt [10].

4.4. Decision Tree

Given training vectors xi ∈ R n , i = 1, , l and a label vector y ∈ R l , a decision
tree groups the sample according to the same labels.
Let Q represents the data at node m. The tree partitions the data θ = ( j , tm )

DOI: 10.4236/jmf.2017.74049 913 Journal of Mathematical Finance

N. X. Wang

Figure 4. An n/p/n Autoencoder Architecture

[Pierre Baldi, 2012].

(feature j and threshold tm ) into Qleft (θ ) and Qright (θ ) subsets:

Qleft (θ )
= ( x, y ) x j ≤ t m
Qright (θ ) = Q \ Qleft (θ )

The impurity function H ( ) is used to calculate the impurity at m, the

choice of which depends on the task being solved (classification or regression)

H ( Qright (θ ) )
nright
H ( Qleft (θ ) ) +
nleft
G ( Q, θ )
=
Nm Nm

Choose the parameters that minimises the impurity

θ ∗ = arg minθ G ( Q, θ )

Then recur for subsets Qleft (θ ∗ ) and Qright (θ ∗ ) until reaching the maxi-
mum possible depth, N m < min samples or N m = 1 [3].

5. Experimental Result
The data we used shown in Table 1, called Qualitative Bankruptcy database, is
created by Martin. A, Uthayakumar. j, and Nadarajan. m in February 2014 [10].
The attributes include industrial risk, management risk, financial flexibility, cre-
dibility, competitiveness, and operating risk.

5.1. Parallel Comparison

5.1.1. SVM (Linear Kernel)
As shown in Table 2, the accuracy increases when truncate increases in a SVM
model.

5.1.2. Neural Network (Activation = Softmax, Num_Classes = 2,

Optimiser = Adam, Loss = Categorical _Crossentropy,
Metrics = Accuracy)
As shown in Table 3, when other things in the model hold the same, dropout
rate of 0.5 yields the highest accuracy.

DOI: 10.4236/jmf.2017.74049 914 Journal of Mathematical Finance

N. X. Wang

Table 1. Dataset Description.

Data set Dimensionality Instances Training Set Test Set Validation

Bankruptcy 6 times1 250 80% 10% 10%

Table 2. Accuracy of Neural Network Model with Truncate 50 or 100.

variation accuracy
truncate = 50 0.9899
truncate = 100 0.9933

Table 3. Accuracy of Neural Network Model with and without Dropout.

variation accuracy
without dropout 0.9867 with loss 0.0462
with dropout (dropout rate = 0.1) 0.9867 with loss 0.0292
with dropout (dropout rate = 0.3) 0.9933 with loss 0.0300
with dropout (dropout rate = 0.4) 0.9933 with loss 0.0401
with dropout (dropout rate = 0.5) 0.9933 with loss 0.0278
with dropout (dropout rate = 0.7) 0.9933 with loss 0.0428
with dropout (dropout rate = 0.8) 0.9867 with loss 0.0318

As shown in Table 4 and Table 5, we can conclude that adding layers in-
creases accuracy. Figure 5 and Figure 6 depict Table 5.

5.1.3. Autoencoder (Encoding_Dim = 2, Activation = “Relu”,

Optimizer = “Adam”, Lose = “Mse”)
As shown in Table 6, autoencoder with decision tree yields higher accuracy.

5.2. Longitudinal Comparison

As shown in Table 7, neural network with truncate = 100 with added layers with
dropout has the highest accuracy. And all the new models have higher accuracy
than the old ones.

6. Conclusions
Support vector machine, neural network with dropout, and autoencoder are
three relatively new models applied in bankruptcy prediction problems. Their
accuracies outperform those of the three older models (robust logistic regres-
sion, inductive learning algorithms, genetic algorithms). The improved aspects
include the control for overfitting, the improved probability of finding the global
maxima, and the ability to handle large feature spaces. This paper compared and
concluded the progress of machine leaning models regarding bankruptcy pre-
diction, and checked to see the performance of relatively new models in the
context of bankruptcy prediction that have rarely been applied in that field.
However, the three models also have drawbacks. SVM does not directly give
probability estimates, but uses an expensive five-fold cross-validation instead.

DOI: 10.4236/jmf.2017.74049 915 Journal of Mathematical Finance

N. X. Wang

Table 4. Accuracy of Neural Network Model with Two, Three, and Four Layer.

variation accuracy

two layer with dropout (dropout rate = 0.5) 0.9933 with loss 0.0278

three layer (added layer with dense 200)

0.9933 with loss 0.0221
with dropout (dropout rate = 0.5)

four layer (added layer with dense 16)

1.0000 with loss 0.0004
with dropout (dropout rate = 0.5)

Table 5. Accuracy of Neural Network Model with Truncate 50 or 100 and With Four
Layers.

variation accuracy

truncate = 50 with four layers

0.9950 with loss 0.0389
(added layer dense 16,200) with dropout rate 0.5

truncate = 100 with four layers

1.0000 with loss 0.0004
(added layer dense 16,200) with dropout rate 0.5

Table 6. Accuracy of Neural Network Model with SVM or With Decision Tree.

variation accuracy

with SVM 0.9867

with decision tree 0.9933

Table 7. Accuracy of Neural Network Model with Different models.

model accuracy

Robust logistic regression 0.6944

inductive learning algorithms (decision tree) 0.897

genetic algorithms 0.94

neural networks without dropout 0.903

SVM truncate = 100 0.9933

Truncate = 100 with four layers (added layer dense 16,200)

1.0000 with loss 0.0004
with dropout rate 0.5

autoencoder (with decision tree) 0.9933

Also, if the data sample is not big enough, especially when outnumbered by the
number of features, SVM is likely to give bad performance [4]. With dropout,
the time to train the neural network will be 2 to 3 times longer than training a
standard neural network. An autoencoder captures as much information as
possible, not necessarily the relevant information. And this can be a problem

DOI: 10.4236/jmf.2017.74049 916 Journal of Mathematical Finance

N. X. Wang

Figure 5. Neural network-loss.

Figure 6. Neural network-accuracy.

when the most relevant information only makes up a small percent of the input.
The solutions to overcome these drawbacks are yet to be found.

References
[1] Hauser, R.P. and Booth, D. (2011) Predicting Bankruptcy with Robust Logistic Re-
gression. Journal of Data Science, 9, 565-584.
[2] Kim, M.-J. and Han, I. (2003) The Discovery of Experts’ Decision Ruels from Qua-
litative Bankruptcy Data Using Genetic Algorithms. Expert Systems with Applica-
tion, 25, 637-646,
[3] Pedregosa, et al. (2011) Scikit-Learn: Machine Learning in Python. Journal of Ma-
chine Learning Research, 12, 2825-2830.

DOI: 10.4236/jmf.2017.74049 917 Journal of Mathematical Finance

N. X. Wang

[4] Sirvastava, N., et al. (2014) Dropout: A Simple Way to Prevent Neural Networks
from Overfitting. Journal of Machine Learning Research, 15, 1929-1958.
[5] Dev, D. (2017) Deep Learning with Hadoop. Packet Publishing, Birmingham, 52.
[6] Nielsen, F. (2001) Neural Networks—Algorithms and Applications.
https://www.mendeley.com/research-papers/neural-networks-algorithms-applicatio
ns-5/
[7] Robinson, N. (n.d.) The Disadvantages of Logistic Regression.
http://classroom.synonym.com/disadvantages-logistic-regression-8574447.html
[8] Sima, J. (1998) Introduction to Neural Networks. Technical Report No. 755.
[9] Baldi, P. (2012) Autoencoders, Unsupervised Learning, and Deep Architectures.
Journal of Machine Learning Research, 27, 37-50.
[10] Martin, A., Uthayakumar, J. and Nadarajan, M. (2014) Qualitative Bankruptcy Data
Set, UCI. https://archive.ics.uci.edu/ml/datasets/qualitative_bankruptcy

DOI: 10.4236/jmf.2017.74049 918 Journal of Mathematical Finance

Machine Learning: An Applied Econometric Approach: Sendhil Mullainathan and Jann Spiess
No ratings yet
Machine Learning: An Applied Econometric Approach: Sendhil Mullainathan and Jann Spiess
48 pages
ee_708_report
No ratings yet
ee_708_report
3 pages
Machine Learning: An Applied Econometric Approach: Sendhil Mullainathan and Jann Spiess
No ratings yet
Machine Learning: An Applied Econometric Approach: Sendhil Mullainathan and Jann Spiess
38 pages
Beyond The Black Box An Intuitive Approach To Investment Prediction With Machine Learning
100% (1)
Beyond The Black Box An Intuitive Approach To Investment Prediction With Machine Learning
17 pages
Machine Learning_ an Applied Econometric Approach
No ratings yet
Machine Learning_ an Applied Econometric Approach
29 pages
Bankruptcy prediction using imaged nancial ratios and convolutional neural networks
No ratings yet
Bankruptcy prediction using imaged nancial ratios and convolutional neural networks
40 pages
AdaBoost Models For Corporate Bankruptcy Prediction With Missing Data
No ratings yet
AdaBoost Models For Corporate Bankruptcy Prediction With Missing Data
26 pages
SSRN Id4249412
No ratings yet
SSRN Id4249412
45 pages
Article en Anglais Predicting Logistique Et Neurone PME PSX
No ratings yet
Article en Anglais Predicting Logistique Et Neurone PME PSX
18 pages
s41283-023-00132-2 1
No ratings yet
s41283-023-00132-2 1
23 pages
Charalambous 2000
No ratings yet
Charalambous 2000
23 pages
Loan Prediction 10
No ratings yet
Loan Prediction 10
10 pages
Effectiveness of Neural Network Types For Prediction of Busness Failure
No ratings yet
Effectiveness of Neural Network Types For Prediction of Busness Failure
10 pages
A Comparison of Artificial Neural Network Model and Logistics Regression in
No ratings yet
A Comparison of Artificial Neural Network Model and Logistics Regression in
13 pages
Forecasting With Neural Netwworks - Fletscher
No ratings yet
Forecasting With Neural Netwworks - Fletscher
9 pages
Document 2
No ratings yet
Document 2
20 pages
Bankruptcy Prediction Dissertation
100% (2)
Bankruptcy Prediction Dissertation
4 pages
An Application of Support Vector Machine To Companies' Financial Distress Prediction
No ratings yet
An Application of Support Vector Machine To Companies' Financial Distress Prediction
9 pages
ML in Financial Crisis Prediction Survey
No ratings yet
ML in Financial Crisis Prediction Survey
16 pages
Machine Learning Models and Bankruptcy Prediction
No ratings yet
Machine Learning Models and Bankruptcy Prediction
31 pages
1 s2.0 S095741741830616X Main
No ratings yet
1 s2.0 S095741741830616X Main
13 pages
Knowledge-Based Systems: Chih-Fong Tsai
No ratings yet
Knowledge-Based Systems: Chih-Fong Tsai
8 pages
Business Failure Prediction With Support Vector Machines and Neural Networks: A Comparative Study
No ratings yet
Business Failure Prediction With Support Vector Machines and Neural Networks: A Comparative Study
14 pages
ssrn-5088929
No ratings yet
ssrn-5088929
11 pages
IJNRD2407179
No ratings yet
IJNRD2407179
7 pages
iim_2022092709434339
No ratings yet
iim_2022092709434339
8 pages
1 s2.0 S0950705111002413 Main
No ratings yet
1 s2.0 S0950705111002413 Main
11 pages
Paper 1
No ratings yet
Paper 1
10 pages
Predicting Financial Distress of Agriculture Companies in The EU 2017 9p SK
No ratings yet
Predicting Financial Distress of Agriculture Companies in The EU 2017 9p SK
9 pages
Financial Supervision and Management System
No ratings yet
Financial Supervision and Management System
9 pages
Machine Learning Seminar Presentation
No ratings yet
Machine Learning Seminar Presentation
20 pages
Application of Machine Learning Algorithms For Business Failure Prediction
No ratings yet
Application of Machine Learning Algorithms For Business Failure Prediction
15 pages
Bankruptcy 1
No ratings yet
Bankruptcy 1
9 pages
Bankruptcy Usecase
No ratings yet
Bankruptcy Usecase
16 pages
Machine Learning: An Applied Econometric Approach
100% (1)
Machine Learning: An Applied Econometric Approach
31 pages
Thesis Frank Wagenmans 3870154
No ratings yet
Thesis Frank Wagenmans 3870154
52 pages
Decision Tree Combined With Neural Networks For Financial Forecast
No ratings yet
Decision Tree Combined With Neural Networks For Financial Forecast
7 pages
@2016ensemble Boosted Trees With Synthetic Features Generation in
No ratings yet
@2016ensemble Boosted Trees With Synthetic Features Generation in
9 pages
1 s2.0 S095741741201250X Main
No ratings yet
1 s2.0 S095741741201250X Main
6 pages
An Adaptive Fuzzy Neural Network Model For Bankruptcy Prediction of Listed Companies On The Tehran Stock Exchange
No ratings yet
An Adaptive Fuzzy Neural Network Model For Bankruptcy Prediction of Listed Companies On The Tehran Stock Exchange
6 pages
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
100% (3)
Python Cheat Sheet: Pandas - Numpy - Sklearn Matplotlib - Seaborn BS4 - Selenium - Scrapy
9 pages
Improving Prediction Accuracy Using Random Forest Algorithm
No ratings yet
Improving Prediction Accuracy Using Random Forest Algorithm
7 pages
Bank Loan2
No ratings yet
Bank Loan2
13 pages
Hybrid Genetic Algorithms and Support Vector Machines
No ratings yet
Hybrid Genetic Algorithms and Support Vector Machines
9 pages
8769 Main PDF
No ratings yet
8769 Main PDF
28 pages
Company Bankruptcy Prediction With SMOTE
No ratings yet
Company Bankruptcy Prediction With SMOTE
8 pages
Machine Learning Models and Bankruptcy Prediction Paper File
No ratings yet
Machine Learning Models and Bankruptcy Prediction Paper File
13 pages
A Genetic Programming Approach For Bankruptcy Prediction Using A Highly Unbalanced Database
No ratings yet
A Genetic Programming Approach For Bankruptcy Prediction Using A Highly Unbalanced Database
10 pages
Paper 3
No ratings yet
Paper 3
5 pages
A Genetic Algorithm Application in Bankruptcy Prediction Modeling
No ratings yet
A Genetic Algorithm Application in Bankruptcy Prediction Modeling
8 pages
Neural Networks Economics
No ratings yet
Neural Networks Economics
27 pages
Mahfoud & Mani 1996
No ratings yet
Mahfoud & Mani 1996
24 pages
Credit Card Score Prediction Using Machine Learning
No ratings yet
Credit Card Score Prediction Using Machine Learning
8 pages
Self-Organizing Learning Array and Its Application To Economic and Financial Problems
No ratings yet
Self-Organizing Learning Array and Its Application To Economic and Financial Problems
13 pages
A Structured Approach To Neural
No ratings yet
A Structured Approach To Neural
8 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
44 pages
Bucket Brigade
0% (1)
Bucket Brigade
6 pages
A Machine Learning Model For Flight Delay Prediction: Certificate
No ratings yet
A Machine Learning Model For Flight Delay Prediction: Certificate
17 pages
FEVER Dataset - ACL2018
No ratings yet
FEVER Dataset - ACL2018
11 pages
Data Science
No ratings yet
Data Science
64 pages
Ecs 171 Project Report Group 11
No ratings yet
Ecs 171 Project Report Group 11
34 pages
Lecture 4 - Bias-Variance Trade-Off and Model Selection
No ratings yet
Lecture 4 - Bias-Variance Trade-Off and Model Selection
66 pages
A Comparative Study On Machine Learning Techniques Using Titanic Dataset
No ratings yet
A Comparative Study On Machine Learning Techniques Using Titanic Dataset
6 pages
Detecting Spam Email With Machine Learning Optimized With Bio-Inspired Metaheuristic Algorithms
No ratings yet
Detecting Spam Email With Machine Learning Optimized With Bio-Inspired Metaheuristic Algorithms
19 pages
Machine Learning Theory CSE 250C: Introductory Lecture
No ratings yet
Machine Learning Theory CSE 250C: Introductory Lecture
29 pages
1811 01348 PDF
No ratings yet
1811 01348 PDF
27 pages
Speech2Face - Learning The Face Behind A Voice
No ratings yet
Speech2Face - Learning The Face Behind A Voice
11 pages
East West Institute of Technology: An Improved Approach For Fire Detection Using Deep Learning Models
No ratings yet
East West Institute of Technology: An Improved Approach For Fire Detection Using Deep Learning Models
21 pages
Affiliated To VTU, Belgaum and Approved by AICTE
No ratings yet
Affiliated To VTU, Belgaum and Approved by AICTE
27 pages
Manage Your Data Science Project Structure in Early Stage
No ratings yet
Manage Your Data Science Project Structure in Early Stage
7 pages
CSC 413 Ass
No ratings yet
CSC 413 Ass
7 pages
Research - GPT3
No ratings yet
Research - GPT3
7 pages
ANN Unit 3
No ratings yet
ANN Unit 3
11 pages
Ou Anane 2013
No ratings yet
Ou Anane 2013
5 pages
Credit Card Fraud Detection
100% (1)
Credit Card Fraud Detection
10 pages
Module 2 Quiz: Email
No ratings yet
Module 2 Quiz: Email
3 pages
Logistic Regression Lecture Notes
No ratings yet
Logistic Regression Lecture Notes
11 pages
All Rights Reserved This Document Is A Chapter of "Modeling Trading System Performance" Published by Blue Owl Press, Inc
No ratings yet
All Rights Reserved This Document Is A Chapter of "Modeling Trading System Performance" Published by Blue Owl Press, Inc
8 pages
How To Fine-Tune BERT For Text Classification?: Corresponding Author The Source Codes Are Available at
No ratings yet
How To Fine-Tune BERT For Text Classification?: Corresponding Author The Source Codes Are Available at
10 pages
Question Bank: Q1) What Is Data Warehouse?
No ratings yet
Question Bank: Q1) What Is Data Warehouse?
17 pages
Credit Card Fraud Detection Using Machine Learning Techniques
No ratings yet
Credit Card Fraud Detection Using Machine Learning Techniques
9 pages
Machine Learning in Hybrid Flow Shop Scheduling With Unrelated Machines
No ratings yet
Machine Learning in Hybrid Flow Shop Scheduling With Unrelated Machines
6 pages
Deepfake Detection Techniques: A Review: Neeraj Guhagarkar, Sanjana Desai Swanand Vaishyampayan, Ashwini Save
No ratings yet
Deepfake Detection Techniques: A Review: Neeraj Guhagarkar, Sanjana Desai Swanand Vaishyampayan, Ashwini Save
10 pages
Modeling Inverse Kinematics in A Robotic Arm - MATLAB & Simulink Example
No ratings yet
Modeling Inverse Kinematics in A Robotic Arm - MATLAB & Simulink Example
5 pages
Fundamentals of Machine Learning: a Simplified Approach
From Everand
Fundamentals of Machine Learning: a Simplified Approach
Er. Sudhir Goswami
No ratings yet
Practical MXNet Applications: Definitive Reference for Developers and Engineers
From Everand
Practical MXNet Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
50 Breakthrough AI Concepts in 500 Words Each: In 500 words, #17
From Everand
50 Breakthrough AI Concepts in 500 Words Each: In 500 words, #17
Nietsnie Trebla
No ratings yet
Artificial Intelligence Algorithms
From Everand
Artificial Intelligence Algorithms
akosnemeth
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet

Bankruptcy Prediction Using Machine Learning: Nanxi Wang

Uploaded by

Bankruptcy Prediction Using Machine Learning: Nanxi Wang

Uploaded by

Journal of Mathematical Finance, 2017, 7, 908-918

Bankruptcy Prediction Using Machine Learning

Shanghai Starriver Bilingual School, Shanghai, China

How to cite this paper: Wang, N.X. (2017) Abstract

DOI: 10.4236/jmf.2017.74049 Nov. 17, 2017 908 Journal of Mathematical Finance

DOI: 10.4236/jmf.2017.74049 909 Journal of Mathematical Finance

Autoencoders are especially successful in solving difficult tasks like natural

DOI: 10.4236/jmf.2017.74049 910 Journal of Mathematical Finance

4.1. Support Vector Machine

respectively, SVM aims at solving the problem:

Figure 1. SVM model [3].

DOI: 10.4236/jmf.2017.74049 911 Journal of Mathematical Finance

4.2. Neural Network with Dropout

As shown in Figure 1, the formal neuron uses n inputs x1 , x2 , , xn to clas-

f in Figure 2 represents activation function.

Figure 2. Neural network model.

DOI: 10.4236/jmf.2017.74049 912 Journal of Mathematical Finance

Figure 3. Artificial neural network.

where f is any activation function.

4.4. Decision Tree

DOI: 10.4236/jmf.2017.74049 913 Journal of Mathematical Finance

Figure 4. An n/p/n Autoencoder Architecture

(feature j and threshold tm ) into Qleft (θ ) and Qright (θ ) subsets:

The impurity function H ( ) is used to calculate the impurity at m, the

Choose the parameters that minimises the impurity

5.1. Parallel Comparison

5.1.2. Neural Network (Activation = Softmax, Num_Classes = 2,

DOI: 10.4236/jmf.2017.74049 914 Journal of Mathematical Finance

Table 1. Dataset Description.

Data set Dimensionality Instances Training Set Test Set Validation

Table 2. Accuracy of Neural Network Model with Truncate 50 or 100.

Table 3. Accuracy of Neural Network Model with and without Dropout.

5.1.3. Autoencoder (Encoding_Dim = 2, Activation = “Relu”,

5.2. Longitudinal Comparison

DOI: 10.4236/jmf.2017.74049 915 Journal of Mathematical Finance

three layer (added layer with dense 200)

four layer (added layer with dense 16)

truncate = 50 with four layers

truncate = 100 with four layers

with SVM 0.9867

with decision tree 0.9933

Table 7. Accuracy of Neural Network Model with Different models.

Robust logistic regression 0.6944

inductive learning algorithms (decision tree) 0.897

genetic algorithms 0.94

neural networks without dropout 0.903

SVM truncate = 100 0.9933

Truncate = 100 with four layers (added layer dense 16,200)

autoencoder (with decision tree) 0.9933

DOI: 10.4236/jmf.2017.74049 916 Journal of Mathematical Finance

Figure 5. Neural network-loss.

Figure 6. Neural network-accuracy.

DOI: 10.4236/jmf.2017.74049 917 Journal of Mathematical Finance

DOI: 10.4236/jmf.2017.74049 918 Journal of Mathematical Finance

You might also like