Working Paper Series: A New Approach To Early Warning Systems For Small European Banks
Working Paper Series: A New Approach To Early Warning Systems For Small European Banks
Disclaimer: This paper should not be reported as representing the views of the European Central Bank
(ECB). The views expressed are those of the authors and do not necessarily reflect those of the ECB.
Abstract
This paper describes a machine learning technique to timely identify cases of individual
bank financial distress. Our work represents the first attempt in the literature to develop an
early warning system specifically for small European banks.
We employ a machine learning technique, and build a decision tree model using a dataset
of official supervisory reporting, complemented with qualitative banking sector and macroe-
conomic variables.
We propose a new and wider definition of financial distress, in order to capture bank
distress cases at an earlier stage with respect to the existing literature on bank failures; by
doing so, given the rarity of bank defaults in Europe we significantly increase the number of
events on which to estimate the model, thus increasing the model precision; in this way we
identify bank crises at an earlier stage with respect to the usual default definition, therefore
leaving a time window for supervisory intervention.
The Quinlan C5.0 algorithm we use to estimate the model also allows us to adopt a
conservative approach to misclassification: as we deal with bank distress cases, we consider
missing a distress event twice as costly as raising a false flag.
Our final model comprises 12 variables in 19 nodes, and outperforms a logit model esti-
mation, which we use to benchmark our analysis; validation and back testing also suggest
that the good performance of our model is relatively stable and robust.
In this paper we build an early warning system to identify cases of financial distress at the level
of individual institutions in the European banking sector.
The motivation for this work stems from the desire to support the National Competent
Authorities in their daily supervisory work, as well as Divisions within the European Central
Bank conducting oversight functions, by providing them with a purely quantitative tool to
complement the expert knowledge component of the supervisory risk assessment work.
This model represents only one of the many tools in the hands of the supervisory, to con-
tribute to help with her work, and its role is solely that of contributing to shape and prioritise
efforts towards certain institutions, whose situation might need to be followed more closely. In
this respect, it should in no way be thought to be the trigger of any direct or indirect supervisory
action.
There is a tendency in the literature to build Early Warning Systems to predict bank defaults.
This is an undoubtedly useful theoretical exercise, but the identification happens too late for the
supervisor to intervene. The goal of our model is to identify an institution when in distress, with
a broader definition based on previous literature and regulation (Bank Recovery and Resolution
Directive, SSM Framework Regulation, Directive 2014/49/EU). With this change of definition,
we obtain a sample of 350 distress cases over a time span of only six quarters, allowing us to
estimate the model in a more precise manner.
The dataset we use is based on bank specific variables coming from quarterly supervisory
data (mainly COREP and FINREP), complemented with banking sector specific variables (e.g.
whether a bank is a member of an Institution Protecting Scheme) and macro-economic indica-
tors. Our panel is particularly wide, as we collect data for around 3,000 institutions.
We estimate a decision tree model using a machine learning technique of supervised learning
and using Quinlan C5.0 algorithm. This methodology allows us to adopt a conservative approach
to misclassification: we in fact consider missing a distress event twice as costly as raising a false
flag, given the supervisory framework in which we operate.
We build our model in a forward-looking perspective, in order to make the identification
of distress cases timely enough - and therefore leaving a time window for the supervisor to
intervene: we map distress events at time t to explanatory variables at t-h, where h is the
prediction horizon of one quarter.
1
As defined by the European supervisor, according to a precise set of rules contained in the SSM Guide to
Banking Supervision:
https://www.bankingsupervision.europa.eu/ecb/pub/pdf/ssmguidebankingsupervision201411.en.pdf
Models for the early identification of bank financial distress represent a useful tool both for the
theoretical work of the researcher and the practical daily use of the supervisor. They in fact help
the researcher understand what is that drives a bank into distress and tailor its investigation
on bank crises, and allow for the timely intervention of the supervisor and in most cases the
triggering of policy actions before the financial situation of an institution further deteriorates.
With this work, we propose an early warning model for the timely identification of distress
of financial institutions, based on a large sample of small European banks.
Existing and comparable models are usually based on conventional modeling techniques such
as multivariate logit models, and are calibrated using only a very small number of default (and
not just distress) events. We here propose to innovate the current theoretical framework along
two lines: first, we create a new definition for distress event inspired by literature and regulation,
and obtain a sample of distress events in our dataset of small European banks that is significantly
larger than most other works in the literature which focus on bank defaults. Second, we propose
a machine learning methodology to build a decision three, which notably improves the predictive
performance with respect to the most usual modeling techniques (we benchmark our decision
tree with a logit estimation).
As said, our proposed approach in defining bank distress enlarges the usual sample size of
distress events and therefore improves the learning of the model. We propose to classify banks
as distressed leveraging on the BRRD's early interventions measures and on its criteria for
categorizing banks as failing or likely to fail. Since this definition does not constitute the final
stage of a bank's failure, the system will predict the distress event2 stage early enough to allow
the supervisor to intervene and adopt preemptive measures to tackle the financial deterioration
case.
This paper makes use of a decision tree model, a technique often applied in machine learning
for classification problems, with the goal to construct a flexible and interpretable signalling
tool for banking supervision. The proposed early warning system (EWS) has the ability to
predict individual cases of bank distress and identify which the variables driving the financial
deterioration are.
This theoretical framework is applied to a unique dataset of more than 3,000 small European
2
Whose definition is outlined in section 5.
3 Background
This paper describes an early warning system for the European banking sector. This is an explicit
choice that is both practical - as granular and frequent data is available for SSM institutions
- and theoretical - since wider samples relative to wider geographical areas might have even
stronger heterogeneity problems, and it might therefore be difficult to build a one-fits-all model
for a global sample (Davis, 2011). The European banking system is extremely assorted and
comprises institutions of different size, scope and business model. Banks range from big globally
significant institutions to small local savings and cooperative banks, together with retail banks
of different sizes, investment banks, custodians, asset managers, among the others.
In November 2014 the European Central Bank assumed responsibility for the supervision
of euro area banks4 and centralised the direct supervision of around 120 significant banking
groups,5 and set up supervisory standards for the remaining 3,000 smaller institutions (classified
as less significant - LSIs from here on), the direct supervision of which is left to the National
Competent Authorities (but still conducted in close collaboration with the ECB).
In this framework, the strong fragmentation of the European banking system called for
a rigorous quantitative approach like the model we present in this paper to complement and
facilitate the daily work of the analyst. Moreover, only recently there have been attempts in
the literature to develop a model for the early detection of distress cases (see e.g. Rosa and
Gartner (2018)), as researchers tend to focus on bank failures or on systemic bank crises6 rather
than single-bank distress: despite representing an undoubtedly useful theoretical exercise, the
practical usefulness of such models is limited as once a bank is defaulting the situation is often
irreversible. The reason for this is also technical, as it is often the lack of data that pushes
3
As defined by the Single Supervisory Mechanism of the European Central Bank.
4
https://www.ecb.europa.eu/press/pr/date/2014/html/pr141104.en.html
5
Data source: EU Banking Supervision Website.
6
On this see the extensive literature reviews by Kumar Ravi and Ravi (2006) and Davis and Karim (2010) for
banking crises and Gramich et al. (2010) for systemic banking crises.
4 Literature Review
Economics is only one of the disciplines that makes use of early warning systems, which are
used in the most various subjects from disaster management for natural events, to medicine for
timely identification of diseases, to the world of social media. Researchers in these subjects often
borrow calculation and modelling techniques from other disciplines like physics and engineering,
7
These types of banks originated in Europe between the late 18th and early 19th Centuries, with the goal of
offering banking services to farmers, workers and small entrepreneurs, which at the time were facing extreme
difficulties to access credit.
• Capital adequacy,
• Asset quality,
• Management,
• Earnings,
• Liquidity,
and represent a useful starting point for the variable selection of any banking model.
Not surprisingly, the interest in the literature on predicting bank distress events peaked after
the 2007 financial crisis: Jin et al. (2011 and 2013) further developed the CAMELS approach
by complementing the six indicators with data on banks' internal controls on risk-taking and
audit quality variables, to find an improved predictive rate. Cole and White (2012) found that
measures of commercial real estate investments are also relevant for predicting bank distress.
Betz et al. (2013) also use the CAMELS indicators as a starting point to build their early
warning system on European banks.
A good recap of the variables most widely used in the literature is in Oat et al. (2013) who,
despite focusing on explanatory variables for systemic risk and financial distress, depict a list
that contains many of the variables that compose also our model.
We partly rely on this literature to construct the initial dataset for our decision tree. We
in fact build the model using three sets of variabels: bank specific indicators, banking-sector
and country level macro-financial variables using the CAMELS approach as one of our starting
points.
As the occurrence of a crisis - no matter how it is defined - can be easily described by a
dummy variable, a common approach in the literature is to use logit/probit models. Thomson
(1992) and Cole and Gunther (1998) estimate logit and probit models to show that vulnerability
Our model uses supervisory reporting, micro- and macro-economic data covering around 3,000
European less significant institutions over a period of four years. The sources of bank-level
data are the Common Reporting (COREP, containing information on capital adequacy and
risk-specific information), available on quarterly basis since December 2014, and the Financial
Reporting (FINREP, which includes balance sheet items, the statement of profit and loss and
detailed breakdowns of assets and liabilities by product and counterparty), available with differ-
ent frequency since December 2014 and on quarterly basis since March 2017. COREP covers the
entire LSI universe since its inception, FINREP instead presents different reporting frequency
and levels of granularity on the basis of the complexity and size of the reporting entity; therefore
for our purposes it has been integrated with an ad-hoc data collection carried out by the Single
Supervisory Mechanism on bi-annual basis. The macro-economic data are mainly obtained from
the ECB Statistical Data Warehouse, complemented by the Eurostat and the OECD for the
regional data.
In order for an early warning system on bank distress to be useful in practice for the the
supervisor and the policy maker, the recognition of the distress event must be timely enough
to allow a buffer of time for intervention. If we consider the failure or liquidation of a bank
as triggering event, as often defined in the literature (see e.g. the literature review by Gissel
et al. (2007), or the more recent works by Jin et al. (2011) or Cole and White (2012)),
we lose the practical validity of the model, which would in turn be helpful only for ex post
calibrations. Moreover, bank failures in Europe are relatively rare, this making the estimation
of such an early warning system even more challenging. We therefore relax the traditional
hypothesis of considering only bank crises or defaults as positive events in the sample (as e.g.
done by Kaminsky and Reinhard 1999, who mark the beginning of a banking crisis by a bank run
leading to closure, merging or take-over by the public sector of a bank, or large-scale government
• It is deemed to be failing or likely to fail within the meaning of Article 32 of the BRRD.
For categorizing a bank as failing or likely to fail, indicators assessing whether a bank has
breached the minimum capital requirements or capital buffers are constructed;
• It meets the conditions for early intervention pursuant to Article 27 of the BRRD. The
triggers used to meet the conditions of early interventions consist of indicators for assessing
if a bank is close to breaching minimum capital requirements;
• In case of the removal of the senior management and management body of the institution,
in its entirely or with regard to individuals, in line with article 28 of the BRRD;
• There is a rapid and significant deterioration of its financial situation according to Article
96 of the Framework Regulation. This is based on expert judgement by national central
banks and in-house qualitative and data;
• One of the four types of conventional bank distress events proposed by Betz et al (2013)
(i.e. bankruptcies, liquidations, state interventions and forced mergers) is met.
The distribution of distress events is not homogeneous across countries, and the frequency
of cases varies significantly among jurisdictions. This is mainly due to two different factors,
first the different levels of fragmentation of the banking system in Europe - with our sample of
institutions strongly polarised towards certain geographies, and second the economic situation
of the countries - for which distress cases are much more frequent in countries with a relatively
weak economic situation.
The current prediction horizon of only one quarter was chosen due to the availability of data.
A longer prediction horizon would in general be desirable, despite however resulting in a smaller
sample of distress events - which in turn would negatively affect the predictive performance of
the model.8
6 Methodology
Data pre-processing steps are required to ensure that unreliable and noisy data as well as ir-
relevant and redundant information is eliminated prior to the modelling phase. As such, the
final training dataset used for the analysis is of high quality, thus increasing the efficiency and
performance of the final model. This represents a key step in our process, as we start from the
extremely vast dataset of supervisory reporting consisting of more than 3,000 variables, and end
up with a final sample of only 12.
As explained above, the extensive and detailed dataset we use to build our model relies
mainly on SSM supervisory reporting data. The available data is extensive and variables tend
to be highly correlated - if not linear combinations of others - therefore data pre-processing
represents a crucial step for the construction of our early warning system.
8
The extension of the prediction horizon will however be the subject of future developments of this work.
Our early warning system is based on a decision tree methodology. The predictive performance of
this technique is very high, both in and out of sample, as demonstrated by the data on accuracy
presented in section 6; moreover, this methodology well handles missing values, a common issue
in the early warning literature (Mitchell 1997), and one of the main flaws in our dataset. Finally,
the decision tree methodology is relatively transparent, and allows supervisors to interpret the
output tree and understand which indicators affect bank distress, thus minimizing the risk of
creating a black box model.
A decision tree is a classification technique commonly used in machine learning. The tree
recursively identifies the significant indicators and their respective thresholds which best split
the sample into the pre-determined classes (in our case distress and no distress).
More in details, a decision tree is a hierarchical model that identifies local regions (leaf nodes)
via a sequence of recursive splits. Starting from the root of the tree, at each non-leaf node a test
n
X
H=− (Pi log2 Pi ) (1)
i=1
which is linked to the underlying probability of occurrence of value i : high entropy indicates
that all classes are (nearly) equally likely (high impurity), while low entropy indicates that few
classes are likely, and others are rarely observed (high purity).
The concept that guides the choice of the split is the maximization of the information gain,
based on conditional entropy:
IG = H − (HL ∗ pL + HR ∗ pR ) (2)
where H is the entropy of the parent node, HL is the entropy of the left node, and pL is the
probability that a random input is sent to the left node.
The final output of this recursive classification technique is a tree, illustrating a set of if-then
rules (decision nodes) to reach a final decision on the classification (leaf nodes). In our case, for
each bank, the classification starts from the root decision node, and based on predictors values
create a path along the tree untila leaf node is reached, classifying if the bank is in distress or
not.
We employ Quinlan's C5.0 algoritm to build the classification tree model. The C5.0 algoritm
is one of the most commonly used, as it is relatively fast and accurate, as well as efficient in
• We impose a relatively short prediction horizon (1-3 months ahead of distress as starting
point), given the short term (<1 year) scope of this EWS. By considering pre-default
events as target variable, we ensure that the system has a forward looking perspective;
• To increase the robustness of simple decision trees (which is relatively low, as underlined
by Alessi and Detken (2014), who use a Random Forest method to overcome the problem),
we employ a boosting technique á la Freund et al. (1999) to identify which variables to
include in the final version of the tree.10
• As there is no univocal rule to choose one particular tree among the estimated ones, we
use the boosting technique to simulate the creation of a large number of trees, and select
the 20 variables that rank highest as of importance (measured in terms of how often they
appear in the trials).
• We complement the variable selection with both expert judgement and quantitative mea-
sures: in particular, for evaluating the performance of the model, we rely on the area
under the Receiver-Operating-Characteristic curve (AUC) and Cohen's kappa statistic,
both standard measures of accuracy in the early warning system literature (e.g. Peltonen
at al. (2015)).
The final tree is composed of 19 nodes, covering 12 different explanatory variables, and is
represented in Figure 1.11
9
For a literature review of Data Mining Algoritms see Wu et al. (2008). The relative R environment used in
this paper refers to Kuhn et al. (2015).
10
Boosting is a technique for generating and combining multiple classifiers to improve the predictive accuracy
of the model. Instead of using a single tree, n separate decision trees (trials) are grown and combined to make
predictions. The error rate of the boosted classifier is often substantially lower than that of single trees.
11
Please note that the trees represented in this version of the paper are somehow anonymised, i.e. without the
precise splitting thresholds of each node.
• Adjusted profitability;
• Deficit-to-GDP ratio;
• GDP growth;
• Leverage ratio;
• Equity exposures;
• Exposures in default;
The indicator of the parent node is profitability, adjusted for the different accounting stan-
dards of the banks in the sample. The node splits the banks between profit (right branch)
and loss (left branch) making. The remaining variables are a mix of macro-economic indicators
(deficit-to-GDP ratio, and real GDP growth) and banking indicators covering the most impor-
tant risks: credit (non-performing loans ratio, non-performing loans coverage ratio, exposure
in default), liquidity (liquidity coverage ratio), market (captured by the sum of trading finan-
cial assets and financial liabilities held for trading over total asset and net gains on financial
assets and liabilities held for trading over total operating income), capital (leverage ratio and
equity exposure), together with the qualitative information of whether a bank is member of an
institutional protection scheme (IPS).
In the framework of supervised learning, the role of our tree is not only to find an efficient
method to identify which banks are in financial distress, but also to suggest which variables are
significant and how they model the distress. In this perspective, it is interesting to analyse the
12
The sum of trading financial assets and financial liabilities held for trading over total assets, and net gains
on financial assets and liabilities held for trading over total operating income.
The supervised learning estimation framework of the decision tree allows us to perform a
precise ex post back-testing analysis. In the course of 2017 the model correctly identified 79% of
distress events. The missing financial deterioration cases were mainly triggered by qualitative
13
We build the correlation matrix and proceed to reduce pairwise correlation by selecting the couples of too
highly correlated variables, and eliminating the one with the largest mean absolute correlation.
7 Conclusions
This paper develops an innovative model to identify cases of bank financial distress, using a
subsample of 3,000 small European institutions14 for a time period of six quarters (between
2014 and 2016).
We build a sample of distress cases based on European regulation, to early detect future
cases of financial deterioration rather than simply referring to banks that are already in or close
to default. With a broad definition of financial deterioration, our sample of distress events is
significantly larger than any other work in the literature, despite a relatively short time series
of data.
On this sample we construct a decision tree model, which accurately classifies banks into
distressed or non-distressed; the prediction horizon of the model is one-three months, a time
span that would in our view give the supervisor enough time to trigger supervisory action.
We find that the predictive power of our model is extremely high, and the decision tree
steadily outperforms the Logit approach, the most widely used methodology to predict binary
classifications which we use as benchmark.
As a final remark, further extensions of this work should go in the direction of increasing
the prediction horizon (currently 1-3 months), and could include the development of an ad hoc
tree for each business model. Unfortunately, the data does not allow us to do this, yet, as given
the polarisation of the dataset towards retail lenders, the sample of remaining business models
is simply not numerous enough to allow for a proper estimation.
A clear limitation of this work is the length of the time series, which we try to partly
overcome with the size of the panel and the granularity of the data. However, we plan to re-
estimate the model with a longer time series when it will be available, to continue to back test
the predicting performance of the model and to analyse the changes in the business models of the
banks; the latter in particular could be achieved in two ways: first, by re-estimating the model
to understand how the environment changed, and what new variables contribute to describing
the business of an institution; second, by conducting a case-by-case analysis for institutions for
which the model changes classification: in this way, it would be possible to get some insights on
14
Excluding the so called Significant Institutions, as defined by the SSM.
[1] Aldasoro I., Borio C. and Drehmann M., 2018, Early Warning Indicators of Banking Crises:
Expanding the Family, BIS Quarterly Review, March 18, 29-45.
[2] Alessi L. and Detken C., 2014, Identifying Excessive Credit Growth and Leverage, ECB
Working Paper n.1723.
[3] Alessi L., Antunes A., Babecky J., Baltussen S., Behn M., Bonfim D., Bush O., Detken C.,
Frost J., Guimaraes R., Havranek T., Joy M., Kauko K., Mateju J., Monteiro N., Neudorfer
B., Peltonen T., Rodrigues P., Rusnak M., Schudel W., Sigmund M., Stremmel H., Smidkova
K., van Tilburg R., Vasicek B. and Zigraiova D., Comparing different early warning systems:
Results from a horse race competition among members of the Macro-prudential Research
Network, MPRA Paper 62194, University Library of Munich, Germany.
[4] Alpaydin E. I., 2010, Introduction to Machine Learning, 2nd Edition, The MIT Press.
[5] Altman E. I., 1968, Financial Ratios, Discrimimant Analysis and the Prediction of Corporate
Bankruptcy, The Journal of Finance, 23, 589-609.
[6] Altman E. I., Marco G., and Varetto F., 1994 , Corporate Distress Diagnosis: Comparisons
Using Linear Discriminate Analysis and Neural Networks, Journal of Banking and Finance,
18, 505-529.
[7] Atiya A., 2001, Bankruptcy Prediction for Credit Risk using Neural Networks: A Survey and
new Results, Transactions on Neural Networks, 12(4), 929-935.
[8] Betz F., Opric S., Peltonen T. A., and Sarlin P., 2014, Predicting Distress in European Banks,
Journal of Banking and Finance, 45, 225-241.
[9] Boritz, J. E. and D. B. Kennedy, 1995, Effectiveness of Neural Network Types for Prediction
of Business Failure, Expert Systems with Applications, 9(4), 95-112.
[10] Bülbül D., Schmidt R. H. and Schüwer U., 2013, Savings Banks and Cooperative Banks in
Europe, SAFE White Paper Series n.5.
[11] Cole, R. A., and J. Gunther, 1998, Predicting Bank Failures: A Comparison of on- and off-
site Monitoring Systems, Journal of Financial Services Research 13(2), 103-117.
[13] Davis P. E. and Karim D., 2007, Comparing Early Warning Systems for Banking Crises,
Journal of Financial Stability, 4(2), 89-120.
[14] Davis P. E., Karim D. and Liadze I., Should multivariate early warning systems for banking
crises pool across regions?, Review of World Economics, 147(4), 693-716.
[15] Drehmann M. and Juselius M., 2013, Evaluating Early Warning Indicators of Banking
Crises: Satisfying Policy Requirements, BIS Working Papers n.421.
[16] Ferriani F., Cornacchia W., Farroni P., Ferrara E., Guarino F. and Pisanti F., 2019, An
Early Warning System for Less Significant Italian Banks, Banca d’Italia Occasional Papers
n.480.
[17] Flannery M. J., 1998, Using Market Information in Prudential Bank Supervision: A Review
of the U.S. Empirical Evidence, Journal of Money, Credit and Banking, 30(3), 273-305.
[18] Freund, Y., Schapire, R., and Abe, N., 1999, A Short Introduction to Boosting, Journal-
Japanese Society For Artificial Intelligence, 14, 771-780, n.1612.
[19] Frydman H., Altman E. I. and Kao D. L., 1985, Introduction Recursive Partitioning for
Financial Classification: The Case of Financial Distress, Journal of Finance, 40(1), 269-291.
[20] Gaytn A. and Johnson C. A., 2002, A Review of the Literature on Early Warning Systems
for Banking Crises, Central Bank of Chile Working Papers n.183.
[21] Gepp A., Kumar K. and Bhattacharya S., 2010, Business Failure Prediction using Decision
Trees, Journal of Forecasting, 29(6), 536-555.
[22] Gepp A. and Kumar K., 2015, Predicting Financial Distress: a Comparison of Survival
Analysis and Decision Tree Techniques, 11th International Conference on Data Mining and
Warehousing.
[23] Gissel J. L., Giacomino D. and Akers M. D., 2008, A Review of Bankruptcy Prediction
Studies: 1930-Present, Journal of Financial Education, 33, 1-42.
[25] Gramith D., Miller G. L., Oet M. V. and Ong S. J., 2018, Early Warning Systems for Sys-
temic Banking Risk: Critical Review and Modeling Implications, Banks and Bank Systems,
5(2), 199-211.
[26] Honohan P., 1997, Banking System Failures in Developing and Transition Countries: Di-
agnosis and Predictions, BIS Working Papers n.39.
[27] Jin, J., K. Kanagaretnam, and G. Lobo, 2011, Ability of Accounting and Audit Quality
Variables to Predict Bank Failure During the Financial Crisis, Journal of Banking & Finance
35(11), 2811-2819.
[28] Jin, J., K. Kanagaretnam, G. Lobo, and R. Mathieu, 2013, Impact of FDICIA Internal
Controls on Bank Risk Taking, Journal of Banking & Finance 37(2), 614-624.
[29] Joos P., Vanhoof K., Ooghe H. and Sierens N., 1998, Credit Classification: a Comparison
of LOGIT Models and Decision Trees, Proceeding Notes of the Workshop on APplication of
Machine Learning and Data Mining in Finance, 59-72.
[30] Kaminsky G. L., 1999, Currency and Banking Crises: the Early Warning of Distress, IMF
Working Paper, WP/99/178.
[31] Kaminsky G. L. and Reinhart C. M., 1999, The Twin Crises: the Causes of Banking and
Balance-of-Payments Problems, The American Economic Review, v.89 n.3, 473-500.
[32] Kohavi R. and Quinlan R., Decision Tree Discovery, in Handbook of Data Mining and
Knowledge Discovery, 267-276, Klosgen & Zytkow, Oxford University Press, Oxford.
[33] Kuhn M., Weston S., Coulter N., and Culp M., 2015, C5.0 Decision Trees and Rule-Based
Models.
[34] Kumar Ravi P. and Ravi V., 2006, Bankruptcy prediction in banks and firms via statistical
and intelligent techniques A review, European Journal of Operational Research, 180(1),
1-28.
[35] Lang J. H., Peltonen T. A. and Sarlin P., 2018, A Framework for Early-warning Modeling
with an Application to Banks, ECB Working Paper n.2182.
[37] Mitchell T., 1997, Machine Learning, 1st edition, McGraw-Hill, New York.
[38] Ng G. S., Quek C. and Jiang H., 2008, FCMAC-EWS: A Bank Failure Early Warning
System based on a Novel Localized pattern Learning and Semantically Associative Fuzzy
Neural Network, Expert Systems with Applications, 34(2), 989-1003.
[39] Oet M. V., Bianco T., Gramlich D. and Ong S. J., SAFE: An Early Warning System for
Systemic Banking Risk, Bank Finance, 37(11), 4510-4533.
[40] Peltonen, T. A., Piloiu, A., and Sarlin, P., 2015, Network Linkages to Predict Bank Distress,
ECB Working Paper n.1828.
[41] Quinlan J.R., 1986, Induction of Decision Trees, Machine learning 1, 81-106.
[42] Roengpitya R., Tarashev N. and Tsatsaronis K., 2014, Bank Business Models, BIS Quarterly
Review, December 2014.
[43] Rosa P. S. and Gartner R. I., 2018, Financial Distress in Brazilian Banks: an Early Warning
Model, Revista Contabilidade & Finanças, 29(77), 312-331.
[44] Sinkey, J. F., 1975, A Multivariate Statistical Analysis of the Characteristics of Problem
Banks,The Journal of Finance, 30(1), 21-36.
[46] Stock J. H. and Watson M., 1989, New Indices of Coincident and Leading Economic Indi-
cators, National Bureau of Economic Research Macroeconomics Annual 1989, NBER.
[47] Tibshirani R., 1996, Regression Shrinkage and Selection via the LASSO, Journal of the
Royal Statistical Society Series B, 58(1), 267-288.
[48] Thomson, J. B., 1991, Predicting Bank Failures in the 1980s, Economic Review-Federal
Reserve Bank of Cleveland, 27(1), 9.
[49] Wu, X., Kumar, V., Quinlan, J. R., Ghosh, J., Yang, Q., et alii, 2008, Top 10 Algorithms
in Data Mining, Knowledge and Information Systems, 14(1), 1-37.
Michael Bräuning
European Central Bank, Frankfurt am Main, Germany; email: [email protected]
Despo Malikkidou
European Banking Authority, Paris, France; email: [email protected]
Stefano Scalone
European Central Bank, Frankfurt am Main, Germany; email: [email protected]
Giorgio Scricco
European Central Bank, Frankfurt am Main, Germany; email: [email protected]