Machine Learning in Anti-Money Laundering - Summary Report: October 2018
Machine Learning in Anti-Money Laundering - Summary Report: October 2018
1. Introduction
The prevention of money laundering and terrorist financing is a key topic in the entire financial sector. For years,
its importance in the eyes of financial institutions of all sizes and business models has grown exponentially, along
with the focus of lawmakers, regulators and supervisors. Preventing the financial system from being misused to
launder illicit funds or to fund terrorist attacks is a key feature in the global effort to reduce the devastating effects
of crime and terrorism.
Despite these efforts, the level of undetected illicit funds remains too high. Financial institutions are increasingly
turning to new technologies to address the issue, among them machine learning.
Machine learning techniques hold great promise in addressing some of the challenges financial institutions are
grappling with. They can be used to increase the efficiency of measures in the various elements of the AML
framework, for example to reduce false positive and improve the effectiveness of transaction monitoring.
For this study, the IIF surveyed 59 financial institutions (FIs) on their application of machine learning techniques
in combatting money laundering. These institutions comprised 54 banks and 5 insurance companies, represent-
ing firms from all continents and across a diverse range of business models and firm sizes. This followed an
earlier IIF study on the application of Machine Learning in Credit Risk.2
1The detailed report is available to employees of regulatory agencies and participant firms on request.
2 The IIF’s Summary Report Machine Learning in Credit Risk can be found on the IIF website at https://www.iif.com/publication/regu-
latory-report/machine-learning-credit-risk
The majority of financial institutions surveyed already use (35% of participants) or experiment (34% of partici-
pants) with machine learning techniques. Another 29% indicated that they are planning on applying new analyt-
ical techniques in the foreseeable future. However, the industry is still cautious in its application, as shown in
the use cases in Section 3.
Figure 1: Do you apply new analytical techniques and technologies in your AML-related analysis (58 firms)
2%
29% 35%
34%
2
Common to these use cases is that the human element of AML is preserved. The consistent aim is to free the
resources of a firm’s analysts to focus on higher risk cases, thus where they are most valuable. Financial institu-
tions are not currently looking to automate the decision-making process, but to support their analysts with in-
creasingly powerful technology and automating process steps that keep them from focusing on the relevant risks.
Figure 2: Stages in the AML process where FIs are applying or experimenting with new analytical techniques
0 5 10 15 20
Data Collection
Customer segmentation
Model development and calibration (rules, alerts and
thresholds)
Transaction/client activity monitoring
Yes, applying new techniques Yes, but only experimenting (pilot projects)
Firms also work on customer segmentation and risk modelling, building models that consider a variety of risk
factors and allow more flexibility by overcoming data gaps (for example using graph theory), ultimately resulting
in a more precise risk assessment of customers and across the institution. The lowest number of projects are run
to improve KYC processes, looking at overcoming data gaps and processing unstructured customer data. In this
context, we have identified a strong use of Natural Language Processing to collect insights from adverse media /
negative news sources. These projects may often be at an experimental stage, but it highlights the importance of
building consistent processes in this respect.
3.2. Benefits
While participants are evenly split between actively applying and experimenting with machine learning, how
advanced the industry is varies depending on the technique. Our analysis has shown that the expected benefits
were indeed realized by those who already apply machine learning. With more advancement in the field in the
future, we expect further benefits to arise.
3
Figure 3: Benefits of Machine Learning in AML
The most prominent benefit is increased speed and/or automation of analysis that allows the AML process to
respond to the latest development in money laundering methods. This increase in automation and speed of key
process steps will be beneficial to all other process steps built around them.
We have also found that the expected ability to identify completely new risk segments and patterns that might
point to money laundering or other types of illicit activity has indeed been realized. Institutions clearly leverage
the inherent capabilities of enhanced analytics to strengthen their defense systems. This new knowledge can be
used to build new typologies in existing risk models both for risk assessment and monitoring purposes. We have
also observed significant advancements in customer segmentation capabilities. By analyzing an entire customer
base for similarities using an unsupervised machine learning approach, firms are able to identify outliers and
investigate if further action is warranted, thus gaining an even more precise view of the risk they might be ex-
posed to.
Unsurprisingly, firms have also reported a reduction of false positive rates, i.e. alerts that are generated but do
not present any risk. Some of these reductions were considerable. We have also had reports of firms having gen-
erated alerts that would have remained undetected under the traditional rules-based approach, as well as an
increase of alerts that warrant investigating (even if they turn out not to indicate illicit activity). An increase in
the quality of generated alerts that leverages a higher level of automation in process steps does indeed allow to
focus the human experts on the most relevant cases. This is increased where firms have indicated that they have
been able to process information from various data sources, showing the potential of a holistic analysis.
It should be mentioned that while a reduction in false positives might occur when the monitoring itself is con-
ducted using machine learning techniques, it is unclear if there would be a heavy increase in conversion of alerts
to STRs/SARs if rolled out at scale. By leveraging more data sources and using a broader view, more unusual
behavior would be identified, but not all unusual behavior is systematically an indication of money-laundering.
A higher quality of alerts however could lead to more (precise) reports to the FIUs.
4
4. Challenges and Recommendations
An interesting observation was made in terms of the challenges encountered. Some of the issues known as bar-
riers to a stronger AML framework come back to haunt the industry in the application of machine learning tech-
niques. At the same time, machine learning itself and its application present their very own challenges that need
to be overcome.
4.1. Regulatory
Some uncertainty remains regarding the support of regulators for this technology as part of an adequate risk
mitigation framework. The IIF recommends a stronger statement of support for the application of new technol-
ogies in the prevention of money laundering and financial crime, as well as a cooperative approach between
public and private sector to determine best practices for the methodology in this context. Balancing financial
crime prevention measures and privacy rules can also be tricky. We recommend clarifying how institutions can
best navigate both frameworks.
Regulators should also be mindful to prevent regulatory fragmentation. If financial institutions faced a more
aligned global framework, they could leverage the enhanced capabilities of machine learning techniques more
effectively, and do not need to be tailored to the specific requirements of various jurisdictions. A stronger inter-
national cooperation is needed to address the evolution of technology for payment systems and standards and
how these should be treated from an AML perspective.
5
One participating institution insightfully identified that “KYC is an enabler for machine learning”. High quality
data (including KYC data) is key to an effective application of machine learning. We therefore reiterate the im-
portance of supporting every initiative related to the digitization of identity information in a consistent and port-
able way.
4.2. Explainability
The primary goal of machine learning is to get good out of sample predictions, and the use of regularization to
penalize excessive complexity. Many experts agree that there is a trade-off between the predictive accuracy of a
model, and model interpretability. Put simply, a linear regression is typically easier to interpret but does not
have sufficient predictive power. Whereas in the other extreme, a powerful neural net with millions of parameters
can give better predictions but is jarring to interpret.
The challenges around explainability of risk models and algorithms based on machine learning techniques can-
not be solved through one recommendation. A more sensible approach is to determine a number of contributing
factors, including:
• Acknowledgment by regulators and industry that the need for explainability depends on the use case, e.g.
only where decisions are automated;
• Cooperation within the financial industry and between the public and private sectors on determining the
expectations to an explanation;
• Devising a transitional model in cooperation between the public and private sectors for the implementa-
tion of machine learning based models, for example in transaction monitoring (gradual transfer of trans-
actions to new monitoring approaches, common review between firms and regulator/supervisor after
each phase);
• Assessing (in cooperation between public and private sectors) which alternatives can be explored that
would balance a loss of explainability in case this is necessary (e.g. demonstration of effectiveness of re-
sults and/or methodology in building the model).
The IIF will shortly be publishing a white paper on machine learning explainability that builds on our earlier
credit risk report, and will welcome further engagement between the private and public sectors on this topic.
3
See also the IIF paper Safeguarding Customer Data in the Financial Sector, July 2018, to be found at https://www.iif.com/publica-
tion/regulatory-report/safeguarding-customer-data-financial-sector
6
view of customers and risks. These restrictions are one of the most important barriers to a more effective
AML/CFT framework and it is crucial that these are reevaluated.
Despite the welcome steps taken by the FATF in the recent months,4 further efforts are needed to address chal-
lenges to operative sharing of AML/CFT information – including mitigating such issues as inconsistent legal
frameworks for data protection, management of SAR-type information, privacy, and bank secrecy - across dif-
ferent jurisdictions. Restrictions implemented at the domestic level, which lead to regulatory fragmentation and
weaken an effective AML risk mitigation must be overcome.
In practice, it should be clearly stated in the applicable AML/CFT laws that it is permissible to share and process
the relevant information (i.e. KYC information and monitoring results) among members of the same banking or
insurance group to prevent financial crime, even if there is no suspicion about a customer. The same must be
possible for a central generation of data features based on accessible data sources and processing such shared
information, as well as the flow of alerts and other investigations.
How information can be shared between different financial institutions to prevent financial crime needs to be
harmonized. It needs to be possible for a firm that has knowledge or is investigating concerns about a customer
to warn another where appropriate. This could be inferred by an incoming transaction. We also encourage all
national supervisors where such rules have already been implemented to promote and support these possibilities
more openly. The financial sector would benefit from more clarity that institutions sharing information among
them in this context will be accepted.
The cooperation between the public sector and the financial sector should be revised as well. The “feedback loop”
between financial institutions filing STRs/SARs and their outcome should be strengthened. The mechanisms set
out in the recently revised FATF Recommendation 2 should be implemented on a national level as well, accom-
panying a revised framework for information sharing for the private sector and a more comprehensive frame-
work for information flows between the private and the public sector.5
The IIF also highlights the importance of a consistent and coordinated approach to allowing these data sharing
flows to prevent ambiguities and regulatory fragmentation as much as possible.
While these issues arise, the cooperation between public and private sector in exploring these technologies is
encouraging so far. Regulators and firms seem to cooperate in many cases to gain more comfort with this tech-
nology and assess common benefits. This welcome development should be pursued further, and can be the basis
to address the highlighted challenge on a cooperative basis between public and private sector.
4 The Financial Action Task Force (FATF) Guidance, Private Sector Information Sharing November 2017
5 FATF, Outcomes FATF Plenary, 21-23 February 2018
7
financing to access information. This approach would pay tribute to the fact that both ends of the data transfer
pursue the same goal, i.e. to prevent the misuse of the financial system for criminal activity.
The tax-related information in particular is extremely valuable, as it can give insight into a (legal or natural)
person’s residency, occupation and declared income. It can be compared to the information held at a financial
institution, which can react in cases where there are doubts about its accuracy.
5. Conclusion
The application of machine learning and artificial intelligence is spreading in the financial industry and increas-
ingly includes the RegTech area as well. Particularly for the prevention of financial crime, we are seeing a growing
number of institutions exploring the capabilities of this technology. We fully expect this trend to continue, with
8
more firms reviewing their processes and launching projects in this space. This will add to the already consider-
able number of firms involved in these projects, bringing them to maturity and potentially identifying further
areas in which to apply these techniques.
The emergence and development of enhanced analytical capabilities in general and of machine learning tech-
niques in particular will not fundamentally change the way financial institutions tackle the fight against money
laundering and terrorist financing. The existing pillars of the framework will continue to dictate which elements
must be covered to build a resilient defense system. Rather, machine learning presents an opportunity to signif-
icantly enhance the effectiveness and efficiency of the existing measures. There is strong evidence that it this is
the case, as the reduction in false positive rates and already better transaction monitoring results have shown.
The same is true for every new pattern of suspicious activity or unusual behavior that is identified through these
means and was unknown before. Greater automation also grants more flexibility in how these measures are ap-
plied and to connect these much more thoroughly than before, such as feeding monitoring results into risk rat-
ings automatically. This allows financial institutions to build a stronger safeguarding framework without putting
unnecessary strain on staff resources.
However, some major work needs to be done on all fronts to enable these results. As we have shown, the major
challenges to overcome are not overly different from the ones the industry faces already today.
Financial institutions and regulators must work together to devise a harmonized, consistent and sustainable
framework around the usage of data. While existing safeguards must not be loosened to the detriment of the
protection of the customer or market integrity, financial institutions have demonstrated their expertise at imple-
menting and following these rules. This should make it possible to leverage the existing data in financial groups
by allowing them to share it, apply a holistic view of the customer and identify illicit activity. Illicit activity which
does not stop at borders can only be tackled efficiently if the information necessary to stop it can be aggregated
across these borders. At the same time, a broader exchange of information between the public and the private
sector, as well as building partnerships for AML purposes must be a priority.
These partnerships should also extend to exploring the capabilities of technology, ultimately developing the
framework together in which these techniques can be applied, and the algorithms used can be explained. Some
requirements that have been developed in a time where they were the only way to meet the goals of the overarch-
ing AML and CFT framework should be revisited.
The IIF applauds the significant progress that has already been made, and encourages all actors to continue on
this path, and to resolve the key issues outlined in this report. When this groundwork is completed, a real and
tangible benefit from these techniques that goes beyond the value for a single organization and towards a
stronger and healthier financial system could be the result.
9
10