0% found this document useful (0 votes)
20 views23 pages

Application of Natural Language Processing (NLP) in Detecting and Preventing Suicide Ideation: A Systematic Review

Uploaded by

Dax Westerman
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
20 views23 pages

Application of Natural Language Processing (NLP) in Detecting and Preventing Suicide Ideation: A Systematic Review

Uploaded by

Dax Westerman
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 23

International Journal of

Environmental Research
and Public Health

Review
Application of Natural Language Processing (NLP) in
Detecting and Preventing Suicide Ideation: A Systematic Review
Abayomi Arowosegbe 1,2, * and Tope Oyelade 3

1 Institute of Health Informatics, University College London, London NW1 2DA, UK


2 Division of Informatics, Imaging & Data Sciences, University of Manchester, Manchester M13 9PL, UK
3 Division of Medicine, University College London, London NW3 2PF, UK
* Correspondence: [email protected]; Tel.: +44-(0)7943535567

Abstract: (1) Introduction: Around a million people are reported to die by suicide every year, and
due to the stigma associated with the nature of the death, this figure is usually assumed to be an
underestimate. Machine learning and artificial intelligence such as natural language processing
has the potential to become a major technique for the detection, diagnosis, and treatment of people.
(2) Methods: PubMed, EMBASE, MEDLINE, PsycInfo, and Global Health databases were searched
for studies that reported use of NLP for suicide ideation or self-harm. (3) Result: The preliminary
search of 5 databases generated 387 results. Removal of duplicates resulted in 158 potentially suitable
studies. Twenty papers were finally included in this review. (4) Discussion: Studies show that
combining structured and unstructured data in NLP data modelling yielded more accurate results
than utilizing either alone. Additionally, to reduce suicides, people with mental problems must be
continuously and passively monitored. (5) Conclusions: The use of AI&ML opens new avenues for
considerably guiding risk prediction and advancing suicide prevention frameworks. The review’s
analysis of the included research revealed that the use of NLP may result in low-cost and effective
alternatives to existing resource-intensive methods of suicide prevention.

Keywords: natural language processing; NLP; text mining; suicide prevention; suicide-ideation;
mental health
Citation: Arowosegbe, A.; Oyelade,
T. Application of Natural Language
Processing (NLP) in Detecting and
Preventing Suicide Ideation: A 1. Introduction
Systematic Review. Int. J. Environ. Suicide is the world’s 13th leading cause of death, accounting for 5–6 percent of all
Res. Public Health 2023, 20, 1514. fatalities [1]. The likelihood of completing suicide varies by sociodemographic variables
https://doi.org/10.3390/ around the world, with young adults, teenagers, and males bearing the largest risks [2].
ijerph20021514 Every suicide is a tragedy that impacts families, towns, and whole nations, as well as the
Academic Editor: Paul B. Tchounwou individuals who are left behind by the deceased. Suicide occurs at any age and was the
fourth highest cause of death among 15–29 years old worldwide in 2019 [3]. Because of
Received: 8 December 2022 the COVID-19 pandemic, people all over the world have been suffering from the effects
Revised: 4 January 2023
of the financial crisis, mental health issues, and a sense of loneliness and isolation. These
Accepted: 11 January 2023
factors have heightened public awareness of the dangers of suicide. Suicidal behaviour
Published: 13 January 2023
is complicated, and no one explanation fits every case. However, many people commit
suicide on the spur of the moment, and having ready access to a means of suicide, such
as poisons or weapons, may make the difference between life and death [4]. Attempting
Copyright: © 2023 by the authors.
suicide by other ways, such as jumping in front of a speeding train or plunging from
Licensee MDPI, Basel, Switzerland. tall buildings, has also been reported [4]. Thus, removing the means of suicide may not
This article is an open access article significantly reduce the rate of suicide.
distributed under the terms and Suicide is a severe public health issue, but it is avoidable with early, evidence-based,
conditions of the Creative Commons and frequently low-cost measures. A robust multi-sectorial suicide prevention plan is
Attribution (CC BY) license (https:// required for national suicide interventions to be successful [3]. Innovative and cost-effective
creativecommons.org/licenses/by/ ways to collect and understand data for suicide prevention are important tools in the fight
4.0/). against suicide [5]. Approaches such as NLP combined with other machine learning

Int. J. Environ. Res. Public Health 2023, 20, 1514. https://doi.org/10.3390/ijerph20021514 https://www.mdpi.com/journal/ijerph
Int. J. Environ. Res. Public Health 2023, 20, 1514 2 of 23

techniques that utilise existing data from Electronic Medical Records (EMRs) and other
repositories have the capability to improve early identification of people at higher risk of
committing suicide. This is especially true given that these computational approaches can
provide a low-cost alternative to other costly methods [6]. Text mining approaches that
are now in use include information retrieval, text classification, document summarisation,
text clustering, and topic modelling. These approaches, on the other hand, concentrate
on collecting usable information from text documents using a range of techniques such
as keyword extraction, categorisation, topic modelling, and sentiment analysis [7]. These
approaches, in contrast to NLP, are more limited in scope and do not always focus on
comprehending the meaning of texts. NLP, on the other hand, can comprehend the meaning
and context of words, as well as the mood and emotion behind texts, phrases, and sentences.
This enables it to understand complicated texts more effectively and extract more relevant
insights than conventional text mining approaches [8].
Over the past several decades, there has been a significant expansion in the body of
knowledge about suicidal behaviour. For instance, research has revealed that the interaction
of biological, psychological, social, environmental, and cultural elements is an important
component in influencing suicide ideation [9]. At the same time, the field of epidemiology
has been instrumental in determining a wide variety of variables, both protective and
risky, that influence the likelihood of an individual committing suicide, both in the general
population and in specific susceptible groups [5,10]. It has also come to light that the risk
of suicide varies greatly among cultures, with culture playing a role both in elevating the
risk of suicidal behaviour and in providing some protection against it [10].
In terms of legislation, it is now known that 28 countries have national suicide preven-
tion policies, and World Suicide Prevention Day, which is celebrated annually on September
10 and is coordinated by the International Association for Suicide Prevention, is recognised
all over the world. In addition, a great number of research centres devoted to suicide have
been established, and there are academic programmes that concentrate on the prevention
of suicide [4]. Self-help groups for the bereaved have been created in several different loca-
tions, and trained volunteers are assisting with online and telephone counselling services
to provide practical assistance. Non-specialized health professionals are being used to
strengthen the evaluation and management of suicidal behaviours. Decriminalizing suicide
in many countries over the course of the last half-century has made it considerably simpler
for those who struggle with suicidal tendencies to get the assistance they need [4].
For suicide prevention strategies to be successful, there must be an improvement in
surveillance and monitoring of suicide and attempts at suicide. Healthcare providers and
treatment facilities need access to innovative tools that will help persons who are at risk of
committing suicide get mental health care and continue to be safe until they do [3]. Accord-
ing to the National Institute of Health NIH, there are two primary methods for identifying
who is at risk of committing suicide: first, “Universal Screening”, which, according to some
estimates, has the potential to identify more than three million adults who are at risk of
committing suicide annually. The second primary method for identifying who is at risk of
committing suicide is by “Predicting Suicide Risk using Electronic Health Records”. The
use of electronic medical records, including the unstructured text of patients’ medical notes
such as discharge summaries, is recognised as a vital resource for the provision of medical
treatment as well as for medical research [11].
The extraction of information and the discovery of new knowledge using NLP and
other machine learning methods have been successfully applied to electronic medical notes
and other text data in a variety of mental health areas such as depression [12] and post-
traumatic stress disorder (PTSD) [13]. An NLP model that recognises indicators of sadness
in free text, such as posts in internet forums like twitter and reddit, chat rooms, and other
such sites, has been developed. Machine learning and artificial intelligence approaches
were used to create this model. NLP was also used to extract emotional content from textual
material to identify patients with PTSD using sentiment analysis from semi-structured
Int. J. Environ. Res. Public Health 2023, 20, 1514 3 of 23

interviews; a machine learning (ML) model was trained on text data from the Audio/Visual
Emotion Challenge and Workshop (AVEC-19) corpus [14].
Suicides can be prevented, and there have been several measures and screening
methods that have been used in the past [4,11]. These include limiting access to the means
of suicide (such as pesticides, weapons, and certain medicines), training and education of
healthcare professionals in recognising suicidal behaviour, responsible media reporting,
raising awareness, and the use of mobile apps and online counselling tools, amongst
other potential solutions. However, the screening tools that are now available may not be
sensitive enough to enable person-centred risk detection consistently [15]. Consequently,
there is an urgent need for novel approaches that focus on the individual when identifying
people who may be at risk for suicide. To improve upon how things are done and to have
an impact on policy, the purpose of this project is to search for, analyse, and report on ways
suicide may be prevented using NLP.

1.1. Rationale
As this is a pressing challenge in the United Kingdom and around the world, it is
necessary that more research and studies be carried out to slow down the growing number
of individuals who take their own lives.
It is difficult to detect suicide ideation because people who are suicidal tend to isolate
themselves and are unwilling to communicate about their thoughts [16]. As a result, detect-
ing suicide ideation may be extremely challenging. Those who are at risk of committing
suicide need to be monitored constantly to identify when they are having suicidal thoughts
so that appropriate action may be taken. This may allow healthcare professional and
relevant experts to save lives through timely interventions.
According to the National Institutes of Health NIH, utilising electronic medical records
is one of the ways that suicide might be averted [11]. However, there hasn’t been enough of
work done in this area, especially using text analytics tools like NLP. The development of a
risk stratification tool via the use of electronic medical records, including both structured
and unstructured data, is one method that may be used to reduce the incidence of suicide.
To contribute to the growing research landscape that could aid in the development of a
suicide prevention tool, the purpose of this study is to investigate and consolidate essential
work that has been done on the use of NLP for detecting suicidal thoughts.

1.2. Research Question


Can natural language processing (NLP) and other text analytics methods be used in
identifying people with suicide ideation?

1.3. Aims & Objectives


Machine learning and artificial intelligence-based modelling, such as NLP and other
text analytics approaches, have the potential to become major techniques for the detection,
diagnosis, and treatment of people who are suffering from mental health issues [17].
This was demonstrated by the results of research in mental health such as depression,
post-traumatic stress disorder (PTSD), and homelessness [18–20]. The primary aims of
this research is to assess how NLP has been utilised in the field of suicide prevention
and its effectiveness as well as limitations. The goal is to provide recommendations for
improvement and suggest areas needing further research. The objectives of this research
are listed below.
• Conduct a comprehensive database search for research on the use of NLP for suici-
dal ideation.
• Collect essential information on the detection and treatment effectiveness of the NLP
approach, as well as its limitations.
• Synthesize, analyse, and report findings from included studies.
• Make future suggestions and identify prospective research areas.
• Formulate recommendations for future efforts based on the findings of the included studies.
Int. J. Environ. Res. Public Health 2023, 20, 1514 4 of 23

2. Methodology
This was a qualitative study with the goal of completing a review of studies that
had been conducted using NLP and other text analytics approaches in the identification
or detection of suicidal ideation. This systematic review was carried out in accordance
with the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-analyses)
standards to increase both the level of transparency and the quality of the reporting
on publications [21].

2.1. Inclusion & Exclusion


All peer-reviewed journal publications published during the last 10 years were in-
cluded. Also included are articles written or translated for publication in the English
language. In addition, studies addressing the use and application of NLP methods or other
text mining approaches for suicide, suicide ideation, and self-harm in any environment,
including mental health, acute, and community services, were included.
We excluded research that did not include NLP or other text mining techniques. In
addition, reviews and other secondary sources were excluded. All poster presentations,
non-full-text submissions, and full-text submissions in languages other than English were
also excluded.

2.2. Search Strategy


Using the following syntax, the Population, Phenomena of Interest, and Context (PICo)
framework was used for the search approach.
((NLP OR “natural language processing” OR “text mining” OR “text analytics” OR
“data mining” OR “information retrieval”)
AND
(“mental health” OR disorder OR depression OR suicide OR psychotic OR psychiatry
OR “self-harm” OR suicidal))

2.3. Databases
A search of the relevant literature was conducted for this investigation utilising five
scientific and medical databases: PubMed, MEDLINE, Embase, PsycINFO, and Global
Health through the OVID platform. Several papers discovered in the reference lists of the
included studies were also included.

2.4. Reference Management


Mendeley was used for reference management, paper collection, and organisation.
Mendeley was chosen for this study because it allows researchers to import and store
papers from a variety of databases and in a variety of formats. It can also be used to
remove duplicates, especially in the case of papers that appear in multiple databases, and
export papers to other applications, such as systematic review management systems such
as Covidence and Rayyan.
Covidence was utilised to manage the systematic review. It’s a web-based systematic
review management software that makes it easier to create systematic reviews and other
types of research reviews by screening citations and completing texts, assessing bias risk,
and extracting study features and findings for analysis. The Covidence system was chosen
because it speeds up the initial screening of abstracts and full texts, enabling the author
and the second reviewer to collaborate on the project and resolve any disagreements about
whether papers should be included or excluded.
Using Covidence, two independent researchers (AA and TO) reviewed the publi-
cations at the abstract and full-text stages in line with the inclusion criteria. Conflicts
were resolved at each stage of the review until consensus was obtained. Cohen’s Kappa
Coefficient, which measures the degree to which the data gathered reflects the variables
tested, was used to examine interrater reliability.
Int. J. Environ. Res. Public Health 2023, 20, 1514 5 of 23

2.5. Quality Assessment


The quality of the included publications in this research was evaluated using the
Mixed Method Appraisal Tool MMAT. The mixed-method evaluation instrument is used
to evaluate quantitative, qualitative, and mixed-method studies that are included in sys-
tematic reviews of mixed-studies [22]. With a focus on mixed-methods research, the tool
specifies a series of criteria and screening questions to obtain an overall quality score.
The MMAT evaluation tool was used since it is an appropriate evaluation instrument
for systematic reviews that include various study designs such as qualitative research,
randomised controlled trials, nonrandomized studies, quantitative descriptive studies, and
mixed methodologies research etc.

2.6. Databases
To obtain relevant data from the included studies, a template for data extraction was
designed. After being exported from Covidence software, the data were cleaned and
transformed in Microsoft Word and Excel before analysis. After the data had been exported
into the data processing software, it was examined and investigated to determine whether
the appropriate data had been collected. A matrix was then created to store the data, initial
codes were derived from the data, the codes were examined, revised, and combined into
themes, and, finally, the themes were refined and presented in a cohesive manner.

2.7. Analysis
Thematic analysis using the reflective approach was used to construct narratives and
discussions from the included papers, using codes and themes generated from the collected
data. These narratives and discussions were based on the findings of the included studies.
For the purposes of this research, specialised software was not required to carry out the
thematic analysis. Instead, tables were created in Microsoft Word, which serve as the
repository for the core themes as well as the secondary themes.
Reflective thematic analysis RFA was adopted for this research because of its widespread
use and reputation as one of the more accessible methods for those with little or no prior
experience in qualitative analysis [17]. In addition, reflective thematic analysis allows easy
identification and analysis of patterns or themes in a given data set and also provides a
simple and theoretically flexible interpretation of qualitative data [18,19].

2.8. Ethics
There are no ethical issues about the safety of the participants or the data collected in
this research. Full-text literature was obtained from several medical, health informatics, and
psychological sources available via the university library and other third-party databases.
Therefore, the data and information gathered by this study are already accessible in the
public and academic domains. The lead investigators of the included studies are expected
to have obtained consent from all persons, organisations, and subjects participating in their
investigations. As a result, no ethical approval is required for this systematic review.

3. Results
3.1. Study Selection
The preliminary search, which consisted of searching 5 separate databases with the
help of the OVID platform, produced a total of 387 results. After the processing of the
information in Mendeley reference management software, a total of 158 records were
produced after the deduplication and initial screening process.
The 158 data that had been pre-processed in Mendeley were then imported into Covi-
dence, which is a management system for systematic reviews, and here is where the screen-
ing processing was completed. Following the review of the full text, twenty (20) studies
were assessed and chosen for inclusion. The search procedure is shown in Figure 1 using
the PRISMA flow diagram shown below.
duced after the deduplication and initial screening process.
The 158 data that had been pre-processed in Mendeley were then imported into C
idence, which is a management system for systematic reviews, and here is where
screening processing was completed. Following the review of the full text, twenty
Int. J. Environ. Res. Public Health 2023,studies
20, 1514 were assessed and chosen for inclusion. The search procedure is shown
6 of 23 in Fig
1 using the PRISMA flow diagram shown below.

Figure 1.1.PRISMA
Figure PRISMADiagram.
Diagram.
3.2. Study Characterisitics
3.2. Study Characterisitics
The characteristics of the included (n = 20) studies are outlined in Table 1. Most
studies (n = 12, 60%) were
The characteristics of conducted in the(n
the included United
= 20) States,
studiesalthough four studies
are outlined were
in Table 1. Most s
conducted in the United Kingdom, two in Asia, one in Spain, and one
ies (n = 12, 60%) were conducted in the United States, although four studies in Brazil. A total of were c
50% of the included studies (n = 10) were done in a clinical context. A total of 15% of the
ducted in the United Kingdom, two in Asia, one in Spain, and one in Brazil. A total of
studies were conducted online or utilising mobile apps. Two studies were conducted in
of
anthe included
emergency studies (nsetting.
department = 10) were
Studiesdone in a clinical
including context.ofAEHR
the modelling totaldata
of 15% of the stu
(n = 8),
were conducted online or utilising mobile apps. Two studies
qualitative interviews (n = 2), and 10 experimental studies (n = 10). were conducted in an em
gency department setting. Studies including the modelling of EHR data (n = 8), qualita
interviews (n = 2), and 10 experimental studies (n = 10).
Int. J. Environ. Res. Public Health 2023, 20, 1514 7 of 23

Table 1. Study Characteristics.

Study Study Model


Study Title Authors, Year Country Aim of Study Sample Size Setting Methods Used Main Findings & Results
Category Population Evaluation Method
A total of 43% (n = 609) didn’t
report suicide ideation.
Participants who didn’t report
Applied NLP and ML
suicide ideation slept 7.32 h
(logistic regression)
compared with 6.86 of those
approach to text
who reported. Sleep quality is
messages. The
also higher for non-suicide.
intervention was
NLP using open-ended
delivered by text
Novel Use of The study aims to use questions had a reasonably high
messages, sent to
Natural Language NLP and machine The model was predictive value for suicidal
participants. The text
Processing (NLP) learning to predict evaluated using ideation. Data obtained from
Adults 18+ message included a link
to Predict Suicidal suicidal ideation and positive predictive free-text responses to general
discharged after to a questionnaire and a
Ideation and Benjamin L. heightened psychiatric value (PPV), questions about patients’ mental
Other: Suicide; self-harm from ED Participant mobile application to
Psychiatric Cook et al., symptoms among Adult sensitivity, and states could be used to predict
Spain Other or short home receive responses from
Symptoms in a 2016 [5] adults recently specificity on the suicidal ideation using NLP
hospitalisation < 7 the participants the
Text-Based Mental discharged from positive cases in the effectively.
days (n = 1453) mobile app used by
Health psychiatric inpatient or remaining 50% of It is possible to use NLP based
participants to report
Intervention in emergency room the sample. machine learning prediction
things such as sleep,
Madrid settings in Madrid. methods to predict suicide risk
appetite, anger etc.
as well as heightened psychiatric
STATA 14 for logistic
symptoms in free-text responses
regression prediction.
sent via mobile phone.
NLP algorithm used 50%
The use of novel NLP methods
(half) of the sample for
may create low-cost and
training.
effective alternatives to
traditional resource-heavy data
monitoring systems.
NLP demonstrating consistently
excellent PPV (>95% for both
outcomes).
An ideal solution for
A mixed-method of
ascertaining suicidal ideation
To demonstrate that evaluation: Manual
Improving and suicide attempt was
NLP methods can be Google’s word2vec review and
ascertainment of provided by psychiatric forms
developed to identify 3.4 million trained on 10 million compared to
suicidal ideation Cosmin A. when available in HER
United suicide phenotypes in patients, clinical notes from EH diagnostic codes
and suicide Bejan et al., Suicide Clinical This NLP system can be applied
States EHRs to enhance 200 million extraction of seed ICD10/11, PPV,
attempt with 2022 [23] to any unstructured clinical text
prevention efforts, clinical notes keywords ‘suicide’ and Recall, F1 score, the
natural language common in EHRs and is feasible
predictive models, and ‘suicidal’. area under the
processing to apply at scale (~200 M notes
precision medicine. receiver operator
here). This information retrieval
curve (AUROC)
approach would be portable to
other health systems and has
been used for the investigation
of social determinants of health.
Int. J. Environ. Res. Public Health 2023, 20, 1514 8 of 23

Table 1. Cont.

Study Study Model


Study Title Authors, Year Country Aim of Study Sample Size Setting Methods Used Main Findings & Results
Category Population Evaluation Method
NLP analysis using
Invenio software.
Unstructured clinical
Identification of To develop and
notes were downloaded
suicidal behaviour evaluate a machine
from the year preceding
among learning algorithm
the index inpatient Sensitivity,
psychiatrically using natural language A moderate sensitivity and
admission. Natural specificity, positive
hospitalized Nicholas processing of electronic negative predictive value, a
United Adolescents language processing predictive value
adolescents using J.Carson et al., health records to Suicide 241 respondents Inpatient modest AUC, and accuracy
States 12–20 years identified phrases from (PPV), negative
natural language 2019 [9] identify suicidal below the most frequent
the notes associated with predictive value
processing and behavior among class baseline.
the suicide attempt (NPV), and accuracy
machine learning psychiatrically
outcome
of electronic hospitalized
random-forest
health records adolescents.
machine-learning
algorithm to develop a
classification model.
Use of a Natural
Data were drawn from
Language
the OM1 Real World Data Extraction of SI is feasible.
Processing-based
Cloud(OM1, Inc., Boston, Future efforts should assess the
Approach to
This study aimed to MA, USA), derived from reproducibility of this approach
Extract Suicide
Palmon N et al., United determine the 3.7 million deterministically linked, in other data sources and
Ideation and Suicide Clinical
2021 [24] States feasibility of extracting patient notes de-identified, examine the feasibility of
Behaviour from
SI from clinical notes. individual-level health classifying SI as passive or active
Clinical Notes to
care claims, EHR and using data contained within the
Support
other data from 2013 to clinical notes.
Depression
the present day.
Research
NLP, STRATA software
Analysed the data as an
event notes in the EHRs,
Strong and near perfect
irre-spective whether
Using natural agreement and when compared
they were created during
language with manual annotations
Patients an inpatient stay, during
processing to demonstrating the validity of
To determine risk diagnosed follow-up or a telephone
extract self-harm Manual annotations the tool.
factors for those with an appointment.
and suicidality Charlotte and calculating This study also highlights the
diagnosed with eating Suicide; eating The analysed cohort was
data from a Cliffe et al., UK 7188 patients Clinical precision (PPV) and potential use of EHR databases
disorders who report Other disorder in extracted via the Clinical
clinical sample of 2021 [16] recall (sensitivity) to further suicidality and SH
self-harm and South Record Interactive Search
patients with statistics research by using NLP
suicidality. London and (CRIS) system and
eating disorders: a techniques. These tools could
Maudsley comprised of individuals
retrospective potentially have use with further
who received an ICD-10
cohort study development in risk prediction
diagnosis of an ED (F50.0
within ED services.
and F50.9) within the
12-year observation
period.
Int. J. Environ. Res. Public Health 2023, 20, 1514 9 of 23

Table 1. Cont.

Study Study Model


Study Title Authors, Year Country Aim of Study Sample Size Setting Methods Used Main Findings & Results
Category Population Evaluation Method
To evaluate the
performance of an
NLP/ML suicide risk
prediction model on The language-based suicide risk
newly collected model performed with good
language from the 37 suicidal and discrimination when identifying
Integration and South-eastern United 33 non-suicidal patients the language of suicidal patients
Validation of a States using models from two EDs were from a different part of the USA
AUC and Brier
Natural Language previously tested on interviewed to test a and later period than when the
scores
Processing language collected in previously developed model was originally developed
AUC of 0.81
Machine Learning the Midwestern US suicide risk prediction and trained
(95% CI = 0.71–0.91)
Suicide Risk Joshua To determine if the NLP/MLmodel. Model the study shows that integrating
United ED patients and a Brier score of
Prediction Model Cohen et al., interview process to Suicide 70 patients Clinical—ED performance was technology and procedures to
States 18–65 years 0.23 when predicting
Based on 2022 [25] collect language for an evaluated with the area collect language for a suicide
suicidal risk on the
Open-Ended NLP/ML model could under the receiver risk prediction model into the
70 patient interviews
Interview be integrated into two operating characteristic ED workflow is feasible.
collected in
Language in the EDs in the curve (AUC) and Brier A brief interview can be
this study.
Emergency South-eastern United scores. successfully implemented into
Department States, and (2) evaluate Interview for data two EDs and NLP/ML models
model performance on collection. can predict suicide risk from the
language from persons patient language with good
in a different discrimination.
geographic region than
where the original
model was developed
These machine learning
algorithms are of sufficiently
high accuracy to be fruitfully
used in an envisioned screening
The creation of an
system, but the remaining parts
automated model for
of the system are not yet ready
analysis and estimation
for implementation
of suicide risk from
Although the design of an
social media data.
intervention system powered by
An examination of how
algorithmic screening is
Natural Language this could be used to public self-stated data
10-fold technically possible, the cultural
Processing of Glen Copper- improve existing and using data donated
United cross-validation implications of implementation
social media as smith et al., screening for suicide Suicide 418 users Online through
States receiver operating are far from settled
Screening for 2018 [26] risk within the health OurDataHelps.org
characteristic (ROC) Currently, this technology is
Suicide Risk care system. Deep learning
only used for intervention for
An exploration of the
individuals who have opted in
ethical and privacy
for the analysis and intervention,
concerns of creating a
but the technology enables
system for suicide risk
scalable screening for suicide
screening not currently
risk, potentially identifying
in care.
many people who are at risk
preventively and prior to any
engagement with a health
care system.
Int. J. Environ. Res. Public Health 2023, 20, 1514 10 of 23

Table 1. Cont.

Study Study Model


Study Title Authors, Year Country Aim of Study Sample Size Setting Methods Used Main Findings & Results
Category Population Evaluation Method
NLP/Deep learning The proposed tool demonstrated
An android virtual an ability to identify suicidal
keyboard can passively ideation from user texts, which
collect user texts and enabled it to be experimented
send them to a web with in studies with
Boamente: A To develop the service. We then professionals and their patients.
Natural Language Boamente tool, a developed a web The performance evaluation
Processing-Based solution that collects platform composed of a results of the model selected to
Evandro J S
Digital Other: textual data from service to receive texts 5-fold be deployed in the system
Diniz et al., Online
Phenotyping Tool Brazil users’ smartphones from keyboard cross-validation (BERTimbau Large) were
2022 [15]
for Smart and identifies the applications, a demonstrated to be promising.
Monitoring of existence of suicidal component with the DL Therefore, the Boamente tool can
Suicidal Ideation ideation. model deployed, and an be effective for identifying
application for data suicidal ideations from
visualization non-clinical texts, which enables
Twitter data, deep it to be experimented with in
learning and evaluation studies with professionals and
80 training and 20 testing their patients.
NLP approaches.
A rule-based approach to
classifying the presence
The good performance of the
of suicide ideation and a
two classifiers in the evaluation
hybrid machine learning
study suggests they can be used
Identifying and rule-based approach
to accurately detect mentions of
Suicide Ideation to identify suicide Manually annotated
suicide ideation and attempt
and Suicidal attempts in a psychiatric gold standard set
To develop NLP within free-text documents in
Attempts in a Andrea C Fer- clinical database. producing precision
approaches to identify this psychiatric database.
Psychiatric nandes et al., UK Suicide Clinical The Clinical Record and recall statistics
and classify suicide Two distinct NLP approaches
Clinical Research 2018 [27] Interactive Search (CRIS) sensitivity of 87.8%
ideation and attempts. are described to identify and
Database using system provides and a precision
classify suicide ideation and
Natural Language de-identified information of 91.7%
attempts, both of which
Processing sourced from South
performed well as indicated by
London and Maudsley
high precision and recall
(SLaM) NHS Trust
statistics.
Events and
Correspondence
document in CRIS EHR
A Controlled Trial To design a prospective NLP
Using Natural clinical trial to test the semi-supervised machine The results show that the
language hypothesis that learning methods, the machines accurately
processing to machine learning conversations of 30 distinguished between suicidal
John P
Examine the United methods can suicidal adolescents and and non-suicidal teenagers.
Pestian et al., Suicide Children 60 Clinical—ED
Language of States discriminate between 30 matched controls were The findings here support NLP
2016 [28]
suicidal the conversation of recorded and analysed. as a strong adjunct to existing
Adolescents in the suicidal and Questionnaire and methods of determining a
emergency non-suicidal interview for data potentially suicidal individual.
department individuals. gathering.
Int. J. Environ. Res. Public Health 2023, 20, 1514 11 of 23

Table 1. Cont.

Study Study Model


Study Title Authors, Year Country Aim of Study Sample Size Setting Methods Used Main Findings & Results
Category Population Evaluation Method
To create an NLP tool
that can, with
Developing a acceptable precision
It is feasible to develop an NLP
Natural Language and recall, identify The evaluation was
tool that identifies, with
Processing tool to Karyn mentions of acts of done against a
Perinatal— NLP acceptable validity, mentions of
identify perinatal Ayre et al., UK perinatal self-harm Self-harm Clinical manually coded
18 years+ CRIS EHR peri-natal self-harm within
self-harm in 2021 [29] within EHRs. (2) To reference standard.
EHRs, although with limitations
electronic use this tool to identify Precision and recall.
regarding temporality.
healthcare records service-users who have
self-harmed perinatally,
based on their EHR.
NLP derived variables offered
small but significant predictive
improvement (AUC = 0.58) for
patients with longer treatment
To evaluate whether The area under the
duration. The small sample size
Natural language natural language EHR stored in Data curve (AUC) and
Veterans limited predictive accuracy.
processing of processing (NLP) of Warehouse, VA users confidence interval
Health Ad- Findings suggest leveraging NLP
clinical mental Maxwell psychotherapy note newly diagnosed with (95%) statistics were
United ministration derived variables from
health notes may Levis et al., text provides Suicide 246 cases Clinical PTSD calculated to
States VHA users psychotherapy notes offers an
add predictive 2020 [19] additional accuracy least absolute shrinkage determine the
diagnosed additional predictive value over
value to existing over and above and selection operator models’ predictive
with PTSD and above the VHA’s
suicide risk models currently used suicide (LASSO) accuracy using the
state-of-the-art structured
prediction models. c-statistic.
EMR-based suicide prediction
model. Replication with a larger
non-PTSD specific sample
is required.
Extracted diagnostic data
Use of natural Showed that mining
from both structured
language To develop algorithms unstructured clinical notes using
codified data and
processing in to identify pregnant NLP substantially improves the
unstructured clinical
electronic medical women with suicidal gold-standard detection of suicidal behaviour.
notes processed by NLP.
records to identify behaviour using validation 9331 women screened positive
Qui-Yue Clinical— We assessed the
pregnant women United information extracted AUC 0.83, PPV, NPV, for suicidal behaviour by either
Zhong et al., Suicide Pregnant 275,843 Clinical diagnostic validity of the
with suicidal States from clinical notes by and sensitivity for codified data (N = 196) or NLP
2019 [30] women algorithm against
behaviour: natural language performance (N = 9145).
gold-standard labels
towards a solution processing (NLP) in validation The addition of NLP resulted in
obtained from manual
to the complex electronic medical an 11-fold increase in the
chart reviews by
classification records. number of pregnant women
psychiatrists and a
problem with suicidal behaviour.
trained researcher.
Int. J. Environ. Res. Public Health 2023, 20, 1514 12 of 23

Table 1. Cont.

Study Study Model


Study Title Authors, Year Country Aim of Study Sample Size Setting Methods Used Main Findings & Results
Category Population Evaluation Method
Analysis reveals that
identification of suicide among
undetermined death cases with
Natural language Black decedents can be greatly
processing and statistical improved when modelled using
Using natural
To improve the text analysis on race-specific death narratives;
language
accuracy of restricted-access case the rate is comparable with the
processing to
Nusrat classification of deaths narratives of suicides, prediction of suicide for White
improve suicide United 10 years and ROC curves and area
Rahman et al., of undetermined intent Suicide Other homicides, and undetermined death cases
classification States older under curve AUC
2022 [10] and to examine racial undetermined deaths in there is strong evidence that NLP
requires
differences in 37 states collected from and automated coding methods
consideration
misclassification. the National Violent could improve the detection of
of race
Death Reporting System indications for suicide and might,
(NVDRS). in particular, help detection in
settings where the death manner
is prone to biases due to the
decedent’s race.
To determine the
Improving
extent to which
Prediction of sociodemographic data,
incorporating natural
Suicide and billing codes, and
language processing of Automated tools to aid
Accidental Death Thomas H narrative hospital
United narrative discharge 845,417 clinicians in evaluating these
After Discharge McCoy Jr. et al., Suicide Other discharge notes for all AUC 0.73
States notes improves discharges risks may assist in identifying
from General 2016 [31] patients from the
stratification of risk for high-risk individuals
Hospitals with hospital’s EHRs.
death by suicide after
Natural Language NLP/statistical analysis
medical or surgical
Processing
hospital discharge.
Aim to predict
Using both structured and
first-time suicide
unstructured EHR data
attempts using a large
Natural language demonstrated accurate and
data-driven approach
processing and robust first-time suicide attempt
that applies natural
machine learning Used both unstructured prediction and has the potential
Fuchiang R language processing
of electronic United and structured data to be deployed across various
Tsui et al., (NLP) and machine Suicide 10–75 years 45,238 Clinical ROC and AUC
health records for States cTAKES NLP tool to populations and clinical settings.
2021 [32] learning (ML) to
prediction of process narrative notes. Using recently developed NLP
unstructured
first-time suicide analyses of unstructured textual
(narrative) clinical
attempts data in EHRs provided a
notes and structured
significant boost to the overall
electronic health record
accuracy of these ML models.
(EHR) data.
Int. J. Environ. Res. Public Health 2023, 20, 1514 13 of 23

Table 1. Cont.

Study Study Model


Study Title Authors, Year Country Aim of Study Sample Size Setting Methods Used Main Findings & Results
Category Population Evaluation Method
Develop a
Simple NLP approaches can be
Identifying comprehensive manually
To evaluate a simple successfully used to identify
Suicidal annotated EHR reference
lexicon and rule-based patients who exhibit suicidal risk
Adolescents from Sumithra standard and assessed
NLP approach to behaviour, and the proposed
Mental Health Velupillai et al., UK Suicide Adolescents 200 Clinical NLP performance at both PPV, recall, f1-score
identify suicidal approach could be useful for
Records Using 2019 [33] document and
adolescents from a other populations and settings.
Natural Language patient-level on data
large EHR databases. The approach shows
Processing from 200 patients
promising results.
CRIS EHR.
The use of NLP substantially
To examine the
Screening improves the sensitivity of
comparative
pregnant women screening suicidal behaviour in
performance of
for suicidal EMRs. However, the prevalence
structured, diagnostic
behaviour in of confirmed suicidal behaviour
codes vs. natural
electronic medical Qui-Yue was lower among women who
United language processing Women
records: Zhong, Suicide 5880 Clinical NLP did not have diagnostic codes
States (NLP) of unstructured 10–64 years
diagnostic codes 2018 [34] for suicidal behaviour but
text for screening
vs. clinical notes screened positive by NLP.
suicidal behavior
processed by NLP should be used together
among pregnant
natural language with diagnostic codes for future
women in electronic
processing EMR-based phenotyping studies
medical records (EMRs).
for suicidal behaviour.
The proposed model
outperformed standard NLP
models in various experiments,
De-identified dataset demonstrating good
Detecting suicide To develop a domain from an emotional translational value and clinical
risk using knowledge-aware risk support system relevance.
knowledge-aware Zhongzhi assessment (KARA) established in Hong Precision, recall and The present study further
22,000
natural language Xu et al., Asia model to improve our Suicide Online Kong, comprising 5682 c-statistic confirmed that it is both possible
conversations
processing and 2021 [35] ability of suicide Cantonese conversations (ROC-AUC) and helpful to deploy an
counselling detection in online between help-seekers accurate, passive, and automatic
service data counselling systems. and counsellors suicide risk detection model for
NLP approach. alerting counsellors to the
presence of potential risk in a
user’s content during the
engagement process.
Int. J. Environ. Res. Public Health 2023, 20, 1514 14 of 23

Table 1. Cont.

Study Study Model


Study Title Authors, Year Country Aim of Study Sample Size Setting Methods Used Main Findings & Results
Category Population Evaluation Method
Electronic Medical
Records of inpatients
with mental disorders
were collected. The text
mining method was
Comparisons of
adopted to screen Findings provided a practical
different To compare the
suicidal behaviours. The way to automatically classify
classification performance of
performances of different patients with or without suicidal
algorithms while methods based on text
combinations of six Precision, recall, behaviours before admission to
using text mining H Zhu et al., mining screen suicidal
Asia Suicide 3600 Inpatient algorithms and two-term F1-value and the hospital, which potentially
to screen 2020 [36] behaviours according
weighting factors were accuracy led to considerable savings in
psychiatric to the chief complaint
compared under various time and human resources for
inpatients with of the psychiatric
training set sizes, which the identification of high-risk
suicidal inpatients
were assessed by patients and suicide prevention.
behaviours
precision, recall, F1-value
and accuracy
SVM, KNN, CART,
Logistic Regression, RF,
Adaboost
Int. J. J.
Int. Environ. Res.
Environ. Res.Public
PublicHealth
Health 2023,20,
2023, 20,1514
x20,
FOR PEER REVIEW 15
14ofof
23
Int. J. Environ. Res. Public Health 2023, x FOR PEER REVIEW 1422
of 22

TheFigure
The Figure
The 2 2below
Figure below depicts
2 below theincluded
depicts
depicts the included
the studies
included bysetting,
studies
studies by setting, with
by setting,
with research
with done
research
research done inina ain a
done
clinical
clinical context
clinical being
context
context the
beingbeing largest proportion
the largest
the largest (n
proportion
proportion = 10).
(n = 10).
(n = 10).

Figure 2. Studies
Figure by settings.
2. Studies by settings.
Figure 2. Studies by settings.

TheFigure
The Figure
The 3 3below
Figure below depicts
depicts
3 below thebreakdown
the
depicts breakdown
the ofofstudies
breakdown studiesbybycountry
of studies countryininwhich
by country which theythey
they
in which were
were were
conducted,
conducted, with
with thethe United
United States
States having
having the
the most
most (n (n
= = 12).
12).
conducted, with the United States having the most (n = 12).

Figure
Figure3.3.Studies
Studies bybycountry.
country.
Figure 3. Studies by country.
3.3. Screening in Emergency Departments
3.3.3.3.
Screening in Emergency Departments
The Screening
unexpected in nature
Emergency Departments
of suicide makes it a leading cause of death, which complicates
TheThe unexpected
unexpected nature
natureof suicide makes
of suicide makes it a leading
a leading cause of death,
cause of death,which compli-
efforts being made all over the world to prevent it [37]. In recent years, the which
abilitycompli-
to
cates efforts being made all over the world to prevent it [37]. In recent years, the the
ability to
analyse large datasets using machine learning and artificial intelligence (ML/AI) has been to
cates efforts being made all over the world to prevent it [37]. In recent years, ability
analyse
analyse
possible, large
which datasets
large using
datasets
results machine
using machine
in improved learning and artificial
learningPatients
risk detection. and artificialintelligence
intelligence
who attempt (ML/AI)
(ML/AI)
suicide has been
has been
may seek
possible,
help possible,which
from nearest results
which in improved
results in improved
emergency risk
department, detection.
riskand Patients
detection. who
Patientsofwho
their chances attempt
attempt
survival suicide may
aresuicide seekseek
may
dependent
help from nearest
help fromassessment
on successful emergency
nearest emergency department,
department,
and treatment. and
Indeed, their
andmost chances
theircompleted of survival
chances ofsuicides are
survivalare dependent
are results
dependentofon on
successful
successful
repeated assessment
attemptsassessment and
made by and treatment. Indeed,
treatment.
undetected andIndeed, most
untreated mostcompleted
completed
individuals suicides
suicides
[38,39]. are results
are results
Estimating of
there-
of re-
peated
peated
likelihood attempts
ofattemptsmade by
made by
multiple suicide undetected
undetected
attempts and untreated
andleft
is largely untreated individuals
to clinicalindividuals [38,39].
judgement[38,39]. Estimating the
Estimating the
in the Emergency
likelihood
Department,
likelihood ofwhere
multiple
of multiplesuicide
suicidal suicideattempts
patients often
attemptsis largely
appear left
is largely[38].toThus,
left clinical judgement
early
to clinical in the
recognition
judgement Emer-
inofthe
self-
Emer-
gency
harm Department,
presentations towhere suicidal
emergency patients
departments often
(ED) appear
may [38].
result Thus,
in
gency Department, where suicidal patients often appear [38]. Thus, early recognition more early recognition
prompt suicide of of
self-harm
ideation presentations to emergency departments (ED) may result
care. presentations to emergency departments (ED) may result in more prompt sui-
self-harm in more prompt sui-
cidecide
ideation care.
ideation care.
Int. J. Environ. Res. Public Health 2023, 20, 1514 16 of 23

The research investigated whether NLP/ML used on recorded interviews for sui-
cide risk prediction model can be implemented in two emergency departments in the
South-eastern United States. In the research, interviews were conducted with 37 suicidal
and 33 non-suicidal patients from two emergency departments to evaluate the NLP/ML
suicide risk prediction model [28,40]. The area under the receiver operating characteristic
curve (AUC) and Brier scores were used to assess the model’s performance. The research
demonstrates that it is viable to integrate technology and methods to gather linguistic data
for a suicide risk prediction model into the emergency department workflow. In addition,
a fast interview with patients may be used efficiently in the emergency department, and
NLP/ML models can reliably predict the patient’s suicide risk based on their comments.
Similar to [25], [28] performed a prospective clinical trial to examine whether machine
learning techniques may distinguish between suicidal and non-suicidal people by based on
their conversations. NLP and semi supervised machine learning techniques were used to
record and evaluate the discussions of 30 suicidal teenagers and 30 matched controls using
questionnaires and interviews as the data collection tools. The findings demonstrates that
the NLP model successfully differentiated between suicidal and non-suicidal teenagers.

3.4. Avoiding Perinatal Suicide


Neonatal fatalities decreased worldwide by 51%, from 5 million in 1990 to 2.5 million
in 2017 [41]. However, this drop has not been seen in low-income and middle-income
nations, which have the largest burden of neonatal deaths [42]. It has been shown that
prompt delivery of high-quality healthcare services and early identification of pregnant
women at risk for unfavourable maternal and perinatal outcomes throughout the prenatal
period enhance mother and neonatal survival [43]. In low-resource countries where the
bulk of perinatal fatalities take place at home, machine learning and artificial intelligence
models may be a crucial tool in assessing risk factors for perinatal mortality and triaging
pregnant women at high risk of severe postpartum depression, suicide ideation, and death.
An NLP tool developed by [29] was used to detect perinatal self-harm in electronic
health records with a sufficient level of recall and accuracy. Additionally, depending on
their EHR, authors identified service users who have self-harmed during pregnancy using
the NLP tool. The work demonstrates that it is possible to create an NLP tool that can
recognise instances of perinatal self-harm in EHRs with acceptable validity; however, there
are certain temporal restrictions.
Further, Zhong et al. [34], successfully created algorithms that used data from clinical
notes collected using NLP in electronic medical records to detect pregnant women who were
exhibiting suicidal behaviour. They used both structured, codified data and unstructured,
NLP-processed clinical notes to extract diagnostic information for their investigation, and
they evaluated the algorithm’s diagnostic validity in comparison to gold-standard labels
generated from manual chart checks by psychiatrists and a skilled researcher. The study
also demonstrated that using structured data and employing NLP to mine unstructured
clinical notes significantly enhances the ability to identify suicidal behaviour in pregnant
women. In addition, the approach led to an 11-fold increase in the number of pregnant
women whose suicide behaviours were identified.
Similarly, [30] compared the performance of predefined diagnostic codes vs. NLP of
unstructured text for detecting suicidal behaviour in pregnant women’s electronic health
data. Utilizing NLP significantly increases the sensitivity of screening for suicidal behaviour
in EHRs. Nevertheless, the proportion of verified suicidal behaviour was lower among
women who did not have diagnostic codes for suicidal behaviour.

3.5. Digital Applications for Suicide Detection


Regrettably, only a small percentage of suicidal patients actively participate in their
therapy, and this percentage is much lower for patients whose suicidal ideation is both
frequent and strong. However, although some individuals with a high suicide risk avoid
Int. J. Environ. Res. Public Health 2023, 20, 1514 17 of 23

face-to-face intervention, they may be more likely to try to get aid discreetly via technologi-
cal means [44].
Mobile health applications (MHA) have the potential to expand access to evidence-
based care for those who have suicidal thoughts by addressing some of the constraints that
are present in traditional mental health therapy [45]. These obstacles include stigmatisation,
the perception that expert treatment is not required, and inadequate time in an acute
suicidal crisis. The proliferation of smartphones has made MHA possible. As a result,
the MHA can deliver assistance in a timely manner, in a convenient manner, in a discrete
manner, and at a cheap cost, particularly in a severe crisis, since they are not constrained
by time or location [46].
In the study by [26], an automated algorithm for analysing and estimating the risk
of suicide based on social media data was developed. The research investigates how the
technique may be used to enhance current suicide risk assessment within the health care
system. It also explores the ethical and privacy considerations associated with developing
a system for screening undiagnosed individuals for suicide risk.
The research indicates that the technology can be used for intervention with people
who have decided not to opt in for interventional services. Indeed, technology allows
scalable screening for suicide risk, with the possibility to identify many people who are
at risk prior to their engagement with a health care system. However, although the
development of the intervention system based on algorithmic screening is technologically
possible, the cultural ramifications of its implementation are not yet decided.
Further, [15] developed the Boamente program, which gathers textual data from users’
smartphones and detects the presence of suicidal ideation. They created an Android
virtual keyboard that can passively gather user messages and transfer them to a web
service using NLP and Deep Learning. They then created a web platform that included a
service for receiving text from keyboard apps, a component with the deep learning model
implemented, and a data visualisation application. The technology exhibited the capacity
to detect suicidal thoughts from user messages, nonclinical texts, and data from third-party
social media apps such as Twitter, allowing it to be tested in trials with professionals and
their patients.
Like [15], [5] employed NLP and machine learning to predict suicide ideation and
elevated mental symptoms among adults recently released from psychiatric inpatient or
emergency hospital settings in Spain. They used NLP and ML (logistic regression) on
participant-sent text messages. The text message included a link to a questionnaire and a
mobile application for collecting participant replies. The research demonstrates that it is
feasible to apply NLP-based machine learning predictions algorithms to predict suicide
risk and elevated mental symptoms from free-text mobile phone answers.
A domain Knowledge Aware Risk Assessment (KARA) model is created in experi-
mental research by [35] to enhance suicide identification in online counselling systems. In
their research, they used NLP on a de-identified dataset of 5682 Cantonese talks between
help-seekers and counsellors from a Hong Kong emotional support system. The study
show that it is both feasible and beneficial to utilise an accurate, passive, and automated
suicide risk detection model to inform counsellors of potential risks in a user’s information
as they are engaging with the user. Additionally, the NLP model performed better than
traditional NLP models in several experiments, indicating strong clinical relevance and
translational utility.

3.6. Suicide Prevention Using Electronic Health Records EHR


Electronic health record (EHR) data, in addition to a clinical decision support system
(CDSS), may act as an “early warning system” to notify professionals about patients who
should be evaluated for suicide risk [47,48]. CDSS is a health information system that may
be incorporated into EHR system or healthcare workflow, allowing clinicians to utilise it
easily and effectively. Because of its capacity to deliver evidence-based healthcare to the
point of treatment, the usage of these technologies has increased in recent years [49].
Int. J. Environ. Res. Public Health 2023, 20, 1514 18 of 23

In four of the included studies [9,16,23,24], NLP was used on clinical notes obtained
from electronic health records (EHR), such as the Clinical Record Interactive Search (CRIS)
system, to identify patients who are at risk of suicidal ideation. Using NLP approaches,
these investigations demonstrated the potential application of EHR information to further
research on suicidality and self-harm. Accordingly, this technology also has the potential to
be useful in the expansion of risk prediction in several other areas of mental health such as
eating disorders and depression.
In addition to utilising clinical notes extracted from EHR, McCoy and colleagues [31]
used sociodemographic data, billing codes, and narrative hospital discharge notes for each
patient taken from the electronic health records (EHRs) of the hospital in order to enhance
suicide risk prediction. The research demonstrates that utilising textual data other than
clinical notes, such as demographic, diagnostic code, and billing data, might help clinicians
in assessing suicide risks and may help in identifying high-risk people with high precision.
Using psychotherapy and psychiatric data from EHRs might also potentially enhance
suicide risk prediction, as shown by [19,36]. Indeed, [36] extracted EHR data of hospitalised
patients and PTSD patients and applied NLP, SVM, KNN, CART, Logistic Regression, RF,
Adaboost, and LASSO for a suicide risk prediction tool. The results imply utilising NLP
and data from psychotherapy and psychiatric notes to automatically categorise patients
with or without suicide ideation before hospitalisation, could possibly result in significant
time and resource savings for the identification of high-risk patients and the prevention
of suicide.

3.7. Racial Disparity


Suicide prevention initiatives must be more carefully targeted if they are to be success-
ful. Racial and ethnic disparities in rates of suicidal thoughts, suicide attempts, and suicide
fatalities need to be better understood [50]. The likelihood of suicide varies across racial
and ethnic groups depending on their experiences with prejudice, past trauma, and the
availability of culturally appropriate mental health care. At the same time, under reporting
and other shortcomings in data collection methods restrict the understanding of racial and
ethnic disparities in suicide and suicidal behaviours [51,52].
For instance, studies have shown that young people and women are more likely to
engage in suicide behaviours [53,54]. The connection between race, ethnicity, and suicide
ideation, on the other hand, is far less well understood. Researchers [10] undertook a
study to enhance the accuracy of the categorization of undetermined-intent fatalities and
to evaluate racial disparities in misclassification. National Violent Death Reporting System
(NVDRS) restricted-access case narratives of suicides, murders, and undetermined deaths
in 37 states were subjected to NLP and statistical text analysis. Their analysis demonstrates
that the identification of suicide in deaths involving Black decedents can be significantly
improved by employing race-specific death narratives in modelling. In their experiment
employing race-specific narratives, the suicide prediction rate was significantly improved,
and prediction power was found to be comparable to that for unexplained deaths involving
white decedents. Overall, there is strong evidence that NLP and automated coding systems
might improve the detection of suicide warning signals. Also, including race as a variable
in the modelling method could result in improved predictive power and consequently a
reduction in suicide ideation and deaths, especially among groups at higher risk of suicide.

3.8. Quality Assessment


After an MMAT evaluation, the studies that were included were found to be sat-
isfactory. On the other hand, some studies included a very limited description of the
methodology, population, and settings in which their study was carried out.

4. Discussion
This is the first qualitative systematic review on the use of NLP for suicide prevention.
Using both structured and unstructured data in data modelling with NLP yielded much
Int. J. Environ. Res. Public Health 2023, 20, 1514 19 of 23

more accurate results, as compared to using either structured or unstructured data alone.
Multiple studies demonstrate that integrating structured data, such as diagnosis code,
demographics, and billing data, with unstructured data, such as narratives, increases the
performance and accuracy of detecting individuals with suicidal ideation.
Additionally, persistent and passive observation of individuals with a confirmed
diagnosis of mental health issues is essential and is shown to reduce suicide and self-harm
incidence [15,35]. It has been reported previously that up to ninety percent of suicides are
associated with mental health issues [55]; therefore, passively monitoring persons with
a confirmed diagnosis using an NLP or other ML/AI-based suicide risk assessment tool
might be useful and advantageous. However, ethical and privacy problems should be
examined regarding the use of patient data for suicide surveillance or monitoring, and
additional research is necessary to impact government policy in this area.
In addition, EHRs of ethnic minority patients have been reported previously to include
less notes and details than those of non-ethnic patients [51]. Data equality is vital for reduc-
ing and preventing suicidal ideation, maximising the potential of NLP and other machine
learning and artificial intelligence technologies, addressing health disparities, and ensuring
fair access to services. Further, the use of race-specific data in the development of suicide
risk or prediction systems might boost accuracy and performance and simultaneously
avoid racial bias, as shown by [10,15]. Researchers and software developers should be
conscious of this information, which has been found to improve the effectiveness of pre-
dictive tools [10]. Electronic health records have a variety of information, which is crucial
for building a suicide risk assessment tool. In addition to EHR, it is also feasible to utilise
social media and smartphone applications data to detect individuals with suicide ideation.
Sometimes, to escape the societal stigma associated with suicide thoughts, people may use
online platforms such as blogs, tweets, and forums to express themselves [45,46]. Therefore,
including smartphones and social data in NLP models may enhance suicide diagnosis.
This research validates the results of a previous systematic reviews by [56], which
concluded that NLP can be used in detecting and treating mental health issues including
suicide and self-harm. In addition, NLP techniques may provide insights from unexplored
data such as those from social media and wearable devices that are often inaccessible to
care providers and physicians. Indeed, although machine learning and artificial intelligence
solutions are not intended to replace clinicians in the prevention of suicide or other mental
health issues, they can be used as a supplement in all phases of mental health care, including
diagnosis, prognosis, treatment efficacy, and monitoring.
Natural language processing (NLP) is a powerful text mining method that has many
benefits over other text mining methods. One of the main advantages of NLP is its ability
to process and understand natural language. NLP algorithms are designed to identify
meaning and structure in unstructured text, which makes it easier for the algorithm to ac-
curately categorise and classify the data [57]. Additionally, NLP can interpret the context of
language and understand the nuances of human communication, including the use of slang,
sarcasm, and context-specific expressions. This makes it far more effective for extracting
meaningful insights from text than other text mining methods such as sentiment analysis,
information extraction, and text classification, among others. This advantage is probably a
reason for the wide use of NLP in clinical diagnosis, especially in mental health [58,59].
Despite the many benefits of NLP, it also has a few limitations when compared to other
text mining methods. One limitation is that NLP is limited to the language it is designed
for; it cannot process text written in a language other than the language it was designed
for. Additionally, NLP requires a large amount of data to be effective and can be difficult to
interpret due to its complexity. Finally, NLP algorithms are often computationally intensive,
requiring a considerable amount of computing power to process the data [60]. However,
NLP use in suicide ideation have shown continued progress and is likely to improve the
future of diagnosis and prevention of suicide ideation and related death.
Suicide ideation is increasingly being detected in social media postings, text messages,
and other digital sources using supervised text mining approaches [61]. Supervised ap-
Int. J. Environ. Res. Public Health 2023, 20, 1514 20 of 23

proaches categorise data using predetermined labels and can be used to find patterns in
text that may indicate suicidal intent. One advantage of supervised approaches is that they
may be used to find patterns that are not always visible and to uncover subtle nuances
in language that may signal suicidal ideation. Furthermore, because the specified labels
provide a more solid basis for assessment, supervised methods are more accurate than
unsupervised approaches [62].
Unsupervised text mining approaches, on the other hand, find patterns in text without
using predefined labels. The advantage of unsupervised approaches is that they may
be used to swiftly examine huge amounts of data and find subtle patterns that may not
be visible to the human eye. However, because there is no reliable basis for evaluation,
unsupervised approaches are less accurate than supervised methods and are more prone to
false positives [63].

Limitations
This study has some limitations. Firstly, preprint and unpublished paper were not
included in this study. Thus, grey studies and other data may exist which are not covered
herein. However, Ovid-based databases including MEDLINE and Embase were systemati-
cally searched, and retrieved studies were subjected to manual reference search. Secondly,
the methodologies utilised in the included research are too heterogeneous, and there are no
metrics available to assess their efficacy. As a result, meta-analysis could not be conducted.
Lastly, the efficacy of NLP in preventing suicide ideation and self-harm could not be quanti-
fied, as the included studies did not provide any metrics to the effect. Future study should
give this a high priority since it might provide additional information regarding the efficacy
of NLP in mental health.

5. Conclusions
According to the findings of this research work, NLP could help in the early detection
of individuals who have suicide ideation and allow timely implementation of preventive
measures. It is also found that passive surveillance via mobile applications, online activity,
and social media is feasible and may help in the early diagnosis and prevention of suicide
in vulnerable groups. However, before passive surveillance can be clinically useful, ethical
and security issues need to be addressed.
When modelling, employing race specific terminologies has been demonstrated to
boost both performance and accuracy among ethnic minority groups. This may boost
health equality and allow equitable access to healthcare services. Furthermore, combining
structured and unstructured data have been reported to enhance accuracy and precision
in suicide detection, which is important for developing an NLP model for predicting
suicide risk.
In summary, the application of artificial intelligence and machine learning offers new
prospects to significantly enhance risk prediction and suicide prevention frameworks.
Based on included studies, the use of NLP may be used to develop low-cost, resource-
efficient alternatives to conventional suicide prevention measures. Thus, there is significant
evidence that NLP is beneficial for recognising individuals with suicidal ideation, conse-
quently giving unique opportunities for suicide prevention.

Recommendations
Based on the results that were obtained from this review, the following recommenda-
tions have been made:
• Reducing suicide is a collective effort; the government should form a suicide preven-
tion task group under DHSC to explore technical solutions for early suicide detection.
• Since most people with suicide ideation seek help from ED first, integrating NLP-based
CDSS in ED workflow for suicide risk might help identify them early.
• Adequate training should be giving to staff to recognise unconscious racial bias when
using EHR systems to record patients’ data.
Int. J. Environ. Res. Public Health 2023, 20, 1514 21 of 23

• Include race-specific data in EHR systems and utilise them as a standard for developing
suicide risk prediction tools.
• More study is required to explore privacy issues and ethics of passive data surveillance
or monitoring, particularly on those with mental illness.

Funding: This research received no external funding.


Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement: Not applicable.
Conflicts of Interest: The authors declare no conflict of interest.

References
1. Lozano, R.; Naghavi, M.; Foreman, K.; Lim, S.; Shibuya, K.; Aboyans, V.; Abraham, J.; Adair, T.; Aggarwal, R.; Ahn, S.Y. Global
and regional mortality from 235 causes of death for 20 age groups in 1990 and 2010: A systematic analysis for the Global Burden
of Disease Study 2010. Lancet 2012, 380, 2095–2128. [CrossRef] [PubMed]
2. Eaton, D.K.; Kann, L.; Kinchen, S.; Shanklin, S.; Flint, K.H.; Hawkins, J.; Harris, W.A.; Lowry, R.; McManus, T.; Chyen, D.; et al.
Youth risk behavior surveillance—United States, 2011. MMWR Surveill Summ. 2012, 61, 1–162. [PubMed]
3. World Health Organization. Suicide; World Health Organization: Geneva, Switzerland, 2021.
4. World Health Organization. Preventing Suicide Preventing Suicide; World Health Organization: Geneva, Switzerland, 2014.
5. Cook, B.L.; Progovac, A.M.; Chen, P.; Mullin, B.; Hou, S.; Baca-Garcia, E. Novel use of natural language processing (NLP) to
predict suicidal ideation and psychiatric symptoms in a text-based mental health intervention in Madrid. Comput. Math. Methods
Med. 2016, 2016, 8708434. [CrossRef]
6. Longhurst, C.A.; Harrington, R.A.; Shah, N.H. A ‘green button’for using aggregate patient data at the point of care. Health Aff.
2014, 33, 1229–1235. [CrossRef] [PubMed]
7. Munot, N.; Govilkar, S.S. Comparative study of text summarization methods. Int. J. Comput. Appl. 2014, 102, 33–37. [CrossRef]
8. Khurana, D.; Koli, A.; Khatter, K.; Singh, S. Natural language processing: State of the art, current trends and challenges. Multimed.
Tools Appl. 2022, 82, 3713–3744. [CrossRef]
9. Carson, N.J.; Mullin, B.; Sanchez, M.J.; Lu, F.; Yang, K.; Menezes, M.; Cook, B.L. Identification of suicidal behavior among
psychiatrically hospitalized adolescents using natural language processing and machine learning of electronic health records.
PLoS ONE 2019, 14, e0211116. [CrossRef]
10. Rahman, N.; Mozer, R.; McHugh, R.K.; Rockett, I.R.; Chow, C.M.; Vaughan, G. Using natural language processing to improve
suicide classification requires consideration of race. Suicide Life Threat. Behav. 2022, 52, 782–791. [CrossRef]
11. NIH. Suicide Prevention; NIH: Bethesda, MD, USA, 2021.
12. Vaci, N.; Liu, Q.; Kormilitzin, A.; De Crescenzo, F.; Kurtulmus, A.; Harvey, J.; O’Dell, B.; Innocent, S.; Tomlinson, A.; Cipriani, A.
Natural language processing for structuring clinical text data on depression using UK-CRIS. Evid.-Based Ment. Health 2020,
23, 21–26. [CrossRef]
13. Shiner, B.; Levis, M.; Dufort, V.M.; Patterson, O.V.; Watts, B.V.; DuVall, S.L.; Russ, C.J.; Maguen, S. Improvements to PTSD quality
metrics with natural language processing. J. Eval. Clin. Pract. 2022, 28, 520–530. [CrossRef]
14. Rodrigues Makiuchi, M.; Warnita, T.; Uto, K.; Shinoda, K. Multimodal fusion of bert-cnn and gated cnn representations for
depression detection. In Proceedings of the 9th International on Audio/Visual Emotion Challenge and Workshop, Nice, France,
21 October 2019; pp. 55–63.
15. Diniz, E.J.; Fontenele, J.E.; de Oliveira, A.C.; Bastos, V.H.; Teixeira, S.; Rabêlo, R.L.; Calçada, D.B.; Dos Santos, R.M.;
de Oliveira, A.K.; Teles, A.S. Boamente: A Natural Language Processing-Based Digital Phenotyping Tool for Smart Monitoring of
Suicidal Ideation. Healthcare 2022, 10, 698. [CrossRef] [PubMed]
16. Cliffe, C.; Seyedsalehi, A.; Vardavoulia, K.; Bittar, A.; Velupillai, S.; Shetty, H.; Schmidt, U.; Dutta, R. Using natural language
processing to extract self-harm and suicidality data from a clinical sample of patients with eating disorders: A retrospective
cohort study. BMJ Open 2021, 11, e053808. [CrossRef] [PubMed]
17. Karmen, C.; Hsiung, R.C.; Wetter, T. Screening internet forum participants for depression symptoms by assembling and enhancing
multiple NLP methods. Comput. Methods Programs Biomed. 2015, 120, 27–36. [CrossRef] [PubMed]
18. Sawalha, J.; Yousefnezhad, M.; Shah, Z.; Brown, M.R.; Greenshaw, A.J.; Greiner, R. Detecting presence of PTSD using sentiment
analysis from text data. Front. Psychiatry 2022, 12, 2618. [CrossRef]
19. Levis, M.; Westgate, C.L.; Gui, J.; Watts, B.V.; Shiner, B. Natural language processing of clinical mental health notes may add
predictive value to existing suicide risk models. Psychol. Med. 2021, 51, 1382–1391. [CrossRef] [PubMed]
20. Divita, G.; Workman, T.E.; Carter, M.E.; Redd, A.; Samore, M.H.; Gundlapalli, A.V. PlateRunner: A Search Engine to Identify
EMR Boilerplates. Stud. Health Technol. Inform. 2016, 226, 33–36.
Int. J. Environ. Res. Public Health 2023, 20, 1514 22 of 23

21. Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.;
Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ 2021, 372, n71.
[CrossRef]
22. Hong, Q.N.; Pluye, P.; Fàbregues, S.; Bartlett, G.; Boardman, F.; Cargo, M.; Dagenais, P.; Gagnon, M.-P.; Griffiths, F.; Nicolau, B.
Mixed Methods Appraisal Tool (MMAT), version 2018. BMJ Open 2021, 11, e039246.
23. Bejan, C.A.; Ripperger, M.; Wilimitis, D.; Ahmed, R.; Kang, J.; Robinson, K.; Morley, T.J.; Ruderfer, D.M.; Walsh, C.G. Improving
ascertainment of suicidal ideation and suicide attempt with natural language processing. medRxiv 2022, 12, 15146. [CrossRef]
24. Palmon, N.; Momen, S.; Leavy, M.; Curhan, G.; Boussios, C.; Gliklich, R. PMH52 Use of a Natural Language Processing-Based
Approach to Extract Suicide Ideation and Behavior from Clinical Notes to Support Depression Research. Value Health 2021,
24, S137. [CrossRef]
25. Cohen, J.; Wright-Berryman, J.; Rohlfs, L.; Trocinski, D.; Daniel, L.; Klatt, T.W. Integration and Validation of a Natural Language
Processing Machine Learning Suicide Risk Prediction Model Based on Open-Ended Interview Language in the Emergency
Department. Front. Digit. Health 2022, 4, 818705. [CrossRef] [PubMed]
26. Coppersmith, G.; Leary, R.; Crutchley, P.; Fine, A. Natural language processing of social media as screening for suicide risk.
Biomed. Inform. Insights 2018, 10, 1178222618792860. [CrossRef] [PubMed]
27. Fernandes, A.C.; Dutta, R.; Velupillai, S.; Sanyal, J.; Stewart, R.; Chandran, D. Identifying suicide ideation and suicidal attempts
in a psychiatric clinical research database using natural language processing. Sci. Rep. 2018, 8, 7426. [CrossRef]
28. Pestian, J.P.; Grupp-Phelan, J.; Bretonnel Cohen, K.; Meyers, G.; Richey, L.A.; Matykiewicz, P.; Sorter, M.T. A controlled trial using
natural language processing to examine the language of suicidal adolescents in the emergency department. Suicide Life Threat.
Behav. 2016, 46, 154–159. [CrossRef]
29. Ayre, K.; Bittar, A.; Kam, J.; Verma, S.; Howard, L.M.; Dutta, R. Developing a natural language processing tool to identify perinatal
self-harm in electronic healthcare records. PLoS ONE 2021, 16, e0253809. [CrossRef]
30. Zhong, Q.-Y.; Mittal, L.P.; Nathan, M.D.; Brown, K.M.; Knudson González, D.; Cai, T.; Finan, S.; Gelaye, B.; Avillach, P.;
Smoller, J.W. Use of natural language processing in electronic medical records to identify pregnant women with suicidal behavior:
Towards a solution to the complex classification problem. Eur. J. Epidemiol. 2019, 34, 153–162. [CrossRef]
31. McCoy, T.H.; Castro, V.M.; Roberson, A.M.; Snapper, L.A.; Perlis, R.H. Improving prediction of suicide and accidental death after
discharge from general hospitals with natural language processing. JAMA Psychiatry 2016, 73, 1064–1071. [CrossRef]
32. Tsui, F.R.; Shi, L.; Ruiz, V.; Ryan, N.D.; Biernesser, C.; Iyengar, S.; Walsh, C.G.; Brent, D.A. Natural language processing
and machine learning of electronic health records for prediction of first-time suicide attempts. JAMIA Open 2021, 4, ooab011.
[CrossRef]
33. Velupillai, S.; Epstein, S.; Bittar, A.; Stephenson, T.; Dutta, R.; Downs, J. Identifying Suicidal Adolescents from Mental Health
Records Using Natural Language Processing. Stud. Health Technol. Inf. 2019, 264, 413–417. [CrossRef]
34. Zhong, Q.-Y.; Karlson, E.W.; Gelaye, B.; Finan, S.; Avillach, P.; Smoller, J.W.; Cai, T.; Williams, M.A. Screening pregnant women for
suicidal behavior in electronic medical records: Diagnostic codes vs. clinical notes processed by natural language processing.
BMC Med. Inform. Decis. Mak. 2018, 18, 30. [CrossRef] [PubMed]
35. Xu, Z.; Xu, Y.; Cheung, F.; Cheng, M.; Lung, D.; Law, Y.W.; Chiang, B.; Zhang, Q.; Yip, P.S. Detecting suicide risk using
knowledge-aware natural language processing and counseling service data. Soc. Sci. Med. 2021, 283, 114176. [CrossRef]
36. Zhu, H.; Xia, X.; Yao, J.; Fan, H.; Wang, Q.; Gao, Q. Comparisons of different classification algorithms while using text mining to
screen psychiatric inpatients with suicidal behaviors. J. Psychiatr. Res. 2020, 124, 123–130. [CrossRef] [PubMed]
37. Bernert, R.A.; Hilberg, A.M.; Melia, R.; Kim, J.P.; Shah, N.H.; Abnousi, F. Artificial intelligence and suicide prevention: A
systematic review of machine learning investigations. Int. J. Environ. Res. Public Health 2020, 17, 5929. [CrossRef] [PubMed]
38. Pestian, J.; Matykiewicz, P.; Grupp-Phelan, J.; Lavanier, S.A.; Combs, J.; Kowatch, R. Using natural language processing to classify
suicide notes. In Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing, Columbus, OH,
USA, 19 June 2008; pp. 96–97.
39. Lewinsohn, P.M.; Rohde, P.; Seeley, J.R. Psychosocial risk factors for future adolescent suicide attempts. J. Consult. Clin. Psychol.
1994, 62, 297. [CrossRef] [PubMed]
40. Pestian, J.P.; Sorter, M.; Connolly, B.; Bretonnel Cohen, K.; McCullumsmith, C.; Gee, J.T.; Morency, L.P.; Scherer, S.; Rohlfs, L.;
Group, S.R. A machine learning approach to identifying the thought markers of suicidal subjects: A prospective multicenter trial.
Suicide Life Threat. Behav. 2017, 47, 112–121. [CrossRef]
41. Mboya, I.B.; Mahande, M.J.; Mohammed, M.; Obure, J.; Mwambi, H.G. Prediction of perinatal death using machine learning
models: A birth registry-based cohort study in northern Tanzania. BMJ Open 2020, 10, e040132. [CrossRef]
42. Hug, L.; Alexander, M.; You, D.; Alkema, L.; UN Inter-Agency Group for Child Mortality Estimation. National, regional, and
global levels and trends in neonatal mortality between 1990 and 2017, with scenario-based projections to 2030: A systematic
analysis. Lancet Glob. Health 2019, 7, e710–e720. [CrossRef]
43. Kuhle, S.; Maguire, B.; Zhang, H.; Hamilton, D.; Allen, A.C.; Joseph, K.; Allen, V.M. Comparison of logistic regression with
machine learning methods for the prediction of fetal growth abnormalities: A retrospective cohort study. BMC Pregnancy
Childbirth 2018, 18, 333. [CrossRef]
44. Wilks, C.R.; Chu, C.; Sim, D.; Lovell, J.; Gutierrez, P.; Joiner, T.; Kessler, R.C.; Nock, M.K. User engagement and usability of suicide
prevention apps: Systematic search in app stores and content analysis. JMIR Form. Res. 2021, 5, e27018. [CrossRef]
Int. J. Environ. Res. Public Health 2023, 20, 1514 23 of 23

45. Sander, L.B.; Lemor, M.-L.; Van der Sloot, R.J.A.; De Jaegere, E.; Büscher, R.; Messner, E.-M.; Baumeister, H.; Terhorst, Y.
A Systematic Evaluation of Mobile Health Applications for the Prevention of Suicidal Behavior or Non-suicidal Self-injury. Front.
Digit. Health 2021, 3, 689692. [CrossRef]
46. Larsen, M.E.; Nicholas, J.; Christensen, H. A systematic assessment of smartphone tools for suicide prevention. PLoS ONE 2016,
11, e0152285. [CrossRef] [PubMed]
47. Warrer, P.; Hansen, E.H.; Juhl-Jensen, L.; Aagaard, L. Using text-mining techniques in electronic patient records to identify ADRs
from medicine use. Br. J. Clin. Pharmacol. 2012, 73, 674–684. [CrossRef]
48. Lu, H.-M.; Chen, H.; Zeng, D.; King, C.-C.; Shih, F.-Y.; Wu, T.-S.; Hsiao, J.-Y. Multilingual chief complaint classification for
syndromic surveillance: An experiment with Chinese chief complaints. Int. J. Med. Inform. 2009, 78, 308–320. [CrossRef]
49. Berrouiguet, S.; Billot, R.; Larsen, M.E.; Lopez-Castroman, J.; Jaussent, I.; Walter, M.; Lenca, P.; Baca-García, E.; Courtet, P.
An Approach for Data Mining of Electronic Health Record Data for Suicide Risk Management: Database Analysis for Clinical
Decision Support. JMIR Ment. Health 2019, 6, e9766. [CrossRef] [PubMed]
50. Suicide Prevention Resource Center. Racial and Ethnic Disparities; Suicide Prevention Resource Center: New York, NY, USA, 2020.
51. Kessler, R.C.; Borges, G.; Walters, E.E. Prevalence of and risk factors for lifetime suicide attempts in the National Comorbidity
Survey. Arch. Gen. Psychiatry 1999, 56, 617–626. [CrossRef] [PubMed]
52. Perez-Rodriguez, M.M.; Baca-Garcia, E.; Oquendo, M.A.; Blanco, C. Ethnic differences in suicidal ideation and attempts.
Prim. Psychiatry 2008, 15, 44.
53. Bridge, J.A.; Horowitz, L.M.; Fontanella, C.A.; Sheftall, A.H.; Greenhouse, J.; Kelleher, K.J.; Campo, J.V. Age-related racial
disparity in suicide rates among US youths from 2001 through 2015. JAMA Pediatr. 2018, 172, 697–699. [CrossRef]
54. Canetto, S.S. Women and suicidal behavior: A cultural analysis. Am. J. Orthopsychiatry 2008, 78, 259–266. [CrossRef]
55. Brådvik, L. Suicide risk and mental disorders. Int. J. Environ. Res. Public Health 2018, 15, 2028. [CrossRef]
56. Le Glaz, A.; Haralambous, Y.; Kim-Dufor, D.-H.; Lenca, P.; Billot, R.; Ryan, T.C.; Marsh, J.; Devylder, J.; Walter, M.; Berrouiguet, S.
Machine learning and natural language processing in mental health: Systematic review. J. Med. Internet Res. 2021, 23, e15708.
[CrossRef]
57. Liddy, E.D. Natural Language Processing; Marcel Decker, Inc.: New York, NY, USA, 2001.
58. He, Q.; Veldkamp, B.P.; Glas, C.A.; de Vries, T. Automated assessment of patients’ self-narratives for posttraumatic stress disorder
screening using natural language processing and text mining. Assessment 2017, 24, 157–172. [CrossRef]
59. Cohen, A.S.; Mitchell, K.R.; Elvevåg, B. What do we really know about blunted vocal affect and alogia? A meta-analysis of
objective assessments. Schizophr. Res. 2014, 159, 533–538. [CrossRef] [PubMed]
60. Chowdhary, K. Natural language processing. In Fundamentals of Artificial Intelligence; Springer: Berlin/Heidelberg, Germany,
2020; pp. 603–649.
61. Narynov, S.; Mukhtarkhanuly, D.; Kerimov, I.; Omarov, B. Comparative analysis of supervised and unsupervised learning
algorithms for online user content suicidal ideation detection. J. Theor. Appl. Inf. Technol. 2019, 97, 3304–3317.
62. Cheng, Q.; Lui, C.S.M. Applying text mining methods to suicide research. Suicide Life Threat. Behav. 2021, 51, 137–147. [CrossRef]
[PubMed]
63. Dang, S.; Ahmad, P.H. Text mining: Techniques and its application. Int. J. Eng. Technol. Innov. 2014, 1, 22–25.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual
author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to
people or property resulting from any ideas, methods, instructions or products referred to in the content.

You might also like