AP Final Project - Group 2
AP Final Project - Group 2
Submitted by
Name SAP ID Roll Number
Anujj Misra 80672300213 A011
We sincerely thank Dr. Siby Abraham for all his help during this research, including his
insightful comments, persistent support, and helpful counsel. His knowledge and support
have been crucial in forming this research.
We further thank Ms. Malvika Rao, Counsellor (Psychologist) at SVKM’s Narsee Monjee
Institute of Management Studies for her guidance and views to help us successfully navigate
through this sensitive topic.
Additionally, we would like to sincerely thank our classmates for their unwavering
cooperation and support. Their supportive comments and helpful critiques have helped us to
improve our concepts and methods.
We also appreciate Swaranka Pethe, a committed psychology student from the University of
Derby, for her insightful comments throughout the sessions. Her knowledge and perceptions
have improved our comprehension of the intricacies involved in mental health concerns.
Without the kind collaboration and participation of the students who generously offered their
experiences and viewpoints, this research would not have been feasible. Their readiness to
have direct, unbiased, and open conversations has been crucial in illuminating the variables
affecting students' mental health and suicide rates.
Abstract employing random forests, a machine
learning model is devised to predict
instances of self-harm with 69% accuracy,
In today's educational milieu, students
based on socio-economic and
confront unprecedented mental health
psychological parameters extracted from
challenges, exacerbated by academic
student profiles. Furthermore, the research
demands, social detachment, and the
endeavors to identify pivotal factors
pervasive influence of digital technology.
contributing to depressive ideation or
The escalating prevalence of anxiety,
inclination toward self-harm utilizing
depression, and burnout among students
MDA techniques like Factor Analysis and
underscores the pressing need for
Regression. The underlying impetus for
comprehensive support and resources to
this project is to provide timely and
address this critical issue. This research
tailored support to students identified as
endeavors to construct a predictive
at-risk for suicide, to mitigate suicide risk
analytics model aimed at anticipating
within campus communities, and to
deteriorating mental health in students and
enhance the overall well-being of students
averting instances of student suicides.
in educational institutions.
Leveraging an extensive dataset
comprising past behavioral, academic, and
demographic information, the study
incorporates numerous variables, Introduction
encompassing social interactions,
academic attainment, and prior mental The surge in student suicides documented
health history. Employing sub-fields of by the National Crime Records Bureau
Artificial Intelligence such as Machine (NCRB) highlights a concerning trend in
Learning, Natural Language Processing India's educational sphere. With 13,089
(NLP), and Multivariate Data Analytics student suicides reported in 2021 alone,
(MDA), the objective is to develop a representing a staggering 70% increase
predictive model capable of discerning from 2011, the gravity of the situation
patterns and trends indicative of cannot be overstated. On average, nearly
heightened risks of suicidal ideation. By 36 students took their own lives each day
leveraging Natural Language Processing throughout the year, shedding light on the
to analyze extensive social media data, profound mental health challenges faced
coupled with the utilization of four by young learners across the nation.
machine learning techniques, the research
distinguishes between suicidal ideation This alarming rise underscores the urgent
and depressive thoughts. Through need for a comprehensive approach to
comparative analysis, the study ascertains address the complex interplay of factors
that artificial Neural Networks exhibit a contributing to deteriorating mental health
predictive accuracy of 70%. Additionally,
within educational settings. Academic essential for future endeavours. However,
pressures, social isolation, and the the transitional phase into academia can
pervasive influence of digital technology pose challenges, potentially leading to
are among the myriad stressors psychological distress among students.
exacerbating mental health issues among Moreover, the competitive nature inherent
students. The relentless pursuit of in academic environments, marked by the
academic excellence, coupled with the pursuit of excellence and overwhelming
pressures of navigating a workloads, coupled with interpersonal
hypercompetitive environment, often conflicts, can exacerbate stress levels,
takes a toll on students' psychological escalating the risk of mental health issues,
well-being. Moreover, the omnipresence including suicidal tendencies.
of social media platforms, while offering
connectivity and convenience, can foster Suicidal behaviour, a multifaceted
feelings of inadequacy, comparison, and phenomenon influenced by various factors
isolation, further exacerbating mental such as biological, psychological, social,
health challenges. and environmental elements, presents a
significant global concern. It manifests in
Ultimately, the overarching goal of this different forms, from ideation to attempted
research is to provide timely and tailored and completed suicide, with profound
support to students identified as at-risk for impacts on individuals, families, and
suicide, thereby mitigating the risk of self- societies at large.
harm within campus communities and
enhancing the overall well-being of According to the World Health
students in educational institutions. Organization (WHO) in August 2023,
Through rigorous analysis and targeted every year more than 700,000 people take
interventions, we aspire to create a safer their own lives, with suicide being the
and more supportive environment fourth leading cause of death among 15–
conducive to the holistic development and 29-year-olds globally in 2019. Moreover,
flourishing of every student. over 77% of global suicides occurred in
low- and middle-income countries in
2019. (World Health Organization,
Literature Review & Research August 2023)
Fig.4.1.3
While there are similarities between the common themes across student
factors identified in the two datasets, such experiences, each dataset offers unique
as factors related to depression and perspectives on the factors influencing
anxiety, there are also notable differences. students' mental health and academic
Dataset 1 emphasized broader domains performance. Together, these datasets
like general well-being and social provide comprehensive insights into the
connectedness, while Dataset 2 focused complex interplay of psychosocial factors
more specifically on symptoms of affecting students, informing the
depression and anxiety arousal. These development of targeted interventions and
differences suggest that while there are. support systems tailored to address their
specific needs.
b) Regression
Fig.4.1.5
The results of the regression analysis of social support and meaningful
reveal several noteworthy findings. engagement on mental well-being.
Firstly, anxiety levels, self-esteem, and
sleep quality emerge as significant One notable finding from the regression
predictors of depression, with higher analysis is the positive relationship
levels of anxiety and lower self-esteem between depression and factors like
associated with increased depressive headache, blood pressure, and stress level.
symptoms. This aligns with existing These physiological markers of stress and
literature highlighting the bidirectional arousal underscore the intricate interplay
relationship between anxiety and between psychological and physiological
depression, as individuals with anxiety processes in depression. Chronic stress
disorders are at heightened risk for and physiological arousal have been
developing depression, and vice versa. implicated in the pathogenesis of
Moreover, poor sleep quality has been depression, affecting neurotransmitter
consistently linked to depression, systems, neuroendocrine function, and
underscoring the importance of addressing inflammatory processes implicated in
sleep disturbances in mental health depressive symptomatology.
interventions.
Importantly, the inclusion of depression as
Furthermore, academic stressors, such as the dependent variable in this regression
study load and future career concerns, also model holds significant implications for
exert a significant influence on depression understanding the link between depression
scores. The pressures and expectations and suicide. Depression is a well-
associated with academic performance established risk factor for suicidal
and future aspirations can exacerbate behavior, with individuals experiencing
feelings of inadequacy and hopelessness, depressive symptoms being at heightened
contributing to depressive symptoms risk for suicidal ideation, suicide attempts,
among students. Similarly, challenges in and completed suicide. The hopelessness,
the teacher-student relationship and despair, and emotional pain characteristic
engagement in extracurricular activities of depression can overwhelm individuals'
are associated with variations in coping mechanisms and lead to suicidal
depression scores, highlighting the impact thoughts and behaviors as a perceived
means of escape from suffering.
c) MDS
Fig.4.1.6
Dimension 1 - Observable symptoms
Dimension 2 - Instantly Addressable symptoms
The MDS plot reveals distinct clusters based apparent symptoms like Fidgety and Lack of
on symptom characteristics. Well-separated Concentration. Dimension 2, potentially
points (e.g., Fidgety, Lack of Concentration) "Instantly Addressable Symptoms," includes
indicate high dissimilarity, while proximity Poor Appetite, Feeling Tired, and Lack of
suggests greater similarity (e.g., Poor Concentration, suggesting potential for
Appetite, Feeling Tired). Dimensions immediate intervention. However, these
represent abstract variations, not specific interpretations are tentative and require
features. Dimension 1, tentatively labeled further investigation for confirmation and
"Observable Symptoms," groups readily refinement.
4.2 NLP: input, the model is also able to make an
accurate classification between ‘depressed’
Following meticulous data processing and ‘suicidal’.
and vectorization procedures, diverse
machine-learning algorithms were
employed to delineate classifications
between 'depressed' and 'suicidal'. A
comparative analysis of these models
yielded the following results:
a. Logistic Regression: Within the
framework of Natural Language b. Support Vector Machines (SVM):
Processing (NLP) for classifying In NLP, distinguishing between
depression and suicidal behavior, depression and suicidal behavior
textual features are leveraged to utilizes text features to determine
ascertain the likelihood of each an optimal decision boundary,
class, thereby enabling effectively separating the two
discrimination between classes. By analyzing textual
depressed and suicidal data, SVM learns patterns in the
individuals. A model was feature space, identifying
meticulously constructed to linguistic cues indicative of
undergo training on the dataset, depression or suicidal ideation.
subsequently facilitating SVM seeks to maximize the
predictions regarding the margin between classes while
classification of input text as minimizing classification errors.
either 'depressed' or 'suicidal'. It transforms text samples into
numerical representations,
facilitating the creation of a
hyperplane that best separates
depressed and suicidal
individuals. Evaluation metrics
like accuracy, precision, and
recall assess SVM’s Performance
in classifying individuals based
on their textual expressions.
Fig.4.3.1
4.4 Tableau: levels of dependency and perceived
expectations, potentially leading to
Utilizing Tableau, our study has derived increased pressure. This dependency and
insightful visualizations that offer a pressure may contribute to elevated levels
nuanced comprehension of the dataset, of stress and, in some cases, escalate to
revealing previously unnoticed patterns more severe suicidal thoughts.
and inferences. These visual 8.23% of students are rated 3 or 4 who are
representations serve as a pivotal staying at private rented accommodation.
component in elucidating complex data Students living independently in private
relationships, facilitating a deeper rented accommodation may experience
understanding of the factors influencing feelings of isolation, as they are away from
student mental health and well-being. By the support systems provided by family or
leveraging Tableau's advanced university halls. Students managing their
visualization capabilities, we have accommodation in private rentals often
unearthed valuable insights crucial for face additional financial and academic
informing targeted interventions and stressors. The absence of a strong support
support strategies tailored to address the network can contribute to heightened
multifaceted challenges encountered by emotional distress and an increased
students in educational settings. likelihood of more severe suicidal
thoughts.
Insights derived -
Fig.4.4.2
III. When we were initially plotting the is a faster rate of increase in suicide
data, no clear inference was apparent. percentages among 10-14-year-olds
However, after some comparative compared to other age groups. Taking
analysis, a notable trend has emerged. In India as an example, it is evident that,
the age group of 10-14 years, the suicide except for the 10-14 age group, the suicide
percentage is lower compared to other age percentage decreased from 2001 to 2009.
brackets, but it is showing the most From 2010 to 2017, it remained relatively
significant increase over time. This pattern constant for all age brackets except the 10-
is observed in many countries, with 14 group, where a notable increase in
similarities across various brackets. percentage has been observed since 2005.
Notably, in several Asian countries, there
Fig.4.4.3
5. Conclusion
depression predictors, alongside academic
In conclusion, this research represents a
stressors, and future concerns.
significant stride toward addressing the
We used NLP to make the distinction
escalating mental health challenges
between depressive and suicidal thoughts
confronting students in today's educational
to identify students who require urgent
landscape. By harnessing advanced
intervention and care. We used 4 machine
analytical methodologies and artificial
learning models for classification. Out of
intelligence, our study endeavors to
the 4 models, comparative analysis shows
construct predictive models capable of
that using logistic regression gave the best
preempting deteriorating mental health
classification results with an accuracy of
trajectories and mitigating instances of
72%. This technique helps us identify at-
student suicides. Through a meticulous
risk youth using their sentiments extracted
analysis of diverse behavioral, academic,
from their social media activities. We also
and demographic variables, alongside
analyze the holistic socio-economic and
sophisticated techniques such as machine
psychological profiles using machine
learning, natural language processing, and
learning to predict the degree of self-harm
multivariate data analytics, we've gained
that a student might be contemplating. The
invaluable insights into the complex
data is extracted from real-life patients.
factors contributing to suicidal ideation
Using this algorithm, we were able to
and depressive episodes among students.
predict with an accuracy of 67.6%. We
Through MDA, nuanced factors
were also able to narrow down the factors
influencing students' well-being emerged.
that affected our dependent variable of
Factor analysis unveiled "General well-
self-harm the most. Continued refinement
being" and "Social Connectedness" in
and validation of these models, along with
Dataset 1, while Dataset 2 revealed
collaborative engagement among
"General Depression Factor" and "Anxiety
stakeholders, will be vital in translating
Arousal Factor." Regression showed
research insights into actionable strategies
anxiety, self-esteem, and sleep quality as
that prioritize student mental health and
facilitate positive change.
6. Future Scope
Moving forward, the future scope of this culture of preventive mental healthcare
project encompasses several key avenues within educational communities.
for further exploration and refinement. Additionally, future iterations of this
Firstly, ongoing efforts will focus on project will prioritize the development of
enhancing the predictive accuracy of our personalized interventions tailored to the
models through the incorporation of unique needs and circumstances of
additional data sources and the refinement individual students, leveraging advanced
of algorithmic parameters. Collaborative analytics and artificial intelligence to
partnerships with educational institutions deliver targeted support resources and
and mental health organizations will be strategies. Through ongoing research,
pivotal in augmenting our datasets and validation, and stakeholder engagement,
validating the efficacy of our predictive we aspire to catalyze a paradigm shift in
frameworks in diverse real-world settings. student mental health support, wherein
It's important to note that the data used for proactive identification, intervention, and
this particular project came only from the support mechanisms become integral
patients who consented to their components of educational ecosystems,
information being released. Therefore, nurturing the holistic well-being and
including more data from a wider pool of resilience of students across diverse socio-
individuals could yield even better results. cultural contexts. Ultimately, the future
Hence, gathering more data will be trajectory of this project is anchored in a
required to ensure the robustness and steadfast commitment to leveraging
generalizability of our predictive models. cutting-edge technology and
interdisciplinary collaboration to address
Furthermore, the integration of real-time the complex and pressing challenges
monitoring capabilities and intervention facing student mental health in the 21st
protocols will enable proactive century.
identification and support for students at
risk of mental health crises, fostering a
Declaration of Competing Interest
We have no interest to declare.
Data Availability
The authors do not have permission to
share data.
References
https://www.who.int/news-room/fact-sheets/detail/suicide
[2] Machado, C. D. S., Ballester, P. L., Cao, B., Mwangi, B., Caldieraro, M. A., Kapczinski, F., &
2996.
https://doi.org/10.1017/s0033291720004997
[3] De Oliveira Crispim, M., Santos, C. M. R. D., Da Silva Frazão, I., De Queiroz Frazão, C. M. F.,
https://doi.org/10.1590/1518-8345.5320.3495
[4] Bernert, R. A., Hilberg, A. M., Melia, R., Kim, J., Shah, N. H., & Abnousi, F. (2020). Artificial
17(16), 5929.
https://doi.org/10.3390/ijerph17165929
[5] Aseltine, R. H., Jr., & Schoenborn, C. A. (2016). Depression and suicidal thoughts among college
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3057910/
[6] Bhattacharya, P., & Chawla, N. (2020). Mental health and suicidal ideation among university
https://www.cambridge.org/core/journals/psychological-
medicine/article/prevalence-of-suicidal-thoughts-and-behaviours-among-college-
students-a-metaanalysis/F31360A7411B35C4AC3B1A8DA67FA016
[7] Garcia, N. M., & Calvete, E. (2019). Risk factors for suicidal ideation in university students: A
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0261785
[8] Racine, N. E., Cunningham, N. A., Liu, H., & Weems, C. F. (2019). Depression and suicidal
https://pubmed.ncbi.nlm.nih.gov/17559087/
[9] Eaton, N. R., Keyes, K. M., Krueger, R. B., & Blazer, D. G. (2010). Major depressive disorder in
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9483000/
[10] Liu, X., & He, J. (2016). Depression and suicidal ideation among Chinese college students: A
[11] Serafini, L., Amore, M., De Berardis, D., & Russo, E. (2019). Suicidal risk factors and protective
170-190.
https://www.scielo.br/j/rbp/a/3bYbDB7dXFr6jtvsdhc6bYb/?lang=en
[12] Eisenberg, D. (2011). Depression and anxiety disorders in college students. American
https://pubmed.ncbi.nlm.nih.gov/21543948/
[13] Prather, R. M., & John, D. M. (2014). The prevalence of suicidal thoughts and behaviors among
high school students in the United States: Population-based estimates from the 2011
https://www.cdc.gov/mmwr/preview/mmwrhtml/ss6104a1.htm