0% found this document useful (0 votes)

27 views6 pages

Suicidal Thought Detection Using NLPNatural Language Processing On Reddit Data

Uploaded by

vijaykambhampati79

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views6 pages

Suicidal Thought Detection Using NLPNatural Language Processing On Reddit Data

Uploaded by

vijaykambhampati79

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

2023 26th International Conference on Computer and Information Technology (ICCIT)

13-15 December, Cox’s Bazar, Bangladesh

Suicidal Thought Detection Using NLP(Natural

Language Processing) on Reddit Data
MD. Rafi Imam1 , Oishi Jyoti2 , Zakia Afrin3 , Md. Munawar Hossain4 , and Tamanna Hossain Mou5
1,2,3,4,5
Department of Electrical & Computer Engineering
1,2,3,4,5
Rajshahi University of Engineering & Technology
Rajshahi-6204, Bangladesh
[email protected] , [email protected] , [email protected] [email protected] ,and [email protected]

Abstract—Our study harnesses the power of NLP to contribute 3 outlines the methodology, emphasizing the use of NLP
to the identification of suicidal ideation in text data. In the realm approaches. In Section 4, the paper details the implementation
of mental health, identification of suicidal ideation at an early of the TF-IDF model. Section 5 discusses results, highlighting
stage is of paramount importance for suicide prevention efforts.
This paper presents a novel approach to suicide ideation detection the significance of NLP in addressing mental health challenges
using Natural Language Processing (NLP) techniques. Faced with through social media analysis. In conclusion (Section 6), the
a dearth of publicly available datasets for this critical task, research underscores the value of this approach for suicide
we have contributed a valuable resource by curating a dataset prevention initiatives in the digital age.
from the ”SuicideWatch” and ”depression” subreddits on the
Reddit platform, collected via the Pushshift API. Specifically, we II. R ELATED W ORKS
employ the LSTM and a Random Forest classifier separately
to achieve promising results in this vital area of research. In [7], TFIDF and BoW are used in tandem to precisely
This work not only advances the field of NLP-based suicide differentiate between positive and negative tweets. They dis-
ideation detection but also contributes a valuable dataset for covered that by utilizing the TF-IDF vectorizer, the precision
future investigations, potentially saving time and resources for of sentiment analysis can be greatly improved and simulation
researchers and professionals dedicated to the prevention of
suicide and the improvement of mental health. We could achieve results demonstrate the efficiency of our suggested method.
up to 93% accuracy in suicidal thought analysis using NLP Using the NLP approach, we achieved 85.25% accuracy in
techniques. sentiment analysis. In [8], The author provided an auto-
Index Terms—NLP, Reddit, Pushshiftf, API, LSTM, Random mated conversational platform that was utilized to identify
Forest Classifier. depression-related risks as a preliminary strategy. The platform
was designed to interpret conversations through NLP and
I. I NTRODUCTION machine learning. The suggested two-phased platform’s initial
In the contemporary digital landscape, social networking phase would examine discussion and sort related emotions
sites have emerged as global agora, reshaping communication into four categories: ”happy,” ”neutral,” ”depressive,” and
dynamics and offering unprecedented insights into human ”suicidal.” In [9], The authors suggested a categorization
emotions [1]–[4]. Among these platforms, Reddit stands out strategy, deep neural networks, Bi-LSTM, CNN, and self-
as a rich source for sentiment research due to its diverse sub- attention are used in this model. , which they demonstrated on
reddits, fostering candid discussions, including those related to several datasets. Furthermore, they contrast three pre-trained
mental health [5]. Addressing the pressing issue of suicide, this word-embeddings for word encoding. The optimistic findings
research employs Natural Language Processing (NLP) tech- achieved on cutting-edge datasets allow us to test the model’s
niques, specifically leveraging the TF-IDF model, to discern validity and examine the optimum word embeddings to use for
between posts expressing suicidal ideation and non-suicidal emotion identification. They suggest their model as a starting
sentiments. With over 47,000 lives lost annually to suicide in point for further research in the issue because deep learning
the United States alone, understanding and preventing suicidal is so significant in the academic world.
thoughts are critical [6]. In [10], the authors provided the findings of a thorough map-
This study aims to contribute to mental health research by ping research to arrange the available published information.
utilizing Reddit data for sentiment analysis. Section 2 reviews We looked for studies completed between 2015 and 2020 in
related studies on suicidal thought prediction, while Section electronic research databases of the scientific literature using a

979-8-3503-5901-5/23/$31.00 ©2023 IEEE

Authorized licensed use limited to: Zhejiang University. Downloaded on December 21,2024 at 10:34:22 UTC from IEEE Xplore. Restrictions apply.
stepwise PRISMA approach. They uncovered 92 relevant stud-
ies on sentiment analysis of student input in learning platform
environments out of 612 initially discovered. The mapping
results revealed that, despite the mentioned limitations, the
area is rapidly expanding, particularly in terms of the use
of DL, which is the most current trend. They outlined some
elements that must be considered in order to contribute to the
maturity of research and development in the area.
In [11], They demonstrated that using social media data to
identify people at risk of suicide is viable. They used NLP
and ML (specifically, deep learning) techniques that described
proposals for an automated method for predicting suicide risk
that may be utilized by persons without specialist mental
health training (for example, a primary care practitioner). They
also discussed the ethical implications of such technology and
the implications for privacy. This technique was only utilized
for those who ”opted in” for the analysis and intervention, Fig. 1. Architecture Overview of Proposed Methodology
but it permitted scalable screening for suicide risk, potentially
identifying many people at risk before they contacted a health
care system. method is the use of text-type data derived from the Reddit
This presented a significant cultural concern regarding the site. The process consists of a number of significant steps, each
trade-off between privacy and prevention—they have the tech- of which contributes to the thorough examination of Reddit
nology that has the potential to save lives, but it only reached data to find suicidal ideation.
a small percentage of the population who could be at risk due
Algorithm 1
to respect for their privacy.
In [12], They tested weakly supervised algorithms for de- Step 1: Data Preparation
tecting ”current” events. Suicidal ideation in electronic health 1. Import necessary libraries:
record (EHR) systems was derived from unstructured clinical • Import the required libraries such as WordCloud, numpy,
notes. Weakly supervised machine learning approaches use pandas, sklearn, neattext, matplotlib, plotly, keras, tensor-
imprecise labels for training, which reduces the effort of flow, and others.
producing a big dataset.
2. Load the dataset:
The dataset had been manually annotated. After identifying
• Load the dataset, which presumably contains text data
a cohort of 600 patients at risk for suicidal ideation, they
used a rule-based NLP technique to categorize the training along with labels indicating whether the text expresses
and validation notes (n = 600). Using this vast collection of suicidal thoughts or not.
clinical observations, they developed multiple ML statistical Step 2: Data Analysis and Visualization
models, including a logistic classifier, support vector machines 3. Explore the data:
(SVM), and a Naive Bayes classifier, as well as one deep learn- • Examine the distribution of classes in the dataset, i.e.,
ing model, a text classification convolutional neural network ”Suicide” and ”Not Suicide.”
(CNN), to be tested on a hand reviewed test set (n = 837). 4. Visualize the class distribution:
Our research is pulsing not merely with technological
• Create bar charts and pie charts to visualize the distribu-
prowess, but also with the tremendous potential to save
tion of suicide-related and non-suicidal content.
lives and alleviate suffering. We have given the scientific
community a rich dataset, a tapestry of human emotions that Step 3: Text Preprocessing
calls out to other researchers who share our dedication to the 5. Clean the text data:
improvement of mental health and the prevention of suicide. • Lowercase the text.
These findings, far from being simple statistics, reflect the • Remove special characters.
unspoken cries for assistance of people in need, and it is our • Remove stopwords.
serious duty to listen and respond. Step 4: Tokenization and Word Frequency
III. P ROPOSED M ETHODOLOGY 6. Tokenize the cleaned text:
• Tokenize the preprocessed text data.
This section summarizes our ideas and focuses on a method
for performing sentiment analysis on Reddit data. 7. Calculate word frequency:
Figure 1 gives an architectural breakdown of the entire • Create a word frequency DataFrame to analyze word
process and outlines the methods used in this study to identify frequencies.
suicidal thoughts. The underlying principle of the created Step 5: Word Cloud Visualization

Authorized licensed use limited to: Zhejiang University. Downloaded on December 21,2024 at 10:34:22 UTC from IEEE Xplore. Restrictions apply.
8. Generate a word cloud: • Remove the 'Unnamed: 0' column if present.
• Generate a word cloud visualization to gain insights into • Check and print basic information about the dataset.
the most frequent words in the dataset. Step 4: Data Visualization
Step 6: Data Splitting and Label Encoding • Visualize the class distribution using a countplot.
9. Split the data: Step 5: Text Length Analysis
• Split the data into training and testing sets. • Calculate the length (in terms of words) of each text.
10. Encode labels: • Analyze the text length statistics, including quantiles.

• Encode the class labels (”Suicide” and ”Not Suicide”) Step 6: Filter Texts Based on Length
using LabelEncoder. • Remove texts with a length exceeding a certain threshold
Step 7: Word Embedding with Pretrained Vectors (317 words in this case).
11. Load pretrained word embeddings: • Visualize the class distribution after filtering.

• Load pretrained word embeddings (GloVe) to create an Step 7: Word Frequency Analysis
embedding matrix for words in the dataset. • Tokenize and count the frequency of words in the text
Step 8: Model Building and Training data.
12. Define the model architecture: • Filter out rare and common words based on quantiles

• Create a Sequential model.

(0.99% and 99.9% quantiles).
• Visualize word frequency statistics.
• Add an Embedding layer with pretrained word embed-
dings. Step 8: Feature Engineering
• Add an LSTM layer with return sequences. • Create binary features for selected frequent words in the
• Add a GlobalMaxPooling1D layer. text data.
• Add Dense layers with ReLU activation. Step 9: Label Encoding
• Compile the model using binary cross-entropy loss and
• Encode the 'class' column, converting 'suicide' to 1 and
SGD optimizer. 'non-suicidal' to 0.
13. Train the model:
Step 10: Data Visualization (Optional)
• Train the model on the training data, monitoring valida-
• Visualize the relationship between the 'class' and the
tion performance.
length of texts.
• Utilize early stopping and learning rate reduction call-
backs. Step 11: Feature Correlation Analysis
• Calculate the Pearson correlation coefficient between
Step 9: Model Evaluation and Visualization
binary features and the target 'class'.
14. Visualize training history:
Step 12: Model Training and Evaluation
• Plot accuracy curves for training and validation data.
• Split the data into training and test sets (1% for training).
Step 10: Model Saving
• Initialize a Random Forest Classifier.
15. Keep the trained model:
• Perform hyperparameter tuning using GridSearchCV.
• Keep the learned model for further use. • Train the Random Forest model on the training data.
Step 11: Suicidal Thought Detection Function • Predict using both training and test data.
16. Create a function to detect suicidal thoughts: • Evaluate the model using accuracy and recall scores.

• Input: Text data.

• Preprocess the input text. The core of our study is the proposed technique, which
• Tokenize, pad, and process the text for prediction. directs our efforts to find and comprehend the existence of
• Use the trained model to predict whether the text contains suicidal ideas in the Reddit dataset. The findings and their
suicidal thoughts (based on a threshold). implications will be presented in the next sections of this
17. Call the detectSuicide function with sample text data for essay as they relate to studies on mental health and suicide
detection. prevention.
IV. I MPLEMENTATION
Algorithm 2
A. Data Collection
Step 1: Import Libraries
The first step entails gathering textual information from
• Import necessary libraries including NumPy, pandas, mat-
Reddit, concentrating on postings pertinent to suicidal and
plotlib, seaborn, warnings, and scikit-learn.
non-suicidal thoughts. This information, which includes posts
Step 2: Load Data from the ”SuicideWatch” and ”depression” subreddits as well
• Load the dataset from the specified file path using Pandas. as non-suicidal messages from r/teenagers, was gathered using
Step 3: Data Preprocessing the Pushshift API.

Authorized licensed use limited to: Zhejiang University. Downloaded on December 21,2024 at 10:34:22 UTC from IEEE Xplore. Restrictions apply.
binary features and the target ’class’. This innovative feature
extraction technique aimed to capture the distinctive patterns
associated with suicidal and non-suicidal posts, providing a
nuanced representation of the dataset. The binary features,
generated through this process, were then utilized for training
and evaluating machine learning models. This tailored feature
extraction methodology holds the potential to reveal intricate
Fig. 2. Dataset Used for Suicidal Thought Detection relationships between specific word occurrences and the clas-
sification of posts, fostering a more insightful understanding
of the dataset and facilitating improved model performance.
B. Data pre-processing
1) Data Extraction: The collected data is extracted from
the API and organized for further analysis.
2) Removal of E-mails Converting to Lower-Case: Firstly,
the text dataset underwent preprocessing where emails were
removed, and the entire dataset was converted to lowercase.
3) Tokenization: Text data is tokenized into individual
words or tokens to facilitate subsequent processing.

Fig. 3. Tokenized Data

4) Lemmatization: Text lemmatization techniques are ap- Fig. 5. Word Frequency

plied to reduce words to their root forms, aiding in the removal
of variations and enhancing feature extraction. This model determines a word’s significance in relation to
5) Stopwords Removal: Common stopwords, such as ”and,” the entirety of the dataset.
”the,” and ”in,” are removed to focus on meaningful content. 1) Word Embedding: For deep learning models like LSTM
and Bi-LSTM, we utilized GloVe and Word2Vec word embed-
dings. GloVe captures global co-occurrence statistics, offering
insights into semantic relationships, while Word2Vec focuses
on local context, enhancing syntactic understanding. This
dual embedding approach enriches our models with nuanced
representations, facilitating improved performance in tasks like
sentiment analysis and post-classification.
D. Algorithm Utilization
Unique algorithms and software packages are designed
expressly for the purpose of helping each step of the process
at hand. Stemming, lemmatization, and stopword elimination
are just a few of the NLP operations that are performed using
the Natural Language Toolkit (NLTK).

Fig. 4. Data Visualization

C. Feature Extraction
In a departure from the conventional TF-IDF method, our
Fig. 6. Separate Variables for Words Based on Their Presence
feature extraction process employed a custom approach to
enhance the interpretability and relevance of features. Specifi-
cally, we crafted binary features based on the frequency of E. Classifier Model
selected words within the text data. This method involved In this study, we investigate the performance of the Random
encoding the ’class’ column, assigning ’suicide’ a numerical Forest classifier and LSTM (Long Short-Term Memory) neural
value of 1 and ’non-suicidal’ a value of 0. Subsequently, networks as two different classifier models for detecting sui-
we calculated the Pearson correlation coefficient between the cidal thoughts. We have also applied Bi-lstm in this research.

Authorized licensed use limited to: Zhejiang University. Downloaded on December 21,2024 at 10:34:22 UTC from IEEE Xplore. Restrictions apply.
V. R ESULTS & D ISCUSSION in identifying complicated sequential relationships within text
This section contains the results of our experimentation data may be limited when compared to Bi-LSTM. This could
and goes into the analysis of the findings obtained by using account for the differences in recall and overall F1 score.
LSTM, Bi-LSTM and Random Forest Classifier models to The performance metrics obtained for the Bi-LSTM, LSTM
detect suicidal ideation in text data. We will first present a and Random Forest Classifier models are summarized in Table
brief assessment of the model’s efficiency in terms of accuracy, 1 below-
precision, recall, and F1 score, followed by a discussion of the
TABLE I
ramifications of these findings. C OMPARISON BETWEEN PARAMETERS
The Bi-LSTM model performs admirably overall, with an
Model Accuracy Precision Recall F1 Score
accuracy of 92.88%. This statistic represents the proportion
Bi-LSTM 92.88% 95.02% 90.50% 92.71%
of correctly classified cases in the test dataset. The model’s LSTM 92.62% 94.73% 90.26% 92.44%
precision score of 95.02% demonstrates its ability to correctly Random Forest
83.75% 86.15% 76.73% 81.17%
classify postings as expressing suicidal ideation while reducing Classifier
false positives. A recall score of 90.50% demonstrates the
model’s ability to recognize true suicidal ideation messages, The table 2 outlines various feature extraction, machine
reducing false negatives. The F1 score of 92.71% strikes a learning, and embedding techniques along with deep learning
good balance between precision and recall, indicating a solid algorithms applied in different studies. The first entry employs
and well-rounded model. The Bi-LSTM model’s exceptional TF-IDF for feature extraction, SVM for machine learning,
performance is due to its capacity to recognize sequential and Word2Vec for embedding, employing LSTM and CNN
dependencies within textual input. The model effectively in deep learning, achieving a notable accuracy of 90.3% as
evaluates the context and complex patterns in the text by reported in [5]. The second study utilizes TF-IDF, LIWC and
leveraging Bidirectional Long Short-Term Memory (LSTM) Sentiments for feature extraction, employing a range of ma-
units, allowing for accurate predictions. chine learning algorithms such as RF, SVM, LR, and ZeroR,
resulting in a high accuracy of 92%, as documented in [13].
Another approach combines TF-IDF, N-Gram, and LIWC for
feature extraction with various machine learning algorithms,
reaching an accuracy of 73.6%, as reported in [9]. Finally, the
current research employs binary feature correlation for feature
extraction, RF for machine learning, and Word2Vec and Glove
for embedding, with Bi-LSTM as the deep learning algorithm,
achieving an impressive accuracy of 93%, as presented in this
paper.
The findings highlight the importance of NLP approaches,
particularly LSTM-based models, in detecting suicidal ideation
in textual data. The LSTM model’s high accuracy, precision,
and recall reveal its ability to detect minor verbal clues
suggestive of suicidal ideation.
Future studies could look into combining multiple NLP
models, creating hybrid models, or including domain-specific
features to improve the efficacy of suicide ideation detection
systems. Efforts to reduce the model’s false positives and false
negatives should also be prioritized, as they have substantial
Fig. 7. Accuracy Curve ramifications in real-world applications. Finally, our research
highlights the significant prospects of NLP approaches, par-
ticularly LSTM models, in the detection of suicidal ideation.
While the Random Forest Classifier achieves a decent While the Random Forest Classifier produces decent results,
accuracy of 83.75%, there are significant discrepancies when the capacity of the LSTM model to catch intricate textual
compared to the Bi-LSTM model. The precision score of patterns is a significant leap in the field of mental health
86.15% indicates that the classifier has a moderately strong research and suicide prevention initiatives.
ability to reduce erroneous positives. The recall score of
76.73%, on the other hand, indicates a modest ability to detect VI. C ONCLUSIONS
true suicidal ideation posts, resulting in a slightly lower F1 In a world dominated by the silent battle of countless
score of 81.17%. people dealing with suicide ideation, our study shines as a
The Random Forest Classifier works on the ensemble learn- beacon of hope and creativity. Our path was distinguished
ing concept, combining predictions from numerous decision by unwavering determination, driven by a desire to apply
trees. While it is useful for a variety of tasks, its efficacy NLP to detect, comprehend and assist people in need. The

Authorized licensed use limited to: Zhejiang University. Downloaded on December 21,2024 at 10:34:22 UTC from IEEE Xplore. Restrictions apply.
TABLE II
COMPARISON WITH PREVIOUS WORKS

Feature Extraction Machine Learning Embedding Deep Learning Best Performing Metric &
Ref
Techniques Algorithms Techniques Algorithms Model Result
TF–IDF SVM Word2Vec LSTM, CNN LSTM-Attention CNN 90.3% [5]
TF-IDF, LIWC, Sentiment RF, SVM, LR, ZeroR NA NA SVM 92% [13]
TF–IDF, N-Gram, LIWC NB, SVM, KNN, RF NA NA NA 73.6% [9]
Binary Feature Correlation RF Word2Vec, Glove LSTM, Bi-LSTM Bi-LSTM 93% This paper

scope of our investigation immediately highlighted a daunting [7] E. J. Diniz, J. E. Fontenele, A. C. de Oliveira, V. H. Bastos, S. Teixeira,
challenge: the scarcity of publicly available datasets necessary R. L. Rabêlo, D. B. Calçada, R. M. Dos Santos, A. K. de Oliveira,
and A. S. Teles, “Boamente: A natural language processing-based
to our attempt. In response, we diligently curated a one- digital phenotyping tool for smart monitoring of suicidal ideation,” in
of-a-kind dataset culled from the candid and heartfelt ex- Healthcare, vol. 10, p. 698, MDPI, 2022.
pressions published on the Reddit platform’s ”SuicideWatch” [8] S. B. Hassan, S. B. Hassan, and U. Zakia, “Recognizing suicidal intent
in depressed population using nlp: a pilot study,” in 2020 11th IEEE
and ”depression” subreddits. This dataset, which spans over Annual Information Technology, Electronics and Mobile Communication
a decade of human emotions, is more than just a resource; Conference (IEMCON), pp. 0121–0128, IEEE, 2020.
it demonstrates our persistent commitment to the goals of [9] M. Polignano, P. Basile, M. de Gemmis, and G. Semeraro, “A compar-
ison of word-embeddings in emotion detection from text using bilstm,
suicide prevention and mental health promotion. Our research cnn and self-attention,” in Adjunct Publication of the 27th Conference
was built around a variety of Natural Language Processing on User Modeling, Adaptation and Personalization, pp. 63–68, 2019.
approaches. We set out on a journey to grasp the complexities [10] M. Kanakaraj and R. M. R. Guddeti, “Nlp based sentiment analysis on
twitter data using ensemble classifiers,” in 2015 3Rd international con-
of human language, the nuances of emotion, and the terrible ference on signal processing, communication and networking (ICSCN),
reality of suicidal ideation. Our method avoided the one-size- pp. 1–5, IEEE, 2015.
fits-all philosophy, instead employing a hybrid of two powerful [11] G. Coppersmith, R. Leary, P. Crutchley, and A. Fine, “Natural language
processing of social media as screening for suicide risk,” Biomedical
models: the LSTM (Long Short-Term Memory), Bi-LSTM informatics insights, vol. 10, p. 1178222618792860, 2018.
and the Random Forest Classifier. We produced exceptional [12] M. Cusick, P. Adekkanattu, T. R. Campion Jr, E. T. Sholle, A. Myers,
results through a dynamic integration of machine learning S. Banerjee, G. Alexopoulos, Y. Wang, and J. Pathak, “Using weak
supervision and deep learning to classify clinical notes for identification
algorithms. In sentiment analysis, the Bi-LSTM model, which of current suicidal ideation,” Journal of psychiatric research, vol. 136,
is known for its ability to capture sequential patterns and pp. 95–102, 2021.
contextual information, achieved an extraordinary accuracy [13] A. E. Aladağ, S. Muderrisoglu, N. B. Akbas, O. Zahmacioglu, and H. O.
Bingol, “Detecting suicidal ideation on forums: proof-of-concept study,”
rate of up to 93%. The Random Forest Classifier, on the Journal of medical Internet research, vol. 20, no. 6, p. e9840, 2018.
other hand, produced equally encouraging results, with a Test
Accuracy of 83.7%. Finally, this thesis represents more than
just a collection of studies; it represents the indomitable spirit
of human compassion and inventiveness. It demonstrates the
revolutionary power of NLP approaches, as demonstrated by
the LSTM and Random Forest Classifier, in addressing today’s
most important concerns. As we embark on an unknown
future, may our work serve as a spark for greater inquiry,
collaboration, and the creation of enhanced suicide prevention
measures.
R EFERENCES
[1] T. Nasukawa and J. Yi, “Sentiment analysis: Capturing favorability using
natural language processing,” in Proceedings of the 2nd international
conference on Knowledge capture, pp. 70–77, 2003.
[2] A. C. Fernandes, R. Dutta, S. Velupillai, J. Sanyal, R. Stewart, and
D. Chandran, “Identifying suicide ideation and suicidal attempts in a
psychiatric clinical research database using natural language processing,”
Scientific reports, vol. 8, no. 1, p. 7426, 2018.
[3] E. Yeskuatov, S.-L. Chua, and L. K. Foo, “Leveraging reddit for suicidal
ideation detection: A review of machine learning and natural language
processing techniques,” International journal of environmental research
and public health, vol. 19, no. 16, p. 10347, 2022.
[4] M. Guidère, “Nlp applied to online suicide intention detection,” in
HealTAC 2020, 2020.
[5] A. Rajput, “Natural language processing, sentiment analysis, and clinical
analytics,” in Innovation in health informatics, pp. 79–97, Elsevier, 2020.
[6] K. Brindha, S. Senthilkumar, A. K. Singh, and P. M. Sharma, “Sentiment
analysis with nlp on twitter data,” in 2022 International Conference on
Smart Generation Computing, Communication and Networking (SMART
GENCON), pp. 1–5, IEEE, 2022.

Authorized licensed use limited to: Zhejiang University. Downloaded on December 21,2024 at 10:34:22 UTC from IEEE Xplore. Restrictions apply.

LR - Farrukh Nadeem, DBA - 60421 Updated
No ratings yet
LR - Farrukh Nadeem, DBA - 60421 Updated
52 pages
Scratch Programming Playground Learn To Program by Making Cool Games 1st Edition Sweigart Download
No ratings yet
Scratch Programming Playground Learn To Program by Making Cool Games 1st Edition Sweigart Download
91 pages
A Suicidal Ideation Detection Framework On Social Media Using Machine Learning and Genetic Algorithms
No ratings yet
A Suicidal Ideation Detection Framework On Social Media Using Machine Learning and Genetic Algorithms
18 pages
Engine Speed Circuit Fault
No ratings yet
Engine Speed Circuit Fault
7 pages
Analyzing Social Media Texts For Suicidal Risk Identification Using Natural Language Processing
No ratings yet
Analyzing Social Media Texts For Suicidal Risk Identification Using Natural Language Processing
5 pages
Suicidal Ideation Detection On Social Media
No ratings yet
Suicidal Ideation Detection On Social Media
46 pages
BDCC 09 00016
No ratings yet
BDCC 09 00016
19 pages
BISMCA Suggestion
No ratings yet
BISMCA Suggestion
3 pages
Ai 1
No ratings yet
Ai 1
3 pages
s41870 023 01725 6
No ratings yet
s41870 023 01725 6
17 pages
Google 15
No ratings yet
Google 15
16 pages
1 s2.0 S1877050923001412 Main
No ratings yet
1 s2.0 S1877050923001412 Main
9 pages
Conf PPT
No ratings yet
Conf PPT
12 pages
Conference PPTT
No ratings yet
Conference PPTT
20 pages
2023 Stacked CNN LSTM Approach For Prediction of Suicidal Ideation
No ratings yet
2023 Stacked CNN LSTM Approach For Prediction of Suicidal Ideation
22 pages
Emotional Health
No ratings yet
Emotional Health
66 pages
Farukh Nadeem Concept Paper Detection of Suicidal Tendencies - 60421 - 20230218
No ratings yet
Farukh Nadeem Concept Paper Detection of Suicidal Tendencies - 60421 - 20230218
13 pages
Sleep Apnea Syndrome Breakthrough by Slidesgo
No ratings yet
Sleep Apnea Syndrome Breakthrough by Slidesgo
39 pages
M Synopsis
No ratings yet
M Synopsis
9 pages
Understanding Mental Health Content On Social Media and It's Effect Towards Suicidal Ideation
No ratings yet
Understanding Mental Health Content On Social Media and It's Effect Towards Suicidal Ideation
15 pages
A Comparative Analysis of Transformer and LSTM Models For Detecting Suicidal Ideation On Reddit
No ratings yet
A Comparative Analysis of Transformer and LSTM Models For Detecting Suicidal Ideation On Reddit
7 pages
Application of Natural Language Processing (NLP) in Detecting and Preventing Suicide Ideation: A Systematic Review
No ratings yet
Application of Natural Language Processing (NLP) in Detecting and Preventing Suicide Ideation: A Systematic Review
23 pages
Suicide Text Classification Using Machine Learning Tecniques
No ratings yet
Suicide Text Classification Using Machine Learning Tecniques
18 pages
Literature Paper
No ratings yet
Literature Paper
8 pages
Suicidal Ideation Detection: A Review of Machine Learning Methods and Applications
No ratings yet
Suicidal Ideation Detection: A Review of Machine Learning Methods and Applications
14 pages
1a+ (192 203) +Ensembled+Machine+Learning+Methods+and+Feature+Extraction+Approaches+for+Suicide Related+Social+Media
No ratings yet
1a+ (192 203) +Ensembled+Machine+Learning+Methods+and+Feature+Extraction+Approaches+for+Suicide Related+Social+Media
12 pages
Emotion Detection and Suicidal Intention Prediction of Differently Depressed Individuals Using Mach
No ratings yet
Emotion Detection and Suicidal Intention Prediction of Differently Depressed Individuals Using Mach
4 pages
Duplichecker Plagiarism Report
No ratings yet
Duplichecker Plagiarism Report
4 pages
Onion Routing
No ratings yet
Onion Routing
37 pages
Dsai Report
No ratings yet
Dsai Report
12 pages
Surveillance
No ratings yet
Surveillance
31 pages
WORK CYCLE 7.2 Web Application Configuration Guide
No ratings yet
WORK CYCLE 7.2 Web Application Configuration Guide
132 pages
Paper
No ratings yet
Paper
10 pages
Research Paper 2.0
No ratings yet
Research Paper 2.0
4 pages
BE Honours (Text, Web and Social Media Analytics
No ratings yet
BE Honours (Text, Web and Social Media Analytics
1 page
С1 Smartwatches Are They as Smart as We Think 1
No ratings yet
С1 Smartwatches Are They as Smart as We Think 1
15 pages
Sonica Eswar Resume
No ratings yet
Sonica Eswar Resume
1 page
Mental Health Analysis in Social Media Posts: A Survey: Muskan Garg
No ratings yet
Mental Health Analysis in Social Media Posts: A Survey: Muskan Garg
24 pages
3706durgam Cheruvu CADASTRAL
No ratings yet
3706durgam Cheruvu CADASTRAL
1 page
Sapramh Ijsr Paper 1
No ratings yet
Sapramh Ijsr Paper 1
7 pages
AI Task
No ratings yet
AI Task
17 pages
Mental Health and Suicide Ideation Assessment With Social Media
No ratings yet
Mental Health and Suicide Ideation Assessment With Social Media
1 page
Mental Health Prediction
No ratings yet
Mental Health Prediction
14 pages
Automatic Identification of Suicide Notes With A Transformer-Based Deep
No ratings yet
Automatic Identification of Suicide Notes With A Transformer-Based Deep
8 pages
Using Machine Learning Algorithms To Detect Suicide Risk Factors On Twitter
No ratings yet
Using Machine Learning Algorithms To Detect Suicide Risk Factors On Twitter
8 pages
B15-Content - Analysis - in - Social - Media (1) - Bbhavani
No ratings yet
B15-Content - Analysis - in - Social - Media (1) - Bbhavani
59 pages
Paper 2 Review
No ratings yet
Paper 2 Review
7 pages
Provide Excellent Office Multifunction Printer in UAE - Konica Minolta Dubai
No ratings yet
Provide Excellent Office Multifunction Printer in UAE - Konica Minolta Dubai
4 pages
IJCRT2106325 BBB
No ratings yet
IJCRT2106325 BBB
11 pages
Lecture 40 Final Review - F24
No ratings yet
Lecture 40 Final Review - F24
56 pages
A Machine Learning Based Depression Analysis
No ratings yet
A Machine Learning Based Depression Analysis
6 pages
2021 Clpsych-1 9
No ratings yet
2021 Clpsych-1 9
6 pages
(IJETA-V10I2P1) :dr. A. Manjula, D. Kalpana, M. Sai Prasad, G. Sanjana, B. Mahender, D. Manisha, M. Abhishek
No ratings yet
(IJETA-V10I2P1) :dr. A. Manjula, D. Kalpana, M. Sai Prasad, G. Sanjana, B. Mahender, D. Manisha, M. Abhishek
10 pages
Suicidal Ideation in Online Posts
No ratings yet
Suicidal Ideation in Online Posts
9 pages
IJNGC Latex Research Paper
No ratings yet
IJNGC Latex Research Paper
10 pages
Assembly Language For Intel - Based Computers, 4 Edition
No ratings yet
Assembly Language For Intel - Based Computers, 4 Edition
32 pages
Duplichecker Plagiarism Report
No ratings yet
Duplichecker Plagiarism Report
2 pages
AIT726 Team8 BVSBTK Checkpoint1
No ratings yet
AIT726 Team8 BVSBTK Checkpoint1
7 pages
Leveraging Machine Learning and NLP For Personalized Mental Health Analysis From Social Media Insights
No ratings yet
Leveraging Machine Learning and NLP For Personalized Mental Health Analysis From Social Media Insights
5 pages
OOPS Interview Questions and Answers 2022
100% (1)
OOPS Interview Questions and Answers 2022
6 pages
Exploratory Analysis of Social Media Prior To A Suicide Attempt
No ratings yet
Exploratory Analysis of Social Media Prior To A Suicide Attempt
12 pages
Mini Porject On Social Media Suicidal Content - Bhavani
No ratings yet
Mini Porject On Social Media Suicidal Content - Bhavani
21 pages
Constructing Depression Prediction Model Using ChatGPT and Machine Learning Algorithms
No ratings yet
Constructing Depression Prediction Model Using ChatGPT and Machine Learning Algorithms
4 pages
Hybrid Decision Tree-Based Machine Learning Models For Short-Term Water Quality Prediction.
No ratings yet
Hybrid Decision Tree-Based Machine Learning Models For Short-Term Water Quality Prediction.
14 pages
Data Sources
No ratings yet
Data Sources
141 pages
Social Media Crime Detection Using Machine Learning Algorithms
No ratings yet
Social Media Crime Detection Using Machine Learning Algorithms
11 pages
s41746 022 00589 7 PDF
No ratings yet
s41746 022 00589 7 PDF
13 pages
Suic Ide Pred Ictio N: Pres Ented To Abh Ijit Path Ak
No ratings yet
Suic Ide Pred Ictio N: Pres Ented To Abh Ijit Path Ak
8 pages
1 Agile Manifesto
No ratings yet
1 Agile Manifesto
39 pages
HP Scitex LX600 & LX 800 Printer Operator Training Guidelines and Checklist
No ratings yet
HP Scitex LX600 & LX 800 Printer Operator Training Guidelines and Checklist
7 pages
ICDSIS-2024 Conference-Template PDF
No ratings yet
ICDSIS-2024 Conference-Template PDF
8 pages
Lecture 09 - Sequential Quadratic Programming
No ratings yet
Lecture 09 - Sequential Quadratic Programming
4 pages
UM S7 Product Data Sheet
No ratings yet
UM S7 Product Data Sheet
2 pages
Suicidal Ideation Detection Using Colbert Project Report
No ratings yet
Suicidal Ideation Detection Using Colbert Project Report
14 pages
TESUP ATLAS7 Wind Turbine User Manual
No ratings yet
TESUP ATLAS7 Wind Turbine User Manual
31 pages
Predicting Stress, Anxiety, and Depression From Social Media Comments: A Holistic Multi-Modal Deep Learning and NLP Framework
No ratings yet
Predicting Stress, Anxiety, and Depression From Social Media Comments: A Holistic Multi-Modal Deep Learning and NLP Framework
6 pages
Installation Procedure
No ratings yet
Installation Procedure
9 pages
Mental Health 2
No ratings yet
Mental Health 2
1 page
Counters: "Registers" Section
No ratings yet
Counters: "Registers" Section
10 pages
Risk Assessment of It Security Possible Solutions and Mechanisms To Control It Security Risk Unit 8: Security
No ratings yet
Risk Assessment of It Security Possible Solutions and Mechanisms To Control It Security Risk Unit 8: Security
15 pages
AI For Generation of Images
No ratings yet
AI For Generation of Images
2 pages
Bacnet MS TP
100% (1)
Bacnet MS TP
44 pages
Deep Reinforcement Learning: An Essential Guide
From Everand
Deep Reinforcement Learning: An Essential Guide
Robert Johnson
No ratings yet
List of Colour
No ratings yet
List of Colour
13 pages
Ansible: Architecture
100% (1)
Ansible: Architecture
7 pages
Data Science Unveiled: A Practical Guide to Key Techniques
From Everand
Data Science Unveiled: A Practical Guide to Key Techniques
Ed A Norex
No ratings yet
HV 48V 80AH LiFeP04
No ratings yet
HV 48V 80AH LiFeP04
1 page
Inject-Concerning Transmitters and Receivers by Peter Neuthinger
No ratings yet
Inject-Concerning Transmitters and Receivers by Peter Neuthinger
5 pages
Informatics Practices Class 12
No ratings yet
Informatics Practices Class 12
8 pages
Explain Each of The Following Symmetric Key Algorithms in 50-100 and List at Least Two (2) Usages For Each of Symmetric Key Algorithms
100% (1)
Explain Each of The Following Symmetric Key Algorithms in 50-100 and List at Least Two (2) Usages For Each of Symmetric Key Algorithms
9 pages

Suicidal Thought Detection Using NLPNatural Language Processing On Reddit Data

Uploaded by

Suicidal Thought Detection Using NLPNatural Language Processing On Reddit Data

Uploaded by

2023 26th International Conference on Computer and Information Technology (ICCIT)

13-15 December, Cox’s Bazar, Bangladesh

Suicidal Thought Detection Using NLP(Natural

979-8-3503-5901-5/23/$31.00 ©2023 IEEE

• Create a Sequential model.

• Input: Text data.

Fig. 3. Tokenized Data

4) Lemmatization: Text lemmatization techniques are ap- Fig. 5. Word Frequency

Fig. 4. Data Visualization

You might also like