Revise 10
Revise 10
Revise 10
AND TECHNOLOGY
Anthonette Camosa-Azares
Capstone Adviser
AUGUST 2023
Introduction
The creation of social media platforms in recent years has changed the
educators, parents, and mental health experts about the rising occurrence
interest in order to effectively treat and alleviate these mental health crises.
Random Forest is a versatile and robust algorithm that has been shown to
social media and mental health by examining multiple data points and
issues among students has been highlighted in the existing research. Early
one’s psychological, emotional, and social well-being. It affects the way how
one thinks, feels, and acts. Mental health is very important at every stage of
health issues using several accuracy criteria and one of them is the Random
them and also obtained the most accurate one in Stacking technique based
with an accuracy of prediction 81.75. Also, Sun, Q., & Ding, H. (2023, July),
prevention. At the same time, it also puts forward the reflection and
Random Forest predictive analysis to social media data for mental health
The main purpose of this study is to predict the ASSCAT students who
researchers will use a predictive model that can detect early signs of mental
health issues based on social media usage patterns. This model can help in
early intervention and the provision of appropriate support to students who
may be at risk.
before it occurs.
college
This study aims to predict the ASSCAT students who are suffering
Forest Algorithm.
usage patterns.
Scope
The study will use Facebook posts and interviews as the primary data
The study will only focus on detecting early signs of Mental Health
support to student.
Delimitation
The study will not include data from other sources such as medical
records.
analysis.
Input Process
Output
Predictive analysis of early
plan
and Facebook use and interviews. Machine learning technique will use to
hybrid Random Forest and Artificial Neural Network (RF-ANN) model that
In the study of Macalli, M., Navarro, et al., (2021) Suicidal ideas and
traits, mental health, and drug use, to predict suicidal thoughts and
behaviors at follow-up. The area under the receiver operating curve (AUC),
higher levels of distress during a global health crisis. The goal of their
post-traumatic stress disorder were more common overall than they would
have been in the absence of a pandemic and at greater rates than those
observed among healthcare personnel and those who had survived severe
Huljanah, M., Rustam, Z., Utama, S., & Siswantining, T. (2019, June) Data
decision tree. The precision to be attained will improve as more trees are
used. The Random Forest Classifier can handle big sample sets and can
development, and lifespan spans more than 40 years, Van Agteren, J., et al.,
trees that are grown by bootstrapping samples of the original data. The final
provides the expected forecasting results given the series of events in the
DATA IDENTIFICATION
DATA COLLECTION
(Sci-kit Learn (Python)
DATA TRANSLATION
(QUILLBOT)
STATISTIC ANALYSIS
DATA CLEANING
(Pandas Python)
FEATURE EXTRACTION
(TF-IDF)
INTEGRATION EXTRACTION
RANDOM FOREST
ALGORITHM
PREDICTIVE ANALYSIS
PREVENTION PLAN
Figure 2. Overall Methodology
Data Identification
In this study, the researchers will collect the relevant data sources
Data Collection
Figure 3 above shows how the researchers will get the data from
posts.
Data Translation
Data Translated
Google
Collected Data
Translator
to translate the data collected. The researchers will use quillbot and select
the text that want to translate. Then, on the toolbar, select
Data Cleaning
values, remove duplicate, and normalize the data. This step ensures the
Missing values,
Translated remove
Data duplicate and
normalize data
Data cleaned
In this figure, the researchers will use Pandas. Data can be cleaned
using Pandas (Python). In this step, it will clean data to remove noise,
Feature Extraction
Feature extraction using TF-IDF involves transforming Facebook
terms within each post. These features will be used to train the predictive
model and identify potential mental health crises among ASSCAT students.
Integration Extraction
This will create a combined matrix where each row corresponds to a data
term.
and extract feature importance to understand which terms from the text
Prevention Plan
After will train the algorithm and evaluating the model’s performance,
accurate information, and avoids causing harm. The researchers will speak
with the guidance counselors and inquire about the best methods to align
our preventive strategy with the school’s prevention plan as part of the
methodology.