0% found this document useful (0 votes)
13 views22 pages

06 - Chapter 2

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views22 pages

06 - Chapter 2

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 22

2.

LITERATURE REVIEW AND RELATED WORKS

2.1 BACKGROUND

Data mining and information retrieval are interdisciplinary fields


focused on extracting hidden patterns and retrieving relevant information from
vast datasets. Data mining involves uncovering insights using techniques like
machine learning [23], clustering, and classification. At the same time,
information retrieval focuses on efficiently retrieving data based on user
queries through methods like indexing and ranking. These fields combine
mathematics, statistics, computer science, and information science principles,
employing tools such as databases, envision, and machine learning
algorithms to manage, analyze, and envisage structured and unstructured
data, thereby facilitating scientific discovery and practical applications [24].
Information extraction (IE) is the automated conversion of
unstructured text into structured data, like relational databases, by identifying
entities (concepts and instances) and relationships [25]. It is vital for text
grading, converting web content into semantic web data, and identifying
clinical trial candidates [26]. Challenges include handling the complexity of
natural language and ensuring the quality of input data, which can contain
inaccuracies and inconsistencies [27]. Advanced natural language processing
and domain-specific knowledge are needed to tackle these challenges
effectively [27].
Naïve Bayes classifiers are simple probabilistic classifiers used in
machine learning. They leverage probability theory, making them highly
scalable and adaptable [28]. They require a small number of parameters that
scale linearly with the number of features in the dataset, making them efficient
for various applications, especially in text classification [29].
Psychological stress serves as a foundational factor contributing to
various health issues and mental disorders. Prolonged exposure to chronic
stress significantly elevates the likelihood of developing numerous health
problems, including insomnia, obesity, heart disease, and cancer, among
others [30].
Figure 2.1: Effects of Stress on the Body

Figure 2.1 depicts the effects of stress on the bodies of human


beings. The Autonomic Nervous System (ANS) regulates many major
physiological functions, including as breathing, organ emission, circulatory
strain, and heart electrical activity [31]. The sympathetic nervous system
(SNS) and the parasympathetic nervous system (PNS) are the two divisions
that make up the autonomic nervous system (ANS). The SNS readies the
body's resources for action under stressful circumstances. . In contrast, the
PNS is responsible for relaxing the body and restoring it to a stable state [32].
In humans, the sympathetic nervous system (SNS) initiates the
stress response, which includes behavioral, physiological, and psychological
symptoms [33] [34]. The following definitions are applied to various answer
groups throughout this thesis: Psychological describes or has to do with the
mind or mental processes, not with carrying out an action [35]. Since
physiological reactions are an involuntary activity or reaction that makes up a
sizable amount of a living thing's everyday functioning, they might be complex
or impossible to observe with the unaided eye [36].

The dynamic process of persons or groups exchanging different


activities is known as social interaction. It was created by sociologist Erving
Goffman [15]. Social media interaction is defined as using an online platform
by people to develop social bonds with others who share their interests, goals,
backgrounds, or real-life connections [10].
In recent years, the pervasive use of social media platforms has created
an unprecedented opportunity to explore the intersection between technology
and mental health [38]. Psychological stress is a pervasive issue affecting
individuals globally, and its early identification is crucial for effective
intervention and support. The ubiquity of social media has transformed it into
a rich source of real-time, user-generated data that reflects individuals'
thoughts, emotions, and experiences. This data, if harnessed appropriately,
can be leveraged to detect and understand psychological stress, a prevalent
and often under diagnosed mental health condition [38].

Figure 2.2: Death from COVID-19

Figure 2.2 depicts the deaths that occurred during COVID-19.


Comparing experiences between nursing homes and hospitals, the study
sought to ascertain if end-of-life (EOL) conversations occurred and whether
patients were present during their last moments, particularly regarding COVID-
19 fatalities [37].
The prevalence of psychological stress, encompassing anxiety and
depression, is a major global concern recognized by organizations like the
World Health Organization (WHO) due to its significant impact on public health
[39]. The COVID-19 pandemic, declared a worldwide epidemic and a public
health emergency by the WHO, has led to profound mental health and
psychological consequences on a global scale [8].
Social media serves as a digital platform where users openly express
a spectrum of emotions, offering a rich repository of data reflecting their mental
states. Stress is notably conveyed through various forms of social media, such
as textual content, emoticons, and social interactions [40]. Recognizing the
shift towards digital mental health solutions, there's a growing consensus on
leveraging platforms like social media for unobtrusive and continuous
monitoring, thereby presenting an opportunity to address mental health
challenges in innovative ways [13][41].

2.2 TRADITIONAL STRESS DETECTION METHODS

Traditional methods of diagnosing psychological stress, including


surveys and clinical interviews, are hindered by scalability issues, timeliness
constraints, and limitations in capturing real-world dynamics [42]. However,
physiological computing presents a promising approach to analyzing users'
psychological states using psychophysiological indicators [43]. This is
augmented by advancements in human-computer interaction and the
expansion of communication bandwidth through smart technology
development, which enhance the analysis capabilities [44]. Researchers can
now detect stress more effectively using physiological sensors, thus gaining a
comprehensive understanding of user experiences with technology.
Integration of new applications empowers even novice users in
stress detection. Psychologists recognize the interconnectedness of emotions
and physiology. Techniques such as heart rate monitoring, electrodermal
activity, heart rate variability, electromyography, pupil diameter measurement,
and galvanic skin response are employed for stress detection [45]. Traditional
methods, like EEG signals, engage users in the system to reduce stress. For
mental stress detection, a hysteresis process analyzes signals like the
electrocardiogram (ECG), pupil diameter (PD), and photoplethysmogram
(PPG), extracting relevant features through soft computing techniques.
Genetic algorithms select augmented features loaded into a fuzzy support
vector machine (FSVM) for stress and relaxation classification [45] [46].
Analysis of social media interactions on platforms where users share
content offers insights into stress states and user activity durations. Deep
learning machine learning approaches analyze stress levels based on
individual tweets and weekly posting patterns, reframing stress analysis within
the context of social media behaviour [47].

2.3 ADVANCED STRESS DETECTION METHODS

The emergence of social media as a ubiquitous platform for


communication provides an opportunity to develop an innovative hybrid model
that can automatically analyze user-generated content to detect signs of
psychological stress.
Machine learning and natural language processing (NLP)
advancements have opened new possibilities for automated large-scale social
media data analysis. Hybrid models, combining various techniques, have
shown promise in addressing the complexities of mental health detection [27].
Early detection of psychological stress is critical for preventing the
exacerbation of mental health conditions. Social media offers a platform for
proactive intervention by identifying signs of distress in real time [48].
Successful stress detection model development requires collaboration
between psychology, data science, and machine learning experts. Combining
domain knowledge with technical expertise enhances the model's
effectiveness [49].
Social media content is not limited to text; it includes videos and
other forms of multimedia. A hybrid model can effectively integrate various
data modalities to capture the complexity of stress expressions [50]. Social
media is characterized by its dynamic and ever-evolving nature. A hybrid
model can adapt to user behaviour and platform dynamics changes, providing
a more robust and resilient solution [38]. Combining different machine learning
and NLP techniques in a hybrid model can enhance accuracy and
generalization, improving the model's ability to detect stress across diverse
user populations and cultural contexts [51]. A hybrid model can facilitate real-
time or near-real-time analysis, enabling timely intervention and support for
individuals exhibiting signs of psychological stress [52].
The background reveals the intersection of mental health
challenges, social media's expressive nature, technological advancements,
and the need for innovative solutions [17]. The motivation lies in leveraging a
novel hybrid model to bridge the gap between traditional diagnostic
approaches and the dynamic, real-world expressions of psychological stress
on social media [10].

2.4 OTHER STRESS DETECTION METHODS

In Modern times, psychological stress is starting to pose a


serious threat to people's Well-Being [53]. Recognizing and controlling stress
before it worsens and causes more significant issues is vital. Figure 2.3 shows
four techniques for identifying psychological stress [32].

Figure 2.3: Taxonomy of Stress Detection Methods


2.5 STRESS MANAGEMENT TECHNIQUES

Psychological reactions include a marked rise in unpleasant feelings


like anger, anxiety, irritation, and sadness [54]. Stress can be assessed using
a self-report questionnaire or an interview with a psychologist. [55]. The first
method, widely regarded as a reliable approach to measuring individual stress
levels, is extensively used. This includes tools such as the Stress Self Rating
Scale (SSRS), the Perceived Stress Scale (PSS), and, in some instances, the
Stress Response Inventory (SRI) [56]. However, these questionnaires only
provide information about the client's current anxiety or stress levels, not the
underlying stressors or the progression of stress. While these tests can be
administered periodically, they may not be effective for detecting subtle
changes that could signal the early stages of a major issue. Typically, they are
used only when the individual or those around them recognize or suspect the
severity of the situation, which, in most cases, is too late. Additionally,
questionnaires are subjective [57] and require the client's attention. Research
indicates that "individuals can experience memory lapses regarding the
emotional tone of a day in as little as 24 hours" [58], suggesting that people
may not always be aware of their true stress levels. Consequently, self-report
questionnaires can sometimes result in inaccurate stress level assessments
[57].
Psychological evaluation is critical in developing a stress detection
model from social media data. Understanding the intricacies of human
behaviour, language nuances, and emotional expressions is essential for
creating a hybrid model that accurately identifies psychological stress.

2.6 KEY ASPECTS OF PSYCHOLOGICAL EVALUATION IN THIS


CONTEXT

Incorporating principles of emotional intelligence is crucial. The


model should be trained to recognize and interpret emotional cues in social
media content, understanding the subtleties of stress-related expressions [59].
Deepening the model's psycholinguistic analysis capabilities enables it to
grasp the underlying emotions and sentiments in the text. This involves
understanding how language choices, tone, and linguistic patterns correlate
with psychological stress indicators [60]. Psychological stress often manifests
in specific contexts. The model should be equipped to contextualize social
media content, considering factors such as life events, relationships, and
recent experiences to assess stress [61] accurately.
Recognizing that individuals express stress differently is vital. The
model should be sensitive to diverse personality traits, coping mechanisms,
and communication styles, ensuring a nuanced evaluation that accounts for
individual differences [62]. Identifying changes in behavioural patterns is a key
aspect. The model should analyze shifts in posting frequency, engagement
with others, and content themes to detect deviations indicative of stress-
related changes. Cultural nuances significantly influence how stress is
expressed. A psychologically informed model should consider cultural
variations in language, expressions, and societal norms to avoid
misinterpretations and enhance cross-cultural applicability [63].
Analyzing how users interact with their social environment provides
valuable insights. The model should assess changes in social connections,
communication patterns, and engagement levels to understand the social
dynamics influencing stress [64]. Stress is a dynamic process that evolves.
The model should adopt a longitudinal perspective, considering historical data
and tracking changes in stress indicators, to comprehensively understand an
individual's mental well-being [65]. Ethical principles in psychological
evaluation are paramount. The model should respect user privacy, avoid
stigmatization, and prioritize the responsible use of mental health information
[66]. Collaboration with psychologists and mental health professionals is
crucial. Integrating domain expertise ensures the model aligns with
established psychological frameworks and ethical guidelines [67]. Establishing
a feedback mechanism for continuous improvement is essential. The model
should be designed to learn from user feedback and adapt to evolving
expressions of psychological stress on social media [68].
A psychologically informed approach to stress detection from social
media involves integrating principles of emotional intelligence, psycholinguistic
analysis, contextual understanding, and cultural sensitivity into the design of
the hybrid model [19]. By incorporating these psychological dimensions, the
model can achieve a more accurate and nuanced assessment of
psychological stress, contributing to early intervention and support.

2.7 PHYSIOLOGICAL INDICATORS

Stress can manifest through objectively measurable and observable


biomarkers using wearable physiological sensors. [70]. When sympathetic
nervous system (SNS) activity increases, it alters the body's hormonal levels
and triggers responses such as increased secretion production, elevated heart
rate, and heightened muscle activity. As a result, temperatures drop, and heart
rate variability (HRV) decreases [71, 69]. These alterations are indicators and
provide information via wearable sensors to identify stress. The following
categories apply to the physiological indicators of stress and the technology
that measures them.
 Activity of Brain: Electroencephalography (EEG), Functional Magnetic
Resonance Imaging (fMRI) [75].
 Activity of Heart: Electrocardiography (ECG) [72] [74].
 Response of Skin: Electrodermal Activity (EDA), Galvanic Skin Response
(GSR) [73] [76].
 Activity of Muscles: Electromyography (EMG) [77].
 Response of Respiratory: Respiratory rate monitors [78].
 Eye gaze, blinking, and pupil diameter (PD): infrared eye tracking systems
[79]
The most common methods, ECG and electrodermal activity, are
discussed below.
As discussed in this section, several physiological signals have been
employed in stress detection, with some proving effective in providing reliable
real-time information about individuals' stress levels. However, a significant
drawback of most methods is the requirement for additional hardware, which
can make them intrusive in everyday life. Despite efforts, such as those
highlighted in recent research [80], to develop wearable physiological
monitoring systems for simplicity, users are still required to continuously wear
this equipment, which can remain conspicuous and financially inaccessible for
some individuals.
Incorporating physiological signals along side social media data can
significantly enhance the accuracy and depth of psychological stress detection
[81]. Integrating physiological signals provides a holistic approach, capturing
both explicit expressions on social media and subtle physiological changes
indicative of stress. Here are key considerations for integrating physiological
signals into the hybrid model:
Integrating wearable devices, bio sensors, and cameras to collect
physiological signals such as heart rate variability (HRV), skin conductance,
electrodermal activity (EDA), and facial expressions can significantly enhance
stress detection models [82]. By understanding physiological changes
associated with stress, like increased heart rate and alterations in skin
conductance and facial expressions, we can implement real-time monitoring
systems that provide timely insights [75]. A hybrid model that processes and
analyzes physiological data concurrently with social media content enables
immediate stress detection. This multimodal fusion of physiological signals
and social media data offers a comprehensive understanding of an individual's
mental state by leveraging complementary information from both sources [83].
Effective feature extraction and representation of physiological signals
are crucial, transforming raw data into meaningful features for model
integration. Machine learning algorithms can then identify complex interactions
between social media content and physiological signals, improving stress
detection accuracy [19] and [50]. Ensuring user consent and privacy,
establishing individual physiological baselines, and aligning physiological
signals with contextual social media information is essential for ethical and
accurate stress interpretation [84].
Validating the model with ground truth data, such as self-reported stress
or clinically validated measures, ensures reliability. The model's dynamic
adaptability to changing physiological responses over time and
interdisciplinary collaboration with experts in physiology and biometrics further
enhance its effectiveness [85]. This integrated approach holds promise for
advancing stress detection and promoting holistic mental health monitoring by
combining explicit and implicit indicators of psychological stress.
By combining social media data with physiological signals, the hybrid
model gains a more nuanced understanding of an individual's psychological
stress, leveraging explicit and implicit indicators. This integrated approach
holds promise for advancing the field of stress detection and promoting holistic
mental health monitoring.

2.8 BEHAVIOURAL RESPONSES

Behaviour is how individuals or groups respond in specific situations,


guided by established protocols, behavioural norms, or accepted social
practices [86]. Stress significantly impacts behaviour, often leading to
observable changes such as increased aggression or irritability, although
these changes can be challenging to quantify easily [87]. Other potential
behavioural shifts can be explored by analyzing how individuals interact with
devices to establish reliable methods for measuring and confirming their
association with stress.
Comparing the advantages of measuring behavioural responses
with physiological measurements, behavioural assessments often offer a
subtle approach that does not necessitate expensive additional equipment
[32].
Understanding and analyzing behavioural responses exhibited on
social media is crucial for a comprehensive psychological stress detection
model. Behavioural cues can offer valuable insights into an individual's
emotional state, providing complementary information to text and physiological
signals [88]. Integrating behavioural responses enhances the hybrid model's
ability to detect psychological stress accurately [89]. Here are key
considerations:
Behavioural indicators should be identified to enhance psychological
stress detection, including changes in posting frequency, language patterns,
social interactions, engagement levels, and specific emojis or expressions
[53]. Sentiment analysis of social media content can gauge emotional tone,
detecting shifts that signal distress [60]. Analyzing social interaction patterns,
such as reduced engagement or withdrawal, and monitoring activity and
posting patterns, like unusual posting times, can reflect stress-induced
behavioural changes. Network analysis can reveal alterations in social
connections, providing additional behavioural cues. Considering temporal
dynamics in behaviour, such as abrupt or gradual changes, offers insights into
the evolving nature of stress.
Integrating behavioural responses with text analysis and
physiological signals through multimodal fusion enhances the model's ability
to capture complex stress manifestations. Assessing user engagement
metrics, such as response rates to comments or messages, and analyzing
linguistic patterns beyond sentiment, including specific words or phrases
associated with stress, contribute to more accurate detection. Machine
learning algorithms can be trained to recognize deviations from baseline
behaviour, improving the model's efficacy.
Establishing a feedback mechanism to refine the model based on
user input ensures it adapts to individual variations in stress responses. Ethical
considerations, such as respecting user privacy, avoiding stigmatization, and
ensuring responsible use of behavioural information, are paramount in the
analysis process. Combining behavioural indicators with other data modalities,
this comprehensive approach holds promise for advancing stress detection
and promoting holistic mental health monitoring.
Integrating behavioural responses into the hybrid model makes it
more adept at capturing how individuals manifest psychological stress on
social media. Combining behavioural cues with other data modalities, this
comprehensive approach contributes to a more nuanced and accurate
psychological stress detection system.

2.9 CATEGORIZATION OF BEHAVIORAL MEASURES

The following categories apply to behavioural stress assessments and


the technology that supports them:
o Mobile phone usage: analyzing information related to users [18] to detect
behavioural patterns indicative of stress.
o Analysis of Speech: Voice stress analysis [6, 24] can be used to determine
stress levels by analysing variations in vocal features.
o Facial expression: employing automated facial expression analysis (AFEA)
to interpret emotional cues through facial movements.
o Body gesture: utilizing automated gesture analysis (AGA) to detect stress-
related body language and movements.
o Typing rhythm: assessing stress levels through keystroke and mouse
dynamics, which can indicate changes in typing speed and accuracy.
These methods provide insights into an individual's stress response by
monitoring various behavioural cues and interactions in real time or through
retrospective analysis.
In contemporary stress detection methods, speech analysis and
mobile phone usage are widely studied techniques. Behavioural stress
recognition estimations are less prevalent than physiological processes and
may not have been thoroughly researched enough, resulting in less precise
stress detection outcomes. However, these behavioural methods show
promise because they yield promising results and do not require additional
hardware, making them accessible and convenient for users.

2.10 INFORMATION RETRIEVAL MODELS

Information retrieval (IR) is widely used in various research fields,


such as information retrieval modelling, query processing, ranking, indexing,
and searching [90]. Document modelling, web searching, digital libraries,
classification, clustering, retrieval assessment, and many other areas have
text in structured, unstructured, or semi-structured formats [91]. Information
extraction and data mining techniques are extensively used in social network
analysis and email mining. Moreover, the information exists in general text,
XML and structured formats, emoticons, audio video source code,
applications, and web services [92]. The Survey of the International Data
Corporation (IDC) confirms that unstructured data occupies > 80% of digital
space [93]. IDC forecasts that in 2017, unstructured data will account for
79.2% of competence shipped, with an estimated income of 57.3%. IR aims
to find the most relevant information to the given query [94].
Definition 1: Information retrieval is finding material (usually
documents) of an unstructured nature (usually text) that satisfies an
information need from within large collections (usually on local computer
servers or from cyberspace) [95].
Definition 2: An information retrieval model is defined as a quadruple
[D, Q, F, R (qi, dj)] [96], where each component serves a specific role in the
retrieval process.
D is a set composed of logical views for the documents in the collection.
Q is a set of logical views for the user's information needs. Such
representations are called queries.
F is a framework for modelling document representations, queries, and
relationships.
R (qi, dj) is a ranking function that associates a real number with a query
qi Q and a document representation dj D. Such ranking defines an order
among the documents. D represents a collection comprising logical views of
documents.
F serves as a framework for modelling how documents and queries are
represented and their interrelationships.
R (qi, dj) denotes a ranking function that assigns a real number to a
query qi from Q and a document representation dj from D. This ranking
function establishes an order of relevance among the documents relative to
the query about query q.

Figure 2.4: Framework of Information Retrieval Search Cycle Process


The information retrieval system performs the task of searching for
information from the past and the future. The steps involved in the search for
information, as seen by the naïve users, are depicted in Figure 2.4. However,
inside the IR model, before searching a set of documents, the pre-processing
task is executed on the Text (Query), which involves tokenization of text,
removing stop words such as articles (the, a, an), prepositions (on, at, im) and
pronouns (for example, a, about, above, according to, across, after,
afterwards. again, against, albeit, all, almost, alone, already, also, although,
always, among, as, at), performing stemming operation to find the base word
(stripping of suffixes and prefixes from the words, for example, ed, ing, ly, pre)
[97] [98], and finally the documents that are most closely matched with the
words of the query are retrieved. Documents are stored using a set of index
terms, as depicted in Figure 2.5(logical view) [94].
The traditional similarity Information Retrieval models are Boolean and
vector-space. Probabilistic and language models use the indexing process to
store documents and queries. The principles and characteristics of those
models are discussed in this section. Apart from this, other extended IR
models, such as set-theoretic models, algebraic models, and extended
probabilistic topic models, are grouped under the Representation and
Reasoning category, which is named a vertical taxonomy of IR models and
tools [99].
Figure 2.5: Logical View of Documents: From Full Text to a Set of
Index Terms

In the year 1958, Luhn introduced the idea of information filtering


known as “Selective Dissemination of Information” [100]. This section
examines the traditional retrieval models that serve as the cornerstone of IR
research, focusing particularly on classic retrieval models that continue to
shape modern Information Retrieval techniques. However, various extended
IR models use statistical and probabilistic mathematical theories.
Information retrieval (IR) models are systems and algorithms
designed to find relevant information from large collections of data [101].
These models are essential in search engines, digital libraries, and other
applications where retrieving relevant information efficiently is crucial. Here
are some key IR models:

 Boolean Model

The Boolean model uses Boolean logic to match documents with


queries, representing both as sets of terms. It employs operators such as AND,
OR, and NOT to combine terms. While this approach is simple and easy to
implement, it cannot rank results by relevance [102].

 Vector Space Model (VSM)

The Vector Space Model (VSM) conceptualizes documents and


queries as vectors in a multidimensional space. Each dimension represents a
term, and the weight of each term can be determined by its frequency. The
similarity between a document and a query is measured using cosine
similarity, which allows the ranking of documents by relevance based on the
angle between the vectors [102].

 Probabilistic Model

Based on probability theory, the probabilistic model estimates the


likelihood that a given document is relevant to a query. The most common
probabilistic model is the Binary Independence Model, which uses the
presence or absence of terms to estimate relevant probabilities [102].

 Language Models

The language model approach treats each document as a probability


distribution over terms, with queries also treated as distributions. The goal is
to find the document model that best explains the query. A common approach
within this framework is the query likelihood model, which estimates the
probability of generating the query from each document [102].

 Latent Semantic Analysis

Latent Semantic Analysis (LSA) employs singular value


decomposition (SVD) to reduce the dimensionality of the term-document
matrix. This technique captures hidden patterns in the data, revealing
relationships between terms that may not be apparent initially. Documents and
queries are subsequently represented in this lower-dimensional space to
uncover similarities [102].

 BM25 (Best Matching 25)

BM25 is a probabilistic model that extends the Binary Independence


Model by incorporating term frequency, document length, and other factors
into the relevance score. It is widely used in search engines due to its
effectiveness in ranking documents [102].

 Neural Network-based Models

Neural network-based models use deep learning techniques to


capture complex patterns in the data. Models such as Convolutional Neural
Networks (CNNs) and Recurrent Neural Networks (RNNs) are used for
information retrieval tasks. Transformers, like BERT and GPT, have shown
significant improvements in understanding context and semantics in queries
and documents. Although these models often require large amounts of data
and computational resources for training, they provide state-of-the-art
performance [102].

 Knowledge-based Models

Knowledge-based models leverage structured data and ontologies


to understand and retrieve relevant information. They use knowledge graphs
to represent entities and relationships, enhancing the retrieval process with
semantic understanding. These models are particularly useful in domains
where precise understanding of terms and relationships is crucial, such as
biomedical information retrieval [102].
Each of these models has its strengths and weaknesses, and the
choice of model often depends on the specific requirements of the retrieval
task, the nature of the data, and the computational resources available. In
practice, hybrid models that combine elements from different approaches are
also used to achieve better performance [103].

2.11 SOCIAL MEDIA INTERACTIONS

A study indicates that an individual's writing style offers insights into


their emotional state [37], suggesting that their writing can change according
to their stress levels. Stress can sometimes enhance a person's writing
abilities, resulting in higher-quality work and a more diverse vocabulary.
Conversely, mood can be directly reflected in the content, particularly
spontaneous messages. Therefore, analyzing text linguistics can provide
valuable information for a stress recognition system.
The interactions and dynamics within social media platforms provide
a rich source of information for understanding an individual's psychological
well-being. Integrating social media interactions into a hybrid model enhances
the overall efficacy of psychological stress detection. Here are key
considerations for leveraging social media interactions in the model:
Various aspects of user behaviour must be analysed to detect
psychological stress through social media interactions [104]. This includes
examining the frequency of interactions such as posting, commenting, liking,
or sharing content, as sudden changes in these rates can indicate shifts in
well-being. Engagement levels within social circles should also be assessed,
as changes in the intensity of interactions with friends, family, or online
communities may reflect shifts in social support or isolation linked to stress.
Evaluating the sentiment of interactions, including comments and replies, can
reveal increased negativity or decreased positivity, indicating stress-related
changes in social dynamics [105]. Applying social network analysis helps
understand the structure and dynamics of social connections, where changes
in network centrality, connection alterations, or communication pattern shifts
may be stress-related.
Identifying indicators of social isolation, such as reduced responses
to mentions or declining group participation, provides valuable insights into
stress-related behavioural changes [106]. Considering peer comparisons can
also be informative, as excessive comparisons or negative reactions to peers'
achievements may signal stress [107]. Temporal patterns of interactions, such
as late-night or early-morning activity, can be associated with stress-related
sleep disturbances [108]. Assessing changes in communication style,
including shifts from detailed to brief or cryptic messages, can further indicate
stress. Monitoring participation in online communities or forums, where
reduced engagement or changes in contributions suggest stress-related
avoidance behaviours, is also crucial.
Integrating social media interaction data with other modalities like
text analysis, physiological signals, and behavioural responses enhances the
model's ability to capture the multifaceted nature of psychological stress [56]
[109], thereby ensuring user privacy.
Numerous tools, such as SentiStrength [110] and TensiStrength
[111], are currently available to automatically analyse text features. These
tools can assess writing performance through lexical measures or by directly
analyzing the "emotions" conveyed in the text, which is their primary purpose.
Researchers [112] and [111] have leveraged this technology to analyze online
posts and determine user stress levels based on the content.

2.12 SEMANTIC WEB AND SOCIAL ENGINEERING

In 1994. Tim Berners-Lee discovered the World Wide Web


Consortium named W3C (Web 2.0). Later, the term "Semantic Web" is derived
that extends the current web from the web of Linked documents to the web of
linked data, which is supposed to be an extension of XML that has RDFS with
an identifier (rdf: ID) for binding the RDF data model nodes together to build a
graph (triplets) [113]. The Semantic data of the web is well-organized and
linked so that machines use web data (metadata) for automation, integration,
interoperability, and reuse across a range of applications in a specific contest
that resembles an intelligent knowledge-based tool [114]. The foundational
elements for constructing the semantic web include Description Logic (DL),
ontology languages such as RDF/S and OWL, ontologies, and knowledge
bases [25]. As well stated in two notable statements by Tim Berners-Lee et.al,
"adding logic to web provides means to use rules to make inferences" and
"logic must be powerful enough to describe complex properties of objects"
Based on the given statement, the hierarchy of semantics is illustrated as
shown in figure 2.6 from weak semantics to strong semantics [115].

Figure 2.6: Hierarchy of Levels of Semantics

Controlled Vocabulary is a repository that stores a list of terms, for


example. To order the item(s) online from the ebay.com website, you must
click, browse, and select an item(s). Taxonomy [116] is a subject-controlled
tree classification that arranges the terms of controlled vocabulary into the
hierarchy to avoid ambiguity between the terms [117]. Similarly, a Thesaurus
is a structured collection of terms in a controlled vocabulary, interconnected
by conceptual relationships such as synonyms, homonyms, and hyponyms,
akin to WordNet [118]. Ontologies, resembling taxonomies, go beyond by
incorporating tightly integrated semantic relationships between terms
governed by rigorous rules for defining terms and their interconnections.
Ontologies have traditionally been linked with logical inference and are now
being leveraged for applications in the Semantic Web.
As computing becomes increasingly ubiquitous and pervasive, a
vast amount of information is generated in structured, unstructured, and semi-
structured formats [119]. Peter Mika has investigated the challenges that the
Semantic Web encounters with integrating social networks, which constitute a
complex network including databases, computational linguistics, knowledge-
based systems, and service-oriented computing [120]. Peter Mika's pioneering
Semantic Web Use Case, which received an award, introduces a novel
approach by applying semantic web techniques to retrieve datasets from
social networks.
This semantic web use case can personalize users' search
processes by integrating information through service discovery and analyzing
social networks like Facebook and others [121].
"Friend of a Friend" (FOAF) is an ontology designed for machines to
represent individuals, their activities, and their connections to others and
objects. FOAF enables decentralized groups to describe social networks
without relying on a central database. Using FOAF, you can create a profile
with personal information, such as name, interests, and connections to others.
This information can then be shared and interconnected with other FOAF
profiles, facilitating the creation of a distributed social network [122]. Swoogle
is the semantic web search ontology crawler that extracts metadata
information and then computes the relationship that exists between the
semantic web documents, which are stored in RDF or OWL [123]. The
information retrieval metrics used to analyse social network data (microblogs)
to know the behaviour of online users and their dynamic interactivity with other
users are studied in [124].

You might also like