0% found this document useful (0 votes)

11 views9 pages

T E I: M S T LLM: HE Thics of Nteractions Itigating Ecurity Hreats in S

This paper examines the ethical challenges posed by security threats to Large Language Models (LLMs), highlighting the risks of prompt injection, jailbreaking, PII exposure, and the generation of harmful content. It emphasizes the urgent need for robust defensive strategies and ethical frameworks to mitigate these threats, ensuring that LLMs operate within societal norms. The authors propose developing an evaluative tool to guide developers in fortifying LLMs and assessing their ethical implications during testing.

Uploaded by

aya217486

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views9 pages

T E I: M S T LLM: HE Thics of Nteractions Itigating Ecurity Hreats in S

Uploaded by

aya217486

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

T HE E THICS OF I NTERACTIONS : M ITIGATING S ECURITY

T HREATS IN LLM S

Ashutosh Kumar, Shiv Vignesh Murthy, Sagarika Singh, Swathy Ragupathy

Rochester Insititute of Technology
Rochester, NY-14623, USA
{ak1825, sm2678, ss3028, sr7788}@rit.edu
arXiv:2401.12273v2 [cs.CR] 10 Jul 2024

A BSTRACT
This paper comprehensively explores the ethical challenges arising from security threats to Large
Language Models (LLMs). These intricate digital repositories are increasingly integrated into our
daily lives, making them prime targets for attacks that can compromise their training data and the
confidentiality of their data sources. The paper delves into the nuanced ethical repercussions of
such security threats on society and individual privacy. We scrutinize five major threats—prompt
injection, jailbreaking, Personal Identifiable Information (PII) exposure, sexually explicit content, and
hate-based content—going beyond mere identification to assess their critical ethical consequences and
the urgency they create for robust defensive strategies. The escalating reliance on LLMs underscores
the crucial need for ensuring these systems operate within the bounds of ethical norms, particularly
as their misuse can lead to significant societal and individual harm. We propose conceptualizing
and developing an evaluative tool tailored for LLMs, which would serve a dual purpose: guiding
developers and designers in preemptive fortification of backend systems and scrutinizing the ethical
dimensions of LLM chatbot responses during the testing phase. By comparing LLM responses with
those expected from humans in a moral context, we aim to discern the degree to which AI behaviors
align with the ethical values held by a broader society. Ultimately, this paper not only underscores the
ethical troubles presented by LLMs; it also highlights a path toward cultivating trust in these systems.

Keywords Large Language Models · Prompt Injection · Jailbreaking · Personally Identifiable Information (PII) ·
Ethical Policies

1 Introduction
1.1 Understanding LLMs

Large Language Models (LLMs) [1] are deep learning-based models, designed to process and generate text at scale,
leveraging advanced neural network architectures such as transformers. A few crucial processes are involved in how
LLMs operate. Initially, a sizable corpus of text data, including books, journals, and web pages, is used to train the
model, which involves exposing the model to a wide range of language tasks, enabling it to learn representations of
words, phrases, and sentences in a way that captures their meanings and relationships. The model is trained using
deep learning methods like neural networks, with several processing layers and an encoder-decoder architecture. They
comprise millions or even billions of parameters, which enable them to capture complex linguistic patterns and semantic
relationships. After training, the model can produce contextually relevant and coherent text responding to the input text.
LLMs can produce coherent and fluent content even when given incomplete or unclear input by employing the language
modeling technique, which involves determining the probability of each word in a sentence based on the ones that
came before it. Modern LLMs are capable of zero-shot and few-shot learning, meaning they can perform tasks without
specific training. This allows them to generalize to new domains and tasks with minimal additional training. LLMs,
such as GPT-4 by OpenAI, Claude, Llamma by Meta AI, Gemma by Google, and Phi by Microsoft, have demonstrated
advanced language understanding and generation capabilities, leading to widespread adoption in diverse applications,
including natural language understanding, text generation, code generation, information retrieval, conversational AI,
Application Description

Natural Language Processing (NLP) LLMs have revolutionized NLP tasks, including text classification,
named entity recognition, sentiment analysis, and language transla-
tion. Their ability to understand and generate human-like text has
enhanced the accuracy and performance of NLP applications.
Conversational AI and Chatbots LLMs are the foundation for developing sophisticated conversational
AI systems and chatbots. They enable more natural and contextually
relevant interactions, improving user experiences in customer support,
virtual assistants, and dialogue systems.
Information Retrieval and Question An- LLMs excel in information retrieval and question-answering tasks
swering by comprehensively understanding and processing complex queries.
They power search engines, recommendation systems, and knowl-
edge bases, enabling users to access relevant information efficiently.
Content Generation and Summarization LLMs are utilized for content generation, including automatic writ-
ing, summarization, and paraphrasing. They can produce coherent
and contextually relevant text, contributing to applications such as
content creation, news summarization, and document generation.
Code Generation and Programming As- LLMs have been integrated into code generation, completion, and
sistance programming assistance tools. They aid developers in writing code,
debugging, and understanding programming languages, enhancing
productivity and software development processes.
Personalization and Adaptive Systems LLMs enable personalized content recommendations, adaptive in-
terfaces, and tailored user experiences. They analyze user input and
behavior to deliver customized responses and services in content
platforms and e-commerce applications. They are also employed
in the education sector, where tools are being developed to produce
instructional content and give students personalized feedback. Cus-
tomer care chatbots that can converse with clients in natural language
and offer tailored responses are being created using LLMs.

Table 1: A detailed look at LLMs and their integration into contemporary applications

and more.
Although LLMs have demonstrated remarkable performance and a broad range of application possibilities as shown in
Table 1, there are several possible risks involved in using LLMs. One problem is that social biases in the training data
may be inherited and amplified by LLMs, which could result in unethical and unfair outcomes for the model. Another
risk is that LLMs may, on purpose or in response to particular cues, produce improper, deceptive, or harmful content.
This may have profound effects on people and society at large. The complexity and scale of LLMs have increased
privacy concerns, especially about data sharing and potential exploitation. More questions are raised if models are made
public after being trained on private data. LLMs frequently commit terms from their training sets to memory, which
could be used by a malicious party to obtain confidential or personal information and jeopardize individual privacy.
Understanding LLMs involves recognizing their capabilities in language processing, their potential impact on diverse
applications, and the need to address challenges related to responsible use and ethical considerations. As LLMs evolve,
ongoing research and development efforts are essential to harnessing their potential while mitigating risks and ensuring
their alignment with societal values.

1.2 Identifying Vulnerabilities in LLMs

1.2.1 Prompt Injection

A complex manipulation of input for Large Language Models (LLMs) is a critical concern within the realm of AI ethics.
Manipulating the input by injecting biased, false, or harmful prompts into these models leads to outputs that perpetuate
bias, encourage destructive behavior, or distort or leak sensitive information. Vulnerabilities stemming from prompt
injection evoke ethical concerns that warrant proactive measures.

2
Multiple avenues of LLMs are vulnerable to prompt injection:

• Data Poisoning: The integrity of LLMs relies heavily on the nature of their training data. Prompt manipulation
during fine-tuning can introduce biases, leading to unintended discrimination in real time. The consequences
amplify when these models undertake content moderation roles, potentially perpetuating stereotypes against
specific ethnic groups.
• Model Inversion Attacks: Proprietary and powerful language and learning models (LLMs) like ChatGPT
operate with undisclosed architectural details, withholding information from the general public. Attempts
to reverse engineer such models pose significant ethical concerns, as adversaries could reconstruct a potent
replica of the original model. This replicated version might be exploited for malicious purposes, effectively
becoming an ’evil twin’ of the authentic model. This scenario highlights the ethical dilemma surrounding the
transparency of LLMs. Adversaries might craft prompts designed to elicit specific responses that indirectly
reveal details about the model’s architecture.
• Bypassing Guardrails: Sophisticated LLMs, such as ChatGPT, are equipped with guardrails that detect
prompts or inputs suggesting the creation of fictional identities or personas. Detection algorithms could flag or
restrict outputs that heavily focus on generating detailed personal information about non-existent individuals.
Fabricating false personas could lead to misinformation, deceit, or the potential for malicious activities,
contradicting ethical usage. However, guardrails are only partially secure and carry some vulnerabilities that
adversaries hunt to exploit. Adversaries might gradually introduce fictional character details into conversations
or prompts to bypass detection algorithms. Starting with seemingly innocent queries before transitioning
progressively to more elaborate fictional details could evade detection. Employing coded language, allusions,
or indirect references to describe fabricated personas makes it harder for detection algorithms to flag such
inputs.

1.2.2 Jailbreaking LLMs

In the context of LLMs, jailbreaking [2] aims to circumvent the restrictions imposed by the model owner or the hosting
platform to achieve unauthorized access to the Language Models’ internal functionalities and protocols. Unrestricted
access through jailbreaking jeopardizes the model’s integrity. Attackers could exploit this to infiltrate and manipulate
critical algorithms or datasets, potentially leading to distortions, biases, or unauthorized output alterations. Furthermore,
Jailbreaking may pave the way for injecting malicious code, compromising the system’s integrity, and perpetuating
biases.

1.2.3 Personal Identity Information (PII) Leaks

LLMs are trained on vast volumes of data encompassing information from the World Wide Web. These datasets contain
confidential and sensitive information, which makes these models vulnerable to leakage. Several reasons make LLMs
susceptible to PII leakage. Although overfitting and memorization are common reasons, these can be mitigated relatively
easily by tuning the model to rely less on memorization and focus more on generalizability. Attackers leverage prompts
to analyze LLM behavior, uncovering sensitive informational tendencies. Real-world prompts tied to recent events can
elicit context-associated data. Patterns in model responses might inadvertently reveal personal identifiers like names,
phone numbers, SSNs, or financial data like credit card numbers. Exploiting these leaks can lead to identity theft,
financial fraud, and severe repercussions for affected individuals or organizations.

1.2.4 Sexual and Hateful Content

Data Poisoning and Bypassing Guardrails may provoke LLMs to generate controversial information, such as sexual
or hateful content, which presents several ethical and moral concerns. LLMs excel at developing natural language
that closely resembles the style of humans. These models can generate responses that flow naturally in conversations,
exhibiting nuances, humor, and contextually appropriate replies. This realism contributes to blurring the distinction
between machine-generated and human-generated content.
LLMs capable of generating sexually explicit content might influence unrealistic expectations about relationships or
intimacy, potentially manipulating young minds. Inappropriate outputs from LLMs might reinforce derogatory remarks
or stereotypes about genders, contributing to the perpetuation of societal biases and discrimination. Misinformation
or misguided advice generated by LLMs regarding sexual health or practices could lead to unsafe behaviors among
adolescents who lack comprehensive sex education. Exposure to unrealistic or inappropriate content might impact
children and adolescents’ emotional and psychological well-being, shaping their perceptions and behaviors in unhealthy
ways. Reinforcing stereotypes or derogatory remarks through LLM outputs can contribute to normalizing discriminatory
attitudes or behaviors among younger audiences.

3
Elderly individuals, influenced by societal changes, might hold onto outdated opinions regarding race, gender, or
social norms. LLMs might inadvertently validate or reinforce these obsolete perspectives. LLMs might reinforce
resistance to adopting new societal norms or technological advancements by generating outputs aligned with traditional,
possibly outdated, viewpoints. If LLM-generated content aligns with outdated biases, it could reinforce discriminatory
attitudes or hinder the acceptance of more inclusive and progressive societal norms. Elderly individuals relying on
LLM-generated content might face social isolation or be susceptible to misinformation if exposed primarily to content
reinforcing outdated opinions.

2 Why Ethics Matter in LLM Attacks?

A critical factor in the creation and use of LLMs is ethics. Because LLMs can produce information that might be
interpreted positively or negatively, it is essential to have proactive ethical frameworks and legislative procedures
to regulate their proper usage and hold them accountable for the results. Important ethical factors in LLMs include
interpretability and explainability. Understanding LLMs’ decision-making processes is difficult due to their "black-box"
nature, essential for gaining public acceptance and trust—especially in delicate areas. Their efficacy and reliability are
restricted by their lack of operational understanding, even with their sophisticated skills.
The rapid advancement and widespread adoption of LLMs make their potential compromise through malicious attacks
an urgent ethical concern. Attacks aim to deliberately manipulate LLM responses to spread misinformation, bias, hate
speech, or inappropriate content that could significantly impact public discourse and decision-making. It is, therefore,
critical to safeguard LLMs based on ethical norms. LLMs are being integrated into sensitive domains like healthcare,
education, law, and policymaking, where reliability and truthfulness are paramount. Compromised models that generate
convincing, misleading, or biased claims could undermine evidence-based decision-making and erode public trust. The
diffusion of toxic, discriminatory, and unreliable content threatens ethical values like wisdom, dignity, equality, and
social cohesion that underpin a just society. Furthermore, attacks to expose private data from an LLM’s training set or
breach its secure systems raise ethical issues around consent, privacy, identity theft, and surveillance. Such violations
contravene ethical duties to respect individual autonomy and prevent harm. On a broader level, manipulated LLMs that
falsely portray minorities or marginalized groups risk reinforcing structural oppression.
For instance, employees at Samsung Semiconductor [3] accidentally leaked company secrets through ChatGPT prompts,
according to a report by Business Today. In response, Samsung’s chief has warned staff against repeating such mistakes,
threatening to block access to ChatGPT on the company network if it happens again. This incident highlights the risks
of sharing sensitive data with large language models (LLMs) like ChatGPT [4]. Even though OpenAI claims to remove
personally identifiable information, it could be retained and reproduced once confidential data is submitted to these
models. Samsung Semiconductor is developing an internal AI assistant to avoid further data leaks. However, it will
have strict data constraints, only processing short prompts under 1,024 bytes (Business Today, 2023). LLMs are trained
on vast datasets, so leaking proprietary information makes it available to other users. Over time, this could empower
competitors with valuable insider knowledge. This demonstrates how organizations must weigh the benefits of AI tools
against data security risks through governance policies and access controls. The Samsung case underlines the need
for employees to be cautious when sharing confidential data with public chatbot systems despite their conversational
convenience.
Therefore, Safeguarding LLMs’ integrity is an ethical obligation to foster their purposeful development aligned with
social goods. Techniques like transparency requirements, controlled testing for emergent harms, and instituting recourse
mechanisms can help continually assess and address threats from attacks. Conceptualizing and governing LLM safety
through the lens of ethics provides the imperative and moral basis to motivate corporate accountability and remedy issues
as they emerge. Establishing such ethical foundations is indispensable to building public trust in LLMs’ productive
integration into society.

3 Potential Misuse and Security Concerns

Examining ethical principles for Large Language Models (LLMs) [5] is a crucial aspect of the evolution of AI technology.
It involves scrutinizing how these models handle sensitive topics, maintain privacy, and avoid biases, ensuring they
align with human values and societal norms. This scrutiny guides the responsible development and deployment of
LLMs and shapes public trust and acceptance, balancing technological advancement with ethical responsibility. As
LLMs become more integral to daily life, their ethical framework becomes more significant in defining their role and
impact on society.Table 2 describes the ethical concerns considering LLMs.

4
Misinformation Identity Theft Bias Amplification Economic Reper- Privacy Challenges
and Societal Impli- from Training cussions
cations Data

LLM-generated mis- PII extracted Biased training LLM manipulation Ability to recon-
information threat- from training data data and targeted in key sectors risks struct images and
ens evidence-based enables digital prompts can am- eroding credibility, match identities
decision-making on impersonation and plify discrimination value generation, from model outputs
critical issues like phishing, violating against groups and public trust in threatens privacy
climate change and individual auton- with less oversight system outputs. rights and consent.
public health. omy. power.

Disinformation cam- Incomplete Restorative steps Financial impacts Lack of data sourc-
paigns powered by anonymization complicated by may dispropor- ing transparency
LLMs risk skewing allows tracing data power imbalances; tionately affect hinders ethical
elections and erod- to original, non- consequences en- vulnerable com- review and risks
ing shared founda- consenting authors. trench demographic munities while exposing private
tions of truth. inequalities. obscuring account- conversations.
ability.
Table 2: Ethical Concerns in Large Language Models

4 Towards Ethical Mitigation: A Proposed Methodology

The tool is designed to enhance the guidelines and strategies for securing large language model (LLM) systems. It
hypothesizes a robust solution for identifying and mitigating unethical or harmful user interactions.

4.1 User Prompt Reception

The system initially receives the user’s prompt through the LLM user interface.

4.2 Prompt Classification Engine

• Prior Analysis: The prompt undergoes an initial analysis where it is scanned for characteristics of these five
categories of vulnerabilities: prompt injection attack, jailbreak attempt, personally identifiable information
(PII), sexual content, and hateful content.
• Probability or Likelihood Calculation: For each category, the system calculates the likelihood of the prompt
belonging to each category as [p1, p2, p3, p4, p5] using respective classification methods, like Natural
Language Processing (NLP) for sexual and hateful prompts. The probabilities are not necessarily an exact
emulation of the nature of the prompt because a significant number of attacks are based on a sequence of
prompts designed to break the LLM’s application structure.
• Primary Category Identification: The system identifies the primary attack category and launches the response
design phase based on the highest probability beyond a certain threshold. However, multiple types of attacks
can be carried out simultaneously using a combined response design because the ultimate purpose of this tool
is to build a shield against all those attacks.

4.3 Ethical and Security Compliance Check

• Based on the identified primary category, the system checks specific compliance rules and ethical guidelines
(if already in place) relevant to that category, or if the LLM system is still in the testing phase, the respective
ethical policies can be designed based on risk analysis using this tool and with the help of stakeholders
including ethicists, designers, and developers.
• This ensures that the response is tailored to mitigate the potential risks associated with the identified category.
The LLM system must respond sensitively to the incoming prompt so as not to trigger more enhanced attacks
on its application database and system architecture.

5
Categorical probabilities

P1 Prompt Injection

P2 Jailbreaking

Personal Identifying

P3 Prompt
Information (PII) Leaks

P4 Sexual

P5 Hateful

Threshold-based
Response

Template Design Response Output

Classification

Figure 1: A tool designed to mitigate LLM security risks

4.4 Response Design Phase

• Template Selection: A response template is chosen depending on the primary category. These templates are
pre-designed to handle specific categories of attacks. Designing a template for a combined attack is tricky, as
there might be hidden repercussions with the user interactions. Designers and developers must sit with ethical
scientists, sociologists, and psychology experts to formulate the designs and trigger keywords and thresholds
for each template.
• Customization and Filtering: The response is customized to the specific prompt, ensuring no unethical or
harmful content is included. This may involve filtering out sensitive information or reframing the answer
to avoid endorsing or propagating harmful content. The thing to note here is that these are response design
templates, not the actual responses.

4.5 Response Delivery

The ethically compliant and secure response is delivered to the user.

4.6 Monitoring and Feedback Loop

• The system continuously monitors its performance and the accuracy of its classifications, probably with the
help of manual oversight in its initial design and implementation phase.
• Feedback from these monitoring processes is used to refine the classification algorithms and response templates.

4.7 Applications

• Testing for ethical interactions at beta-level (test phase) LLM software and further designing ethical guidelines
• Integrating into existing LLM chatbots to monitor incoming prompts and update the security protocols to
ensure ethical interaction between the user and the LLM
• Being open-source, it can rely on constant updation of ethical and security bugs and create a feedback loop for
AI ethical security in general

6
4.8 Challenges

4.8.1 Contextual Assessment

Recognizing that a single prompt might fit within a specific attack category is essential, but this is only sometimes
the case. Such a prompt is often merely a fragment of a broader conversation, especially noticeable at the start of an
interaction. The challenge lies in determining the optimal point for activating the prompt classification mechanism. One
approach to consider is using a dictionary, which maps the sequences of user prompts to potential attack categories.
This would function like assigning a preliminary probability of an attack based on key-value pairs accumulated during
testing phases.

4.8.2 Shielded System Design

Safeguard the tool by concealing its inner workings, ensuring it remains secure. Implement separate, isolated components
within the large language model (LLM) backend and the tool interface. This separation prevents external reprogramming
and loss of control over the tool. Remember, increasing the number of features can also increase potential vulnerabilities
in the system.

4.8.3 Auto-Disable Functionality

If a node in the tool malfunctions, it should automatically deactivate, allowing the LLM to revert to its original, pre-tool
state with updated protocols. This modular design ensures the tool can be seamlessly detached from the LLM system
when necessary.

5 Preemptive Ethical Measures

Integrating your tool into designing and developing Large Language Models (LLMs) can ensure ethical compliance,
data integrity, user-friendly design, and robust security.
Some pre-emptive ethical measures organizations could take are:

5.1 Ethical Compliance and Transparency

• Ethical Guideline Development: Establish clear ethical guidelines for LLM usage, including respect for
privacy, non-discrimination, and avoiding harmful content.
• Transparent Decision-Making: Document and make the decision-making process behind the LLM’s design
transparent, especially regarding ethical considerations. The biases of template designers can creep into the
developmental stage, so it’s essential to maintain transparency.
• Stakeholder Engagement: Involve diverse stakeholders, including ethicists, in the development process to
ensure a wide range of perspectives and concerns are considered. Organizations should recruit ethical scientists
to ensure the development of guidelines and the overall impact of their decisions at each step of the process.
• Ethics Training for Developers: Provide ethics training for developers and designers to sensitize them to
potential ethical issues in LLM development. The entire procedure requires the tech stack and a whole set of
humanities to execute this.

5.2 User Interface Design

• Intuitive Reporting Mechanisms: Design user interfaces with easy-to-use reporting mechanisms for unethical
or problematic content generated by the LLM.
• User Consent and Control: Implement clear consent protocols for users, letting them know how their data is
used and giving them control over their interaction with the LLM.
• Accessible Design: Ensure the interface is accessible to diverse users, including those with disabilities, to
promote inclusivity.

5.3 Robust Security Protocols

• Regular Security Audits: Conduct regular security audits to identify and address vulnerabilities in the LLM
system.

7
• Advanced Threat Detection: Integrate advanced threat detection systems to identify and mitigate potential
security breaches or misuses preemptively.
• Data Protection Measures: Implement robust data protection measures, such as encryption and secure data
storage, to safeguard user data.

5.4 Continuous Monitoring and Evaluation

• Real-Time Monitoring: Use your tool to monitor LLM outputs in real-time, quickly identifying and addressing
ethical issues.
• Feedback Loops: Establish feedback loops where user feedback and monitoring insights are used continuously
to improve the LLM’s ethical compliance.
• Impact Assessment: Regularly assess the impact of the LLM on users and society to ensure it aligns with
ethical and societal values.

5.5 Training and Development

• Continuous Learning: Incorporate mechanisms for the LLM to learn from its interactions and improve its
ethical decision-making capabilities.
• Developer and User Education: Educate developers and users about the ethical use of LLMs and the
importance of data integrity and security.

By implementing these measures, organizations can ensure that their LLMs are not only ethically compliant but also
resilient to various challenges, thereby maintaining user trust and the integrity of their systems.

6 Ethical Response to LLM Attacks

The "AI response spectrum" refers to the range of possible responses an artificial intelligence (AI) system can generate
in response to various inputs, prompts, or queries. This spectrum encompasses the diversity of potential outputs that
an AI model, such as a Large Language Model (LLM), can produce, ranging from accurate and helpful responses to
biased, harmful, or incorrect outputs. Understanding the AI response spectrum is crucial for evaluating the capabilities
and limitations of AI systems and addressing ethical considerations and potential risks associated with their use. By
mapping human responses onto the AI response spectrum, researchers and developers can analyze how AI systems
interpret and process human input and work towards ensuring that the generated responses align with ethical standards,
human intentions, and societal values.
Efforts to manage and regulate the AI response spectrum involve fine-tuning AI models on instruction-formatted data,
aligning models using human feedback, and promoting ethical and responsible AI usage through explainability and
accountability measures (7, 30). These approaches aim to steer AI responses toward being helpful, honest, and harmless
while mitigating the risks of biased, harmful, or incorrect outputs. Human reactions to the diverse AI response spectrum
can vary significantly based on the nature of the AI-generated outputs.
Here are some potential human reactions corresponding to the possible AI response spectrum:

• Accurate and informative responses: Artificial intelligence is increasingly appreciated for its accurate and
insightful responses, especially in knowledge-seeking and assistance. This positive reception underscores the
growing reliance on technology for decision-making and information gathering, highlighting AI’s expanding
role in everyday life and professional settings.
• Biased or Misleading Responses: People often express concern and skepticism towards AI systems that
produce biased or misleading information, particularly when it perpetuates stereotypes or inaccuracies. This
caution highlights the importance of ethical and reliable AI practices in shaping societal perceptions and
decision-making processes.
• Harmful or inappropriate outputs: Users may feel discomfort, distress, or offense from toxic or improper
AI-generated content, leading to a potential loss of trust in the technology. This reaction underscores the need
for responsible AI development that prioritizes user well-being and trustworthiness.
• Incomplete or Incoherent Responses: Users often experience frustration and dissatisfaction with AI systems
that deliver incomplete or incoherent outputs, especially when they seek clear and comprehensive answers. It
highlights the importance of advancing AI to better meet user expectations for accuracy and relevance.

8
• Ethical and Trustworthy Responses: Humans tend to respond positively to AI-generated content that is
ethical and trustworthy, fostering a sense of trust in the technology and encouraging its responsible use. This
confidence is critical to integrating AI more deeply and beneficially into various aspects of life.
• Creative and Novel Outputs: Users often appreciate and engage with AI-generated content that is creative
and novel, recognizing the technology’s potential for innovative and imaginative outputs. This appreciation
fosters a more significant interaction and interest in AI’s possibilities for creativity and originality.

Understanding human reactions to the AI response spectrum is essential for evaluating the impact of AI-generated
outputs on users and society. It underscores the importance of steering AI systems toward producing ethical, accurate,
and helpful responses while addressing bias, harm, and misinformation concerns. Promoting transparency, explainability,
and user feedback can foster trust and align AI-generated content with human values and expectations.

References
[1] Yupeng Chang, Xu Wang, Jindong Wang, Yuan Wu, Linyi Yang, Kaijie Zhu, Hao Chen, Xiaoyuan Yi, Cunxiang
Wang, Yidong Wang, et al. A survey on evaluation of large language models. ACM Transactions on Intelligent
Systems and Technology, 15(3):1–45, 2024.
[2] Tony Y. Zhuo, Yujia Huang, Chen Chen, and Zhaojun Xing. Red teaming chatgpt via jailbreaking: Bias, robustness,
reliability and toxicity. ArXiv, 2023.
[3] Business Today. Samsung employees accidentally leaked company secrets via chatgpt: Here’s what happened,
April 2023.
[4] Eric Derner and Katja Batistič. Beyond the safeguards: Exploring the security risks of chatgpt. ArXiv, 2023.
[5] Junseong Bang, Byung-Tak Lee, and Pangun Park. Examination of ethical principles for llm-based recommendations
in conversational ai. In 2023 International Conference on Platform Technology and Service (PlatCon), pages
109–113, 2023.

Lovely Hearts Consulting Toolkit by Slidesgo
No ratings yet
Lovely Hearts Consulting Toolkit by Slidesgo
79 pages
Large Language Models and Their Use Cases
No ratings yet
Large Language Models and Their Use Cases
3 pages
LLM Seminar PDF
No ratings yet
LLM Seminar PDF
10 pages
Large Language Models
No ratings yet
Large Language Models
3 pages
Python BAKMR010399001
No ratings yet
Python BAKMR010399001
3 pages
Large Language Models
No ratings yet
Large Language Models
2 pages
100 Daysofcybersecurity
No ratings yet
100 Daysofcybersecurity
62 pages
《A Primer on Large Language Models and their Limitations
No ratings yet
《A Primer on Large Language Models and their Limitations
33 pages
LLMs
No ratings yet
LLMs
72 pages
LLM Security: Vulnerabilities, Attacks, Defenses, and Countermeasures
No ratings yet
LLM Security: Vulnerabilities, Attacks, Defenses, and Countermeasures
32 pages
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
2024 NTU - Resaro - LLM - Security - Paper
No ratings yet
2024 NTU - Resaro - LLM - Security - Paper
19 pages
Training Large Language Models
No ratings yet
Training Large Language Models
7 pages
Techniques, Tricks & Frameworks
No ratings yet
Techniques, Tricks & Frameworks
143 pages
Attention Is All You Need.
No ratings yet
Attention Is All You Need.
5 pages
Large Language Models (LLMs) - Architecture, Training, Applications, And Challenges
No ratings yet
Large Language Models (LLMs) - Architecture, Training, Applications, And Challenges
5 pages
1st Note
No ratings yet
1st Note
3 pages
LLM
No ratings yet
LLM
3 pages
Large Language Models
No ratings yet
Large Language Models
2 pages
LLM Basics
No ratings yet
LLM Basics
3 pages
Exploring The Evolution of Large Language Models: Architectures, Applications, and Future Directions
No ratings yet
Exploring The Evolution of Large Language Models: Architectures, Applications, and Future Directions
11 pages
Large Language Model: Instructor Name: Shukdev Datta ML Developer at Innovative Skills
No ratings yet
Large Language Model: Instructor Name: Shukdev Datta ML Developer at Innovative Skills
22 pages
LLM Advancements Applications Challenges 20000 Words
No ratings yet
LLM Advancements Applications Challenges 20000 Words
3 pages
Privacy&Security For LLMs-Privacy-Preserving Techniques For Personalized AI - Lin
No ratings yet
Privacy&Security For LLMs-Privacy-Preserving Techniques For Personalized AI - Lin
64 pages
LLM - Seminar Report
No ratings yet
LLM - Seminar Report
13 pages
Llms
No ratings yet
Llms
3 pages
Day 17 Introduction To LLMs
No ratings yet
Day 17 Introduction To LLMs
7 pages
1 s2.0 S266729522400014X Main
No ratings yet
1 s2.0 S266729522400014X Main
21 pages
Large Language Models - PPTX 20250612 203058 0000
No ratings yet
Large Language Models - PPTX 20250612 203058 0000
12 pages
Large Language Models (LLMS) : Survey, Technical Frameworks, and Future Challenges
No ratings yet
Large Language Models (LLMS) : Survey, Technical Frameworks, and Future Challenges
51 pages
Fai Unit-5 TB
No ratings yet
Fai Unit-5 TB
7 pages
What Are LLMs
No ratings yet
What Are LLMs
3 pages
Pranay Report-1
No ratings yet
Pranay Report-1
36 pages
Planet, Code - PYTHON For LARGE LANGUAGE MODELS - A Beginners Handbook For Leveraging Llms Into Modern Development Workflows and Applications (2025)
100% (1)
Planet, Code - PYTHON For LARGE LANGUAGE MODELS - A Beginners Handbook For Leveraging Llms Into Modern Development Workflows and Applications (2025)
254 pages
The Need For Large Language Models
No ratings yet
The Need For Large Language Models
3 pages
A Survey On Responsible LLMS: Inherent Risk, Malicious Use, and Mitigation Strategy
No ratings yet
A Survey On Responsible LLMS: Inherent Risk, Malicious Use, and Mitigation Strategy
35 pages
Unlocking the Power of LLMs- Transformative Use Cases Across Industries (1)
No ratings yet
Unlocking the Power of LLMs- Transformative Use Cases Across Industries (1)
44 pages
afsa
No ratings yet
afsa
1 page
LLM and Security
No ratings yet
LLM and Security
4 pages
Understanding Large Language Models (LLMS) - A Mode
No ratings yet
Understanding Large Language Models (LLMS) - A Mode
3 pages
LLM
No ratings yet
LLM
1 page
What Are Large Language Models (LLMS) - IBM
No ratings yet
What Are Large Language Models (LLMS) - IBM
11 pages
Industrial Applications of Large Language Models
No ratings yet
Industrial Applications of Large Language Models
23 pages
IJRPR29621
No ratings yet
IJRPR29621
7 pages
A Review On Large Language Models Architectures Ap
No ratings yet
A Review On Large Language Models Architectures Ap
31 pages
Hullmi: Human vs. LLM Identification With Explainability
No ratings yet
Hullmi: Human vs. LLM Identification With Explainability
17 pages
2 Notes
No ratings yet
2 Notes
3 pages
A Review On Large Language Models Archit
No ratings yet
A Review On Large Language Models Archit
32 pages
Generative AI
No ratings yet
Generative AI
6 pages
Dokumen - Pub Quick Start Guide To Large Language Models Strategies and Best Practices For Using Chatgpt and Other Llms 9780138199425
No ratings yet
Dokumen - Pub Quick Start Guide To Large Language Models Strategies and Best Practices For Using Chatgpt and Other Llms 9780138199425
325 pages
LLM 1
No ratings yet
LLM 1
6 pages
Large Language Models A Comprehensive Survey of It
No ratings yet
Large Language Models A Comprehensive Survey of It
30 pages
Aeon - Co-Can Philosophy Help Us Get A Grip On The Consequences of AI
No ratings yet
Aeon - Co-Can Philosophy Help Us Get A Grip On The Consequences of AI
10 pages
LLM Report
No ratings yet
LLM Report
10 pages
LLM Survey
No ratings yet
LLM Survey
31 pages
Data Seminar
No ratings yet
Data Seminar
10 pages
Augmenting Human Potential:: The Role of Llms in Shaping The Future of Hci
No ratings yet
Augmenting Human Potential:: The Role of Llms in Shaping The Future of Hci
4 pages
An Analysis of Large Language Models: Their Impact and Potential Applications
No ratings yet
An Analysis of Large Language Models: Their Impact and Potential Applications
24 pages
Introduction to LLMs for Business Leaders: Responsible AI Strategy Beyond Fear and Hype: Byte-Sized Learning Series
From Everand
Introduction to LLMs for Business Leaders: Responsible AI Strategy Beyond Fear and Hype: Byte-Sized Learning Series
I. Almeida
No ratings yet
Unraveling the Magic of Large Language Models: A Journey into the Future of Communication
From Everand
Unraveling the Magic of Large Language Models: A Journey into the Future of Communication
Lila Hartney
No ratings yet
LangChain Applications in Modern LLM Development: The Complete Guide for Developers and Engineers
From Everand
LangChain Applications in Modern LLM Development: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Linesanity - A Journey From Simple To Complex Linetypes
No ratings yet
Linesanity - A Journey From Simple To Complex Linetypes
40 pages
Proof of Claim
No ratings yet
Proof of Claim
1 page
BRD2506A A03 Schematic
No ratings yet
BRD2506A A03 Schematic
13 pages
Unveiling The Horizons of Advanced Artificial Intelligence
No ratings yet
Unveiling The Horizons of Advanced Artificial Intelligence
6 pages
M2E11 MainDrive 04
No ratings yet
M2E11 MainDrive 04
14 pages
CS 4402-01learning Journal Unit 1
No ratings yet
CS 4402-01learning Journal Unit 1
2 pages
Description Box Suction (Vacuum Feed), Standard Rocker Arm: Model P-55U Pump Data Sheet
No ratings yet
Description Box Suction (Vacuum Feed), Standard Rocker Arm: Model P-55U Pump Data Sheet
3 pages
Noman Abid Resume
No ratings yet
Noman Abid Resume
1 page
All Practicals IWT
No ratings yet
All Practicals IWT
22 pages
Eight Simple Qigong Exercises For Health: The Eight Pieces of Brocade
No ratings yet
Eight Simple Qigong Exercises For Health: The Eight Pieces of Brocade
1 page
Sonder Notes
No ratings yet
Sonder Notes
5 pages
MScThesis PepijnKessels
No ratings yet
MScThesis PepijnKessels
142 pages
Math MCQ Probability
100% (2)
Math MCQ Probability
6 pages
Chapter 16 Auditing
No ratings yet
Chapter 16 Auditing
29 pages
Instructions For Programming The Control Electronics RE11: 1 Generating An Update File
No ratings yet
Instructions For Programming The Control Electronics RE11: 1 Generating An Update File
2 pages
1.5.7 Packet Tracer - Network Representation
No ratings yet
1.5.7 Packet Tracer - Network Representation
3 pages
Institutional Email Request Application Form: (To Send OTP For First Login)
No ratings yet
Institutional Email Request Application Form: (To Send OTP For First Login)
1 page
Dell Unity OE Revision Matrix - Dell UK
No ratings yet
Dell Unity OE Revision Matrix - Dell UK
4 pages
OEG Service Information Hard Disk Initialization Method For DNC-DT Function
100% (1)
OEG Service Information Hard Disk Initialization Method For DNC-DT Function
2 pages
Unified Council: Unified Cyber Olympiad - Uc 329
No ratings yet
Unified Council: Unified Cyber Olympiad - Uc 329
4 pages
IBM Deskstar 120GXP
No ratings yet
IBM Deskstar 120GXP
2 pages
BR SPG1700 PDF
No ratings yet
BR SPG1700 PDF
2 pages
SKMU MCA 2years Syllabus
No ratings yet
SKMU MCA 2years Syllabus
41 pages
Oop Chapter 05
No ratings yet
Oop Chapter 05
15 pages
Mini Project Report Format 6th Sem
No ratings yet
Mini Project Report Format 6th Sem
10 pages
Computer Aided Process Control System
No ratings yet
Computer Aided Process Control System
10 pages
Inspection Report - GBT Raft Foundation
No ratings yet
Inspection Report - GBT Raft Foundation
6 pages
Arts6 - Quarter2 - Module 7-Edited
No ratings yet
Arts6 - Quarter2 - Module 7-Edited
19 pages
Grade 7 To 8 Math Curriculum Guide
No ratings yet
Grade 7 To 8 Math Curriculum Guide
7 pages