Reflection Prompting Making LLM Think

The document outlines the Reflection framework for enhancing language-based agents through linguistic feedback, emphasizing its components such as Actor, Evaluator, and Self-Reflection. It discusses the advantages and limitations of using Reflection, including its suitability for trial and error scenarios and the challenges of hallucination in LLMs. Additionally, it highlights the evolving nature of agents that combine LLMs with capabilities for task planning, tool use, memory management, reasoning, and learning.

Uploaded by

pathakpritee20

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views15 pages

Reflection Prompting Making LLM Think

Uploaded by

pathakpritee20

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Reflection Prompting

Making LLM Think

Logistics
● One week from now - you will have an exam
● It will be one hour - multiple choice questions
● Whatever covered until last week will be covered

● Projects - Next Wednesday is the deadline for picking your project topics and
teams
Recall
● We learned about ReAct framework and saw an example of how to make an
LLM reason - a bit unsuccessfully
● ReAct => Reason + Action
● Reasoning involves, LLM “thinking” through results of its actions
● Action involves calling tools that can take action on LLM’s direction with LLM
specified inputs and produce results
● We keep repeating until LLM “thinks” it has found the “answer” to user query
● This is one of the basic frameworks for building LLM Agent
Reflection
Like ReAct framework, we can also use Reflection framework.

“Reflection is a framework to reinforce language-based agents through

linguistic feedback” - Prompt Engineering Guide

Per the Paper Shinn et al. (2023) - "Reflexion is a new paradigm for ‘verbal‘
reinforcement that parameterizes a policy as an agent’s memory encoding paired
with a choice of LLM parameters."
Reflection Diagram

Actor: Takes an Action.

Environment: Provides feedback as
reward or observation
Trajectory: series of feedback and
action for a shorter duration are
stored as Short Term Memory.
Self-Reflection: LLM that processes
the external feedback and internal
feedback - evaluator’s evaluation of
the Trajectory of the conversation.
Experience: text from
Self-Reflection to store for future use
as Long Term Memory.
Reflection - Actor
Actor:
● The Actor takes an action in an environment and receives an observation
which results in a trajectory.
● Chain-of-Thought (CoT) and ReAct are used as Actor modalities.
○ These are prompting methods used to make the LLM “think”
● A memory component is also added to provide additional context to the agent
○ To provide more context in the prompt for the LLM to decide next action
○ Feedback from environment - user or another LLM is added as short term memory of the
conversation / transaction
○ We also provide memory of prior actions and their reflections to Actor as input as long term
memory
Reflection - Evaluator
Evaluator:

● Judges Actor’s actions based on external feedback and recent

action-feedback pairs (short term memory)
● This is a combination of LLM judgements or rules set that is applied to the
input
● The goal is to create internal feedback for the reflection LM to process the
external feedback with
Reflection - Self-Reflection
Self-Reflection LLM:

● Generates text to reinforce decision for the Actor in self-improvements

● This provides valuable feedback for the actor and is stored in long term
memory for future actions
● To generate this feedback, the model takes input, external rewards to Actor’s
actions, the evaluator’s judgement of this action-feedback pair and collections
of past such pairs.
Reflection - When To Use
Reflection is best when used for

● Trial and Error is acceptable

● Other ML techniques like Reinforcement Learning (RL) is not feasible
○ Training data is too expensive
○ Not enough experts
● Detailed feedback is useful
● Explainability / Traceability of decisions is important
○ Medical diagnosis
○ Learning from previous unrelated runs is important
Reflection - When NOT To Use
Reflection LLM -

● It is an LLM - It can “Hallucinate”...

● When your reflection based teacher is hallucinating -
○ You learn to hallucinate MORE!!

Long Term Memory -

● Is complicated to store and retrieve

Reflection - What is it good for?
● Answers follow a linear path - one decision leads to the next one
● Time is not an issue in answering
○ The back and forth takes time
● Decision needs reasoning
● Programming assignment - you can make LLMs write your assignment :-D …
○ IF you learn how to prompt them!!
Agent
Exciting and rapidly evolving area in artificial intelligence

● At their core, these agents are systems that combine large language models
(LLMs) with additional capabilities to perform tasks or solve problems more
effectively than a standard LLM alone
● "agent" is a system that can perceive its environment, make decisions, and
take actions to achieve specific goals. It's like a digital entity with a degree of
autonomy.
Agent
Key Features:
● Task Planning: They can break down complex tasks into smaller,
manageable steps.
● Tool Use: They can utilize various digital tools or APIs to gather information
or perform actions.
● Memory and Context Management: They can maintain context over longer
interactions and "remember" important information.
● Reasoning: They can apply logical reasoning to solve problems or make
decisions.
● Learning and Adaptation: Some advanced agents can learn from their
interactions and improve over time.
References
ReAct Prompting: https://www.promptingguide.ai/techniques/react

Reflection Prompting: https://www.promptingguide.ai/techniques/reflexion

LLM Reasoning: https://www.promptingguide.ai/research/llm-reasoning

Agent
To be continued…

IEEE Conference Template
No ratings yet
IEEE Conference Template
7 pages
Basics of Prompt Engineering
No ratings yet
Basics of Prompt Engineering
29 pages
Shift of Educational Focus From Content To Learning Outcomes
100% (1)
Shift of Educational Focus From Content To Learning Outcomes
25 pages
What We Learned From A Year of Building With LLMs (For True Epub) (Eugene Yan, Bryan Bischof, Charles Frye Etc.)
No ratings yet
What We Learned From A Year of Building With LLMs (For True Epub) (Eugene Yan, Bryan Bischof, Charles Frye Etc.)
90 pages
Graph of Thoughts: Solving Elaborate Problems With Large Language Models
No ratings yet
Graph of Thoughts: Solving Elaborate Problems With Large Language Models
13 pages
LLM's For Code Generation
No ratings yet
LLM's For Code Generation
31 pages
What We've Learned From A Year of Building With LLMs - Applied LLMs
No ratings yet
What We've Learned From A Year of Building With LLMs - Applied LLMs
37 pages
Reflexion: Language Agents With Verbal Reinforcement Learning
No ratings yet
Reflexion: Language Agents With Verbal Reinforcement Learning
18 pages
Language Agents With Verbal Reinforcement Learning
No ratings yet
Language Agents With Verbal Reinforcement Learning
19 pages
Agent Design Patterns
No ratings yet
Agent Design Patterns
17 pages
Exploring Augmentation and Cognitive Strategies For AI Based Synthetic Personae
No ratings yet
Exploring Augmentation and Cognitive Strategies For AI Based Synthetic Personae
9 pages
Dynasaur:: Large Language Agents Beyond Predefined Actions
No ratings yet
Dynasaur:: Large Language Agents Beyond Predefined Actions
15 pages
Fundamentals and Beyond
No ratings yet
Fundamentals and Beyond
12 pages
Planet, Code - PYTHON For LARGE LANGUAGE MODELS - A Beginners Handbook For Leveraging Llms Into Modern Development Workflows and Applications (2025)
No ratings yet
Planet, Code - PYTHON For LARGE LANGUAGE MODELS - A Beginners Handbook For Leveraging Llms Into Modern Development Workflows and Applications (2025)
254 pages
Applications of Generative AI - Somsuvra Chatterjee
No ratings yet
Applications of Generative AI - Somsuvra Chatterjee
35 pages
Chatgpt: A Technical Perspective: Presented by Teamx
No ratings yet
Chatgpt: A Technical Perspective: Presented by Teamx
18 pages
LLM Agents - Prompt Engineering Guide
No ratings yet
LLM Agents - Prompt Engineering Guide
16 pages
Omnireflect: Discovering Transferable Constitutions For LLM Agents Via Neuro-Symbolic Reflections
No ratings yet
Omnireflect: Discovering Transferable Constitutions For LLM Agents Via Neuro-Symbolic Reflections
24 pages
Lab Session1 25oct2024
No ratings yet
Lab Session1 25oct2024
29 pages
Fil 106 Ugnayang NG Wika, Kultura at Lipunan
No ratings yet
Fil 106 Ugnayang NG Wika, Kultura at Lipunan
7 pages
Generative Ai Terminology
67% (3)
Generative Ai Terminology
26 pages
$R1AJXVA
No ratings yet
$R1AJXVA
14 pages
LLM Fundamentals - 1 Introduction - 1 Neo4j and Genai
No ratings yet
LLM Fundamentals - 1 Introduction - 1 Neo4j and Genai
4 pages
LLM Powered Autonomous Agents - Lil'Log
No ratings yet
LLM Powered Autonomous Agents - Lil'Log
24 pages
Listening Comprehension
No ratings yet
Listening Comprehension
3 pages
Teaching LLMs To Think and Act - ReAct Prompt Engineering - by Bryan McKenney - Medium
No ratings yet
Teaching LLMs To Think and Act - ReAct Prompt Engineering - by Bryan McKenney - Medium
15 pages
A4.2 Advanced Problem Solving Nerority - Prompt-Engineering-Mastery Wiki GitHub
No ratings yet
A4.2 Advanced Problem Solving Nerority - Prompt-Engineering-Mastery Wiki GitHub
2 pages
Google REST
No ratings yet
Google REST
19 pages
Reflection-Bench: Probing AI Intelligence With Reflection
No ratings yet
Reflection-Bench: Probing AI Intelligence With Reflection
11 pages
Small Language Models (SLMS)
No ratings yet
Small Language Models (SLMS)
23 pages
20 Types Prompting Styles
No ratings yet
20 Types Prompting Styles
22 pages
Agentic Design1
No ratings yet
Agentic Design1
15 pages
LLM Applications
100% (1)
LLM Applications
1 page
Prompt Engineering Mastery
No ratings yet
Prompt Engineering Mastery
9 pages
State of AI - by Eduardo Mace - ScalePV 2023
No ratings yet
State of AI - by Eduardo Mace - ScalePV 2023
36 pages
Techniques, Tricks & Frameworks
No ratings yet
Techniques, Tricks & Frameworks
143 pages
Self-Improving LLM Architectures With Open Source
No ratings yet
Self-Improving LLM Architectures With Open Source
14 pages
TPTU: Task Planning and Tool Usage of Large Language Model-Based AI Agents
No ratings yet
TPTU: Task Planning and Tool Usage of Large Language Model-Based AI Agents
36 pages
PAPER Prompt Engineering For LLM
No ratings yet
PAPER Prompt Engineering For LLM
6 pages
LLM Agent Overview
No ratings yet
LLM Agent Overview
1 page
11 LPI 5 Naif+Adeeb+Alotaibi 7 5010
No ratings yet
11 LPI 5 Naif+Adeeb+Alotaibi 7 5010
22 pages
Fine Tuning Techniques For Large Language Models LLMs
No ratings yet
Fine Tuning Techniques For Large Language Models LLMs
15 pages
Summary Completion - Textbook 4.5
No ratings yet
Summary Completion - Textbook 4.5
4 pages
MISIC Theerroneouspracticeof6percentproration
100% (1)
MISIC Theerroneouspracticeof6percentproration
3 pages
Module 3
No ratings yet
Module 3
43 pages
What We Learned From A Year of Building With LLMs (Part I) - O'Reilly
No ratings yet
What We Learned From A Year of Building With LLMs (Part I) - O'Reilly
22 pages
Prompt Engineering
No ratings yet
Prompt Engineering
5 pages
Ge Elec 1 - Chapter 4
No ratings yet
Ge Elec 1 - Chapter 4
20 pages
02 - Embeddings, Prompting, & Moderation
No ratings yet
02 - Embeddings, Prompting, & Moderation
54 pages
Get The Best From AI
No ratings yet
Get The Best From AI
34 pages
Language Learning - March 1985 - o Malley - Learning Strategies Used by Beginning and Intermediate Esl Students
No ratings yet
Language Learning - March 1985 - o Malley - Learning Strategies Used by Beginning and Intermediate Esl Students
26 pages
Ob - Group 3
No ratings yet
Ob - Group 3
20 pages
LLM
No ratings yet
LLM
3 pages
CPCS335 1 Introduction
No ratings yet
CPCS335 1 Introduction
34 pages
Chapter 1
No ratings yet
Chapter 1
29 pages
Psychology: By: E/Cdt - Marc Augustus T. Garcia
No ratings yet
Psychology: By: E/Cdt - Marc Augustus T. Garcia
14 pages
Lesson No. 2
No ratings yet
Lesson No. 2
15 pages
The Alliance LD Toolkit
No ratings yet
The Alliance LD Toolkit
80 pages
LLMs Agents Guide
No ratings yet
LLMs Agents Guide
11 pages
2024 NTU - Resaro - LLM - Security - Paper
No ratings yet
2024 NTU - Resaro - LLM - Security - Paper
19 pages
Self Concept
No ratings yet
Self Concept
4 pages
Coping Skills Inventory
No ratings yet
Coping Skills Inventory
2 pages
Extempore Speech Action Plan
No ratings yet
Extempore Speech Action Plan
3 pages
Unit 5 Speaking
No ratings yet
Unit 5 Speaking
2 pages
Teacher Coaching and Development Process
No ratings yet
Teacher Coaching and Development Process
7 pages
GALLM Unit 5 Note
No ratings yet
GALLM Unit 5 Note
7 pages
Chapter 1
No ratings yet
Chapter 1
29 pages
2013 Psychology Exam
No ratings yet
2013 Psychology Exam
40 pages
《A Primer on Large Language Models and their Limitations
No ratings yet
《A Primer on Large Language Models and their Limitations
33 pages
Cewvbtyhtrh
No ratings yet
Cewvbtyhtrh
3 pages
Week 11 Chats
No ratings yet
Week 11 Chats
5 pages
Global Logic Interview Questions and Answers
No ratings yet
Global Logic Interview Questions and Answers
6 pages
ACTFLPerformance Descriptors-Interpersonal PDF
No ratings yet
ACTFLPerformance Descriptors-Interpersonal PDF
2 pages
Cte Ped 06 Module
No ratings yet
Cte Ped 06 Module
78 pages
Smart - Verbal Report of Habtamu Takele
No ratings yet
Smart - Verbal Report of Habtamu Takele
10 pages
Critical Thinking Mindset Skills
No ratings yet
Critical Thinking Mindset Skills
2 pages
A Semi Detailed Lesson Plan Education
No ratings yet
A Semi Detailed Lesson Plan Education
3 pages
An Investigation of Direct and Indirect Learning Strategies in Learning
No ratings yet
An Investigation of Direct and Indirect Learning Strategies in Learning
18 pages
AI Prompt Engineering Workshop - Week 3 Cheat Sheet
No ratings yet
AI Prompt Engineering Workshop - Week 3 Cheat Sheet
6 pages
Merged
No ratings yet
Merged
28 pages
Planejamento Bimestral 9º
No ratings yet
Planejamento Bimestral 9º
1 page
Language Acq Theories Chart
No ratings yet
Language Acq Theories Chart
2 pages
Class XI Mock Set 1 Psychology
No ratings yet
Class XI Mock Set 1 Psychology
3 pages
PERDEV
No ratings yet
PERDEV
12 pages
Prompt Engineering 201 Advanced Methods and Toolkits - AI, Software, Tech, and People. Not in That Order. by X
No ratings yet
Prompt Engineering 201 Advanced Methods and Toolkits - AI, Software, Tech, and People. Not in That Order. by X
2 pages
Llama3, LangGraph and Elasticsearch - Build A Local Agent For Vector Search - Search Labs
100% (2)
Llama3, LangGraph and Elasticsearch - Build A Local Agent For Vector Search - Search Labs
48 pages
Guide 4 Prompt Engineering
No ratings yet
Guide 4 Prompt Engineering
1 page
Enhancing AI Systems With Agentic Workflows Patterns in Large Language Model
No ratings yet
Enhancing AI Systems With Agentic Workflows Patterns in Large Language Model
6 pages
Strategic Project Management Made Simple: Practical Tools for Leaders and Teams
From Everand
Strategic Project Management Made Simple: Practical Tools for Leaders and Teams
Terry Schmidt
3.5/5 (8)
Reinforcement Learning Explained - A Step-by-Step Guide to Reward-Driven AI
From Everand
Reinforcement Learning Explained - A Step-by-Step Guide to Reward-Driven AI
Luka Nikolic
No ratings yet