0% found this document useful (0 votes)
196 views17 pages

Mastering Agents

The e-book 'Mastering AI Agents' explores the use of AI agents powered by large language models (LLMs) to perform end-to-end tasks, enhancing decision-making and task execution. It covers various types of AI agents, their applications, frameworks for building them, evaluation methods, and common pitfalls in deployment. The book aims to guide leaders in integrating AI agents into their workflows effectively.

Uploaded by

Titty Mathew
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
196 views17 pages

Mastering Agents

The e-book 'Mastering AI Agents' explores the use of AI agents powered by large language models (LLMs) to perform end-to-end tasks, enhancing decision-making and task execution. It covers various types of AI agents, their applications, frameworks for building them, evaluation methods, and common pitfalls in deployment. The book aims to guide leaders in integrating AI agents into their workflows effectively.

Uploaded by

Titty Mathew
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 17

Mastering AI Agents

Preface
In our previous e-book, “Mastering RAG,” our goal was clear: building enterprise-grade
RAG systems, productionizing them, monitoring their performance, and improving them.
At the core of it, we understood how RAG systems enhance an LLM’s ability to work with
specific knowledge by providing relevant context.

In this e-book, we’re taking a step further and asking, “How do we use LLMs to
accomplish end-to-end tasks?” This singular question opens up a door: AI agents. A RAG
system helps an LLM provide accurate answers based on given context. An AI agent
takes that answer and actually does something with it — makes decisions, executes
tasks, or coordinates multiple steps to achieve a goal.

A RAG-enhanced LLM could help answer questions about policy details by pulling relevant
information. But an AI agent could actually process the claim end-to-end by analyzing the
documentation, checking policy compliance, calculating payments, and even coordinating
with other systems or agents when needed.

The ideas behind agents has existed for years. It can be a software program or another
computational entity that can accept input from its environment and take actions based
on rules. With AI agents, you’re getting what has never been there before: the ability to
understand the context without predefined rules, the capacity to tune decisions based on
context, and learning from every interaction. What you’re getting is not just a bot working
with a fixed set of rules but a system capable of making advanced decisions in real-time.

Companies have quickly adapted, adopted, and integrated AI agents into their workflows.
Capgemini’s research found that “10% of organizations already use AI agents, more than
half plan to use them in 2025 and 82% plan to integrate them within the next three years.”

2
Mastering AI Agents

This e-book aims to be your go-to guide for all things AI agents. If you’re a leader looking
to guide your company to build successful agentic applications, this e-book can serve
as a great guide to get you started. We also explore approaches to measuring how well
your AI agents perform, as well as common pitfalls you may encounter when designing,
measuring, and improving them.

The book is divided into five chapters:

Chapter 1 introduces AI agents, their optimal applications, and scenarios where they
might be excessive. It covers various agent types and includes three real-world use cases
to illustrate their potential.

Chapter 2 details three frameworks—LangGraph, Autogen, and CrewAI—with evaluation


criteria to help choose the best fit. It ends with case studies of companies using these
frameworks for specific AI tasks.

Chapter 3 explores the evaluation of an AI agent through a step-by-step example of a


finance research agent.

Chapter 4 explores how to measure agent performance across systems, task completion,
quality control, and tool interaction, supported by five detailed use cases.

Chapter 5 addresses why many AI agents fail and offers practical solutions for successful
AI deployment.

We hope this book will be a great stepping stone in your journey to build trustworthy
agentic systems.

- Pratik Bhavsar

3
Contents
Chapter 1: Chapter 2:
What are AI agents Frameworks for
Building Agents

7/27 28/43

Types of AI Agents 10 LangGraph vs. AutoGen vs. 30


CrewAI
When to Use Agents? 21
Practical Considerations 31
When Not to Use Agents? 22
What Tools and Functionalities Do 31
10 Questions to Ask Before You 23 They Support?
Consider an AI Agent
How Well Do They Maintain the 32
3 Interesting Real-World Use 25 Context?
Cases of AI Agents
Are They Well-Organized and Easy 33
to Interpret?

What’s the Quality of 34


Documentation?

Do They Provide Multi-Agent 34


Support?

What About Caching? 35


Looking at the Replay Functionality 35

What About Code Execution? 35

Human in the Loop Support? 37

Popular Use Cases Centered 40


Around These Frameworks
Mastering AI Agents

Chapter 3: Chapter 4:
How to Evaluate Agents Metrics for Evaluating
AI Agents

44/61 62/79

Requirements 44 Case Study 1: Advancing the 63


Claims Processing Agent
Defining the Problem 44
Case Study 2: Optimizing the Tax 66
Define the React Agent 45 Audit Agent
State Management 46 Case Study 3: Elevating the Stock 69
Analysis Agent
Create the Graph 47
Case Study 4: Upgrading the 72
Create the LLM Judge 54 Coding Agent
Use Galileo Callbacks 55 Case Study 5: Enhancing the Lead 75
Scoring Agent

5
Mastering AI Agents

Chapter 5:
Why Most AI Agents Fail &
How to Fix Them

80/95

Development Issues 81

LLM Issues 82

Production Issues 86

6
01
CHAPTER

WHAT ARE AI
AGENTS?
Mastering AI Agents

What are AI agents?


Let’s start by understanding what AI agents are and which tasks you should use them for
to maximize their potential.

AI agents are software applications that use large language models (LLMs) to
autonomously perform specific tasks, ranging from answering research questions to
handling backend services. They’re incredibly useful for tasks that demand complex
decision-making, autonomy, and adaptability. You might find them especially helpful in
dynamic environments where the workflow involves multiple steps or interactions that
could benefit from automation.

Salesforce estimates that salespersons spend 71% of their time on non-selling tasks (like
administrative tasks and manually entering data). Imagine the time that could have gone
into directly engaging with customers, developing deeper relationships, and ultimately
closing more sales. This is true across multiple domains and applications: finance, health
care, tech, marketing, sales, and more.

Let’s use an example to understand this better. Imagine you run an online retail business
and receive hundreds of customer inquiries every day about order statuses, product
details, and shipping information. Instead of answering each and every query yourself, you
can integrate an AI agent into your solution to handle these queries.

Here’s how it would typically work:

1. Customer Interaction
A customer messages your service asking, “When will my order ship?”

2. Data Retrieval
The AI agent accesses the order management system to find the specific order details.

3. Response Generation
Based on the data retrieved, the agent automatically provides an updates to the customer,
such as sending “Your order will ship tomorrow and you’ll receive a tracking link via email
once it’s on its way.”

8
Mastering AI Agents

The return to having an AI agent is multifold here:

• Super quick response time that keeps your customers happy


• Frees up your human staff to handle more complex queries and issues
• Improves your overall productivity and efficiency

Fig 1.1 is an example of how agents are leveraged for code generation.

Conversation

Switch to Backup User

Write <file> <content>

Eval Environment Run test AI Agent

Error <failure log>


Actions OpenAI GPT-4
Retrieval signature

Repository
Content result

Write <file> <fix>

Run test

Success

Fig 1.1: Automated AI-Driven Development using AI agents

9
Mastering AI Agents

Types of AI Agents
Now that we’re familiar with what AI agents are, let’s look at different types of AI
agents along with their characteristics, examples, and when you can use them.
See Table 1.1 below to get a quick idea of the types of AI agents and where and
when you can use them.

Name of the agent Key Characteristics Examples Best For

RPA, email Repetitive tasks,


Fixed Automation: The No intelligence, predictable
autoresponders, basic structured data, no need
Digital Assembly Line behavior, limited scope
scripts for adaptability

LLM-Enhanced: Email filters, content Flexible tasks, high-


Context-aware, rule-
Smarter, but Not moderation, support volume/low-stakes, cost-
constrained, stateless
Einstein ticket routing sensitive scenarios

Multi-step workflows, Travel planners, AI Strategic planning, multi-


ReAct: Reasoning
dynamic planning, basic dungeon masters, stage queries, dynamic
Meets Action
problem-solving project planning tools adjustments

High-stakes decisions,
External knowledge Legal research tools,
ReAct + RAG: domain-specific tasks,
access, low hallucinations, medical assistants,
Grounded Intelligence real-time knowledge
real-time data technical support
needs

Multi-tool integration, Complex workflows


Tool-Enhanced: The Code generation tools,
dynamic execution, high requiring multiple tools
Multi-Taskers data analysis bots
automation and APIs

Meta-cognition, Tasks requiring


Self-Reflecting: The Self-evaluating systems,
explainability, self- accountability and
Philosophers QA agents
improvement improvement

Memory-Enhanced: Long-term memory, Individualized


Project management AI,
The Personalized personalization, adaptive experiences, long-term
personalized assistants
Powerhouses learning interactions

Environment Active environment control, System control, IoT


AutoGPT, adaptive
Controllers: The World autonomous operation, integration, autonomous
robotics, smart cities
Shapers feedback-driven operations

Autonomous learning, Neural networks, swarm Cutting-edge research,


Self-Learning: The
adaptive/scalable, AI, financial prediction autonomous learning
Evolutionaries
evolutionary behavior models systems

Table 1.1: Types of agents and their characteristics

10
Mastering AI Agents

Fixed Automation –
The Digital Assembly Line
This level of AI agents represents the simplest and most rigid form of automation. These
agents don’t adapt or think—they just execute pre-programmed instructions. They are
like assembly-line workers in a digital factory: efficient but inflexible. Great for repetitive
tasks, but throw them a curveball, and they’ll freeze faster than Internet Explorer.
(See Table 1.2 below)

Feature Description

Intelligence No learning, adaptation, or memory.

Behavior Predictable and consistent, follows pre-defined rules.

Scope Limited to repetitive, well-defined tasks. Struggles with unexpected scenarios.

Best Use Cases Routine tasks, structured data, situations with minimal need for adaptability.

RPA for invoice processing, email autoresponders, basic scripting tools (Bash,
Examples
PowerShell).

Table 1.2: Characteristics of a fixed automation agent

The fixed automation workflow (See Fig 1.2) follows a simple, linear path. It begins when
a specific input (like a file or data) triggers the system, which consults its predefined
rulebook to determine what to do. Based on these rules, it executes the required action
and finally sends out the result or output. Think of it as a digital assembly line where
each step must be completed in exact order, without deviation.

Fixed Automation Agent

Predefined Execute Send Output /


Input Trigge
Rule Action Result

Fig 1.2: Workflow of a fixed automation agent

11
Mastering AI Agents

LLM-Enhanced –
Smarter, but Not Exactly Einstein
These agents leverage LLMs to provide contextual understanding and handle
ambiguous tasks while operating within strict boundaries. LLM-Enhanced Agents
balance intelligence and simplicity, making them highly efficient for low-complexity,
high-volume tasks. Take a look at their features below in Table 1.3.

Feature Description

Context-aware; leverages LLMs to process ambiguous inputs with contextual


Intelligence
reasoning.

Behavior Rule-constrained; decisions are validated against predefined rules or thresholds.

Scope Stateless; no long-term memory; each task is processed independently.

Tasks requiring flexibility with ambiguous inputs, high-volume/low-stakes


Best Use Cases
scenarios, and cost-sensitive situations where "close enough" is sufficient.

Examples Email filters, AI-enhanced content moderation, customer support classification.

Table 1.3: Characteristics of an LLM-enhanced agent

The workflow below (Fig 1.3) shows how these smarter agents process information:
starting with the input, the agent uses LLM capabilities to analyze and understand
the input context. This analysis then passes through rule-based constraints that keep
the agent within defined boundaries, producing an appropriate output. It’s like having
a smart assistant who understands context but still follows company policy before
making decisions.

LLM-Enhanced Agent

LLM Rule-based Output /


Input Data
(contextual analysis) Constraint Action

Fig 1.3: Workflow of a LLM-enhanced agent

12
Mastering AI Agents

ReAct –
Reasoning Meets Action
ReAct agents combine Reasoning and Action to perform tasks that involve strategic
thinking and multi-step decision-making. They break complex tasks into manageable
steps, reasoning through problems dynamically and acting based on their analysis.
These agents are like your type-A friend who plans their weekend down to the minute.
Table 1.4 lists their characteristics.

Feature Description

Reasoning and action; mimics human problem-solving by thinking through a


Intelligence
problem and executing the next step.
Handles multi-step workflows, breaking them down into smaller, actionable parts.
Behavior
Dynamically adjusts strategy based on new data.

Scope Assists with basic open-ended problem-solving, even without a direct solution path.

Strategic planning, multi-stage queries, tasks requiring dynamic adjustments, and


Best Use Cases
re-strategizing.

Language agents solving multi-step queries, AI Dungeon Masters, project planning


Examples
tools.

Table 1.4: Characteristics of a fixed ReAct agent

The ReAct workflow starts with an Input Query and then enters a dynamic cycle between
the Reasoning and Action Phase, as you’ll see in Fig 1.4. Unlike simpler agents, it can
loop between thinking and acting repeatedly until the desired outcome is achieved before
producing the final Output/Action. Think of it as a problem solver that keeps adjusting its
approach - analyzing, trying something, checking if it worked, and trying again if needed.

Fixed Automation Agent

Reasoning

Output /
Input Trigge repeat until desired outcome achieved
Action

Action Phase

Fig 1.4: Workflow of a ReAct agent

13
Mastering AI Agents

ReAct + RAG – Grounded Intelligence


Now, moving on to agents who are much more intelligent, we come to ReAct + RAG
agents that combine reasoning, action, and real-time access to external knowledge
sources. This integration allows them to make informed decisions grounded in accurate,
domain-specific data, making them ideal for high-stakes or precision-critical tasks
(especially when you add evaluations). These agents are your ultimate trivia masters with
Google on speed dial. See Table 1.5 to learn how this agent works.

Feature Description

Employs a RAG workflow, combining LLMs with external knowledge sources


Intelligence
(databases, APIs, documentation) for enhanced context and accuracy.
Uses ReAct-style reasoning to break down tasks, dynamically retrieving information
Behavior
as needed. Grounded in real-time or domain-specific knowledge.
Designed for scenarios requiring high accuracy and relevance, minimizing
Scope
hallucinations.
High-stakes decision-making, domain-specific applications, tasks with dynamic
Best Use Cases
knowledge needs (e.g., real-time updates).

Legal research tools, medical assistants referencing clinical studies, technical


Examples
troubleshooting agents.

Table 1.5: Characteristics of a ReAct + RAG agent

Starting with an Input Query, this advanced workflow combines ReAct’s reasoning-action
loop with an additional Knowledge Retrieval step. The agent cycles between Reasoning,
Action Phase, and Knowledge Retrieval (See Fig 1.5) — consulting external sources as
needed — until it reaches the desired outcome and produces an Output/Action. It’s like
having a problem solver who not only thinks and acts but also fact-checks against reliable
sources along the way.

ReAct + RAG Agent

Reasoning

Output /
Input Query repeat until desired outcome achieved
Action

Knowledge
Action Phase
Retrieval

Fig 1.5: Workflow of a ReAct + RAG agent

14
Mastering AI Agents

Tool-Enhanced – The Multi-Taskers


Tool-enhanced agents are versatile problem solvers that integrate multiple tools,
leveraging APIs, databases, and software to handle complex, multi-domain workflows.
They combine reasoning, retrieval, and execution for seamless, dynamic task
completion. Think of them as tech-savvy Swiss Army knives capable of combining
reasoning, retrieval, and execution seamlessly! (See Table 1.6)

Feature Description

Leverages APIs, databases, and software tools to perform tasks, acting as a multi-
Intelligence
tool integrator.
Handles multi-step workflows, dynamically switching between tools based on task
Behavior
requirements.
Automates repetitive or multi-stage processes by integrating and utilizing diverse
Scope
tools.
Jobs requiring diverse tools and APIs in tandem for complex or multi-stage
Best Use Cases
automation.

Code generation tools (GitHub CoPilot, Sourcegraph's Cody, Warp Terminal), data
Examples
analysis bots combining multiple APIs.

Table 1.6: Characteristics of tool-enhanced agents

Starting with an Input Query, the agent combines reasoning with a specialized tool loop.
After the initial reasoning phase, it selects the appropriate tool for the task (Tool Selection)
and then executes it (Tool Execution). This cycle repeats until the desired outcome is
achieved, leading to the final Output/Action. (See Fig 1.6)

Tool Enhanced Agent

Reasoning

Output /
Input Query repeat until desired outcome achieved
Action

Tool Execution Tool Selection

Fig 1.6: Workflow of tool-enhanced agents

15
Mastering AI Agents

Self-Reflecting – The Philosophers


These agents think about their thinking. Self-reflecting agents introduce meta-
cognition—they analyze their reasoning, assess their decisions, and learn from mistakes.
This enables them to solve tasks, explain their reasoning, and improve over time,
ensuring greater reliability and accountability. (See Table 1.7)

Feature Description

Exhibits meta-cognition, evaluating its own thought processes and decision


Intelligence
outcomes.
Provides explanations for actions, offering transparency into its reasoning. Learns
Behavior
from mistakes and improves performance over time.

Scope Suited for tasks requiring accountability and continuous improvement.

Quality assurance, sensitive decision-making where explainability and self-


Best Use Cases
improvement are crucial.

AI that explains its reasoning, self-evaluating learning systems, quality assurance


Examples
(QA) agents.

Table 1.7: Characteristics of self-reflecting agents

Starting with an Input Query, the agent goes through a cycle of Reasoning and Execution,
but with a crucial additional step: Reflection. After each execution, it reflects on its
performance and feeds those insights back into its reasoning process. This continuous
loop of thinking, doing, and learning continues until the desired outcome is achieved,
producing the final Output/Action. This is evident in Fig 1.7.

Execution

When
desired Output /
Input Query Reasoning Reflection
outcome Action
achived

Feedback Loop

Fig 1.7: Workflow of self-reflecting agents

16
Download the Full ebook

You might also like