Agents IA
Agents IA
Welcome to AI agents. After watching this video, you will be able to explain what AI agents are and
discuss the characteristics and types of AI agents. Additionally, you will be able to explore multi-agent
systems and the various applications of AI agents. Did you know? The global AI market is expected to
increase to $1.8 trillion within the next seven years with a staggering compound annual growth rate, or
CAGR, of 36.6%. According to Gartner, using AI agents in customer service will save companies around
$80 billion by reducing labor costs. Understanding AI agents is crucial as it enables various businesses to
use automation and data-driven decision making, further improving efficiency, innovation, and
competitiveness in the respective fields. Let's define what AI agents are. AI agents are software
programs that interact with their environment, collect and process data, and perform tasks on their own
to achieve the goals set by humans. They can make decisions, solve problems and adapt to new
information without needing constant human help. So how do AI agents work? Let's take the example of
a self-driving car. It includes several key phrases, perception, where AI agents use sensors like cameras
and radar in a car to gather data about the environment, spotting vehicles, pedestrians, and road signs.
By understanding phase algorithms process this data to identify objects, their speed and movement.
Then you have decision-making where the agent uses a knowledge base and logic and determines when
to accelerate, brake, or turn. Further, in action phase, actuators execute these decisions, controlling the
car's steering and brakes to navigate safely. Finally, learning phase, where the agent improves over time
using machine learning, analyzing past experiences to enhance performance. For a self-driving car, this
means continually learning from its driving experiences. AI agents possess several key characteristics
such as social ability, autonomy, reactiveness, and proactiveness. Social ability refers to an agent's
capacity to communicate and collaborate with other agents and entities in its environment. Autonomy is
the agent's ability to operate independently and make decisions without external help. Reactiveness
allows agents to respond immediately to changes in their environment, while proactiveness enables
them to take initiative and make decisions to achieve their objectives. Let's discuss the areas where each
of the characteristics can be used. The social ability feature of AI agents is extensively used in the
healthcare sector. In healthcare, chatbots use social skills to give medical advice, check symptoms, and
schedule appointments and reminders. Self driving cars use the autonomy feature to navigate roads,
make driving decisions, and operate without human assistance. Reactiveness is employed in
thermostats, which adjust home temperatures depending on real-time data. In industries, predictive
maintenance systems use the reactiveness feature to analyze equipment data and schedule
maintenance before breakdowns occur. In today's fast changing tech world, managing and coordinating
many independent systems is becoming more important. The need appears in different areas, like
managing resources spread over a large area, operating smart transportation systems, and creating
complex simulations. This is where multi-agent systems come in. In multi-agent systems, multiple agents
work together to achieve individual and collective goals. Multi-agent systems enable distributed
problem-solving, cooperative decision-making, and emergent behaviors from agent interactions. These
multi-agent systems are used in various fields, including online marketplaces, robotic coordination, and
traffic management. In online marketplaces, buyer and seller agents negotiate prices and manage
transactions for their users, facilitating efficient and automated commerce. In robotic coordination,
teams of robots collaborate on complex tasks such as warehouse logistics or search and rescue missions.
In traffic management, agents representing autonomous cars communicate to optimize traffic flow and
reduce congestion. In today's digital age, tech giants employ AI agents in their services and products.
Google integrates AI agents into YouTube for content recommendation and moderation, analyzing user
behavior to suggest videos and manage content. Additionally, AI agents help in detecting and removing
inappropriate content, enforcing community guidelines, and managing copyright issues. In Gmail, AI
agents power features like smart compose and smart reply, which suggest responses based on the
email's content. The AI agents also help organize emails, filter out spam, and find important messages.
Google Maps uses AI agents for real-time navigation, gathering real time-data from various sources to
provide accurate route recommendations, estimate travel times, and offer alternative routes to avoid
traffic jams. Amazon's Alexa uses AI for voice recognition, understanding natural language, and
controlling smart home devices. The Amazon.com e-commerce platform uses AI to recommend
products, improve search results, and manage inventory. Amazon Web Services, or AWS, offers AI
services for developers such as machine learning models, natural language processing, and computer
vision tools. As we enter a new era, AI agents are becoming more than just tools. They are partners in
making strategic decisions and engaging with customers. Their impact will continue to increase. With
this, you've reached the end of the video. In this video, you learned that agents in AI are software
programs that engage with surroundings, process data, and autonomously carry out tasks to meet
human-defined goals. You also learned about the key traits of AI agents. Social ability means agents can
talk and work with others, autonomy means they can work alone and decide things by themselves.
Reactiveness lets them quickly react to changes, and proactiveness means they can take charge and
make decisions to reach their goals. Finally, you learned that multi-agent systems facilitate distributed
problem-solving and cooperative decision-making, and how the same can be applied in varied areas
such as online marketplaces, robotic coordination, and traffic management.
What I can do is take my query, feed that into a model that can generate a response.
I think we can all expect that this answer will be incorrect because the model doesn't know who I am
and does not have access to this sensitive information about me. So they can help with summarizing
documents, they can help me with creating first drafts for emails and different reports I'm trying to do.
But the magic gets unlocked when I start building systems around the model and actually take the
model and integrate them into to the existing processes I have. So if we were to design a system to solve
this, I would have to give the model access to the database where my vacation data is stored. So that
same query would get fed into the language model. The difference now is the model would be
prompted to create a search query, and that would be a search query that can go into the database that
I have. So that would go and fetch the information from the database, output an answer, and then that
would go back into the model that can generate a sentence to answer. So Maya, you have ten days left
in your vacation database. So the answer that I would get here would be correct. This is an example of a
compound AI system, and it recognizes that certain problems are better solved when you apply the
principles of system design.
So what does that mean by the term system? You can understand there's multiple components, so
systems are inherently modular. I can have a model, I can choose between tuned models, large language
models, image generation models, but also I have programmatic components that can come around it.
So I can have output verifiers, I can have programs that can take a query and then break it down to
increase the chances of the answer being correct. I can combine that with searching databases, I can
combine that with different tools. So when we talking about a system approaches, I can break down
what I desire my program to do and pick the right components to be able to solve that. And this is
inherently easier to solve for than tuning a model. So that makes this much faster and quicker to adapt.
Okay, so the example I used below is an example of a compound AI system. You also might be popular
with retrieval augmented generation, which is one of the most popular and commonly used compound
AI systems out there. Most rag systems and the example I used below are defined in a certain way. So if I
bring a very different query, let's say I ask about the weather. In this example here, it's going to fail,
because the path that this program has to follow is to always search my vacation policy database, and
that has nothing to do with the weather. So when we say the path to answer a query, we are talking
about something called the control logic of a program.
So, compound AI systems, we said most of them have programmatic control logic. So that was
something that I defined myself as the human. Now let's talk about where do agents come in? One
other way of controlling the logic of a compound AI system is to put, put a large language model in
charge. And this is only possible because we're seeing tremendous improvements in the capabilities of
reasoning of large language models. So large language models, you can feed them complex problems,
and you can prompt them to break them down and come up with a plan on how to tackle it. Another
way to think about it is on one end of the spectrum, I'm telling my system to think fast, act as
programmed, don't deviate from the instructions I've given you. And on the other end of the spectrum,
you're designing your system to think slow. So create a plan. Attack each part of the plan, see where you
get stuck, see if you need to readjust the plan. So I might give you a complex question, and if you would
just give me the first answer that pops into your head, very likely that answer might be wrong, but you
have higher chances of success if you break it down. Understand, where do you need external help to
solve some parts of the problem, and maybe take an afternoon to solve it? And when we put LLMs in
charge of the logic, this is when we're talking about an agentic approach. So let's break down the
components of LLM agents. The first capability is the ability to reason, which we talked about. So this is
putting the model at the core of how problems are being solved. The model will be prompted to come
up with a plan and to reason about each step of the process along the way. Another capability of agents
is the ability to act, and this is done by external programs that are known in the industry as tools. So
tools are external pieces of the program, and the model can define when to call them and how to call
them in order to best execute the solution to the question they've been asked. So an example of a tool
could be search, searching the web, searching a database at their disposal. Another example could be a
calculator to do some math. This could be a piece of program code that maybe might manipulate a
database. This can also be another language model that maybe you're trying to do a translation task and
you want a model that can be able to do that. And there's so many other possibilities of what can do
there. So these could be APIs, basically any piece of external program you want to give your model
access to. Third capability, that is the ability to access memory. And the term memory can mean a
couple of things. So we talked about the models, thinking through the program, kind of how you think
out loud when you're trying to solve through a problem. So those inner logs can be stored and can be
useful to retrieve at different points in time. But also this could be the history of conversations that you
as a human had when interacting with the agent, and that would allow to make the experience much
more personalized. So the way of configuring AI agents, there's many ways to approach it. One of the
more most popular ways of going about it is through something called react, which as you can tell by the
name, combines the reasoning and act components of LLM agents. So let's make this very concrete.
What happens when I configure a react agent? You have your user query that gets fed into a model, so
an LLM. The LLM is given a prompt, so the instructions that's given is don't give me the first answer that
pops to you. Think slow, plan your work, and then try to execute something, try to act. And when you
want to act, you can define whether if you want to use external tools to help you come up with the
solution. Once you get, you call a tool and you get an answer, maybe it gave you the wrong answer or it
came up with an error. You can observe that. So the LLM would observe the answer, would determine if
it does answer the question at hand, or whether it needs to iterate on the plan and tackle it differently
up until I get to a final answer. So let's go back and make this very concrete again. Let's talk about my
vacation example. And as you can tell I'm really excited to go on one, so I want to take the rest of my
vacation days. I'm planning to go on to Florida next month. I'm planning on being outdoors a lot and I'm
prone to burning. So I want to know what is the number of two ounce sunscreen bottles that I should
bring with me. And this is a complex problem, there's a number of things to plan. One is how many
vacation days am I planning to take? And maybe that is information the system can retrieve from its
memory, because I asked that question before. Two is how many hours do I plan to be in the sun? I said
I plan to be in there a lot. So maybe that would mean looking into the weather forecast for next month
in Florida and seeing what is the average sun hours that are expected. Three is maybe going to a public
health website to understand what is the recommended dosage of sunscreen per hour in the sun. And
then four, doing some math to be able to determine how much of that sunscreen fits into two ounce
bottles. So that's quite complicated. But what's really powerful here is there's so many paths that can be
explored in order to solve a problem. So this makes the system quite modular, and I can hit it with much
more complex problems. So going back to the concept of compound AI systems, compound AI systems
are here to stay. What we're going to observe this year is that they're going to become more agentic.
The way that I like to think about it is you have a sliding scale of LLM autonomy,
and you would, the person defining the system would examine what trade offs they want in terms of
autonomy in the system for certain problems, especially problems that are narrow, well defined, so you
don't expect someone to ask about the weather when they mean to ask about vacations. So a narrow
problem set, you can define a narrow system like this one. It's more efficient to go the programmatic
route because every single query will be answered the same way. If I were to apply the agentic approach
here, there might be unnecessarily looping and iteration. So for narrow problems, programmatic
approach can be more efficient than going the agentic route. But if I expect to have a system accomplish
very complex tasks, like say, trying to solve GitHub issues independently and handle a variety of queries,
a spectrum of queries. This is where an agentic route could be helpful because it would take you too
much effort to configure every single path in the system. And we're still in the early days of agentic
systems. We're seeing rapid progress when you combine the effects of system design with agentic
behavior. And, of course, you will have a human in the loop in most cases, as the accuracy is improving.
Robotique et automatisation
Welcome to robotics and automation. After watching this video, you will be able to define the term
robotics and how robots work. You will further discuss how AI technologies are integrated into robotics.
You will also be able to examine how robotics enable automation in various industries. Lastly, you will
explore the concept of robotic process automation or RPA and its applications. Across the industrial
world, companies are investing heavily in robotics and automation. With the advancement of
technology, robots have become more capable of performing complex tasks that were previously done
by humans. This shift is driven by the need for higher productivity, improved quality, and the ability to
operate continuously. According to a recent global industrial robotic survey conducted by McKenzie,
many industries plan to spend 25% of their budgets on automated systems over the next four years. Let
us understand the term robotics and how it enables automation. Robotics involves designing,
constructing, and operating robots, machines that can perform tasks by themselves or with some help.
Robots are complex systems made up of several key components, sensors, actuators, and controllers.
Sensors act as the robots eyes and ears, gathering information about the environment. For instance, a
camera sensor helps a robot navigate by capturing images and detecting obstacles. While temperature
sensors ensure the robot operates within safe limits. Actuators are the robots muscles, enabling
movement and interaction with the environment. This includes electric motors which move robotic arms
and wheels, and hydraulic and pneumatic actuators, which control various parts of the robot.
Controllers act as the robot's brain and run software that determines the robot's actions. For instance, in
a robotic arm, the controller interprets sensor data and directs the motors to move the arm accurately.
Robots use these components and work around various AI technologies, such as artificial intelligence,
machine learning, and natural language processing or NLP to enable automation. Let's explore a few
such examples. A robot vacuum cleaner uses AI to clean your home efficiently. It has sensors to detect
dirt, walls, and furniture, and an AI controller to map your home and choose the best cleaning path. The
actuators adjust the vacuums wheels and brush to clean surfaces, such as carpets and hardwood floors.
A smart lawn mower uses machine learning to mow your lawn efficiently. It has sensors to detect grass
height, obstacles, and lawn boundaries. The controller learns the lawns layout and grass growth patterns
to optimize mowing routes and schedules. The actuators adjust the mowers wheels and blades to
ensure an even cut. A voice activated smart speaker robot uses NLP to understand and respond to your
spoken commands. It has microphones as sensors to capture your voice, an NLP controller to process
and understand language, and actuators to perform actions like playing music, providing updates, or
controlling smart home devices. Did you know that these devices act as self operating robots that do not
need human intervention? As they can figure things out on their own? Let's discuss more such
examples. Space exploration and deep sea exploration, find use of self operating robots. Space rovers,
like the ones on Mars, explore other planets, collect valuable information, and send it back to Earth all
by themselves. Similarly, underwater drones explore deep parts of the ocean, collect samples, and do
research in places that are hard for people to reach. Robots working directly alongside humans are
known as cobots or collaborative robots. Unlike traditional robots, cobots are designed to interact
directly with people, making automation more accessible and versatile. Cobots are equipped with
advanced sensors and AI technologies, allowing them to communicate and coordinate their actions to
complete tasks that require teamwork. For instance, in automobile manufacturing, one cobot can hold a
part in place while the other welds it, ensuring precision and speed. In logistics, cobots can work
together to sort and move packages efficiently. Robotics uses AI to enable automation in many
industries. In health care, surgical robots perform precise operations, reducing recovery times and
improving patient outcomes. Rehabilitation robots help patients recover mobility and strength, offering
personalized therapy. Agriculture robots plant, harvest, and monitor crops with precision, boosting
yields and reducing waste. They optimize irrigation, apply fertilizers accurately, and monitor crop health
in real time. In retail, self checkout kiosks and robots manage inventory and assist customers,
streamlining operations, and improving customer service. This reduces wait times and ensures stock
availability. Let's now discuss another concept called robotic process automation or RPA. RPA is a type of
computer software that helps create, use, and control virtual robots. These robots act like humans when
they work with digital systems and software. A PEGA system survey among 500 decision makers
employing a range of RPA solutions revealed, 63% view RPA as a major component of overall digital
transformation, 23% said it is the most important component, whereas 14% said it is a minor
component. RPA can automate repetitive and rule-based tasks across various industries. In finance and
accounting, RPA automates invoice processing by capturing, validating, and posting invoices. It also
manages payroll calculations, deductions, and payments, and generates financial reports by extracting
data from multiple sources. In human resources, RPA streamlines employ onboarding by collecting
documents, entering data, and setting up profiles, leave management by handling leave requests and
updating records, recruitment by screening resumes, scheduling interviews, and sending
communication. In this video, you learn that robotics involves designing, constructing, and operating
robots to perform tasks with or without humans. It utilizes sensors to observe processors for decision
making, actuators for movement, integrating technologies like AI, machine learning, and NLP for
automation. Further, you learned about cobots, which collaborate with humans using advanced sensors
and AI, communicating and coordinating tasks that require teamwork. Lastly, you learned that robotic
process automation or RPA automates repetitive tasks and helps create, use, and control virtual robots.
And these, in fact, are two behaviors that are often observed as problematic when interacting with large
language models. They are LLM challenges. Now, what would have happened if I'd taken a beat and first
gone and looked up the answer on a reputable source like NASA? Well, then I would have been able to
say, okay, so the answer is Saturn with 146 moons. And in fact, this keeps changing because scientists
keep on discovering more and more moons. So I have now grounded my answer in something more
believable. I have not hallucinated or made up an answer. And by the way, I didn't leak personal
information about how long ago it's been since I was obsessed with space. All right, so what does this
have to do with large language models? Well, how would a large language model have answered this
question? So let's say that I have a user asking this question about moons. A large language model
would confidently say, okay, I have been trained. And from what I know in my parameters during my
training, the answer is Jupiter.
The answer is wrong. But we don't know. The large language model is very confident in what it
answered. Now, what happens when you add this retrieval augmented part here? What does that
mean? That means that now, instead of just relying on what the LLM knows, we are adding a content
store. This could be open like the Internet, this could be closed like some collection of documents,
collection of policies, whatever. The point though now is that the LLM first goes and talks to the content
store and says, hey, can you retrieve from me information that is relevant to what the user's query was?
And now with this retriever augmented answer, it's not Jupiter anymore, we know that it is Saturn.
What does this look like? Well, first, user prompts the LLM with their question. They say, this is what my
question was. And originally, if we're just talking to a generative model, the generative model says, okay,
I know the response. Here it is. Here's my response. But now, in the RAG framework, the generative
model actually has an instruction that says, no, no, no, first go and retrieve relevant content. Combine
that with the user's question, and only then generate the answer. So the prompt now has three parts.
The instruction to pay attention to the retrieved content together with the user's question, now give a
response. And in fact, now you can give evidence for why your response was what it was. So now
hopefully you can see how does RAG help the two LLM challenges that I had mentioned before? So, first
of all, I'll start with the out of date part. Now, instead of having to retrain your model, if new
information comes up, like, hey, we found some more moons now to Jupiter again, maybe sadder again.
In the future, all you have to do is you augment your data store with new information, updated
information. So now, the next time that a user comes and asks the question, we're ready, we just go
ahead and retrieve the most up to date information. The second problem, source. Well, the large
language model is now being instructed to pay attention to primary source data before giving its
response, and in fact, now being able to give evidence. This makes it less likely to hallucinate or to leak
data because it is less likely to rely only on information that it learned during training. It also allows us to
get the model to have a behavior that can be very positive, which is knowing when to say, I don't know.
If the user's question cannot be reliably answered based on your data store, the model should say, I
don't know. Well, instead of making up something that is believable and may mislead the user. This can
have a negative effect as well, though, because if the retriever is not sufficiently good to give the large
language model the best, most highest quality grounding information, then maybe the user's query that
is answerable doesn't get an answer.
Tesla leverages AI in the automotive industry, specifically in autonomous driving technology. Tesla's
autopilot systems leverage AI algorithms, including computer vision and deep learning, to enhance the
performance of advanced driving assistance features. It analyzes data from cameras, radar, and sensors
to navigate, detect obstacles, and make real-time driving decisions, improving safety and efficiency. The
summon feature in Tesla's autopilot system uses AI to detect and react to its surroundings. It enables
drivers to use smartphone apps to remotely park vehicles within a specified range. Additionally, Tesla's
battery management system, or BMS, uses AI to predict energy requirements based on driving
conditions and driver behavior. Adjusting acceleration and climate control to conserve energy. It also
helps plan efficient routes with the navigation system, considering traffic and elevation. Lets now
explore a general roadmap consisting of various steps to adopt AI in business operations. Adopting AI is
a multifaceted process that requires careful planning and execution. From defining business goals and
identifying suitable use cases to data readiness and building AI capabilities, deploying AI solutions, and
monitoring and optimizing AI systems, each step is crucial for success. The journey begins with defining
the business goals. Businesses need to identify the specific problems they aim to solve with AI, engaging
stakeholders to understand their needs and outline clear objectives. It is crucial to ask questions like,
what are the pain points? And how can AI help achieve business goals?
After setting the business goals, the next step is to identify suitable use cases. Perform a thorough
analysis of your business processes to find specific areas where AI can be effectively applied. Focus on
tasks that are repetitive, time consuming, data driven or require complex decision making.
One of the most critical steps in adopting AI is data readiness. It is crucial for businesses to collect, clean
and organize their data to ensure it is ready for AI algorithms. Data readiness involves removing
inconsistencies, filling gaps, and ensuring the data is relevant to the problem. The next step is to build AI
capabilities within the organization. This includes training employees on AI concepts, tools, and
techniques. Investing in upskilling your workforce ensures they can effectively use and manage AI
technologies. The training program helps employees understand how AI can be applied to specific tasks.
Next, AI solutions are deployed, choosing the best AI tools and technologies that match your business
needs. The deployment is done in a phased manner, starting with pilot projects and gradually scaling up.
It is necessary to ensure that AI solutions are integrated seamlessly with the existing systems and
processes. In the final stage, after the AI solutions are deployed into the business environment, it
requires continuous monitoring and optimization to track performance regularly and identify and
resolve issues. This ensures that the AI models are updated and refined, making necessary adjustments
over time.
In this video, you learned that AI can be adopted in businesses to increase efficiency, enhance decision
making, improve customer service, and drive innovation. You also gained insights on a few real world
examples of companies that have adopted AI and obtained benefits from it. Finally, you learned general
steps for adopting AI in business. First, as part of the business requirement analysis, you set clear goals.
Next, you need to identify suitable use cases, identifying areas to apply AI, then prepare data by first
collecting it and then cleaning and organizing it. Further, build AI capabilities to train employees and
develop the necessary setup. Deploy AI solutions, integrate them with existing systems. And finally,
monitor and optimize AI systems to ensure they work well.
Les agents en IA sont des programmes logiciels qui s'engagent dans l'environnement, traitent
des données et exécutent des tâches de manière autonome pour atteindre des objectifs définis
par l'homme.
La robotique implique la conception, la construction et l'utilisation de robots pour effectuer des
tâches avec ou sans l'aide de l'homme.
Les cobots collaborent avec les humains en utilisant des capteurs avancés et l'IA, en
communiquant et en coordonnant les tâches qui nécessitent un travail d'équipe.
L'automatisation des processus robotiques (RPA) automatise les tâches répétitives et permet de
créer, d'utiliser et de contrôler des robots virtuels.
L'IA automatise les tâches répétitives pour les entreprises, comme la saisie de données, la
planification et la génération de rapports, et permet aux employés de se concentrer davantage
sur le travail créatif et stratégique.
L'IA génère des idées innovantes et surmonte les blocages créatifs en aidant les entreprises à
entrer en résonance avec le public cible, ce qui se traduit par un engagement et des ventes plus
élevés.
Les premières IA utilisaient des systèmes basés sur des règles mais manquaient de créativité.
Des décennies plus tard, le Deep learning a permis à l'IA d'apprendre et de s'adapter comme le
cerveau humain.
Les réseaux antagonistes génératifs (GAN) ont marqué une percée, générant des images, de la
musique, du design et de la médecine de haute qualité. Ils ont fait progresser l'IA générative
dans les domaines de la musique, de la conception et de la médecine.
Analyse de données : Analyse d'ensembles de données complexes pour identifier des modèles
et générer des rapports exploitables
L'IA peut être adoptée dans les entreprises pour accroître l'efficacité, améliorer la prise de
décision, améliorer le service à la clientèle et favoriser l'innovation.
Définir les objectifs de l'entreprise : Identifier les problèmes et fixer des objectifs clairs
Identifier les cas d'utilisation appropriés : Identifier les domaines dans lesquels appliquer l'IA
Préparation des données : Collecter les données, puis les nettoyer et les organiser
Renforcer les capacités en matière d'IA : Former les employés et mettre en place la
configuration nécessaire
Les outils d'IA peuvent aider les organisations à améliorer leur efficacité en automatisant
l'engagement des employés, le recrutement, l'intégration et la gestion des performances. Les
chatbots IA ont transformé les interactions avec les clients en automatisant les réponses et en
garantissant la cohérence.
Les professionnels de différents domaines tirent parti des outils et plateformes d'IA pour
améliorer la productivité et l'efficacité.
Copy.ai, Jasper et Synthesia : Aident les spécialistes du marketing et les créateurs de contenu
Éthicien de l'IA : Veiller à ce que les systèmes d'IA soient développés et utilisés de manière
éthique, en tenant compte des implications sociales et morales.
Stratèges en IA : Élaborent des stratégies à long terme pour la mise en œuvre de l'IA afin
d'atteindre les objectifs de l'entreprise.
Spécialistes du marketing de l'IA : Utilisent l'IA pour analyser les données des consommateurs
et optimiser les stratégies marketing.