Blogs Nvidia Com Blog What-Is-Retrieval-Augmented-Generation
Blogs Nvidia Com Blog What-Is-Retrieval-Augmented-Generation
PDFmyURL converts web pages and even full websites to PDF easily and quickly.
u Share
Reading Time: 6 mins
g
NVIDIA and our partners useEditor’s
cookiesnote: This article
and other was
tools to updated
collect on September
information 23, 2024.
you provide as
f
well as your interaction with our websites for performance improvement, analytics, and to
To understand
assist in our marketing efforts. the
We also share latest
this advancewith
information in generative AI, imagine a
our social media,
courtroom.
advertising, and analytics partners. You can manage your cookie settings by clicking on
"Manage Settings". Please see our Cookie Policy for more information.
Judges hear and decide cases based on their general understanding
h
of the law. Sometimes a case — like a malpractice suit or a labor
Manage Settings
dispute — requires special expertise, so judges send Agree
court clerks to a
d
law library, looking for precedents and specific cases they can cite.
Patrick Lewis, lead author of the 2020 paper that coined the term, Waterways Wonder:
apologized for the unflattering acronym that now describes a Clearbot Autonomously
growing family of methods across hundreds of papers and dozens of Cleans Waters With
Energy-Efficient AI
commercial services he believes represent the future of generative
AI.
How Digital Twins Are
“We definitely would have put more thought into the name had we Driving Efficiency and
known our work would become so widespread,” Lewis said in an Cutting Emissions in
Manufacturing
interview from Singapore, where he was sharing his ideas with a
regional conference of database developers.
Get Ready to Slay:
“We always planned to have a nicer sounding name, but when it came ‘Dragon Age: The
Veilguard’ to Soar Into
time to write the paper, no one had a better idea,” said Lewis, who
GeForce NOW at Launch
now leads a RAG team at AI startup Cohere.
PDFmyURL converts web pages and even full websites to PDF easily and quickly.
Retrieval-augmented generation (RAG) is a Productivity,’ NVIDIA
technique for enhancing the accuracy and CEO Says as Lenovo
Brings Smarter AI to
reliability of generative AI models with facts
Enterprises
fetched from external sources.
PDFmyURL converts web pages and even full websites to PDF easily and quickly.
What’s more, the technique can help models clear up ambiguity in a
user query. It also reduces the possibility a model will make a wrong
guess, a phenomenon sometimes called hallucination.
That makes the method faster and less expensive than retraining a
model with additional datasets. And it lets users hot-swap new
sources on the fly.
In fact, almost any business can turn its technical or policy manuals,
videos or logs into resources called knowledge bases that can
enhance LLMs. These sources can enable use cases such as
customer or field support, employee training and developer
productivity.
PDFmyURL converts web pages and even full websites to PDF easily and quickly.
elements users need to create their own applications with this new
method.
Once companies get familiar with RAG, they can combine a variety of
off-the-shelf or custom LLMs with internal or external knowledge
bases to create a wide range of assistants that help their employees
and customers.
PDFmyURL converts web pages and even full websites to PDF easily and quickly.
An example application for RAG on a PC.
PCs equipped with NVIDIA RTX GPUs can now run some AI models
locally. By using RAG on a PC, users can link to a private knowledge
source – whether that be emails, notes or articles – to improve
responses. The user can then feel confident that their data source,
prompts and response all remain private and secure.
The roots of the technique go back at least to the early 1970s. That’s
when researchers in information retrieval prototyped what they called
question-answering systems, apps that use natural language
processing (NLP) to access text, initially in narrow topics such as
baseball.
The concepts behind this kind of text mining have remained fairly
constant over the years. But the machine learning engines driving
them have grown significantly, increasing their usefulness and
popularity.
PDFmyURL converts web pages and even full websites to PDF easily and quickly.
In the mid-1990s, the Ask Jeeves service, now Ask.com, popularized
question answering with its mascot of a well-dressed valet. IBM’s
Watson became a TV celebrity in 2011 when it handily beat two
human champions on the Jeopardy! game show.
PDFmyURL converts web pages and even full websites to PDF easily and quickly.
The IBM Watson question-answering system became a celebrity when it won big on the TV
game show Jeopardy!
“I showed my supervisor and he said, ‘Whoa, take the win. This sort of
thing doesn’t happen very often,’ because these workflows can be
hard to set up correctly the first time,” he said.
PDFmyURL converts web pages and even full websites to PDF easily and quickly.
At a high level, here’s how an NVIDIA technical brief describes the
RAG process.
When users ask an LLM a question, the AI model sends the query to
another model that converts it into a numeric format so machines
can read it. The numeric version of the query is sometimes called an
embedding or a vector.
Finally, the LLM combines the retrieved words and its own response
to the query into a final answer it presents to the user, potentially
citing sources the embedding model found.
PDFmyURL converts web pages and even full websites to PDF easily and quickly.
Corporate Information Get Involved News & Events
Technical Training
PDFmyURL converts web pages and even full websites to PDF easily and quickly.