0% found this document useful (0 votes)

52 views

Cloud Google Com Use-Cases Retrieval-Augmented-Generation

Uploaded by

uma5b3

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

52 views

Cloud Google Com Use-Cases Retrieval-Augmented-Generation

Uploaded by

uma5b3

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Overview Solutions Products Pricing Resources  Docs Support language English‬ Sign in

Contact Us Start free

Topics RAG

What is Retrieval-Augmented
Generation (RAG)?
RAG (Retrieval-Augmented Generation) is an AI framework
that combines the strengths of traditional information retrieval 35:30
systems (such as search and databases) with the capabilities
Grounding for Gemini with Vertex AI Search and DIY
of generative large language models (LLMs). By combining
RAG
your data and world knowledge with LLM language skills,
grounded generation is more accurate, up-to-date, and
relevant to your specific needs. Check out this e-book to
unlock your “Enterprise Truth.”

Get started for free

How does Retrieval-Augmented Generation work?

RAGs operate with a few main steps to help enhance generative AI outputs:

PDFmyURL converts web pages and even full websites to PDF easily and quickly.
Retrieval and pre-processing: RAGs leverage powerful search algorithms to query external data, such
as web pages, knowledge bases, and databases. Once retrieved, the relevant information undergoes pre-
processing, including tokenization, stemming, and removal of stop words.
Grounded generation: The pre-processed retrieved information is then seamlessly incorporated into the
pre-trained LLM. This integration enhances the LLM's context, providing it with a more comprehensive
understanding of the topic. This augmented context enables the LLM to generate more precise,
informative, and engaging responses.

RAG operates by first retrieving relevant information from a database using a query generated by the LLM. This
retrieved information is then integrated into the LLM's query input, enabling it to generate more accurate and
contextually relevant text. Retrieval is usually handled by a semantic search engine that uses embeddings
stored in vector databases and sophisticated ranking and query rewriting features, ensuring that the results
are relevant to the query and will answer the user’s question.

Why Use RAG?

RAG offers several advantages augmenting traditional methods of text generation, especially when dealing
with factual information or data-driven responses. Here are some key reasons why using RAG can be
beneficial:

Access to fresh information

LLMs are limited to their pre-trained data. This leads to outdated and potentially inaccurate responses. RAG
overcomes this by providing up-to-date information to LLMs.

Factual grounding

PDFmyURL converts web pages and even full websites to PDF easily and quickly.
LLMs are powerful tools for generating creative and engaging text, but they can sometimes struggle with
factual accuracy. This is because LLMs are trained on massive amounts of text data, which may contain
inaccuracies or biases.

Providing “facts” to the LLM as part of the input prompt can mitigate “gen AI hallucinations.” The crux of this
approach is ensuring that the most relevant facts are provided to the LLM, and that the LLM output is entirely
grounded on those facts while also answering the user’s question and adhering to system instructions and
safety constraints.

Using Gemini’s long context window (LCW) is a great way to provide source materials to the LLM. If you need to
provide more information than fits into the LCW, or if you need to scale up performance, you can use a RAG
approach that will reduce the number of tokens, saving you time and cost.

Search with vector databases and relevancy re-rankers

RAGs usually retrieve facts via search, and modern search engines now leverage vector databases to
efficiently retrieve relevant documents. Vector databases store documents as embeddings in a high-
dimensional space, allowing for fast and accurate retrieval based on semantic similarity. Multi-modal
embeddings can be used for images, audio and video, and more and these media embeddings can be
retrieved alongside text embeddings or multi-language embeddings.

Advanced search engines like Vertex AI Search use semantic search and keyword search together (called
hybrid search), and a re-ranker which scores search results to ensure the top returned results are the most
relevant. Additionally searches perform better with a clear, focused query without misspellings; so prior to
lookup, sophisticated search engines will transform a query and fix spelling mistakes.

Relevance, accuracy, and quality

The retrieval mechanism in RAG is critically important. You need the best semantic search on top of a curated
knowledge base to ensure that the retrieved information is relevant to the input query or context. If your

PDFmyURL converts web pages and even full websites to PDF easily and quickly.
retrieved information is irrelevant, your generation could be grounded but off-topic or incorrect.

By fine-tuning or prompt-engineering the LLM to generate text entirely based on the retrieved knowledge, RAG
helps to minimize contradictions and inconsistencies in the generated text. This significantly improves the
quality of the generated text, and improves the user experience.

The Vertex Eval Service now scores LLM generated text and retrieved chunks on metrics like “coherence,”
“fluency,” “groundedness,” "safety," “instruction_following,” “question_answering_quality,” and more. These
metrics help you measure the grounded text you get from the LLM (for some metrics that is a comparison to a
ground truth answer you have provided). Implementing these evaluations gives you a baseline measurement
and you can optimize for RAG quality by configuring your search engine, curating your source data, improving
source layout parsing or chunking strategies, or refining the user’s question prior to search. A RAG Ops,
metrics driven approach like this will help you hill climb to high quality RAG and grounded generation.

RAGs, agents, and chatbots

RAG and grounding can be integrated into any LLM application or agent which needs access to fresh, private,
or specialized data. By accessing external information, RAG-powered chatbots and conversational agents
leverage external knowledge to provide more comprehensive, informative, and context-aware responses,
improving the overall user experience.

Your data and your use case are what differentiate what you are building with gen AI. RAG and grounding bring
your data to LLMs efficiently and scalably.

What Google Cloud products and services

are related to RAG?
The following Google Cloud products are related to Retrieval-Augmented
Generation:
PDFmyURL converts web pages and even full websites to PDF easily and quickly.
Vertex AI Search Vertex AI Vector Search BigQuery
Vertex AI Search is Google The ultra performant vector Large datasets that you can use
Search for your data, a fully index that powers Vertex AI to train machine learning
managed, out-of-the-box Search; it enables semantic and models, including models for
search and RAG builder. hybrid search and retrieval from Vertex AI Vector Search.
huge collections of embeddings
with high recall at high query
rate.

Grounded Generation AlloyDB LlamaIndex on Vertex

API Run models in Vertex AI and Build your own search engine
Gemini high-fidelity mode access them in your application for RAG and grounding using
grounded with Google Search using familiar SQL queries. Use Google or open source
or inline facts or bring your own Google models, such as Gemini, components and our fully
search engine. or your own custom models. managed orchestration system
based on LlamaIndex.

PDFmyURL converts web pages and even full websites to PDF easily and quickly.
Further reading
Learn more about using retrieval augmented generation with these resources.

Using Vertex AI to build next-gen search applications | Google Cloud Blog

RAGs powered by Google Search technology
RAG with databases on Google Cloud
Infrastructure for a RAG-capable generative AI application using Vertex AI
APIs to build your own search and Retrieval Augmented Generation (RAG) systems
How to use RAG in BigQuery to bolster LLMs
Code sample and quickstart to get familiar with RAG

Take the next step Need help getting

started?
Start building on Google Cloud with $300 in free credits and Contact sales
20+ always free products.
Work with a trusted
partner
Get started for free Find a partner

Continue browsing
See all products

Why Google Products and pricing Solutions Resources Engage

Choosing Google Cloud Google Cloud pricing Infrastructure modernization Google Cloud Affiliate Program Contact sales

Trust and security Google Workspace pricing Databases Google Cloud documentation Find a Partner

Modern Infrastructure Cloud See all products Application modernization Google Cloud quickstarts Become a Partner

PDFmyURL converts web pages and even full websites to PDF easily and quickly.
Multicloud Smart analytics Google Cloud Marketplace Events

Global infrastructure Artificial Intelligence Learn about cloud computing Podcasts

Customers and case studies Security Support Developer Center

Analyst reports Productivity & work Code samples Press Corner

transformation
Whitepapers Cloud Architecture Center Google Cloud on YouTube
Industry solutions
Blog Training Google Cloud Tech on YouTube
DevOps solutions
Certifications Follow on X
Small business solutions
Google for Developers Join User Research
See all solutions
Google Cloud for Startups We're hiring. Join Google Cloud!

System status Google Cloud Community

Release Notes

About Google | Privacy | Site terms | Google Cloud terms Our third decade of climate action: join us

Sign up for the Google Cloud newsletter Subscribe language English‬

PDFmyURL converts web pages and even full websites to PDF easily and quickly.

RAG - A Simple Introduction
100% (5)
RAG - A Simple Introduction
75 pages
RAG Architecture
100% (6)
RAG Architecture
52 pages
Hou Et Al 2023 Virtual Simulation Experiments A Teaching Option For Complex and Hazardous Chemistry Experiments
No ratings yet
Hou Et Al 2023 Virtual Simulation Experiments A Teaching Option For Complex and Hazardous Chemistry Experiments
9 pages
KSC - 4IR - The Fourth Industrial Revolution Book PDF
28% (25)
KSC - 4IR - The Fourth Industrial Revolution Book PDF
9 pages
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
7 pages
RAG - The Future of LLMs - LinkedIn
No ratings yet
RAG - The Future of LLMs - LinkedIn
7 pages
Retrieval Augmented Generation - A Simple Introduction
No ratings yet
Retrieval Augmented Generation - A Simple Introduction
82 pages
WWW Databricks Com Glossary Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Databricks Com Glossary Retrieval-Augmented-Generation-Rag
12 pages
WWW Cohesity Com Glossary Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Cohesity Com Glossary Retrieval-Augmented-Generation-Rag
5 pages
5th and 6th Topic
No ratings yet
5th and 6th Topic
8 pages
Rag
No ratings yet
Rag
10 pages
What Is Retrieval Augmented Generation Rag Final v2 Cs
No ratings yet
What Is Retrieval Augmented Generation Rag Final v2 Cs
5 pages
tyjt
No ratings yet
tyjt
2 pages
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
No ratings yet
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
36 pages
A Taxonomy of Retrieval Augmented Generation
100% (1)
A Taxonomy of Retrieval Augmented Generation
56 pages
Research Ibm Com Blog retrieval-augmented-generation-RAG
No ratings yet
Research Ibm Com Blog retrieval-augmented-generation-RAG
11 pages
Building Blocks of Rag Ebook Final
100% (1)
Building Blocks of Rag Ebook Final
9 pages
v1_covered_dd8bccc1-d5a3-4e08-8468-11e29c92981b
No ratings yet
v1_covered_dd8bccc1-d5a3-4e08-8468-11e29c92981b
16 pages
Blogs Nvidia Com Blog What-Is-Retrieval-Augmented-Generation
No ratings yet
Blogs Nvidia Com Blog What-Is-Retrieval-Augmented-Generation
12 pages
download4
No ratings yet
download4
2 pages
WWW - K2view - Com - What Is Retrieval Augmented Generation
No ratings yet
WWW - K2view - Com - What Is Retrieval Augmented Generation
29 pages
NVIDIA RAG Whitepaper
No ratings yet
NVIDIA RAG Whitepaper
7 pages
Grounding LLM Models For Increased Accuracy
No ratings yet
Grounding LLM Models For Increased Accuracy
9 pages
Llmrag
No ratings yet
Llmrag
6 pages
rag
No ratings yet
rag
20 pages
The Ultimate Guide to GenAI RAG: Enhancing AI with Real-Time Data Retrieval
No ratings yet
The Ultimate Guide to GenAI RAG: Enhancing AI with Real-Time Data Retrieval
12 pages
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
No ratings yet
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
12 pages
17 (Advanced) RAG Techniques To Turn Your LLM App Prototype Into A Production-Ready Solution - by Dominik Polzer - Jun, 2024 - Towards Data Science
No ratings yet
17 (Advanced) RAG Techniques To Turn Your LLM App Prototype Into A Production-Ready Solution - by Dominik Polzer - Jun, 2024 - Towards Data Science
54 pages
2.5 Retrieval Augmented Generation RAG
No ratings yet
2.5 Retrieval Augmented Generation RAG
2 pages
7 Agentic RAG System Architectures to Build AI Agents
No ratings yet
7 Agentic RAG System Architectures to Build AI Agents
12 pages
RAG Workflowllllll
No ratings yet
RAG Workflowllllll
3 pages
RAG First Month Assessment GenAI
No ratings yet
RAG First Month Assessment GenAI
3 pages
Minor_proj
No ratings yet
Minor_proj
15 pages
GTC'24 Special Event -Build a RAG-powered Application With a Human Voice Interface [SE62869]- Deck - FINAL_1714408879420001sjpp
No ratings yet
GTC'24 Special Event -Build a RAG-powered Application With a Human Voice Interface [SE62869]- Deck - FINAL_1714408879420001sjpp
108 pages
The DOM GraphRAG Project
No ratings yet
The DOM GraphRAG Project
30 pages
Introduction To RAG (Retrieval Augmented Generation) and Vector Database - by Sachinsoni - Medium
No ratings yet
Introduction To RAG (Retrieval Augmented Generation) and Vector Database - by Sachinsoni - Medium
18 pages
Unlocking Data with Generative AI and RAG: Enhance generative AI systems by integrating internal data with large language models using RAG
From Everand
Unlocking Data with Generative AI and RAG: Enhance generative AI systems by integrating internal data with large language models using RAG
Keith Bourne
No ratings yet
Semantic Search and Beyond handout-Tim-Clarke
No ratings yet
Semantic Search and Beyond handout-Tim-Clarke
16 pages
RAG Technics
No ratings yet
RAG Technics
8 pages
What is Retrieval-Augmented Generation (RAG)
No ratings yet
What is Retrieval-Augmented Generation (RAG)
12 pages
DSPT 114 - Hands-On With LlamaIndex - First Steps For Retrieval-Augmented Generation (RAG)
No ratings yet
DSPT 114 - Hands-On With LlamaIndex - First Steps For Retrieval-Augmented Generation (RAG)
87 pages
Github - Blog - Ai and ML - Generative Ai - What Is Retrieval Augmented Generation and What Does It Do For Generative Ai
No ratings yet
Github - Blog - Ai and ML - Generative Ai - What Is Retrieval Augmented Generation and What Does It Do For Generative Ai
14 pages
A Practical Blueprint For Implementing Generative AI Retrieval-Augmented Generation
No ratings yet
A Practical Blueprint For Implementing Generative AI Retrieval-Augmented Generation
19 pages
RAG-Driven Generative AI: Build custom retrieval augmented generation pipelines with LlamaIndex, Deep Lake, and Pinecone
From Everand
RAG-Driven Generative AI: Build custom retrieval augmented generation pipelines with LlamaIndex, Deep Lake, and Pinecone
Denis Rothman
No ratings yet
Understanding RAG AI
No ratings yet
Understanding RAG AI
6 pages
retrieval-augmented-generation-options-Good-5-38
No ratings yet
retrieval-augmented-generation-options-Good-5-38
34 pages
RAGBench - Explainable Benchmark For Retrieval-Augmented Generation Systems
No ratings yet
RAGBench - Explainable Benchmark For Retrieval-Augmented Generation Systems
18 pages
External Information On Large Linguistic Models Utilizing Retrieval Enhanced Generation (RAG)
100% (10)
External Information On Large Linguistic Models Utilizing Retrieval Enhanced Generation (RAG)
6 pages
AI For Education RAG
No ratings yet
AI For Education RAG
18 pages
1732974151910
No ratings yet
1732974151910
12 pages
Langchain Retrieval Augmented Generation White Paper
100% (1)
Langchain Retrieval Augmented Generation White Paper
23 pages
Developers_Guide_to_RAG_with_Data_Streaming
100% (1)
Developers_Guide_to_RAG_with_Data_Streaming
22 pages
2024-05-EB-A Compact GuideTo RAG
No ratings yet
2024-05-EB-A Compact GuideTo RAG
38 pages
How Build A RAG Agent With LlamaIndex
No ratings yet
How Build A RAG Agent With LlamaIndex
4 pages
A_SIMPLE_GUIDE_TO_RETRIEVAL_AUGMENTED_GENERATION_1720484135
No ratings yet
A_SIMPLE_GUIDE_TO_RETRIEVAL_AUGMENTED_GENERATION_1720484135
9 pages
7 Popular Agentic RAG System Architectures 1736324693
No ratings yet
7 Popular Agentic RAG System Architectures 1736324693
10 pages
26 RAG Concepts in Alphabetical Order
No ratings yet
26 RAG Concepts in Alphabetical Order
15 pages
Retrieval-Augmented Generation For Large Language Models A Survey
No ratings yet
Retrieval-Augmented Generation For Large Language Models A Survey
26 pages
Rag Survey
No ratings yet
Rag Survey
22 pages
Gen AI guide
No ratings yet
Gen AI guide
6 pages
A Survey On Retrieval-Augmented Text Generation For Large Language Models
No ratings yet
A Survey On Retrieval-Augmented Text Generation For Large Language Models
18 pages
RAG Slide ENG
No ratings yet
RAG Slide ENG
41 pages
Aplicación de La IA A La Educación
No ratings yet
Aplicación de La IA A La Educación
13 pages
Advertisement MoES Project Updated
No ratings yet
Advertisement MoES Project Updated
1 page
02 Exploring - Students - Perceptions - of - ChatGPT - Thematic - Analysis - and - Follow-Up - Survey
No ratings yet
02 Exploring - Students - Perceptions - of - ChatGPT - Thematic - Analysis - and - Follow-Up - Survey
14 pages
Unit 5 - Robotic Process Automation
No ratings yet
Unit 5 - Robotic Process Automation
29 pages
Corner Detector in Computer Vision
No ratings yet
Corner Detector in Computer Vision
57 pages
Ipa Project
No ratings yet
Ipa Project
10 pages
(Symbolic Computation - Artificial Intelligence) M. M. Botvinnik (Auth.) - Computers in Chess - Solving Inexact Search Problems-Springer-Verlag New York (1984)
No ratings yet
(Symbolic Computation - Artificial Intelligence) M. M. Botvinnik (Auth.) - Computers in Chess - Solving Inexact Search Problems-Springer-Verlag New York (1984)
169 pages
Sentieo Quarterly Report PDF
No ratings yet
Sentieo Quarterly Report PDF
21 pages
Fuzzy C Mean
No ratings yet
Fuzzy C Mean
6 pages
PG AI Principles v1.0
No ratings yet
PG AI Principles v1.0
9 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
151 pages
2599-Article Text-10848-1-10-20230801
No ratings yet
2599-Article Text-10848-1-10-20230801
11 pages
The Intersection of Technology and Health
No ratings yet
The Intersection of Technology and Health
3 pages
Viola Jones Algorithm
No ratings yet
Viola Jones Algorithm
19 pages
Krishna
No ratings yet
Krishna
17 pages
Pretraining Part1 16 Mar 23 PDF
No ratings yet
Pretraining Part1 16 Mar 23 PDF
32 pages
Lec10 Handout
No ratings yet
Lec10 Handout
41 pages
Artificial Inteligence PDF
No ratings yet
Artificial Inteligence PDF
328 pages
Pitsco Network Magazine Article 4
No ratings yet
Pitsco Network Magazine Article 4
2 pages
Introduction To Ai
No ratings yet
Introduction To Ai
9 pages
DFA To Regular Expression
No ratings yet
DFA To Regular Expression
14 pages
1 s2.0 S1359644621005043 Main
No ratings yet
1 s2.0 S1359644621005043 Main
18 pages
5-Academic Honesty Policy
No ratings yet
5-Academic Honesty Policy
1 page
Artificial Neural Network̄
No ratings yet
Artificial Neural Network̄
62 pages
Heim Theory - Syntrometrische Maximentelezentrik
100% (1)
Heim Theory - Syntrometrische Maximentelezentrik
100 pages
Neuromorphic Computing Systems for Industry 4 0 22nd Edition Dhanasekar S. - Discover the ebook with all chapters in just a few seconds
No ratings yet
Neuromorphic Computing Systems for Industry 4 0 22nd Edition Dhanasekar S. - Discover the ebook with all chapters in just a few seconds
84 pages
International Journal of Information Management
No ratings yet
International Journal of Information Management
16 pages
Venkatesh GR
No ratings yet
Venkatesh GR
2 pages