0% found this document useful (0 votes)

163 views36 pages

Gen AI

The document discusses various concepts related to LangChain and OCI Generative AI, including retriever types, model endpoints, fine-tuning methods, and the role of embeddings in NLP. Key highlights include the advantages of Parameter-Efficient Fine-tuning (PEFT), the differences between Top k and Top p sampling, and the significance of the temperature parameter in model outputs. Additionally, it covers the importance of vector normalization, the implications of using dedicated AI clusters, and the benefits of few-shot prompting.

Uploaded by

praveen.odi1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

163 views36 pages

Gen AI

Uploaded by

praveen.odi1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 36

Revision

Revision
Revision
Revision
1. In LangChain, which retriever search type is used to balance between relevancy
and diversity?
Maximal Marginal Relevance (MMR) retriever. Similarity is incorrect.

2. What does a dedicated RDMA cluster network do during model fine-tuning and
inference?

It enables the deployment of multiple fine-tuned models within a single cluster

3. Which role does a "model endpoint" serve in the inference workflow of the OCI
Generative AI service?

The above answer is incorrect. Serves as a designated point for user requests and
responses.

In summary, the model endpoint serves as the bridge between the trained model and
live production usage, enabling seamless and efficient inference in the OCI Generative AI
Service
4. What is a distinguishing feature of "Parameter-Efficient Fine-tuning (PEFT)" as opposed
to the classic "Fine-tuning" in Large Language Model training?

PEFT involves only a few or new parameters and uses labelled task-specific data

The key distinguishing feature of PEFT is its focus on updating only a small subset of the
model’s parameters or adding a few new ones, thereby achieving task-specific
adaptation with much lower computational and memory overhead compared to the
classic fine-tuning approach, which updates all the parameters of the model. This makes
PEFT particularly advantageous for adapting large pre-trained models to new tasks in a
resource-efficient manner.

In summary, PEFT strikes a balance between adaptation and efficiency, making it a

valuable technique for downstream tasks with resource constraints

5. How does the Retrieval-Augmented Generation (RAG) Token technique differ from RAG
Sequence when generating a model's response?

RAG Token retrieves relevant documents for each part of the response and construct the
answer incrementally.
6. Which component of Retrieval-Augmented Generation (RAG) evaluates and prioritizes
the information retrieved by the retrieval system?

Ranker

7. Which statement describes the difference between "Top k" and "Top p" in selecting
the next token in the OCI Generative AI Generation models?

Top K selects the next token based on its position in the list of probable token, whereas
top P selects based on the cumulative probability of the Top tokens.
In summary, Top k sampling limits the choice to a fixed number of top tokens, while
Top p sampling adapts the number of tokens based on their cumulative probability,
offering a balance between ensuring high-probability selections and allowing for greater
diversity.

8. Which statement is true about the "Top p" parameter of the OCI Generative AI
Generation models?

Top P limits token selection based on the sum of their probabilities

9. What is the primary function of the "temperature" parameter in the OCI Generative AI
Generation models?

Controls the randomness of the model output and affecting its creativity

10. What distinguishes the Cohere Embed v3 model from its predecessor in the OCI
Generative AI service?

Improved retrieval for Retrieval-Augmented Generation RAG systems

In summary, Cohere Embed v3 combines performance enhancements, enhanced RAG

capabilities, and multilingual support to provide a powerful and efficient text embedding
model for enterprise use cases.
The Cohere Embed v3 model distinguishes itself from its predecessor through
improved embedding quality, higher dimensionality, faster and more efficient
performance, enhanced transfer learning capabilities, advanced architectural
innovations, increased robustness, and better generalization across domains. These
enhancements make Embed v3 a more powerful and versatile tool for generating
embeddings in the OCI Generative AI service.

11. What is the purpose of the "stop sequence" parameter in the OCI Generative AI
Generation models?

Stop generating more contents.

The stop sequence parameter in OCI Generative AI Generation models is used to define a
specific point at which the model should stop generating text. This ensures control over
the length and relevance of the generated output, enhances the usability of the
generated text in various applications, and helps prevent over-generation, making the
outputs more precise and contextually appropriate.

12. What does a higher number assigned to a token signify in the "Show Likelihoods"
feature of the language model token generation?

Consider a context where the language model is generating text:

 Context: "The weather today is"

 Predicted Tokens and Likelihoods:
 "sunny" (0.75)
 "rainy" (0.15)
 "cloudy" (0.08)
 "stormy" (0.02)

In this example:
 The token "sunny" has the highest likelihood (0.75), indicating that the model
predicts "sunny" as the most probable next word in the sequence.
 Lower likelihood tokens like "stormy" (0.02) are considered less probable by the
model in this context.

Summary

In the "Show Likelihoods" feature, a higher number assigned to a token signifies that the
token has a higher probability of being selected as the next token in the sequence. This
reflects the model's higher confidence that the token is the most appropriate and
contextually relevant choice given the preceding text.

13. "Given the following code: Prompt Template (input_variables[""human_input"",

""city""], template-template) Which statement is true about Prompt Template in relation
to input_variables?"

Prompt template supports any number of variables, including the possibility of having
none
14. Which is NOT a built-in memory type in LangChain?

Conversation ImageMemory

5. "Given the following code: chain = prompt | 11m"

LCEL, is a declarative way to easily compose chains together

16. "Given a block of code: qa = Conversational Retrieval Chain. from 11m (11m,
retriever-retv, memory-memory) when does a chain typically interact with memory
during execution?"

After user input but before chain execution, and again after core logic but before output
17. Which is NOT a category of pre trained foundational models available in the OCI
Generative AI service?

Translation models

Generation, summarization and embedding models are the pre trained foundational
models available in the OCI Generative AI service.
18. How are fine-tuned customer models stored to enable strong data privacy and
security in the OCI Generative AI service?

Stored in Object Storage encrypted by default

19. Why is normalization of vectors important before indexing in a hybrid search system?
It standardizes vector lengths for meaningful comparison using metric such as Cosine
Similarity

20. How does the architecture of dedicated AI clusters contribute to minimizing GPU
memory overhead for T- Few fine-tuned model inference?

By sharing base model weights across multiple fine-tuned models on the same group of
GPUs

21. "You create a fine-tuning dedicated AI cluster to customize a foundational model with
your custom training data. How many unit hours are required for fine-tuning if the cluster
is active for 10 hours?"

20 Unit hours

22. Which Oracle Accelerated Data Science (ADS) class can be used to deploy a Large
Language Model (LLM) application to OCI Data Science model deployment?

GenerativeAi
23. "Given the following prompts used with a Large Language Model, classify each as
employing the Chain-of- Thought, Least-to-most, or Step-Back prompting technique. 1.
Calculate the total number of wheels needed for 3 cars. Cars have 4 wheels each. Then,
use the total number of wheels to determine how many sets of wheels we can buy with
$200 if one set (4 wheels) costs $50. 2. Solve a complex math problem by first
identifying the formula needed, and then solve a simpler version of the problem before
tackling the full question. 3. To understand the impact of greenhouse gases on climate
change, let's start by defining what greenhouse gases are. Next, we'll explore how they
trap heat in the Earth's atmosphere."

The answer above is incorrect. 1: chain of thought, 2: Least-to-most, 3: Step-back

24. Analyze the user prompts provided to a language model. Which scenario exemplifies
prompt injection (jailbreaking)?

Case where standard protocols prevent you from answering..

25. What does "k-shot prompting" refer to when using Large Language Models for task-
specific applications?
Explicitly providing k examples of the intended task in the prompt to guide the model
output

26. Which technique involves prompting the Large Language Model (LLM) to emit
intermediate reasoning steps as part of its response?

Chain-of-thought

27. Which is the main characteristic of greedy decoding in the context of language
model word prediction?

It picks the most likely word to emit at each step of decoding

28. What is the primary purpose of LangSmith Tracing?

The above answer is incorrect. To monitor the performance of the language model

29. Which is NOT a typical use case for LangSmith Evaluators?

Assessing code readability

30. How does the integration of a vector database into Retrieval-Augmented Generation
(RAG)-based Large Language Models (LLMs) fundamentally alter their responses?

It shifts the basis of their responses from pretrained internal knowledge to real-time data
retrieval
31. How do Dot Product and Cosine Distance differ in their application to comparing text
embeddings in natural language processing?

Dot product measures the magnitude and direction of vectors, whereas Cosine distance
focuses on the orientation regardless of magnitude.

32. What is a cost-related benefit of using vector databases with Large Language Models
(LLMs)?

They offer real-time updated knowledge bases and are cheaper than fine tuned LLMs
33. An AI development company is working on an advanced AI assistant capable of
handling queries in a seamless manner. Their goal is to create an assistant that can
analyze images provided by users and generate descriptive text, as well as take text
descriptions and produce accurate visual representations. Considering the capabilities,
which type of model would the company likely focus on integrating into their AI
assistant?

The above answer is incorrect. A diffusion model that specializes in producing complex
output

34. Which statement best describes the role of encoder and decoder models in natural
language processing?
Encoder models convert a sequence of words into a vector representation and decoder
models take that vector representation and generate output sequences based on it.

35. What issue might arise from using small data sets with the Vanilla fine-tuning method
in the OCI Generative AI service?

Overfitting

Fine-tuning on a small dataset may lead to overfitting. The model becomes too
specialized in replicating the training data, resulting in limited variety and poor
generalization to new data. To mitigate this, it’s crucial to strike a balance between
capturing patterns from the training data and maintaining the ability to generate diverse
and novel content

6. Which is a key characteristic of the annotation process used in T-Few fine-tuning?

Uses annotated data to adjust a fraction of model weight

37. When should you use the T-Few fine-tuning method for training a model?

For data sets with a few thousands sample or less

38. Which is a key advantage of using T-Few over Vanilla fine-tuning in the OCI
Generative AI service?
Faster training time and lower cost

39. How does the utilization of T-Few transformer layers contribute to the efficiency of
the fine-tuning process?

By restricting updates to only a specific group of transformer layers

40. "What does ""Loss"" measure in the evaluation of OCI Generative AI fine-tuned
models? The difference between the accuracy of the model at the beginning of training
and the accuracy of the deployed model"
The level of incorrectness in the models predictions, with lower values indicating better
performance.

41. Which is a distinctive feature of GPUs in Dedicated AI Clusters used for generative AI
tasks?

The GPUs allocated for a customer’s generative AI tasks are isolated from other GPUs.

42. What is the purpose of frequency penalties in language model outputs?

To penalize tokens that have already appeared, based on the number of times they have
been used

43. What happens if a period (.) is used as a stop sequence in text generation?

The model stops generating text after it reaches the end of the first sentence, even if the
token limit is much higher.

44. What is the main advantage of using few-shot model prompting to customize a Large
Language Model (LLM)?

It provides examples in the prompt to guide the LLM to better performance with no
training cost.

45. What is the purpose of embeddings in natural language processing?

To create numerical representations of text that capture the meaning and relationships
between words or phrases

Exam 1Z0-922 MySQL Implementation Associate Dumps
100% (1)
Exam 1Z0-922 MySQL Implementation Associate Dumps
8 pages
EV Juices - Pixelmon Wiki
No ratings yet
EV Juices - Pixelmon Wiki
1 page
4-Discovery of The Subatomic Particles
100% (1)
4-Discovery of The Subatomic Particles
35 pages
BCM-Blood Circulatory Massager - TIEN'S Presentation
75% (8)
BCM-Blood Circulatory Massager - TIEN'S Presentation
52 pages
Life Science Book
67% (3)
Life Science Book
448 pages
100 Lean Six Sigma Green Belt Exam Questions - Free Practice Test
No ratings yet
100 Lean Six Sigma Green Belt Exam Questions - Free Practice Test
43 pages
Krystal Core 1 Exam
100% (1)
Krystal Core 1 Exam
19 pages
Go4braindumps 1z0 1127 24 Questions by Day 22 07 2024 11qa
No ratings yet
Go4braindumps 1z0 1127 24 Questions by Day 22 07 2024 11qa
12 pages
Keba User Manual (4030) - 27march2006
88% (8)
Keba User Manual (4030) - 27march2006
100 pages
Comptia A Certification Complete Practice Questions For Core 1 220 1001 and Core 2 220 1002
No ratings yet
Comptia A Certification Complete Practice Questions For Core 1 220 1001 and Core 2 220 1002
210 pages
NCP3 Skin Integrity
67% (3)
NCP3 Skin Integrity
3 pages
1Z0 1127 24testtest
No ratings yet
1Z0 1127 24testtest
12 pages
QR.00001 Global Product Assurance Testing
No ratings yet
QR.00001 Global Product Assurance Testing
32 pages
CheckPoint CCSE R81.20 156-315.81.20 Dumps
No ratings yet
CheckPoint CCSE R81.20 156-315.81.20 Dumps
4 pages
156 215 81 20 Exam Questions
100% (1)
156 215 81 20 Exam Questions
25 pages
AAnalyst 300 Data Sheet
No ratings yet
AAnalyst 300 Data Sheet
2 pages
CAPG20-D1 Datastructures
No ratings yet
CAPG20-D1 Datastructures
107 pages
JN0-224 Automation and DevOps, Associate (JNCIA-DevOps) Exam Free Dumps
No ratings yet
JN0-224 Automation and DevOps, Associate (JNCIA-DevOps) Exam Free Dumps
5 pages
NCA-AIIO Exam Dumps
No ratings yet
NCA-AIIO Exam Dumps
5 pages
NSG 211
No ratings yet
NSG 211
125 pages
1Z0 1127 25 Exam Questions
No ratings yet
1Z0 1127 25 Exam Questions
6 pages
VMware 2V0 41.23
No ratings yet
VMware 2V0 41.23
13 pages
Root Cause Analysis Enhancing Safety in Chemical Processing Environments
100% (1)
Root Cause Analysis Enhancing Safety in Chemical Processing Environments
91 pages
B. They Contain Security and Topology Information
No ratings yet
B. They Contain Security and Topology Information
35 pages
Sifcon Report 1
100% (1)
Sifcon Report 1
27 pages
Oracle 1z0 1127 24 Dumps by Houston 28 05 2024 6qa Certscare
No ratings yet
Oracle 1z0 1127 24 Dumps by Houston 28 05 2024 6qa Certscare
7 pages
c946 Final January1
No ratings yet
c946 Final January1
27 pages
AURTTC003 Diagnose and Repair Cooling Systems Underpinning
No ratings yet
AURTTC003 Diagnose and Repair Cooling Systems Underpinning
43 pages
Jaw Relations
No ratings yet
Jaw Relations
131 pages
Reflections On AIDS
No ratings yet
Reflections On AIDS
8 pages
SAMPLE PAPER 3 - Dgca Question Bank For Pilot
No ratings yet
SAMPLE PAPER 3 - Dgca Question Bank For Pilot
42 pages
Mid Test Questions and Answers
No ratings yet
Mid Test Questions and Answers
27 pages
Ece-Vii-dsp Algorithms & Architecture (10ec751) - Solution
No ratings yet
Ece-Vii-dsp Algorithms & Architecture (10ec751) - Solution
79 pages
Generator Emergency Purging
No ratings yet
Generator Emergency Purging
1 page
All Qstions
No ratings yet
All Qstions
15 pages
VMware 1V0 603
No ratings yet
VMware 1V0 603
21 pages
Impatient Parents Analysis and Rationale
No ratings yet
Impatient Parents Analysis and Rationale
6 pages
Test 1
No ratings yet
Test 1
108 pages
cf1 Examination Guide For Exams From 1 September 2023 To 31 August 2024
No ratings yet
cf1 Examination Guide For Exams From 1 September 2023 To 31 August 2024
33 pages
Wa0003.
No ratings yet
Wa0003.
173 pages
CMU-CS 252 - Test Banks For Student - On Tap
No ratings yet
CMU-CS 252 - Test Banks For Student - On Tap
28 pages
CompTIA Test4prep SK0-004 v2019-02-19 by Owen 260q
No ratings yet
CompTIA Test4prep SK0-004 v2019-02-19 by Owen 260q
110 pages
Nursing Practice Test IV & V
100% (1)
Nursing Practice Test IV & V
31 pages
(DD13-14) Micro-Patho PDF
No ratings yet
(DD13-14) Micro-Patho PDF
632 pages
52-Word Wrap Functionality in ALV
No ratings yet
52-Word Wrap Functionality in ALV
8 pages
Exam 156-315.80: Check Point Certified Security Expert - R80
No ratings yet
Exam 156-315.80: Check Point Certified Security Expert - R80
112 pages
Hussain Et Al., 2015
No ratings yet
Hussain Et Al., 2015
11 pages
4A0-100 Exam
No ratings yet
4A0-100 Exam
24 pages
01 Slides
No ratings yet
01 Slides
109 pages
Aindumps 117-102 v2013-10-08 by June 199q
No ratings yet
Aindumps 117-102 v2013-10-08 by June 199q
60 pages
c946 Final January 3
No ratings yet
c946 Final January 3
43 pages
ARASETV37 N1 P16 36a
No ratings yet
ARASETV37 N1 P16 36a
21 pages
HT39
No ratings yet
HT39
18 pages
Cisco Testkings 210-260 v2016-06-06 by HSV 155q
No ratings yet
Cisco Testkings 210-260 v2016-06-06 by HSV 155q
112 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
Practical Dispatch Test 2C
No ratings yet
Practical Dispatch Test 2C
16 pages
CMU CS 252 - Test Banks For Student On Tap Đã G P
No ratings yet
CMU CS 252 - Test Banks For Student On Tap Đã G P
26 pages
ECE-VII-DSP ALGORITHMS & ARCHITECTURE Part A
No ratings yet
ECE-VII-DSP ALGORITHMS & ARCHITECTURE Part A
111 pages
DSA Viva Stack + Tree + Graph
No ratings yet
DSA Viva Stack + Tree + Graph
20 pages
Boatsafetymanual
No ratings yet
Boatsafetymanual
52 pages
Black Rose Industries Limited: Global Consumption Share (%) Global Acrylamide Market by End-Use (%)
No ratings yet
Black Rose Industries Limited: Global Consumption Share (%) Global Acrylamide Market by End-Use (%)
6 pages
102 500 Preguntas
No ratings yet
102 500 Preguntas
63 pages
Black Holes and Beyond
No ratings yet
Black Holes and Beyond
140 pages
Affidavit For Certificates Loss Dec 2022
No ratings yet
Affidavit For Certificates Loss Dec 2022
1 page
ABA Passit4sure Ctfa v2019!02!07 by Ben 480q
No ratings yet
ABA Passit4sure Ctfa v2019!02!07 by Ben 480q
193 pages
JN0-360 V2-Oct 2016 PDF
No ratings yet
JN0-360 V2-Oct 2016 PDF
258 pages
Exetastai-The Discourses of Identity in Hellenistic Erythrai
100% (1)
Exetastai-The Discourses of Identity in Hellenistic Erythrai
34 pages
CSE220 Practice Midterm Exam
No ratings yet
CSE220 Practice Midterm Exam
10 pages
ECE-VII-DSP ALGORITHMS & ARCHITECTURE PartB
No ratings yet
ECE-VII-DSP ALGORITHMS & ARCHITECTURE PartB
70 pages
Standard Set RES Exam Sample Questions and Answers 26 Jan 2023
No ratings yet
Standard Set RES Exam Sample Questions and Answers 26 Jan 2023
5 pages
CSE220 Final Spring-24 Set-A
No ratings yet
CSE220 Final Spring-24 Set-A
4 pages
CIE 1 Portions FEM - HKRV - BMSCE 21022020 PDF
No ratings yet
CIE 1 Portions FEM - HKRV - BMSCE 21022020 PDF
70 pages
Microsoft Train4sure 70-740 v2019-01-24 by Logan 138q
No ratings yet
Microsoft Train4sure 70-740 v2019-01-24 by Logan 138q
116 pages
LDR 531 Final Exam - LDR 531 Final Exam Organizational Leadership Final Exam - UOP Students
100% (1)
LDR 531 Final Exam - LDR 531 Final Exam Organizational Leadership Final Exam - UOP Students
66 pages
Medicnes Billl
No ratings yet
Medicnes Billl
1 page
Regression Analysis - Final2
No ratings yet
Regression Analysis - Final2
7 pages
Free Online AI Face Swap 2
No ratings yet
Free Online AI Face Swap 2
1 page
1Z0 1127 24 Demo
No ratings yet
1Z0 1127 24 Demo
5 pages
MidTerm Exam PMGT 530 Hu
No ratings yet
MidTerm Exam PMGT 530 Hu
8 pages
Math 110 Prob Sets 5 Trigonometry
No ratings yet
Math 110 Prob Sets 5 Trigonometry
4 pages
WGU FPC1 MicroEconomics - Most Recent
No ratings yet
WGU FPC1 MicroEconomics - Most Recent
81 pages
CEA - Sample Exam Questions
No ratings yet
CEA - Sample Exam Questions
7 pages
EAI-05 Warehouse
No ratings yet
EAI-05 Warehouse
2 pages
Intercultural Communication
No ratings yet
Intercultural Communication
5 pages
IC Basic Receipt 11304 WORD
No ratings yet
IC Basic Receipt 11304 WORD
2 pages
Contact Us Escalation Matrix 15 Oct 24
No ratings yet
Contact Us Escalation Matrix 15 Oct 24
1 page
Characterization of Microbial and Prebiotic of Bread
No ratings yet
Characterization of Microbial and Prebiotic of Bread
33 pages
Logic Proposition
No ratings yet
Logic Proposition
12 pages
SS Specimen Papers (2267, 227X) - With Marking Points
No ratings yet
SS Specimen Papers (2267, 227X) - With Marking Points
8 pages
PDGA Rules Competition Manual Combined 2011
No ratings yet
PDGA Rules Competition Manual Combined 2011
36 pages
Computatıonal Fluıd Dynamıcs Based Desıgn and Investıgatıon of Nose Cone Aerodynamıcs of Formula Style Student Desıgned Racecar
No ratings yet
Computatıonal Fluıd Dynamıcs Based Desıgn and Investıgatıon of Nose Cone Aerodynamıcs of Formula Style Student Desıgned Racecar
7 pages
Statistics Practice Set
No ratings yet
Statistics Practice Set
6 pages
RGSHOA Memo For Garbage Collection
100% (1)
RGSHOA Memo For Garbage Collection
1 page
Oxford Insight Mathematics 10-5-25 3 AC For NSW Student Book Obook John Ley Michael Fuller Z Lib Org 60
No ratings yet
Oxford Insight Mathematics 10-5-25 3 AC For NSW Student Book Obook John Ley Michael Fuller Z Lib Org 60
1 page
Microsoft AZURE® AZ-104 Administrator Practice Tests
From Everand
Microsoft AZURE® AZ-104 Administrator Practice Tests
iCertify Training
No ratings yet