0% found this document useful (0 votes)

5 views7 pages

Assignment 9 Solution

The document contains an assignment on large language models with 10 questions, each worth 1 mark, covering topics such as knowledge graphs, entity alignment, scoring functions, and various models like TransE and RotatE. Each question includes a correct answer and an explanation detailing the reasoning behind it. The assignment aims to assess understanding of concepts related to knowledge graphs and their applications in machine learning.

Uploaded by

Harsh Vardhan Choudhary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views7 pages

Assignment 9 Solution

Uploaded by

Harsh Vardhan Choudhary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Introduction to Large Language Models

Assignment- 9

Number of questions: 10 Total mark: 10 X 1 = 10

_________________________________________________________________________

QUESTION 1: [1 mark]

Which of the following statement best describes why knowledge graphs (KGs) are
considered more powerful than a traditional relational knowledge base (KB)?

a. KGs require no schema, whereas KBs must have strict schemas.

b. KGs store data only in the form of hypergraphs, eliminating redundancy.
c. KGs allow flexible, graph-based connections and typed edges, enabling richer
relationships and inferences compared to KBs.
d. KGs completely replace the need for textual sources by storing all possible facts.

Correct Answer: c

Explanation:

• Traditional relational knowledge bases enforce strict schemas (rows, columns,

tables). In contrast, KGs store entities as nodes with typed edges (relations) between
them, allowing richer and more flexible relationships.

• KGs can represent complex connections (e.g., multi-edges, different relation types)
and support inference by traversing these connections.

• While some knowledge graphs can be partially schema-less or schema-flexible,

choice (c) specifically highlights the flexibility in graph connections and typed edges,
which is what makes KGs generally more powerful for representing rich, interlinked
data.

_______________________________________________________________________

QUESTION 2: [1 mark]

Entity alignment and relation alignment are crucial between KGs of different languages.
Which of the following factors contribute to effective alignment?

a. Aligning relations solely by their lexical similarity, ignoring semantic context

b. Transliteration or language-based string matching for entity labels
c. Ensuring all language aliases are represented identically in each KG
d. Matching neighbours, or connected entities, across different KGs

Correct Answer: b, d

Explanation:
• Transliteration or language-based string matching (b): For multilingual KGs,
matching entity labels across languages (e.g., transliterating names or matching
synonyms) is key to identifying equivalent entities.

• Matching neighbours (d): Alignment goes beyond just matching labels; you also
look at the graph structure - if two entities are neighbours or connected to the same
concept in each KG, that supports they might be the same or related.

• Why not a or c?

(a) Lexical similarity alone (ignoring context) is insufficient, as terms might match
lexically but differ in meaning across languages.

(c) It is often not possible or necessary to have identical labels for all language
aliases if they mean the same thing in context; the important aspect is linking them,
not forcing identical representations.

_________________________________________________________________________

QUESTION 3: [1 mark]

In the context of knowledge graph completion (KGC), which statement best describes the
role of the scoring function 𝑓(𝑠, 𝑟, 𝑜)?

a. It determines whether two entities refer to the same real-world concept.

b. It produces a raw confidence score indicating how plausible a triple (𝑠, 𝑟, 𝑜) is.
c. It explicitly encodes only the subject’s embedding, ignoring the relation and object
embeddings.
d. It ensures that every negative triple gets a higher score than any positive triple.

Correct Answer: b

Explanation:

• A raw confidence score for plausibility (b): In KGC, the scoring function evaluates
a triple (𝑠, 𝑟, 𝑜) (subject, relation, object) to determine how likely it is to be true
according to the learned embeddings. A higher (or lower, depending on the
convention) score indicates higher plausibility.

• Why not the others?

(a) That relates to entity resolution, not necessarily the KGC scoring function.

(c) The scoring function typically factors in all three embeddings (subject, relation,
and object).
(d) While in training we often prefer valid triples to have higher scores than negative
ones, the scoring function itself just produces a plausibility value. It doesn’t guarantee
all negative triples score higher or lower unconditionally.

_________________________________________________________________________

QUESTION 4: [1 mark]

One key difference between the differentiable KG approach and the semantic interpretation
approach to KGQA is:

a. Differentiable KG approaches are fully rule-based, while semantic interpretation is

purely neural.
b. Differentiable KG approaches do not require any graph embeddings, relying instead
on explicit logical forms.
c. Semantic interpretation is more transparent or interpretable, whereas differentiable
KG is end-to-end trainable but less interpretable.
d. Both approaches use logical forms; the primary difference is the type of question they
can answer.

Correct Answer: c

Explanation:

• Semantic interpretation is more interpretable, differentiable KG is more end-to-

end (c):
o Semantic interpretation methods typically rely on building an explicit logical
form of the question, which is transparent and can be easily explained.
o Differentiable KGQA uses neural embeddings and end-to-end
backpropagation over the graph, making it powerful but less human-
interpretable.

• Why not the others?

(a) Differentiable KG approaches are not fully rule-based — in fact, they’re more
neural and less rule-based.

(b) Differentiable approaches definitely do use graph embeddings.

(d) While both can handle complex questions, the key difference in c is the
interpretability vs. end-to-end trainability.

________________________________________________________________________

QUESTION 5: [1 mark]

Considering the differentiable KG approach, which elements are typically learned jointly
when training an end-to-end KGQA model?
a. The textual question representation (e.g., BERT embeddings)
b. The graph structure encoding (e.g., GCN or transformer-based graph embeddings)
c. Predefined logical forms to ensure interpretability
d. The final answer selection mechanism that identifies which node(s) in the graph
satisfy the question

Correct Answer: a, b, d

Explanation:

• Textual question representation (a): The system learns how to embed the input
question (often with a neural model like BERT).
• Graph structure encoding (b): It also learns how to encode nodes and relations in
the knowledge graph (using a graph neural network, attention, etc.).
• Final answer selection (d): Finally, the model learns how to map from the question
and graph embeddings to the correct node(s) in the KG.
• Why not (c)? Predefined logical forms are more typical of semantic-parsing-based
KGQA, not differentiable KGQA approaches. Differentiable KGQA is usually end-to-
end and does not require manually crafted logical forms.

_________________________________________________________________________

QUESTION 6: [1 mark]

Uniform negative sampling can have high variance and may require large number of
samples. Why is that the case?

a. Because the margin-based loss cannot converge without big mini-batches.

b. Because randomly picking negative entities does not guarantee close or challenging
negatives, causing unstable training estimates.
c. Because negative sampling must ensure every possible negative triple is covered.
d. Because the number of relations in the KG is too large for small number of samples.

Correct Answer: b

Explanation:

• High variance arises when negatives are not challenging (b): If negative
examples are chosen completely at random, many will be too easy for the model to
distinguish, providing limited learning signal. The model sees less “borderline” cases,
so estimates of how well the model can separate real vs. fake facts fluctuate
significantly.

• Why not the others?

(a) Margin-based losses can converge with or without large mini-batches if sampling
is done carefully.
(c) We don’t need to cover every possible negative triple, just enough meaningful
ones for training.

(d) The number of relations in a KG might be large, but that alone doesn’t necessarily
drive the variance issue.
_________________________________________________________________________

QUESTION 7: [1 mark]

In testing embedding and score quality for KG completion, mean rank and hits@K are typical
metrics. What does hits@K specifically measure in this context?

a. The percentage of queries for which the correct answer appears in the top-K of the
ranked list.
b. The reciprocal of the rank of the correct answer.
c. The probability of the correct answer appearing as the highest scored candidate.
d. The margin of the correct triple score relative to all negative triples.

Correct Answer: a

Explanation:

• Hits@K = The percentage of queries for which the correct entity (or triple) is in
the top-K predictions (a). This means if the correct answer is within the first K
results in the ranking, we call it a “hit.” We then compute how many queries achieve
this, divided by the total.

• Why not b, c, or d?

(b) That is more like Mean Reciprocal Rank (MRR), not hits@K.

(c) If K=1, that might coincide with hits@1, but the metric hits@K is about the top-K in
general, not exclusively the top candidate.

(d) That describes a margin-based idea, not hits@K.

_________________________________________________________________________

QUESTION 8: [1 mark]

In the TransE model, the scoring function for a triple (𝑠, 𝑟, 𝑜) is typically defined as

𝑓(𝑠, 𝑟, 𝑜) =∥ 𝑒! + 𝑒" − 𝑒# ∥

where 𝑒! , 𝑒" , 𝑒# are embeddings of the subject, relation, and object, respectively. Which
statement best explains what a low value of 𝑓(𝑠, 𝑟, 𝑜) indicates in this context?

a. That (𝑠, 𝑟, 𝑜) is an invalid triple according to the learned embeddings.

b. That 𝑒! and 𝑒# must be orthogonal.
c. That the relation embedding 𝑒" is zero.
d. That (𝑠, 𝑟, 𝑜) has a high likelihood of being a true fact in the knowledge graph.

Correct Answer: d

Explanation:

• A low distance = a high plausibility (d): In TransE, the model is trained such that
𝑠 + 𝑟 ≈ 𝑜 for a valid triple. If the norm ∥ 𝑠 + 𝑟 − 𝑜 ∥ is small, it means the subject,
relation, and object embeddings line up well, indicating that triple is likely true.

• Why not a, b, c?

(a) A high value would correspond to an invalid triple.

(b) Orthogonality is not directly indicated by a small distance in TransE.

(c) Zero relation embedding is not required for plausibility; the relation embedding
can be non-zero and still yield a low distance.

_________________________________________________________________________

QUESTION 9: [1 mark]

In RotatE, if a relation 𝑟 is intended to be symmetric, how would that typically manifest in the
complex plane?

a. The relation embedding 𝑒" must always equal zero.

$
b. The angle of 𝑒" must be .
%
c. The relation embedding 𝑒" is its own inverse (i.e., a 180° rotation when squared).
d. The magnitude of 𝑒" must be greater than 1.

Correct Answer: c

Explanation:

• Relation embedding is its own inverse (c): In RotatE, each relation is modeled as
a rotation in the complex plane. For a relation to be symmetric, applying that relation
twice would yield the original entity, so 𝑟 % = 1. A 180° rotation (i.e., 𝜋 radians) is its
own inverse because rotating twice by 180° brings you back to the same orientation.

• Why not a, b, or d?

(a) A zero embedding is not characteristic of symmetry in RotatE.

(b) An angle of 𝜋 radians alone indicates an inverse relation, but the question
specifically references the notion of the embedding being its own inverse, which
implies 𝑟 % = 1.
(d) The magnitude constraint (often magnitude = 1 in RotatE) is not specifically about
symmetry.

_________________________________________________________________________

QUESTION 10: [1 mark]

Which main advantage do rotation-based models (like RotatE) have over translation-based
ones (like TransE) when it comes to complex multi-relational patterns in a KG?

a. Rotation-based models cannot model any symmetry or inverse patterns, so they are
simpler.
b. Rotation-based models handle a broader set of relation properties (symmetry, anti-
symmetry, inverses, composition) more naturally.
c. Rotation-based models have no hyperparameters to tune, unlike TransE.
d. Rotation-based models are guaranteed to yield perfect link prediction.

Correct Answer: b

Explanation:

• Rotation-based models can capture more complex relational properties (b): By

representing relations as rotations in the complex plane, RotatE naturally supports
symmetric relations (𝑟 % = 1) anti-symmetric relations, inverses (rotations in the
opposite direction), and composition (cumulative rotations).

• Why not a, c, or d?

(a) In fact, they can model symmetry, inverses, etc.

(c) They do have hyperparameters (e.g., embedding dimension, learning rate); it is

not hyperparameter-free.

(d) No model is guaranteed to be perfect for all link prediction tasks.

_________________________________________________________________________

CS-3011 (Ai) - CS End April 2024
No ratings yet
CS-3011 (Ai) - CS End April 2024
23 pages
Sample Midterm Questions Answers
No ratings yet
Sample Midterm Questions Answers
5 pages
Quizzes: Module 1 - Formal Systems
No ratings yet
Quizzes: Module 1 - Formal Systems
24 pages
Gyroscopic Forces Modeled in Ansys
100% (1)
Gyroscopic Forces Modeled in Ansys
92 pages
DM All
No ratings yet
DM All
1,731 pages
Artificial Intelligence (2180703) : Semester: Vii Credit: 6 MCQ Question Bank
No ratings yet
Artificial Intelligence (2180703) : Semester: Vii Credit: 6 MCQ Question Bank
10 pages
ARTIFICIAL INTELLIGENCE FOR YOU MCQ - Front Page - BASKAR.M
No ratings yet
ARTIFICIAL INTELLIGENCE FOR YOU MCQ - Front Page - BASKAR.M
58 pages
Artificial Intelligence Question Bank
100% (2)
Artificial Intelligence Question Bank
8 pages
Mca Mcse003 PDF
0% (1)
Mca Mcse003 PDF
23 pages
Week 3 Lect 1 (Semantic Net and Frames)
No ratings yet
Week 3 Lect 1 (Semantic Net and Frames)
42 pages
15840-Unit Vi
No ratings yet
15840-Unit Vi
58 pages
Knowledge Representation
No ratings yet
Knowledge Representation
47 pages
AI Question and Answer
No ratings yet
AI Question and Answer
13 pages
Bayes' Theorem Relates To - Probability A) Joint B) Prior
No ratings yet
Bayes' Theorem Relates To - Probability A) Joint B) Prior
6 pages
CP-KGC: Constrained-Prompt Knowledge Graph Completion With Large Language Models
No ratings yet
CP-KGC: Constrained-Prompt Knowledge Graph Completion With Large Language Models
10 pages
CS 717: Endsem
No ratings yet
CS 717: Endsem
5 pages
207B-Discrete Mathematics PDF
No ratings yet
207B-Discrete Mathematics PDF
18 pages
Ai Iiuc PDF
No ratings yet
Ai Iiuc PDF
5 pages
Ai Ete
No ratings yet
Ai Ete
7 pages
MCQ DM For Unit 2 Combinatorics
No ratings yet
MCQ DM For Unit 2 Combinatorics
22 pages
Trig Cheat Sheet PDF
100% (1)
Trig Cheat Sheet PDF
1 page
BT9402 MQP
No ratings yet
BT9402 MQP
11 pages
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 8 - Week 5
100% (1)
Artificial Intelligence - Knowledge Representation and Reasoning - Unit 8 - Week 5
5 pages
Exercise - 1 Objective Problems - JEE Main: Section A
No ratings yet
Exercise - 1 Objective Problems - JEE Main: Section A
44 pages
My HAL Project
100% (2)
My HAL Project
45 pages
D351
100% (2)
D351
6 pages
002 Angular Kinematics
100% (1)
002 Angular Kinematics
28 pages
Development of Gyroscopic Effect On Two-Wheeler Vehicle Stability
No ratings yet
Development of Gyroscopic Effect On Two-Wheeler Vehicle Stability
20 pages
Rotational Motion - Day 1 2 PDF
No ratings yet
Rotational Motion - Day 1 2 PDF
68 pages
CCKS GRL YizhouSun v1
No ratings yet
CCKS GRL YizhouSun v1
85 pages
MCQ On Knowledge Representation 5eea6a0e39140f30f369e525
No ratings yet
MCQ On Knowledge Representation 5eea6a0e39140f30f369e525
21 pages
Ccet2 Ai-1 2.5 Units
No ratings yet
Ccet2 Ai-1 2.5 Units
15 pages
Exploring Large Language Models For Knowledge Graph Completion
No ratings yet
Exploring Large Language Models For Knowledge Graph Completion
7 pages
Earth Rotates and Revolves Differentiated Display Posters English
No ratings yet
Earth Rotates and Revolves Differentiated Display Posters English
3 pages
BertNet Harvesting Knowledge Graphs From Pretrained Language Models
No ratings yet
BertNet Harvesting Knowledge Graphs From Pretrained Language Models
13 pages
2019 Introduction To Neural Network Based Approaches For Question Answering Over Knowledge Graphs
No ratings yet
2019 Introduction To Neural Network Based Approaches For Question Answering Over Knowledge Graphs
34 pages
AI CT3 MCQs
No ratings yet
AI CT3 MCQs
5 pages
PHYF 115 Tutorial Questions
No ratings yet
PHYF 115 Tutorial Questions
25 pages
2022 Findings-Naacl 19
No ratings yet
2022 Findings-Naacl 19
16 pages
Retrieval Reranking and Multi Task Learning For Knowledge Base Question Answering
No ratings yet
Retrieval Reranking and Multi Task Learning For Knowledge Base Question Answering
11 pages
3 Mechanisms Position Analysis
No ratings yet
3 Mechanisms Position Analysis
103 pages
Fly Swatter Geometry Only - Review
100% (3)
Fly Swatter Geometry Only - Review
54 pages
AI Answers
No ratings yet
AI Answers
4 pages
Important Bit Questions
No ratings yet
Important Bit Questions
20 pages
cs221 Section9 Problems
No ratings yet
cs221 Section9 Problems
6 pages
Exp04 GYRO PDF
No ratings yet
Exp04 GYRO PDF
4 pages
Measuremnt of Angles
No ratings yet
Measuremnt of Angles
10 pages
Sine and Cosine Rule
No ratings yet
Sine and Cosine Rule
3 pages
Arbitrary Plane Reflection
No ratings yet
Arbitrary Plane Reflection
2 pages
Christian Storytelling
No ratings yet
Christian Storytelling
9 pages
Kgvalidator: A Framework For Automatic Validation of Knowledge Graph Construction
No ratings yet
Kgvalidator: A Framework For Automatic Validation of Knowledge Graph Construction
23 pages
CS2910 SampleQs
No ratings yet
CS2910 SampleQs
5 pages
Week 2
No ratings yet
Week 2
22 pages
Class9 AI PA4 Sample MS 2024-25
No ratings yet
Class9 AI PA4 Sample MS 2024-25
7 pages
Governor
No ratings yet
Governor
12 pages
...... Aiml Vityarthi Mod 3
No ratings yet
...... Aiml Vityarthi Mod 3
26 pages
Chapter 5 Part-I Intro&Wumpus World
No ratings yet
Chapter 5 Part-I Intro&Wumpus World
58 pages
AI - Chapter 4 Review Questions
No ratings yet
AI - Chapter 4 Review Questions
3 pages
Towards A Question Answering System Over Temporal Knowledg Graph Embeddings
No ratings yet
Towards A Question Answering System Over Temporal Knowledg Graph Embeddings
10 pages
2024 Acl-Long 545
No ratings yet
2024 Acl-Long 545
22 pages
207C-Discrete Mathematics
No ratings yet
207C-Discrete Mathematics
22 pages
Automation Studio Introduction
No ratings yet
Automation Studio Introduction
84 pages
2021 Naacl-Main 45
No ratings yet
2021 Naacl-Main 45
12 pages
Irs Ins
No ratings yet
Irs Ins
6 pages
NLP Final
No ratings yet
NLP Final
11 pages
Exam 2012
No ratings yet
Exam 2012
3 pages
Magnetic Effect of Current
No ratings yet
Magnetic Effect of Current
12 pages
KGLM: Integrating Knowledge Graph Structure in Language Models For Link Prediction
No ratings yet
KGLM: Integrating Knowledge Graph Structure in Language Models For Link Prediction
8 pages
2022SvendsenBertoliniVind TheNewStorstrmBridge NonlinearDynamicShipImpacts
No ratings yet
2022SvendsenBertoliniVind TheNewStorstrmBridge NonlinearDynamicShipImpacts
10 pages
Chapter 4 !
No ratings yet
Chapter 4 !
19 pages
11 Rotational Motion - Package With Q.B - 2023-24.
No ratings yet
11 Rotational Motion - Package With Q.B - 2023-24.
41 pages
1986 - US6536700 - Device For Controlling Optical Fiber Twist On A Bobbin
No ratings yet
1986 - US6536700 - Device For Controlling Optical Fiber Twist On A Bobbin
13 pages
Fundamentals of Mechanical Engineering: December 2024
No ratings yet
Fundamentals of Mechanical Engineering: December 2024
264 pages
Solution 2024
No ratings yet
Solution 2024
11 pages
Your Roll No ................ :: B.Sc. ® Computer Science
No ratings yet
Your Roll No ................ :: B.Sc. ® Computer Science
4 pages
Question 1 (20 Marks) : A) Select The Correct Answer (4 Marks)
No ratings yet
Question 1 (20 Marks) : A) Select The Correct Answer (4 Marks)
4 pages
Module 3 Workbook
No ratings yet
Module 3 Workbook
13 pages
Assignment 11 Solution
No ratings yet
Assignment 11 Solution
7 pages
Assignment 7 Solution
No ratings yet
Assignment 7 Solution
3 pages
Physics 1-2
No ratings yet
Physics 1-2
86 pages
KRR Assignment Answers
No ratings yet
KRR Assignment Answers
4 pages
Screenshot 2024-06-20 at 5.29.56 AM
No ratings yet
Screenshot 2024-06-20 at 5.29.56 AM
27 pages
AI Knowledge Representation Answers
No ratings yet
AI Knowledge Representation Answers
3 pages
2022.findings Naacl.115
No ratings yet
2022.findings Naacl.115
12 pages
DLL Mathematics 4 q1 w1
No ratings yet
DLL Mathematics 4 q1 w1
12 pages
Final Exam Spring 18
No ratings yet
Final Exam Spring 18
4 pages
Scratch Project
No ratings yet
Scratch Project
3 pages
IGNOU MCA Data Science and Big Data Previous Years Unsolved Papers MCS 226
From Everand
IGNOU MCA Data Science and Big Data Previous Years Unsolved Papers MCS 226
Manish Soni
No ratings yet
IGNOU MCA Object-Oriented Analysis and Design Previous Years Unsolved Papers MCS 219
From Everand
IGNOU MCA Object-Oriented Analysis and Design Previous Years Unsolved Papers MCS 219
Manish Soni
No ratings yet

Assignment 9 Solution

Uploaded by

Assignment 9 Solution

Uploaded by

Introduction to Large Language Models

Number of questions: 10 Total mark: 10 X 1 = 10

a. KGs require no schema, whereas KBs must have strict schemas.

• Traditional relational knowledge bases enforce strict schemas (rows, columns,

• While some knowledge graphs can be partially schema-less or schema-flexible,

a. Aligning relations solely by their lexical similarity, ignoring semantic context

a. It determines whether two entities refer to the same real-world concept.

• Why not the others?

a. Differentiable KG approaches are fully rule-based, while semantic interpretation is

• Semantic interpretation is more interpretable, differentiable KG is more end-to-

• Why not the others?

(b) Differentiable approaches definitely do use graph embeddings.

a. Because the margin-based loss cannot converge without big mini-batches.

• Why not the others?

(d) That describes a margin-based idea, not hits@K.

a. That (𝑠, 𝑟, 𝑜) is an invalid triple according to the learned embeddings.

(a) A high value would correspond to an invalid triple.

(b) Orthogonality is not directly indicated by a small distance in TransE.

a. The relation embedding 𝑒" must always equal zero.

(a) A zero embedding is not characteristic of symmetry in RotatE.

QUESTION 10: [1 mark]

• Rotation-based models can capture more complex relational properties (b): By

(a) In fact, they can model symmetry, inverses, etc.

(c) They do have hyperparameters (e.g., embedding dimension, learning rate); it is

(d) No model is guaranteed to be perfect for all link prediction tasks.

You might also like