Understanding LLMs Solberg-2025
Understanding LLMs Solberg-2025
*
Staff Editor, Georgetown Law Technology Review, Volume 9;
Editorial Board, Liberty University Law Review, Volume 18; LL.M.
Candidate in Technology Law and Policy, Georgetown University Law
Center (2025); J.D., Liberty University School of Law (2024); A.L.M. in
Extension Studies, Concentration: Data Science, Harvard University
(2021); B.S. Financial Mathematics and Statistics, University of
California, Santa Barbara (2018).
No. 1] GEORGETOWN LAW TECHNOLOGY REVIEW 257
I. INTRODUCTION
1
What Are Large Language Models (LLMs)?, ELASTIC,
https://www.elastic.co/what-is/large-language-models [perma.cc/N2KR-
WMU4] (last visited Dec. 29, 2024).
2
Id.
3
See A Complete Guide to Natural Language Processing,
DEEPLEARNING.AI, https://www.deeplearning.ai/resources/natural-
language-processing/ [perma.cc/BP5A-P55Z] (last updated Jan. 11,
2023) .
258 Understanding LLMs [Vol. 9
4
Id.
5
Deepak Bhatt, Natural Language Processing: Bridging the Gap
Between Humans and Machines, GLOB. TECH. REV. (July 7, 2024),
https://www.globaltechnologyreview.com/post/natural-language-
processing-bridging-the-gap-between-humans-and-machines
[perma.cc/E7VW-M5P7].
6
Alexander S. Gillis, Ben Lutkevich & Ed Burns, What is Natural
Language Processing (NLP)?, TECHTARGET (last updated Aug. 2024),
https://www.techtarget.com/searchenterpriseai/definition/natural-
language-processing-NLP [perma.cc/KF22-7YYT].
7
Seyed Saeid Masoumzadeh, From Rule-Based Systems to
Transformers: A Journey Through the Evolution of Natural Language
Processing, MEDIUM (June 19, 2023),
https://medium.com/@masoumzadeh/from-rule-based-systems-to-
transformers-a-journey-through-the-evolution-of-natural-language-
9131915e06e1# [perma.cc/67WJ-QXEM].
8
Id.
9
Id.
10
GenAI vs. LLMs vs. NLP: A Complete Guide, SCRIBBLEDATA,
https://www.scribbledata.io/blog/genai-vs-llms-vs-nlp-a-complete-guide/
[perma.cc/SEX3-SXVP] (last visited Dec. 29, 2024).
11
Id.
No. 1] GEORGETOWN LAW TECHNOLOGY REVIEW 259
A. TRANSFORMER MODELS
12
Id.
13
What is Natural Language Processing (NLP)?, AWS,
https://aws.amazon.com/what-is/nlp/# [perma.cc/5SPG-LXAX] (last
visited Dec. 29, 2024).
14
What Are Large Language Models (LLMs)?, supra note 1.
15
Id.
16
Rick Merritt, What is a Transformer Model?, NVIDIA (Mar. 25,
2022), https://blogs.nvidia.com/blog/what-is-a-transformer-model/
[perma.cc/C4UC-ZTX9].
17
Tyler Au, An Introduction to the Transformer Model: The Brains
Behind Large Language Models, LYRID.IO (May 1, 2024),
https://www.lyrid.io/post/an-introduction-to-the-transformer-model-the-
brains-behind-large-language-models# [perma.cc/B2RF-ZYYB].
18
Id.
19
What Are Large Language Models (LLMs)?, supra note 1.
260 Understanding LLMs [Vol. 9
1. Embedding Layer
20
Au, supra note 17; Pradeep Menon, Introduction to Large
Language Models and the Transformer Architecture, MEDIUM (Mar. 9,
2023), https://rpradeepmenon.medium.com/introduction-to-large-
language-models-and-the-transformer-architecture-534408ed7e61
[perma.cc/TQ32-QRUB].
21
Au, supra note 17; Menon, supra note 20.
22
Au, supra note 17.
23
Josep Ferrer, How Transformers Work: A Detailed Exploration of
Transformer Architecture, DATACAMP (Jan. 9, 2024),
https://www.datacamp.com/tutorial/how-transformers-work
[https://perma.cc/6PW9-3WEN].
24
Id.
25
Id.
26
What Are Large Language Models (LLMs)?, supra note 1.
27
Id.
No. 1] GEORGETOWN LAW TECHNOLOGY REVIEW 261
2. Feedforward Layer
3. Recurrent Layer
28
What is Embedding Layer: LLMs Explained, CHATGPT GUIDE (last
updated June 12, 2024), https://www.chatgptguide.ai/2024/02/29/what-is-
embedding-layer-llms-explained/ [perma.cc/SA6F-ZV2N].
29
Punyakeerthi BL, Understanding Feed Forward Networks in
Transformers, MEDIUM (Apr. 29, 2024),
https://medium.com/@punya8147_26846/understanding-feed-forward-
networks-in-transformers-77f4c1095c67 [perma.cc/9BQU-25KT].
30
Ian Goodfellow, Yoshua Bengio & Aaron Courville, Chapter 6:
Deep Feedforward Networks in DEEP LEARNING BOOK 164–223 (2016),
https://www.deeplearningbook.org/contents/mlp.html [perma.cc/6TFX-
LYPH]; Sandaruwan Herath, The Feedforward Network (FFN) in The
Transformer Model, MEDIUM (Apr. 19, 2024), https://medium.com/image-
processing-with-python/the-feedforward-network-ffn-in-the-transformer-
model-6bb6e0ff18db [perma.cc/JSR2-X88M].
31
Feedforward Neural Network, GEERKSFORGEEKRS (last updated
June 20, 2024), https://www.geeksforgeeks.org/feedforward-neural-
network/ [perma.cc/5RWF-8CAT].
32
See What Are Large Language Models (LLMs)?, supra note 1; see
also Andrej Karpathy, The Unreasonable Effectiveness of Recurrent
Neural Networks, GITHUB.IO (May 21, 2015),
https://karpathy.github.io/2015/05/21/rnn-effectiveness/ [perma.cc/WJQ9-
XRJ3].
262 Understanding LLMs [Vol. 9
4. Attention Mechanism
33
See Karpathy, supra note 32; see also Large Language Model
(LLM), GROWTHLOOP (last updated Feb. 28, 2024),
https://www.growthloop.com/university/article/llmv [perma.cc/7YGV-
RSVA].
34
See Karpathy, supra note 32; see also Large Language Model
(LLM), supra note 33.
35
See Merritt, supra note 16.
36
See id.
37
See Menon, supra note 20.
38
Catherine Breslin, What’s a Parameter in an LLM?, MEDIUM (Jan.
6, 2024), https://catherinebreslin.medium.com/what-is-a-parameter-
3d4b7736c81d [perma.cc/QH9K-CHTZ]; Sean Michael Kerner, What are
No. 1] GEORGETOWN LAW TECHNOLOGY REVIEW 263
1. Fine-Tuning
2. Prompt Tuning
47
Id.
48
Id.
49
Id.
50
Id.
51
Prompt Tuning vs. Fine-Tuning, supra note 41.
52
Id.
53
Dimitri Didmanidze, Understanding Prompt Tuning: Enhance Your
Language Models with Precision, DATACAMP (May 19, 2024),
https://www.datacamp.com/tutorial/understanding-prompt-tuning
[perma.cc/CBS4-C8H5].
54
Id.
55
Id.
56
Id.
57
Id.
No. 1] GEORGETOWN LAW TECHNOLOGY REVIEW 265
or “error.”58 This loss serves as the guiding metric for improving the
soft prompts. Backpropagation, a standard optimization method in
neural networks, is used to update parameters.59 However, in prompt
tuning, only the parameters of the soft prompts are adjusted; the
model’s core weights remain untouched.60 The errors are propagated
backward through the network, and the soft prompts are fine-tuned
to better align the model’s output with the desired outcome.61
The cycle of forward passes, loss evaluation, and
backpropagation is repeated over multiple epochs.62 With each
iteration, the soft prompts adapt further, learning how to shape the
input in a way that minimizes errors and maximizes task-specific
performance.63 Over time, this iterative process allows the model to
become highly specialized for the task at hand while preserving its
general-purpose functionality.
58
Id.
59
Id.
60
Id.
61
Id.
62
Id.
63
Id.
64
Pooja Choudhary, Benefits And Limitations Of LLM, AIThority
(June 18, 2024), https://aithority.com/machine-learning/benefits-and-
limitations-of-llm/ [perma.cc/3JE8-4PNM].
65
Shyam Achuthan, How AI-Powered Language Models are
Transforming the Customer Support Landscape, LINKEDIN (Mar. 19,
2024), https://www.linkedin.com/pulse/how-ai-powered-language-
models-transforming-customer-support-shyam-njbsf [perma.cc/Z9V3-
6QBJ].
266 Understanding LLMs [Vol. 9
B. CHALLENGES
66
Shanthi Kumar V, How AI LLMs Transform Social Media
Interactions, LINKEDIN (Jan. 31, 2024),
https://www.linkedin.com/pulse/how-ai-llms-transform-social-media-
interactions-shanthi-kumar-v--fuoff/ [perma.cc/D4QA-GJK6].
67
The Role of Large Language Models in eCommerce & Retail
Industry in 2024, AMPLEWORK SOFTWARE (Sept. 30, 2024),
https://www.amplework.com/blog/large-language-models-in-ecommerce-
and-retail/ [perma.cc/E9G8-DQYJ].
68
LLMs in Banking, PACIFIC DATA INTEGRATORS (Oct. 1, 2024),
https://www.pacificdataintegrators.com/blogs/llms-in-banking-enhance-
fraud-detection-risk-assessment [perma.cc/PPK3-WU86].
69
How to Use Large Language Models for Marketing, KIRAN VOLETI
(Aug. 17, 2024), https://kiranvoleti.com/how-to-use-large-language-
models-llms-for-marketing [perma.cc/2KMV-RPJ2].
70
Shuroug A. Alowais, Sahar S. Alghamdi, Nada Alsuhebany, Tariq
Alqahtani, Abdulrahman I. Alshaya, Sumaya N. Almohareb, Atheer
Aldairem, Mohammed Alrashed, Khalid Bin Saleh, Hisham A. Badreldin,
Majed S. Al Yami, Shmeylan Al Harbi & Abdulkareem M. Albekairy,
Revolutionizing Healthcare: The Role of Artificial Intelligence in Clinical
Practice, 23 BMD MED. ED., 2023 at 1–15,
https://doi.org/10.1186/s12909-023-04698-z.
71
A Guide to Large Language Models (LLMs) For Enterprises, DAVE
AI (Aug. 2024), https://www.iamdave.ai/blog/a-guide-to-large-language-
modelsllms-for-enterprises/ [perma.cc/389V-DLVR].
No. 1] GEORGETOWN LAW TECHNOLOGY REVIEW 267
V. CONCLUSION
72
5 Biggest Challenges with LLMs and How to Solve Them,
TENEO.AI, https://www.teneo.ai/blog/5-biggest-challenges-with-llms-and-
how-to-solve-them [perma.cc/B35S-5XZW] (last visited Dec. 29, 2024).
73
Id.
74
Id.
75
Id.
76
Id.