Slides

slides

Uploaded by

ayushsrinivasbansal7x

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views9 pages

Slides

slides

Uploaded by

ayushsrinivasbansal7x

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

LoRA: Low-Rank Adaptation of

Large Language Models

Umar Jamil
License: Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0):
https://creativecommons.org/licenses/by-nc/4.0/legalcode

Not for commercial use

Umar Jamil - https://github.com/hkproj/pytorch-lora

How do neural networks work?
Target

Input Output Loss

Hidden Layer 1 Hidden Layer 2

Umar Jamil - https://github.com/hkproj/pytorch-lora

Fine Tuning
Target

Input Output Loss

Hidden Layer 1 Hidden Layer 2

Fine tuning means training a pre-trained network on new data to improve its performance on
a specific task. For example, we may fine-tune a LLM that was trained on many programming languages
and fine-tune it for a new dialect of SQL.
Umar Jamil - https://github.com/hkproj/pytorch-lora
Problems with fine-tuning

1. We must train the full network, which it computationally expensive for the average user
when dealing with Large Language Models like GPT
2. Storage requirements for the checkpoints are expensive, as we need to save the entire
model on the disk for each checkpoint. If we also save the optimizer state (which we
usually do, then the situation gets worse!)
3. If we have multiple fine-tuned models, we need to reload all the weights of the model
every time we want to switch between them, which can be expensive and slow. For
example, we may have a model fine-tuned for helping users write SQL queries and one
model for helping users write Javascript code.

Umar Jamil - https://github.com/hkproj/pytorch-lora

Introducing LoRA
Hidden Layer 1 Target

🥶 W (pre-trained)
FROZEN
ℝ ×

Input Output Loss

🦾
B A
ℝ × ℝ ×

𝑟 ≪ min(𝑑, 𝑘)

Umar Jamil - https://github.com/hkproj/pytorch-lora

What are the benefits?

1. Less parameters to train and store: if 𝑑 = 1000, 𝑘 = 5000, (𝑑 × 𝑘) = 5,000,000; using

𝑟 = 5, we get (𝑑 × 𝑟) + (𝑟 × 𝑘) = 5,000 + 25,000 = 30,000. Less than 1% of the
original.
2. Less parameters = less storage requirements
3. Faster backpropagation, as we do not need to evaluate the gradient for most of the
parameters
4. We can easily switch between two different fine-tuned model (one for SQL generation
and one for Javascript code generation) just by changing the parameters of the A and B
matrices instead of reloading the W matrix again.

Umar Jamil - https://github.com/hkproj/pytorch-lora

Why does it work?

It basically means that the W matrix of a pretrained model contains many parameters that convey
the same information as others (so they can be obtained by a combination of the other weights); This
means we can get rid of them without decreasing the performance of the model. This kind of matrices
are called rank-deficient (they do not have full-rank).

Umar Jamil - https://github.com/hkproj/pytorch-lora

A brief tutorial on the rank of a matrix…

Umar Jamil - https://github.com/hkproj/pytorch-lora

Thanks for watching!
Don’t forget to subscribe for
more amazing content on AI
and Machine Learning!

Umar Jamil - https://github.com/hkproj/pytorch-lora

A Practical Guide To Fast Fine-Tuning Your LLMs With Unsloth - by Dr. Ashish Bamania - Apr, 2025 - AI Advances
No ratings yet
A Practical Guide To Fast Fine-Tuning Your LLMs With Unsloth - by Dr. Ashish Bamania - Apr, 2025 - AI Advances
27 pages
Zig Programming: From Zero to Systems Master
From Everand
Zig Programming: From Zero to Systems Master
Niklas Hoffmann
No ratings yet
Py Torch
No ratings yet
Py Torch
786 pages
02 PyTorch, Datasets, and Models
No ratings yet
02 PyTorch, Datasets, and Models
39 pages
Lora Fine-Tuning Without Gpus: A Cpu-Efficient Meta-Generation Framework For Llms
No ratings yet
Lora Fine-Tuning Without Gpus: A Cpu-Efficient Meta-Generation Framework For Llms
19 pages
PyTorch PDF
No ratings yet
PyTorch PDF
72 pages
LLM Fine Tuning
No ratings yet
LLM Fine Tuning
1 page
A Robot Is A Virtual or Mechanical Artificial Agent
100% (3)
A Robot Is A Virtual or Mechanical Artificial Agent
13 pages
Lecture 14 Introduction To Pytorch
No ratings yet
Lecture 14 Introduction To Pytorch
45 pages
Slides
No ratings yet
Slides
81 pages
Romanma Automated Program Repair FINAL
No ratings yet
Romanma Automated Program Repair FINAL
110 pages
Parameter Efficient Fine Tuning 1735415619
No ratings yet
Parameter Efficient Fine Tuning 1735415619
98 pages
Chapter 1
No ratings yet
Chapter 1
50 pages
Chatgpt | Generative AI - The Step-By-Step Guide For OpenAI & Azure OpenAI In 36 Hrs.
From Everand
Chatgpt | Generative AI - The Step-By-Step Guide For OpenAI & Azure OpenAI In 36 Hrs.
AJIT DASH
No ratings yet
PyTorch 1 - 0 - Bringing Research and Production Together Presentation
No ratings yet
PyTorch 1 - 0 - Bringing Research and Production Together Presentation
108 pages
04 AIS421 Finetuning Part 2
No ratings yet
04 AIS421 Finetuning Part 2
50 pages
4 LLM Fine Tuning Techniques
No ratings yet
4 LLM Fine Tuning Techniques
8 pages
2024 - A Survey On LoRA of Large Language Models - Mao Et Al - Arxiv
No ratings yet
2024 - A Survey On LoRA of Large Language Models - Mao Et Al - Arxiv
31 pages
Chapter 3
No ratings yet
Chapter 3
26 pages
L RA F F - : A I E: O VS ULL INE Tuning N Llusion of Quivalence
No ratings yet
L RA F F - : A I E: O VS ULL INE Tuning N Llusion of Quivalence
21 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
A Survey On LoRA of Large Language Models
No ratings yet
A Survey On LoRA of Large Language Models
30 pages
Chapter 3
No ratings yet
Chapter 3
26 pages
Pytorch Basics - For Absolute Beginners - Sel, Tam (Sel, Tam) - 2021 - Anna's Archive - Copie
No ratings yet
Pytorch Basics - For Absolute Beginners - Sel, Tam (Sel, Tam) - 2021 - Anna's Archive - Copie
62 pages
lora综述2501 00365v1
No ratings yet
lora综述2501 00365v1
22 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
Lora: Low-Rank Adaptation of Large Language Models
No ratings yet
Lora: Low-Rank Adaptation of Large Language Models
20 pages
Beginner's PyTorch Guide
No ratings yet
Beginner's PyTorch Guide
35 pages
Quanta: Efficient High-Rank Fine-Tuning of Llms With Quantum-Informed Tensor Adaptation
No ratings yet
Quanta: Efficient High-Rank Fine-Tuning of Llms With Quantum-Informed Tensor Adaptation
28 pages
Chapter1 Intro
No ratings yet
Chapter1 Intro
35 pages
Loraland
No ratings yet
Loraland
27 pages
LoRA Techniques in LLM Fine-Tuning
No ratings yet
LoRA Techniques in LLM Fine-Tuning
27 pages
LoRA+ - Efficient Low Rank Adaptation of Large Models
No ratings yet
LoRA+ - Efficient Low Rank Adaptation of Large Models
24 pages
Fine-Tuning GPT For Summarization
No ratings yet
Fine-Tuning GPT For Summarization
9 pages
Fine Tuning
No ratings yet
Fine Tuning
24 pages
Lab 10
No ratings yet
Lab 10
9 pages
Good Synthesis Essay Prompts
100% (3)
Good Synthesis Essay Prompts
8 pages
01 - Introduction
No ratings yet
01 - Introduction
35 pages
Deep Learning Lab: How To Train Your First Neural Network
No ratings yet
Deep Learning Lab: How To Train Your First Neural Network
68 pages
00 Pytorch and Deep Learning Fundamentals PDF
No ratings yet
00 Pytorch and Deep Learning Fundamentals PDF
44 pages
Microsoft AI - 900 - Exam
100% (1)
Microsoft AI - 900 - Exam
14 pages
Full Fine-Tuning, PEFT, Prompt Engineering, or RAG
No ratings yet
Full Fine-Tuning, PEFT, Prompt Engineering, or RAG
23 pages
Fine-Tuning Llama 2 On A Custom Dataset
No ratings yet
Fine-Tuning Llama 2 On A Custom Dataset
22 pages
A Comparative Study Between Full-Parameter and LoRA-based
No ratings yet
A Comparative Study Between Full-Parameter and LoRA-based
8 pages
微调方法 Chain of LoRA - Efficient Fine-tuning of Language Models via Residual Learning
No ratings yet
微调方法 Chain of LoRA - Efficient Fine-tuning of Language Models via Residual Learning
9 pages
1912 Lora Low Rank Adaptation of La
No ratings yet
1912 Lora Low Rank Adaptation of La
13 pages
Unsloth Docs - Unsloth Documentation
No ratings yet
Unsloth Docs - Unsloth Documentation
10 pages
Pytorch Neural Networks Guide 1717173717
No ratings yet
Pytorch Neural Networks Guide 1717173717
17 pages
Lab 5
No ratings yet
Lab 5
27 pages
Lora - Low-Rank Adaptation of Large Language Models - 2106.09685
No ratings yet
Lora - Low-Rank Adaptation of Large Language Models - 2106.09685
26 pages
A Note On LoRA
No ratings yet
A Note On LoRA
6 pages
Platypus
No ratings yet
Platypus
17 pages
Low Rank Adaptation
No ratings yet
Low Rank Adaptation
7 pages
Konigstein2 v-8 ScrambledContent Chapter-9
No ratings yet
Konigstein2 v-8 ScrambledContent Chapter-9
10 pages
Aoml Projj
No ratings yet
Aoml Projj
11 pages
Parameter Efficient Fine-Tuning (PEFT)
No ratings yet
Parameter Efficient Fine-Tuning (PEFT)
10 pages
DIP Lab 10
No ratings yet
DIP Lab 10
11 pages
LLM Research Report
No ratings yet
LLM Research Report
8 pages
Lora and Qlora
No ratings yet
Lora and Qlora
5 pages
Stars 4 0 0 0 + Forks 7 0 0 + License MIT
No ratings yet
Stars 4 0 0 0 + Forks 7 0 0 + License MIT
19 pages
Why Finetuning
No ratings yet
Why Finetuning
7 pages
Fine Tuning LLM Locally With Qlora
No ratings yet
Fine Tuning LLM Locally With Qlora
4 pages
Roshan CV Final
No ratings yet
Roshan CV Final
1 page
Vigyaan Problem Statements (: Aavartan'19
No ratings yet
Vigyaan Problem Statements (: Aavartan'19
3 pages
Sarowar 2025 Ijca 924776
100% (1)
Sarowar 2025 Ijca 924776
34 pages
Porter Five Forces
No ratings yet
Porter Five Forces
22 pages
Unit-1 Organizational Goals of Hyundai Company
No ratings yet
Unit-1 Organizational Goals of Hyundai Company
36 pages
The Future of Work After COVID 19 Report
No ratings yet
The Future of Work After COVID 19 Report
152 pages
2023년 - 중3 - 2학기 기말 - 신천중학교 - 서울시 송파구 - NE능률 (김성곤)
No ratings yet
2023년 - 중3 - 2학기 기말 - 신천중학교 - 서울시 송파구 - NE능률 (김성곤)
8 pages
AI in EU Insurance
No ratings yet
AI in EU Insurance
92 pages
Major Project PPT
No ratings yet
Major Project PPT
9 pages
Linked in Marketing
No ratings yet
Linked in Marketing
26 pages
AIFinTech100 Summary 2021
No ratings yet
AIFinTech100 Summary 2021
77 pages
What's Your Cognitive Strategy
No ratings yet
What's Your Cognitive Strategy
6 pages
Machine Learning Techniques
No ratings yet
Machine Learning Techniques
3 pages
EPAM Q1 2024 Investor Presentation
No ratings yet
EPAM Q1 2024 Investor Presentation
30 pages
Unit 5 1
No ratings yet
Unit 5 1
3 pages
M.E. Exam. Time Table Nov. - Dec. 2013
No ratings yet
M.E. Exam. Time Table Nov. - Dec. 2013
7 pages
5-Toward Responsible AI in The Era of Generative AI - A Reference Architecture For Designing Foundation Model-Based Systems
No ratings yet
5-Toward Responsible AI in The Era of Generative AI - A Reference Architecture For Designing Foundation Model-Based Systems
10 pages
Ai Selfcar
No ratings yet
Ai Selfcar
4 pages
Assessment 2 Summary Happy Career and Work Based Learning Portfolio September 2024 Tagged 1pdf
No ratings yet
Assessment 2 Summary Happy Career and Work Based Learning Portfolio September 2024 Tagged 1pdf
9 pages
Robotics and Automation PPT R.M V.S
No ratings yet
Robotics and Automation PPT R.M V.S
14 pages
Jahnavi IITPATNA
100% (1)
Jahnavi IITPATNA
1 page
Paper Chromatography Project Class 12: Upload Home Explore Login Signup
No ratings yet
Paper Chromatography Project Class 12: Upload Home Explore Login Signup
25 pages
Title of Project Machine Learning Based Signal Detection in Band Limited VLC-OFDM System
No ratings yet
Title of Project Machine Learning Based Signal Detection in Band Limited VLC-OFDM System
16 pages
Image Tampering Detection
No ratings yet
Image Tampering Detection
19 pages
The Kmean Quatization
No ratings yet
The Kmean Quatization
14 pages
Dreyfus Dreyfus 2005 Peripheral Vision Expertise in Real World Contexts
No ratings yet
Dreyfus Dreyfus 2005 Peripheral Vision Expertise in Real World Contexts
14 pages
Brochure ICME Program Rev
No ratings yet
Brochure ICME Program Rev
5 pages
CHE482 F24 Course Outline - Group Design Project
No ratings yet
CHE482 F24 Course Outline - Group Design Project
7 pages
SL Classification For Data Science..
No ratings yet
SL Classification For Data Science..
4 pages
Dreamcast Architecture: Architecture of Consoles: A Practical Analysis, #9
From Everand
Dreamcast Architecture: Architecture of Consoles: A Practical Analysis, #9
Rodrigo Copetti
No ratings yet
Mini Question Bank 6th sem-1903BS005-MACHINE LEARNING
No ratings yet
Mini Question Bank 6th sem-1903BS005-MACHINE LEARNING
3 pages
2022 (E.s)
No ratings yet
2022 (E.s)
1 page