LLM Fine Tuning

Uploaded by

Pankaj Singhi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

164 views16 pages

LLM Fine Tuning

Uploaded by

Pankaj Singhi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Intro to LLM Fine Tuning

Amber Liu
2023/09
Difference Between Pre-training
Stage Pretraining Supervised Fine-tuning

Language modeling
Algorithm
predict the next token

Raw internet text Carefully curated text

Dataset ~trillions of words ~10-100K (prompt, response)
low-quality, large quantity low quantity, high quality

1000s of GPUs months of

1-100 GPUs days of training
Resource training
ex: Vicuna-13B
ex: GPT LLaMA, PaLM

https://build.microsoft.com/en-US/sessions/db3f4859-cd30-4445-a0cd-553c3304f8e2
Pretrained Models are NOT Assistants
• Base model does not answer questions
• lt only wants to complete internet documents
• Language models are not aligned with user intent

Write a poem about bread and cheese.

Write a poem about someone who

died of starvation.

Write a poem about angel food cake.

Write a poem about someone who

choked on a ham sandwich.

Write a poem about a hostess who

makes the
When do you want Fine-Tuning?
1. Vanilla fine-tuning
• Gain knowledge for specific downstream task
2. Prompt engineering
• Precise control over output
• No computing resources
3. Instruction tuning
• Adhere LLM to human’s instructions

Gain Behavior
Knowledge Change
When do you want Fine-Tuning?
4. Retrieval Augmented Generation (RAG) LLM

5. Parameter-Efficient Fine-Tuning (PEFT)

6. Reinforcement Learning from Human Feedback (RLHF)
• Align with human preference
RLHF
Fine-tuned model is not aligned with human preference
1. Memory Capacity Intensive
Challenges 2. Computation Intensive
Parameter-Efficient
Fine-tuning (PEFT):

a class of methods that adapt

LLMs by updating only a small
subset of model parameters.

https://arxiv.org/pdf/2307.10169.pdf
PEFT Taxonomy

Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning

Addictive: Adapters
Add additional, learnable layers into a Transformer architecture. ~3%

Parameter-efficient transfer learning for nlp

Selective: BitFit
Only fine-tune the biases of the network. (<1%)

Fail when model size is large

Reparametrization-based: LoRa

- Only update the low-rank matrix

- 10000x less trainable parameter
- 3x less GPU memory requirement
- Apply to any linear layer
- No inference overhead
QLoRa
Fine-tuning Library
1. Pytorch
2. Hugging Face - PEFT
3. Lamini
4. OpenAI Fine-tuning API
Reference
1. LoRA: LOW-RANK ADAPTATION OF LARGE LANGUAGE MODELS
2. Prefix Tuning: P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks
3. Prompt Tuning: The Power of Scale for Parameter-Efficient Prompt Tuning
4. P-Tuning: GPT Understands, Too
5. Parameter-efficient transfer learning for nlp
6. Challenges and Applications of Large Language Models
7. QLORA: Efficient Finetuning of Quantized LLMs
8. Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning
Source
https://build.microsoft.com/en-US/sessions/db3f4859-cd30-4445-a
0cd-553c3304f8e2
https://web.stanford.edu/class/cs224n/slides/cs224n-2023-lecture1
1-prompting-rlhf.pdf
https://www.bilibili.com/video/BV1Tu4y1R7H5/?spm_id_from=33
3.788.recommend_more_video.0&vd_source=39940709d86c95c
61be9bec979dfb187
https://www.youtube.com/watch?v=dA-NhCtrrVE

LLM Cheat Sheetpdf
No ratings yet
LLM Cheat Sheetpdf
7 pages
Sinan Ozdemir - Quick Start Guide To Large Language Models, Second Edition-Addison-Wesley (2024)
No ratings yet
Sinan Ozdemir - Quick Start Guide To Large Language Models, Second Edition-Addison-Wesley (2024)
279 pages
Hotel Management System: Project Report of
50% (2)
Hotel Management System: Project Report of
87 pages
Lecture 3 Finetuning Part 1
No ratings yet
Lecture 3 Finetuning Part 1
85 pages
Fine Tuning
No ratings yet
Fine Tuning
24 pages
02 - Hands-On - Prompt Engineering
No ratings yet
02 - Hands-On - Prompt Engineering
58 pages
5 Pretraining On Unlabeled Data - Build A Large Language Model (From Scratch)
No ratings yet
5 Pretraining On Unlabeled Data - Build A Large Language Model (From Scratch)
61 pages
Knowledge Graph Construction Using Large Language Models
No ratings yet
Knowledge Graph Construction Using Large Language Models
17 pages
NCA-GENL Nvidia Generative Ai Llms Exam Dumps
No ratings yet
NCA-GENL Nvidia Generative Ai Llms Exam Dumps
5 pages
Knowledge Graphs V Vector Databases and When Not To Use Them!
No ratings yet
Knowledge Graphs V Vector Databases and When Not To Use Them!
3 pages
GenAI Roadmap
No ratings yet
GenAI Roadmap
8 pages
LangChain Academy - Introduction To LangGraph - Motivation
No ratings yet
LangChain Academy - Introduction To LangGraph - Motivation
17 pages
Fine-Tuning Legal-BERT - LLMs For Automated Legal Text Classification - by Drewgelbard - Nov, 2024 - Towards AI
No ratings yet
Fine-Tuning Legal-BERT - LLMs For Automated Legal Text Classification - by Drewgelbard - Nov, 2024 - Towards AI
27 pages
Fine-Tuning AI Models - A Guide. Fine-Tuning Is A Technique For Adapting - by Prabhu Srivastava - Medium
No ratings yet
Fine-Tuning AI Models - A Guide. Fine-Tuning Is A Technique For Adapting - by Prabhu Srivastava - Medium
12 pages
Reading:: Sources
No ratings yet
Reading:: Sources
15 pages
Machine Learning GenAI Roadma
No ratings yet
Machine Learning GenAI Roadma
36 pages
LLM and RAG
No ratings yet
LLM and RAG
12 pages
Generative AI Interview Questions and Answers
No ratings yet
Generative AI Interview Questions and Answers
7 pages
Vector Database
No ratings yet
Vector Database
8 pages
Best Practices For Prompt Engineering With The OpenAI
No ratings yet
Best Practices For Prompt Engineering With The OpenAI
6 pages
Prompt Engineering For Vision Models Slides 1720084286
No ratings yet
Prompt Engineering For Vision Models Slides 1720084286
17 pages
LLM Assignment 1
No ratings yet
LLM Assignment 1
3 pages
Techniques To FineTune LLMs
No ratings yet
Techniques To FineTune LLMs
7 pages
LangChain & RAG
No ratings yet
LangChain & RAG
62 pages
GenAI Pinnacle Roadmap
100% (1)
GenAI Pinnacle Roadmap
8 pages
Newwhitepaper Agents2
No ratings yet
Newwhitepaper Agents2
84 pages
5 Techiques To FineTune LLMs
No ratings yet
5 Techiques To FineTune LLMs
7 pages
GPT-4o API Deep Dive Text Generation Vision and Function Calling
No ratings yet
GPT-4o API Deep Dive Text Generation Vision and Function Calling
21 pages
LLM Challenges
No ratings yet
LLM Challenges
1 page
Gen Ai Solutions
No ratings yet
Gen Ai Solutions
14 pages
A Gentle Intro To Chaining LLMS, Agents, and Utils Via LangChain
No ratings yet
A Gentle Intro To Chaining LLMS, Agents, and Utils Via LangChain
26 pages
2303.13936-Programming Research ChatGPT and CoPilot
100% (1)
2303.13936-Programming Research ChatGPT and CoPilot
9 pages
Everything You Need To Know About Small Language Models (SLM) and Its Applications
No ratings yet
Everything You Need To Know About Small Language Models (SLM) and Its Applications
3 pages
Retrieval Augmentation Reduces Hallucination in Conversation
No ratings yet
Retrieval Augmentation Reduces Hallucination in Conversation
21 pages
LangChain Programming For Beginners
No ratings yet
LangChain Programming For Beginners
154 pages
Vector Database in LLMs
No ratings yet
Vector Database in LLMs
14 pages
Lecture 26
No ratings yet
Lecture 26
17 pages
MM-LLMs Recent Advances in MultiModal Large Language Models
No ratings yet
MM-LLMs Recent Advances in MultiModal Large Language Models
22 pages
Lecture 2 Prompt Engineering
No ratings yet
Lecture 2 Prompt Engineering
60 pages
MemGPT - Towards LLMs As Operating Systems - 2310.08560
No ratings yet
MemGPT - Towards LLMs As Operating Systems - 2310.08560
15 pages
NLP and Generative AI Syllabus - 2025
No ratings yet
NLP and Generative AI Syllabus - 2025
5 pages
Advanced Prompt Engineering
No ratings yet
Advanced Prompt Engineering
27 pages
Brief Introduction To GenAI
No ratings yet
Brief Introduction To GenAI
1 page
20 Types of LLM Guardrails
No ratings yet
20 Types of LLM Guardrails
12 pages
U1 NLP App Solved
No ratings yet
U1 NLP App Solved
26 pages
Effective Prompt Engineering For LLMs - A Developer's Guide To Advanced AI Techniques - by Pankaj - Nov, 2024 - Medium
No ratings yet
Effective Prompt Engineering For LLMs - A Developer's Guide To Advanced AI Techniques - by Pankaj - Nov, 2024 - Medium
16 pages
Guide To RAG System Evaluation Metrics
No ratings yet
Guide To RAG System Evaluation Metrics
21 pages
Tf-Idf: David Kauchak cs160 Fall 2009
No ratings yet
Tf-Idf: David Kauchak cs160 Fall 2009
51 pages
Small Language Models (SLMS)
No ratings yet
Small Language Models (SLMS)
23 pages
A Review On Large Language Models Architectures Ap
No ratings yet
A Review On Large Language Models Architectures Ap
31 pages
Generative AI With Large Language Models AWS & DeepLearning
No ratings yet
Generative AI With Large Language Models AWS & DeepLearning
96 pages
Langchain PDF Reader
100% (1)
Langchain PDF Reader
15 pages
MLOPs Artem Koval
No ratings yet
MLOPs Artem Koval
38 pages
Large-Language-Model-Based-Artificial-Intelligence-In-The-Language-Classroom-Practical-Ideas-For-Teaching - Content File PDF
No ratings yet
Large-Language-Model-Based-Artificial-Intelligence-In-The-Language-Classroom-Practical-Ideas-For-Teaching - Content File PDF
20 pages
Generative Adversial Network
No ratings yet
Generative Adversial Network
21 pages
RAG Notes
No ratings yet
RAG Notes
4 pages
Building Your Own Autonomous LLM Agents - LinkedIn
No ratings yet
Building Your Own Autonomous LLM Agents - LinkedIn
33 pages
Hands-On Lab With LLMs and Gen AI Within IDC
No ratings yet
Hands-On Lab With LLMs and Gen AI Within IDC
57 pages
Fine Tuning Techniques For Large Language Models LLMs
No ratings yet
Fine Tuning Techniques For Large Language Models LLMs
15 pages
Retrieval-Augmented Generation For Large Language Models: A Survey
No ratings yet
Retrieval-Augmented Generation For Large Language Models: A Survey
26 pages
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
From Everand
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
Fouad Sabry
No ratings yet
10 2023
No ratings yet
10 2023
53 pages
Quiz - Flottation - Corrigé
No ratings yet
Quiz - Flottation - Corrigé
2 pages
FINAL English 10 Q1 Module 7
No ratings yet
FINAL English 10 Q1 Module 7
30 pages
DFC20123 Chap 1 Fundamentals of DBMS
No ratings yet
DFC20123 Chap 1 Fundamentals of DBMS
48 pages
Introduction To Computer
No ratings yet
Introduction To Computer
15 pages
Lesson Plan - Computer Architecture: Data Representation, Micro-Operations Organization and Design
No ratings yet
Lesson Plan - Computer Architecture: Data Representation, Micro-Operations Organization and Design
6 pages
Mortal Kombat 11 For Nintendo Switch - Nintendo Game Details
No ratings yet
Mortal Kombat 11 For Nintendo Switch - Nintendo Game Details
1 page
Knowing The Different Types of Film Genre
No ratings yet
Knowing The Different Types of Film Genre
13 pages
Configuring Oracle Workflow For OAuth 2.0 With Microsoft Office 365 Exchange Online in Oracle E-Business Suite Release 12.2 and Release 12.1.3 (Doc ID 2884072.1)
No ratings yet
Configuring Oracle Workflow For OAuth 2.0 With Microsoft Office 365 Exchange Online in Oracle E-Business Suite Release 12.2 and Release 12.1.3 (Doc ID 2884072.1)
17 pages
Walberg Theory of Educational Productivity
100% (1)
Walberg Theory of Educational Productivity
1 page
Module 5: Purposes and Functions of Language Assessment & Test
100% (1)
Module 5: Purposes and Functions of Language Assessment & Test
15 pages
TỔNG HỢP CÁC ĐỀ 01-06
No ratings yet
TỔNG HỢP CÁC ĐỀ 01-06
22 pages
Ausubels Meaningful Verbal Theory
No ratings yet
Ausubels Meaningful Verbal Theory
4 pages
Isomorphisms and Allomorphisms in The Morphemic Structure of English and Ukrainian Words
No ratings yet
Isomorphisms and Allomorphisms in The Morphemic Structure of English and Ukrainian Words
13 pages
Assembly Language For x86 Processors: Chapter 1: Basic Concepts
No ratings yet
Assembly Language For x86 Processors: Chapter 1: Basic Concepts
41 pages
Themes & Symbols - A Doll's House
No ratings yet
Themes & Symbols - A Doll's House
5 pages
English 5 - DLP - Week 1 - Day 1 - August 5, 2024
No ratings yet
English 5 - DLP - Week 1 - Day 1 - August 5, 2024
4 pages
Tcontwebbac02 Iom
No ratings yet
Tcontwebbac02 Iom
74 pages
AI Unit 1 Notes
No ratings yet
AI Unit 1 Notes
15 pages
Learning English
No ratings yet
Learning English
2 pages
Phonetic Foolishness
No ratings yet
Phonetic Foolishness
1 page
Generative AI 2
No ratings yet
Generative AI 2
24 pages
Lecture-12 Modulus Equations: Chapter 1 - Fundamentals of Mathematics
No ratings yet
Lecture-12 Modulus Equations: Chapter 1 - Fundamentals of Mathematics
15 pages
Classroom Observation and Research
100% (1)
Classroom Observation and Research
12 pages
Steel-Grating-Catalogue
No ratings yet
Steel-Grating-Catalogue
23 pages
Class 2 Eng QB 2025-26
No ratings yet
Class 2 Eng QB 2025-26
14 pages
SDT MS Autumn 2021 FINAL
100% (1)
SDT MS Autumn 2021 FINAL
15 pages
TC1601en-Ed05 TSAPI Deployments For Voice Recorder
No ratings yet
TC1601en-Ed05 TSAPI Deployments For Voice Recorder
50 pages
030 - Mathematics and Mathematics Difficulties
No ratings yet
030 - Mathematics and Mathematics Difficulties
159 pages