LLM Fine Tuning
LLM Fine Tuning
Amber Liu
2023/09
Difference Between Pre-training
Stage Pretraining Supervised Fine-tuning
Language modeling
Algorithm
predict the next token
https://build.microsoft.com/en-US/sessions/db3f4859-cd30-4445-a0cd-553c3304f8e2
Pretrained Models are NOT Assistants
• Base model does not answer questions
• lt only wants to complete internet documents
• Language models are not aligned with user intent
Gain Behavior
Knowledge Change
When do you want Fine-Tuning?
4. Retrieval Augmented Generation (RAG) LLM
https://arxiv.org/pdf/2307.10169.pdf
PEFT Taxonomy