0% found this document useful (0 votes)

104 views7 pages

Distributed Fine-Tuning With The Transformers API by HuggingFace - Databricks

This document provides an example of how to fine-tune a Hugging Face model using the Transformers API and TorchDistributor API in a distributed manner on Databricks. It shows how to preprocess IMDB sentiment analysis data, set up the training function, run local and distributed training on a single node with multiple GPUs, and evaluate the trained models.

Uploaded by

XuanHungHo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

104 views7 pages

Distributed Fine-Tuning With The Transformers API by HuggingFace - Databricks

Uploaded by

XuanHungHo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

5/15/24, 10:11 PM Distributed fine-tuning with the Transformers API by HuggingFace - Databricks

Distributed fine-tuning with the Transformers API

(https://databricks.com)
Distributed fine-tuning with the
by HuggingFace
Hugging Face Transformers API
This notebook provides an example of how to fine-tune a Hugging Face model
using the Transformers API (https://huggingface.co/blog/pytorch-ddp-
accelerate-transformers) and the TorchDistributor API
(https://github.com/apache/spark/blob/master/python/pyspark/ml/torch/distributor
The fine-tuning guidance in this article is based on this Hugging Face blog post
(https://huggingface.co/blog/sentiment-analysis-python#a-fine-tuning-model-
with-python).
Requirements
Databricks Runtime ML 13.0 and above
(Recommended) GPU instances

Define the number of GPUs to use

In this example you use a cluster with 4 worker nodes. If you are using a different
cluster configuration, update NUM_WORKERS accordingly.
import torch

NUM_WORKERS = 4

def get_gpus_per_worker(_):
import torch
return torch.cuda.device_count()

NUM_GPUS_PER_WORKER = sc.parallelize(range(4),
4).map(get_gpus_per_worker).collect()[0]
USE_GPU = NUM_GPUS_PER_WORKER > 0

https://docs.databricks.com/en/_extras/notebooks/source/deep-learning/distributed-fine-tuning-hugging-face.html 1/7
5/15/24, 10:11 PM Distributed fine-tuning with the Transformers API by HuggingFace - Databricks

Preprocess your data

Initialize the tokenizer and collator for preprocessing the data.
import torch
from transformers import TrainingArguments, Trainer
from transformers import AutoTokenizer, DataCollatorWithPadding
from sklearn.model_selection import train_test_split

tokenizer = AutoTokenizer.from_pretrained("distilbert-base-uncased")
data_collator = DataCollatorWithPadding(tokenizer=tokenizer)

Import and preprocess the IMDB dataset

One key difference in the Hugging Face blog and this notebook is that this
example uses all of the IMDB data, not just 3000 data points.

https://docs.databricks.com/en/_extras/notebooks/source/deep-learning/distributed-fine-tuning-hugging-face.html 2/7
5/15/24, 10:11 PM Distributed fine-tuning with the Transformers API by HuggingFace - Databricks

from datasets import load_dataset # import the huggingface Datasets option

import pandas as pd

imdb = load_dataset("imdb")
train = pd.DataFrame(imdb["train"])
test = pd.DataFrame(imdb["test"])

texts = train["text"].tolist()
labels = train["label"].tolist()

train_texts, val_texts, train_labels, val_labels = train_test_split(

texts, labels, test_size=0.2)

train_encodings = tokenizer(train_texts, truncation=True)

val_encodings = tokenizer(val_texts, truncation=True)

class ImdbDataset(torch.utils.data.Dataset):
def __init__(self, encodings, labels):
self.encodings = encodings
self.labels = labels

def getitem(self, idx):

item = {key: torch.tensor(val[idx]) for key, val in
self.encodings.items()}
item['labels'] = torch.tensor(self.labels[idx])
return item

def __len__(self):
return len(self.labels)

tokenized_train = ImdbDataset(train_encodings, train_labels)

tokenized_test = ImdbDataset(val_encodings, val_labels)

Set up the training function

The TorchDistributor API has support for single node multi-GPU training as well
as multi-node training.
When you wrap the single-node code in the train() function, Databricks
recommends you include all the import statements inside the train() function to
avoid library pickling issues. You can return any picklable object in
train_model() , but that means you can't return Trainer since that can't be
picklable without a process group. You can instead return the best checkpoint
path and use that externally.

https://docs.databricks.com/en/_extras/notebooks/source/deep-learning/distributed-fine-tuning-hugging-face.html 3/7
5/15/24, 10:11 PM Distributed fine-tuning with the Transformers API by HuggingFace - Databricks

import numpy as np
from datasets import load_metric

from transformers import AutoModelForSequenceClassification

model = AutoModelForSequenceClassification.from_pretrained("distilbert-base-
uncased", num_labels=2)

def compute_metrics(eval_pred):
load_accuracy = load_metric("accuracy")
load_f1 = load_metric("f1")

logits, labels = eval_pred

predictions = np.argmax(logits, axis=-1)
accuracy = load_accuracy.compute(predictions=predictions,
references=labels)["accuracy"]
f1 = load_f1.compute(predictions=predictions, references=labels)["f1"]
return {"accuracy": accuracy, "f1": f1}

output_dir = "/dbfs/rithwik-db/imdb/finetuning-sentiment-model-v1" # Save to

DBFS (required)

def train_model():
from transformers import TrainingArguments, Trainer

training_args = TrainingArguments(
output_dir=output_dir,
learning_rate=2e-5,
per_device_train_batch_size=16,
per_device_eval_batch_size=16,
num_train_epochs=2,
weight_decay=0.01,
save_strategy="epoch",
report_to=[], # REMOVE MLFLOW INTEGRATION FOR NOW
push_to_hub=False, # DO NOT PUSH TO MODEL HUB FOR NOW,
load_best_model_at_end=True, # RECOMMENDED
metric_for_best_model="eval_loss", # RECOMMENDED
evaluation_strategy="epoch" # RECOMMENDED
)

trainer = Trainer(
model=model,
args=training_args,
train_dataset=tokenized_train,
eval_dataset=tokenized_test,
tokenizer=tokenizer,
data_collator=data_collator,
compute_metrics=compute_metrics,
)
trainer.train()
return trainer.state.best_model_checkpoint

https://docs.databricks.com/en/_extras/notebooks/source/deep-learning/distributed-fine-tuning-hugging-face.html 4/7
5/15/24, 10:11 PM Distributed fine-tuning with the Transformers API by HuggingFace - Databricks

# It is recommended to create a separate local trainer from pretrained model

instead of using the trainer used in distributed training
def test_model(ckpt_path):
model = AutoModelForSequenceClassification.from_pretrained(ckpt_path,
num_labels=2)
local_trainer =
Trainer(model=model,eval_dataset=tokenized_test,tokenizer=tokenizer,data_col
lator=data_collator,compute_metrics=compute_metrics)
return local_trainer.evaluate()

def test_example(ckpt_path, inputs):

from transformers import pipeline
model = AutoModelForSequenceClassification.from_pretrained(ckpt_path,
num_labels=2)
p = pipeline(task="sentiment-analysis", model=model, tokenizer=tokenizer)
outputs = p(inputs)
return ["Positive" if item["label"] == "LABEL_0" else "Negative" for item
in outputs]

Run local training

single_node_ckpt_path = train_model()

test_model(single_node_ckpt_path)

Run distributed training on a single node with

multiple GPUs
Distributor with local_mode=True will run the train() function directly on the
driver node of the spark cluster.
To configure how many GPUs to use in total for this run, pass num_processes=N
to the Distributor where N is the number of GPUs you want to use on the driver
node. Note that you don't need to actually make any changes to your training
code.

https://docs.databricks.com/en/_extras/notebooks/source/deep-learning/distributed-fine-tuning-hugging-face.html 5/7
5/15/24, 10:11 PM Distributed fine-tuning with the Transformers API by HuggingFace - Databricks

from pyspark.ml.torch.distributor import TorchDistributor

NUM_PROCESSES = torch.cuda.device_count()
print(f"We're using {NUM_PROCESSES} GPUs")
single_node_multi_gpu_ckpt_path =
TorchDistributor(num_processes=NUM_PROCESSES, local_mode=True,
use_gpu=USE_GPU).run(train_model)

test_model(single_node_multi_gpu_ckpt_path)

Run distributed training on multi-node

The TorchDistributor with local_mode=False (default) runa the train()
function on the worker nodes of the Spark cluster.
To configure the number of GPUs to use in total for this run, pass
num_processes=TOTAL_NUM_GPUS to the TorchDistributor. Notice that you don't
need to actually make any changes to your training code.
from pyspark.ml.torch.distributor import TorchDistributor

NUM_PROCESSES = NUM_GPUS_PER_WORKER * NUM_WORKERS

print(f"We're using {NUM_PROCESSES} GPUs")
multi_node_ckpt_path = TorchDistributor(num_processes=NUM_PROCESSES,
local_mode=False, use_gpu=USE_GPU).run(train_model)

test_model(multi_node_ckpt_path)

Test the model with Transformers's

pipeline API

https://docs.databricks.com/en/_extras/notebooks/source/deep-learning/distributed-fine-tuning-hugging-face.html 6/7
5/15/24, 10:11 PM Distributed fine-tuning with the Transformers API by HuggingFace - Databricks

https://docs.databricks.com/en/_extras/notebooks/source/deep-learning/distributed-fine-tuning-hugging-face.html 7/7

OceanofPDF.com You Killed Me First - John Marrs
No ratings yet
OceanofPDF.com You Killed Me First - John Marrs
315 pages
Centralized_LLM_Fine-tuning
No ratings yet
Centralized_LLM_Fine-tuning
4 pages
Finetuning
No ratings yet
Finetuning
3 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Introduction to PHP, Part 4, Second Edition
From Everand
Introduction to PHP, Part 4, Second Edition
Adam Majczak
No ratings yet
Parameter Efficient Fine
No ratings yet
Parameter Efficient Fine
14 pages
Interview Questions for IBM Mainframe Developers
From Everand
Interview Questions for IBM Mainframe Developers
Robert Wingate
1/5 (1)
FineTune OPUS MT Engine
No ratings yet
FineTune OPUS MT Engine
9 pages
09 Tensorflow101 Slide
No ratings yet
09 Tensorflow101 Slide
78 pages
Introduction to PHP, Part 5, Second Edition
From Everand
Introduction to PHP, Part 5, Second Edition
Adam Majczak
No ratings yet
IGNOU PGDCA MCS 206 Object Oriented Programming using Java Previous Years solved Papers
From Everand
IGNOU PGDCA MCS 206 Object Oriented Programming using Java Previous Years solved Papers
Manish Soni
No ratings yet
Finetuning
No ratings yet
Finetuning
4 pages
Salesforce Certified Platform Developer I CRT-450 Exam Preparation
From Everand
Salesforce Certified Platform Developer I CRT-450 Exam Preparation
Georgio Daccache
No ratings yet
Pytorch Neural Networks Guide 1717173717
No ratings yet
Pytorch Neural Networks Guide 1717173717
17 pages
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
Week 13 GCP Lec Notes
No ratings yet
Week 13 GCP Lec Notes
28 pages
Py Torch
No ratings yet
Py Torch
786 pages
cl12_huggingface
No ratings yet
cl12_huggingface
34 pages
PHP Package Mastery: 100 Essential Tools in One Hour - 2024 Edition
From Everand
PHP Package Mastery: 100 Essential Tools in One Hour - 2024 Edition
Kanto
No ratings yet
L6 Hardware and Software For DL en
No ratings yet
L6 Hardware and Software For DL en
66 pages
Implementation of Time Series Forecasting
No ratings yet
Implementation of Time Series Forecasting
12 pages
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Angular Generative AI: Building an intelligent CV enhancer with Google Gemini
From Everand
Angular Generative AI: Building an intelligent CV enhancer with Google Gemini
Abdelfattah Ragab
No ratings yet
Software Design Simplified
From Everand
Software Design Simplified
Liviu Catalin Dorobantu
No ratings yet
PyTorch Made Easy A Quick Overview
No ratings yet
PyTorch Made Easy A Quick Overview
55 pages
Operationalizing The Model
No ratings yet
Operationalizing The Model
46 pages
Ilhan Resource-Efficient Transformer Pruning for Finetuning of Large Models CVPR 2024 Paper
No ratings yet
Ilhan Resource-Efficient Transformer Pruning for Finetuning of Large Models CVPR 2024 Paper
10 pages
Deep Learning and Neural Networks
No ratings yet
Deep Learning and Neural Networks
8 pages
Lab 9
No ratings yet
Lab 9
29 pages
Aditya Joshi 23252595 Assign 5
No ratings yet
Aditya Joshi 23252595 Assign 5
7 pages
Your First Neural Network
No ratings yet
Your First Neural Network
15 pages
Building Deep Learning Models Using the PyTorch Library
No ratings yet
Building Deep Learning Models Using the PyTorch Library
4 pages
Untitled (2)
No ratings yet
Untitled (2)
10 pages
Core Java Programming Book
From Everand
Core Java Programming Book
Manish Soni
No ratings yet
Hugging Face
100% (1)
Hugging Face
11 pages
Assignment Text Classification Using Hugging Face
No ratings yet
Assignment Text Classification Using Hugging Face
6 pages
Astro AI
No ratings yet
Astro AI
20 pages
Mercity - Ai-Guide To Fine-Tuning LLMs Using PEFT and LoRa Techniques
No ratings yet
Mercity - Ai-Guide To Fine-Tuning LLMs Using PEFT and LoRa Techniques
25 pages
vnd.openxmlformats-officedocument.wordprocessingml.document&rendition=1-10
No ratings yet
vnd.openxmlformats-officedocument.wordprocessingml.document&rendition=1-10
13 pages
pytorch.org_tutorials__sources_beginner_ptcheat
No ratings yet
pytorch.org_tutorials__sources_beginner_ptcheat
7 pages
Spring Boot Intermediate Microservices: Resilient Microservices with Spring Boot 2 and Spring Cloud
From Everand
Spring Boot Intermediate Microservices: Resilient Microservices with Spring Boot 2 and Spring Cloud
Jens Boje
No ratings yet
Deep Learning
No ratings yet
Deep Learning
46 pages
How To Fine-Tune LLMs in 2024 With Hugging Face
100% (1)
How To Fine-Tune LLMs in 2024 With Hugging Face
13 pages
Intro To Deep Learning With TensorFlow - Introduction To TensorFlow Cheatsheet - Codecademy
No ratings yet
Intro To Deep Learning With TensorFlow - Introduction To TensorFlow Cheatsheet - Codecademy
8 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
C Programming
From Everand
C Programming
Netra
No ratings yet
Deep Learning Lab: How To Train Your First Neural Network
No ratings yet
Deep Learning Lab: How To Train Your First Neural Network
68 pages
15 Ways to Lower LLM Costs
No ratings yet
15 Ways to Lower LLM Costs
17 pages
Salesforce Developer Interview Questions: 1.0, #1
From Everand
Salesforce Developer Interview Questions: 1.0, #1
SFDC TELUGU
No ratings yet
PDL Final Assignment-3 Aryan
No ratings yet
PDL Final Assignment-3 Aryan
8 pages
Building Deep Neural Network
No ratings yet
Building Deep Neural Network
17 pages
Fine-tuned vs RAG Short Notes ?
No ratings yet
Fine-tuned vs RAG Short Notes ?
25 pages
Respostas Machine Learning Engineer
No ratings yet
Respostas Machine Learning Engineer
14 pages
Deep Learning Tensorflow
No ratings yet
Deep Learning Tensorflow
35 pages
Deep Learning With Tensorflow
No ratings yet
Deep Learning With Tensorflow
50 pages
Practical 1
No ratings yet
Practical 1
6 pages
"C Programming for Beginners: A Step-by-Step Guide"
From Everand
"C Programming for Beginners: A Step-by-Step Guide"
Lov kush
No ratings yet
DL Pipeline and Tutorial
No ratings yet
DL Pipeline and Tutorial
36 pages
Transfer Learning For Image Classification in Pytorch
No ratings yet
Transfer Learning For Image Classification in Pytorch
13 pages
6f3d04c8-be18-4a93-a326-25e217c808cc
No ratings yet
6f3d04c8-be18-4a93-a326-25e217c808cc
8 pages
Deep learning1
No ratings yet
Deep learning1
23 pages
Ict
100% (1)
Ict
4 pages
Chem 201935443552
No ratings yet
Chem 201935443552
10 pages
HJHJBBBB
No ratings yet
HJHJBBBB
6 pages
National Urban Development Policy
100% (1)
National Urban Development Policy
19 pages
Urgent Important Circular Aapar Id
No ratings yet
Urgent Important Circular Aapar Id
2 pages
Research Report: Bvlgari
No ratings yet
Research Report: Bvlgari
49 pages
Bans Enshu Kai
No ratings yet
Bans Enshu Kai
11 pages
Linux Device Driver_ODT
No ratings yet
Linux Device Driver_ODT
2 pages
Week-2 Module-2 Principles of Image Interpretation
No ratings yet
Week-2 Module-2 Principles of Image Interpretation
25 pages
Lecture 4 _ Choking
No ratings yet
Lecture 4 _ Choking
21 pages
Chronic Pancreatitis or Pancreatic Tumor
No ratings yet
Chronic Pancreatitis or Pancreatic Tumor
18 pages
Chemistry of Fats & Oils
No ratings yet
Chemistry of Fats & Oils
38 pages
Exercise: Curriculum Mapping: Course Intended Learning Outcomes Program Outcomes PO1 PO2 PO3 PO4 PO5
No ratings yet
Exercise: Curriculum Mapping: Course Intended Learning Outcomes Program Outcomes PO1 PO2 PO3 PO4 PO5
3 pages
Resume - Salsman
No ratings yet
Resume - Salsman
3 pages
Botlab Flowers Fruits Fruits Seed Dispersal 1
No ratings yet
Botlab Flowers Fruits Fruits Seed Dispersal 1
6 pages
PF ST EcoMi KTS EN
No ratings yet
PF ST EcoMi KTS EN
2 pages
Synthetic Hydrograph Models
No ratings yet
Synthetic Hydrograph Models
17 pages
r48 3000e3 Datasheet
No ratings yet
r48 3000e3 Datasheet
2 pages
An Empirical Formula Between Outdoor Temperature and Humidity Baru
No ratings yet
An Empirical Formula Between Outdoor Temperature and Humidity Baru
9 pages
Chapter 3 - Part 2
No ratings yet
Chapter 3 - Part 2
22 pages
CL1 - Suggested Solutions-December 2022
No ratings yet
CL1 - Suggested Solutions-December 2022
16 pages
Closers End Game Item Guide
No ratings yet
Closers End Game Item Guide
7 pages
Yearly Status Report - 2016-2017: Data of The Institution
No ratings yet
Yearly Status Report - 2016-2017: Data of The Institution
16 pages
Programming Guideline DOCU v14 en PDF
No ratings yet
Programming Guideline DOCU v14 en PDF
109 pages
(Hart) - S.E.a. Lab. Science Experiments and Activities (1990)
No ratings yet
(Hart) - S.E.a. Lab. Science Experiments and Activities (1990)
199 pages
Anaerobic Treatment Process
No ratings yet
Anaerobic Treatment Process
8 pages
1.-Germans at Meat 1910
No ratings yet
1.-Germans at Meat 1910
4 pages
TEST 1 G9
No ratings yet
TEST 1 G9
3 pages
Aud Theo 96 102
No ratings yet
Aud Theo 96 102
7 pages

Distributed Fine-Tuning With The Transformers API by HuggingFace - Databricks

Uploaded by

Distributed Fine-Tuning With The Transformers API by HuggingFace - Databricks

Uploaded by

5/15/24, 10:11 PM Distributed fine-tuning with the Transformers API by HuggingFace - Databricks

Distributed fine-tuning with the Transformers API

Define the number of GPUs to use

Preprocess your data

Import and preprocess the IMDB dataset

from datasets import load_dataset # import the huggingface Datasets option

train_texts, val_texts, train_labels, val_labels = train_test_split(

train_encodings = tokenizer(train_texts, truncation=True)

def __getitem__(self, idx):

tokenized_train = ImdbDataset(train_encodings, train_labels)

Set up the training function

from transformers import AutoModelForSequenceClassification

logits, labels = eval_pred

output_dir = "/dbfs/rithwik-db/imdb/finetuning-sentiment-model-v1" # Save to

# It is recommended to create a separate local trainer from pretrained model

def test_example(ckpt_path, inputs):

Run local training

Run distributed training on a single node with

from pyspark.ml.torch.distributor import TorchDistributor

Run distributed training on multi-node

NUM_PROCESSES = NUM_GPUS_PER_WORKER * NUM_WORKERS

Test the model with Transformers's

You might also like

def getitem(self, idx):