DVC Cheatsheet
DVC Cheatsheet
DVC is a version control system for machine learning projects, allowing you to track, share, and
---
1. Getting Started
dvc init
Commit Changes:
dvc push
Pull Data from Remote Storage:
dvc pull
---
2. Tracking Experiments
Run an Experiment:
dvc repro
Track Parameters:
Specify parameters in a params.yaml file and link them to stages in the pipeline.
Example params.yaml:
learning_rate: 0.01
batch_size: 32
---
3. Pipelines
Example:
dvc dag
Run the Entire Pipeline:
dvc repro
---
Log Metrics:
"accuracy": 0.95,
"loss": 0.05
Visualize Plots:
---
5. Versioning Data
dvc checkout
---
6. Sharing Projects
git push
dvc push
dvc pull
---
7. Useful Commands
dvc gc
---
- SSH: ssh://user@server:/path
---
9. Useful Links