huggingface · abidlabs · Jul 29, 2025 · Jul 17, 2025 · Jul 17, 2025 · Jul 17, 2025
diff --git a/_blog.yml b/_blog.yml
@@ -6450,4 +6450,15 @@
   tags:
     - cli
     - huggingface_hub
-    - python
+    - python
+
+- local: trackio
+  title: "Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face"
+  thumbnail: /blog/assets/trackio/thumbnail.gif
+  author: abidlabs
+  date: July 29, 2025
+  tags:
+    - research
+    - gradio
+    - open-source
+
diff --git a/assets/trackio/thumbnail.gif b/assets/trackio/thumbnail.gif
diff --git a/trackio.md b/trackio.md
@@ -0,0 +1,202 @@
+---
+title: "Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face"
+thumbnail: /blog/assets/trackio/thumbnail.gif
+authors:
+- user: abidlabs
+- user: znation
+- user: nouamanetazi
+- user: sasha
+- user: qgallouedec
+---
+
+# Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face
+
+If you have trained your own machine learning model, you know how important it is to be able to track metrics, parameters, and hyperparameters during training and visualize them afterwards to better understand your training run.
+
+Most machine learning researchers use specific experiment tracking libraries to do this. However, these libraries can be paid, require complex setup, or lack the flexibility needed for rapid experimentation and sharing.
+
+## Why We Switched to Trackio
+
+At Hugging Face, our science team has started using [Trackio](https://github.com/gradio-app/trackio) for our research projects, and we've found several key advantages over other tracking solutions:
+
+**Easy Sharing and Embedding**: Trackio makes it incredibly simple to share training progress with colleagues or embed plots directly in blog posts and documentation using iframes. This is especially valuable when you want to showcase specific training curves or metrics without requiring others to set up accounts or navigate complex dashboards.
+
+**Standardization and Transparency**: Metrics like GPU energy usage are important to track and share with the community so we can have a better idea of the energy demands and environmental impacts of model training. Using Trackio, which directly gets information from the `nvidia-smi` command, makes it easy to quantify and compare energy usage and to add it to [model cards](https://huggingface.co/docs/hub/model-cards).
+
+**Data Accessibility**: Unlike some tracking tools that lock your data behind proprietary APIs, Trackio makes it straightforward to extract and analyze the data being recorded. This is crucial for researchers who need to perform custom analysis or integrate training metrics into their research workflows.
+
+**Flexibility for Experimentation**: Trackio's lightweight design allows us to easily experiment with new tracking features during training runs. For instance, we can decide when to move tensors from GPU to CPU when logging tensors while training, which significantly improves training throughput when you need to track model/intermediate states without impacting performance.
+
+## Using Trackio
+
+Ok then, so what is Trackio and how do you use it? Trackio is an [open-source Python library](https://github.com/gradio-app/trackio) that lets you track any metrics and visualize them using a local [Gradio](https://gradio.dev/) dashboard. You can also sync this dashboard to Hugging Face Spaces, which means you can then share the dashboard with other users simply by sharing a URL. Since Spaces can be private or public, this means you can share a dashboard publicly or just within members of your Hugging Face organization.
+
+## Installing
+
+You can install `trackio` using pip:
+
+```bash
+pip install trackio
+```
+
+Or, if you prefer using `uv`:
+
+```bash
+uv pip install trackio
+```
+
+## Usage
+
+`trackio` is designed to be a drop-in replacement for experiment tracking libraries like `wandb`. The API is compatible with `wandb.init`, `wandb.log`, and `wandb.finish`, so you can simply import `trackio` as `wandb` in your code.
+
+```diff
+- import wandb
++ import trackio as wandb
+```
+
+Here is an example:
+
+```python
+import trackio
+import random
+import time
+
+runs = 3
+epochs = 8
+
+def simulate_multiple_runs():
+    for run in range(runs):
+        trackio.init(project="fake-training", config={
+            "epochs": epochs,
+            "learning_rate": 0.001,
+            "batch_size": 64
+        })
+        for epoch in range(epochs):
+            train_loss = random.uniform(0.2, 1.0)
+            train_acc = random.uniform(0.6, 0.95)
+            val_loss = train_loss - random.uniform(0.01, 0.1)
+            val_acc = train_acc + random.uniform(0.01, 0.05)
+            trackio.log({
+                "epoch": epoch,
+                "train_loss": train_loss,
+                "train_accuracy": train_acc,
+                "val_loss": val_loss,
+                "val_accuracy": val_acc
+            })
+            time.sleep(0.2)
+    trackio.finish()
+
+simulate_multiple_runs()
+```
+
+### Visualizing Results
+
+After logging your experiments, you can launch the dashboard to visualize your results. Run the following command in your terminal:
+
+```bash
+trackio show
+```
+
+Or, launch it from Python:
+
+```python
+import trackio
+trackio.show()
+```
+
+You can also specify a project name:
+
+```bash
+trackio show --project "my project"
+```
+
+Or in Python:
+
+```python
+trackio.show(project="my project")
+```
+
+![example trackio dashboard](https://github.com/user-attachments/assets/c004aa61-a86a-4ece-be13-a01b640f216e)
+
+
+### Sharing with 🤗 Spaces
+
+To sync your local dashboard to Hugging Face Spaces, simply pass a `space_id` to `init`:
+
+```python
+trackio.init(project="fake-training", space_id="org_name/space_name")
+```
+
+If you are hosting your dashboard on Spaces, you can simply share the URL or embed it anywhere using an iframe:
+
+```html
+<iframe src="https://org_name-space_name.hf.space/?project=fake-training&metrics=train_loss,train_accuracy&sidebar=hidden" width=600 height=600 frameBorder="0"></iframe>
+```
+
+<iframe src="https://trackio-documentation.hf.space/?project=fake-training&metrics=train_loss,train_accuracy&sidebar=hidden" width=600 height=600 frameBorder="0"></iframe>
+
+
+Since Spaces can be private or public, this means you can share a dashboard publicly or just within members of your Hugging Face organization — all for free!
+
+When you sync your Trackio dashboard to Hugging Face Spaces, the data is logged to an ephemeral Sqlite database on Spaces. Because this database is reset if your Space restarts, Trackio also converts the Sqlite database to a Parquet dataset and backs it up to a Hugging Face Dataset every 5 minutes. This means you can visualize your logged metrics in a Hugging Face dataset at any time easily:
+
+<img width="100%" alt="image" src="https://github.com/user-attachments/assets/5d3e9db3-a7f6-4851-a779-bd4f3d9f73e6" />
+
+Tip: you can set the name of this dataset by passing in a `dataset_id` to `trackio.init()`.
+
+### Integrated with 🤗 Transformers and 🤗 Accelerate
+
+Trackio integrates natively with Hugging Face libraries like `transformers` and `accelerate`, so you can log metrics with minimal setup.
+
+**With `transformers.Trainer`:**
+
+```python
+import numpy as np
+from datasets import Dataset
+from transformers import Trainer, AutoModelForCausalLM, TrainingArguments
+
+# Create a fake dataset
+data = np.random.randint(0, 1000, (8192, 64)).tolist()
+dataset = Dataset.from_dict({"input_ids": data, "labels": data})
+
+# Train a model using the Trainer API
+trainer = Trainer(
+    model=AutoModelForCausalLM.from_pretrained("Qwen/Qwen3-0.6B"),
+    args=TrainingArguments(run_name="fake-training", report_to="trackio"),
+    train_dataset=dataset,
+)
+trainer.train()
+```
+
+**With `accelerate`:**
+
+```python
+from accelerate import Accelerator
+
+accelerator = Accelerator(log_with="trackio")
+accelerator.init_trackers("fake-training")
+
+...  # Prepare the model, dataloader, etc.
+
+for step, batch in enumerate(dataloader):
+    ...  # Your training logic here
+    accelerator.log({"training_loss": loss}, step=step)
+
+accelerator.end_training()
+```
+
+No extra setup needed—just plug it in and start tracking.
+
+## Design Principles
+
+- API compatible with popular experiment tracking libraries, making migration both to and from Trackio seamless.
+- Local-first: logs and dashboards run and persist locally by default, with the option to host on Hugging Face Spaces.
+- Lightweight and extensible: the core codebase is under 1,000 lines of Python, making it easy to understand and modify.
+- Free and open-source: all features, including hosting on Hugging Face, are free.
+- Built on top of 🤗 Datasets and Spaces for robust data handling and visualization.
+
+## Next Steps
+
+Trackio is intentionally lightweight and is currently in beta. Some features found in other tracking tools, such as artifact management, or complex visualizations, are not available yet. If you'd like to have these features, please create issues here: https://github.com/gradio-app/trackio/issues
+
+Given Trackio's lightweight and open-source nature, we'd love to work with the machine learning community to design an experiment tracking product that works for all of us!