ViqusViqus
Navigate
Company
Blog
About Us
Contact
System Status
Enter Viqus Hub

Hugging Face Introduces Mutable Storage Buckets for ML Artifacts

ML Artifacts Storage Buckets Hugging Face Hub S3-like Storage Data Pipelines Xet Deduplication
March 10, 2026
Viqus Verdict Logo Viqus Verdict Logo 8
Streamlining the Machine Learning Lifecycle
Media Hype 7/10
Real Impact 8/10

Article Summary

Hugging Face is introducing Storage Buckets, a fundamentally new approach to object storage specifically tailored for the dynamic nature of modern machine learning workflows. Traditionally, the Hugging Face Hub primarily served as a repository for final, immutable artifacts like trained models and datasets. However, the reality of ML development involves a continuous stream of intermediate files – checkpoints, optimizer states, processed shards, logs, traces, and more – that frequently change and often require version control. Storage Buckets address this gap by providing mutable, S3-like object storage directly accessible from the Hub, allowing developers to seamlessly manage these transient artifacts. Built on Hugging Face’s Xet backend, these Buckets leverage chunk-based storage and deduplication to optimize bandwidth, transfer speeds, and storage efficiency. This is particularly crucial for large-scale training pipelines and distributed workloads. The key benefits include reduced bandwidth consumption, faster transfers, and improved storage utilization. Furthermore, Buckets offer global storage by default, combined with pre-warming capabilities to bring frequently accessed data closer to compute resources, minimizing latency. The launch is supported by a private beta program with key launch partners, and offers programmatic access via the Hub, CLI, Python client, fsspec, and JavaScript client, facilitating integration with existing workflows and tools. The feature builds on the seamless model/dataset workflow offered by the Hub, aiming to simplify the entire ML artifact lifecycle.

Key Points

  • Storage Buckets are a new, mutable object storage solution on the Hugging Face Hub designed for managing intermediate ML artifacts.
  • They are built on Hugging Face’s Xet backend, utilizing chunk-based storage and deduplication to improve efficiency.
  • Buckets offer global storage with pre-warming capabilities for optimal performance in distributed training pipelines.

Why It Matters

The introduction of Storage Buckets represents a significant improvement in the ML ecosystem by addressing a long-standing pain point: the management of constantly evolving artifacts. Previously, developers were forced to manage these intermediate files using disparate solutions, introducing complexity and inefficiencies. By providing a centralized, scalable, and optimized storage layer directly within the Hugging Face Hub, this innovation streamlines the ML workflow, reduces operational overhead, and directly impacts developer productivity. It’s not simply a new feature; it’s a fundamental shift towards a more robust and practical approach to ML development.

You might also be interested in