Blog

Around Saturn Cloud

Technical guides, platform updates, and engineering insights from the team.

The Open Source AI Framework Landscape in 2026: A Map for AI Engineers

A categorized guide to the OSS frameworks AI engineers use in 2026, across agent orchestration, retrieval, serving, training, …

What an LLM Inference Stack Actually Looks Like

The layers of a production LLM inference stack, why each one exists, and which parts you get from open source versus build yourself.

The Complete Guide to GPU Cloud Infrastructure

The architecture, operations, and failure modes of running a GPU cloud in 2026. Written for the people building them.

Speeding Up Kubernetes Image Pulls with Spegel

How we deployed Spegel, a cluster-local OCI registry mirror, to cut cold-start image pull times on our Nebius deployment, and the …

Why GPU Clouds Need a Platform Layer

GPU clouds that sell only compute hours are losing enterprise customers to hyperscalers. Enterprise AI teams don't evaluate GPU clouds …

Should GPU Cloud Operators Build or Buy a Platform Layer?

In this article, we’ll discuss what it actually takes to build a platform layer in-house, what it costs, and where the decision tips …

Using our Claude Code plugin to set up our team's website workspace

Our website is built with Hugo. Some of our contributors aren't developers, and installing the toolchain on a laptop is enough friction …

Designing a Claude Code Plugin for AI Infrastructure

Three design principles we learned building a Claude Code plugin for Saturn Cloud: treat the live API as ground truth, treat the skill …

Building an ML Pipeline on Saturn Cloud with the Claude Code Plugin

How we used the saturn-cloud Claude Code plugin to build a reproducible ML demo end-to-end: ingestion, dataset versioning, feature …