Blog
AI/ML

Running SLURM on Kubernetes with Nebius
Why HPC teams want SLURM semantics even when they have Kubernetes, and how to get both on Nebius AI Cloud
See more

Validating Multi-Node GPU Clusters with NCCL Tests
How to run NCCL all_reduce benchmarks to verify your GPU cluster's interconnect performance before running production training.
See more

Multi-Node GPU Training Infrastructure on Crusoe with Terraform
Provisioning multi-GPU clusters with InfiniBand and NVLink using the Crusoe Terraform provider for distributed training workloads.
See more

Saturn Cloud on Crusoe: Platform Architecture
How to deploy Saturn Cloud on Crusoe for teams that need H100, H200, and GB200 GPUs without hyperscaler quota constraints.
See more

A Field Guide to Crusoe InfiniBand with Terraform
Practical answers to the questions you'll have when provisioning InfiniBand-connected GPU clusters on Crusoe.
See more

GPU Cloud Comparison: 17 Neoclouds for AI in 2025
A technical comparison of GPU cloud providers beyond AWS, GCP, and Azure, covering pricing, InfiniBand networking, storage options, and …
See more
