← Library
End-to-end Slurm training + vLLM inference demo on Nebius
A community reference project that provisions a 2-node, 16x H100 Soperator (Slurm-on-Kubernetes) cluster with Terraform, runs SFT and LoRA fine-tuning via sbatch, then serves the model with single- and multi-node vLLM — with reported accuracy gains from 2% to 88%.soperator
aicloud
The full write-up lives on the original source — use the link above to read it.