Rafay Systems has achieved NVIDIA AI Cloud-Ready validation, confirming that its platform meets NVIDIA's software standard for operating production-grade AI cloud infrastructure. Rafay is among the first independent software vendors to earn this designation, joining a select group of platforms validated to deliver the API-driven, multi-tenant capabilities that AI factories require to serve neocloud and sovereign workloads at scale.
What the validation means
The NVIDIA AI Cloud-Ready initiative sits alongside the NVIDIA Cloud Partners (NCP) program, part of the NVIDIA Partner Network (NPN). While the NCP program defines the hardware standard for AI cloud infrastructure, the AI Cloud-Ready initiative defines how cloud AI factories should deliver use cases. Together, they form a full-stack recipe for cloud AI factories.
With the Rafay Platform having undergone NVIDIA AI Cloud-Ready validation, Rafay customers instantly meet four key market requirements:
- API-driven infrastructure access: Kubernetes, virtual machines, SLURM, and bare metal delivered as services, accessible programmatically via published APIs and consumable on demand.
- Hard and soft multi-tenancy: Workload isolation at the hardware and platform levels, with quota enforcement and policy governance across teams, applications, and geographies.
- Production AI workload support: Validated compatibility with the NVIDIA accelerated compute stack, including orchestration, networking, and AI platform software, with support for token-metered NVIDIA NIM microservices, NVIDIA NeMo libraries, and AI Blueprints.
- Enterprise-grade operational controls: Lifecycle management, security, compliance, self-service workflows, and real-time monitoring that large buyers require before committing capacity.
Technical integration details
Rafay's AI Cloud-Ready designation reflects a multi-year technical relationship with NVIDIA. The Rafay Platform works in concert with the NVIDIA Infra Controller, which handles rack-scale provisioning of NVIDIA Grace Blackwell systems, while Rafay provides the orchestration, governance, and service-delivery layer above it. Together with NVIDIA Cloud Providers, they form a complete stack from bare metal to AI services.
The Rafay Platform also:
- Complies with the NVIDIA Enterprise AI Factory validated design for Blackwell-based enterprise deployments.
- Natively supports NVIDIA BlueField-3 DPUs and RTX PRO 6000 Blackwell Server Edition for full-stack GPU cloud orchestration, with planned support for future NVIDIA BlueField-4 DPUs.
- Is available via NVIDIA DSX Air for customers looking to validate their deployments from metal to model.
- Published a reference architecture for GPU PaaS with NVIDIA accelerated computing and NVIDIA AI Enterprise software.
Deployment examples
The Rafay Platform powers sovereign and neocloud AI deployments across six continents:
- Yotta, India's premier AI Cloud, is running its Shakti Cloud Platform on NVIDIA and Rafay.
- Cassava Technologies is deploying Africa's first NVIDIA-powered AI factories on the Rafay Platform.
- Firmus Technologies has integrated Rafay's PaaS capabilities into its green-energy-powered Australian AI Cloud.
- TELUS is building a sovereign AI Studio in Canada with Rafay and NVIDIA.
- Additional deployments span the Middle East, Latin America, and Southeast Asia.
Bottom line
For operators evaluating their path to AI cloud readiness, Rafay provides the validated software stack out of the box, without the cost, complexity, or time required to build or assemble a platform in-house. The company's platform delivers a suite of capabilities—from metal to model—that AI factories can monetize quickly, complete with token-based consumption options.