SoftBank has unveiled ‘Infrinia AI Cloud OS,’ a software stack built for AI data centers. It is designed to help operators run Kubernetes and AI workloads at scale without the complexity of managing every layer themselves. The system supports Kubernetes as a Service and Inference as a Service, letting users deploy Large Language Models via APIs and run GPU cloud services more efficiently. The goal is to reduce operational burden and cut costs compared with building custom solutions in-house. Operators can automate everything from BIOS and GPU drivers to networking and storage, and scale clusters dynamically based on workload. The software also handles secure multi-tenancy and automated maintenance, including monitoring and failover. Also Read: TGES Joins CBRE & La Clé du Joie for Data Centers SoftBank will first deploy Infrinia…
Sign in to your account