CPU-only Deployment

AI Services supports CPU-only deployments as an alternative to accelerator-based deployments. This enables users to run AI workloads on standard CPU hardware without requiring specialized accelerators such as IBM Spyre™.

By following the setup instructions, users can configure AI Services and launch the catalog. The catalog UI provides an interface to create, manage, and monitor applications within AI Services. CPU-only deployments are supported only with the Podman runtime. Containers deployed use default security contexts and do not require special SELinux policies or device access.

Use Cases for CPU-Only Deployments
  • Development & Testing: Quick setup for development environments.
  • Small-Scale Deployments: Suitable for low-volume production workloads.
  • Cost-Sensitive Scenarios: Ideal when specialized hardware is not available or not cost-effective.
  • Proof of Concept: Useful for evaluating AI Services before investing in accelerated infrastructure.
  • Edge Deployments: Enables running on standard server hardware without GPU requirements.
  • Fit-for-purpose deployments: Some AI Services run optimally as CPU-only deployment.