Troubleshooting

Guidance to diagnose and resolve common issues

Before reporting an issue, we recommend reconfiguring your environment and verifying the setup. Try running - ai-services bootstrap - in your terminal. If everything is configured correctly, the output should look like this.

~ % ai-services bootstrap
Configuring the LPAR
✔ podman already installed
✔ Podman already configured
⠹ Checking spyre card configurationservicereport 2.2.5

Spyre configuration checks                          PASS

  VFIO Driver configuration                         PASS
  User memlock configuration                        PASS
  sos config                                        PASS     Auto Fixed
  sos package                                       PASS
  VFIO udev rules configuration                     PASS
  User group configuration                          PASS     Auto Fixed
  VFIO device permission                            PASS
  VFIO kernel module loaded                         PASS
  VFIO module dep configuration                     PASS

Memlock limit is set for the sentient group.
Spyre user must be in the sentient group.
To add run below command:
    sudo usermod -aG sentient <user>
    Example:
    sudo usermod -aG sentient abc
    Re-login as <user>.



✔ Spyre cards configuration validated successfully.
LPAR configured successfully
-------------------------------------------------------------------
Re-login to the shell to relect necessary permissions assigned to vfio cards
Validating LPAR
✔ Current user is root
✔ LPAR affinity score is above the threshold: 70
✔ Operating system is RHEL with version 9.6
✔ System is running on IBM Power11 (ppc64le)
✔ System is registered with RHN
✔ IBM Spyre Accelerator is attached to the LPAR
⠹ Validating servicereport ...servicereport 2.2.5

Spyre configuration checks                          PASS

  VFIO Driver configuration                         PASS
  User memlock configuration                        PASS
  sos config                                        PASS
  sos package                                       PASS
  VFIO udev rules configuration                     PASS
  User group configuration                          PASS
  VFIO device permission                            PASS
  VFIO kernel module loaded                         PASS
  VFIO module dep configuration                     PASS


✔ ServiceReport tool has successfully run on the LPAR
All validations passed
LPAR boostrapped successfully