IBM Support

LI83089: POSTGRES PODS RESTARTS FREQUENTLY DUE TO LIVENESS TIMEOUT

Subscribe to this APAR

By subscribing, you receive periodic emails alerting you to the status of the APAR, along with a link to the fix after it becomes available. You can track this item individually or track all items by product.

Notify me when this APAR changes.

Notify me when an APAR for this component changes.

 

APAR status

  • Closed as program error.

Error description

  • Postgres pods restart frequently. The describe of the pod
    complains of timeout error:
    
    Liveness probe failed: command
    "/opt/crunchy/bin/postgres-ha/health/pgha-liveness.sh" timed out
    
    The version of yq delivered with out database vendor - version
    3.3.0 has a known issue which is causing yq to take longer to
    run, and eventually times out.  Fix is to increase the liveness
    timeout of postgres pods from 10 seconds to 30 seconds.
    

Local fix

  • 1) For all the postgres deployments, edit timeoutSeconds from 10
    to 30 for liveness probe
    
    Steps:
    -Find the name of the deployments using
    kubectl get deploy | grep postgres
    - Edit the deployment
    kubectl edit deploy
    -Search for liveness, should look similar to below:
    
    livenessProbe:          exec:
               command:
               -
    /opt/crunchy/bin/postgres-ha/health/pgha-liveness.sh
             failureThreshold: 3
             initialDelaySeconds: 30
             periodSeconds: 15
             successThreshold: 1
             timeoutSeconds: 10
    
    -Edit the timeoutSeconds: 10 to timeoutSeconds: 30
    

Problem summary

  • Postgres pods restart frequently. The describe of the pod
    complains of timeout error:
    
    Liveness probe failed: command
    "/opt/crunchy/bin/postgres-ha/health/pgha-liveness.sh" timed out
    

Problem conclusion

  • Fix is added in APIC release v10.0.5.6. Increased the liveness
    timeout of postgres pods from 10 seconds to 30 seconds.
    

Temporary fix

Comments

APAR Information

  • APAR number

    LI83089

  • Reported component name

    API CONNECT ENT

  • Reported component ID

    5725Z2201

  • Reported release

    A0X

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2023-11-13

  • Closed date

    2024-02-20

  • Last modified date

    2024-02-20

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    API CONNECT ENT

  • Fixed component ID

    5725Z2201

Applicable component levels

[{"Business Unit":{"code":"BU059","label":"IBM Software w\/o TPS"},"Product":{"code":"SSMNED","label":"IBM API Connect"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"A0X","Line of Business":{"code":"LOB67","label":"IT Automation \u0026 App Modernization"}}]

Document Information

Modified date:
20 February 2024