IBM Support

IV01042: PE 5.2 HANGS WITH MPI PERSISTENT COMMUNICATION

 

APAR status

  • Closed as program error.

Error description

  • The customer upgraded to PE 5.2 and now their
    applications hangs. The problem is that the MPI
    persistent communication has a problem in PE 5.2
    in an internal function which computes the rank to
    task id and does not handle  MPI_PROC_NULL correctly.
    Somehow it gets set to 0, so we have other task
    sitting on a persistent call waiting for task 0
    but task 0 is sitting on a MPI_Barrier call
    where all tasks should participate.
    

Local fix

Problem summary

  • The application calls persistent functions with
    MPI_PROC_NULL. When we upgrade from 5.1.1 to 5.2.1, MPI
    internal function that computes the rank to task id does not
    handle MPI_PROC_NULL correctly, somehow it gets set to 0, so
    we have other task on a persistent call waiting for task 0
    but task 0 is on a MPI_Barrier call where all tasks should
    participate.
    

Problem conclusion

  • When MPI_PROC_NULL is passed to the computation function of
    rank to task id, it should be set to MPCI_PROC_NULL, then
    the communication layer will handle correctly.
    

Temporary fix

Comments

APAR Information

  • APAR number

    IV01042

  • Reported component name

    PARALLEL ENVIR.

  • Reported component ID

    5765PEA00

  • Reported release

    522

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Submitted date

    2011-05-26

  • Closed date

    2011-06-10

  • Last modified date

    2013-05-02

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    PARALLEL ENVIR.

  • Fixed component ID

    5765PEA00

Applicable component levels

  • R522 PSY U859445

       UP13/05/02 I 1000

PTF to Fileset Mapping

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SG6MU7","label":"Parallel Environment for AIX"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"522","Edition":"","Line of Business":{"code":"LOB08","label":"Cognitive Systems"}},{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SG6MU7","label":"Parallel Environment for AIX"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"522","Edition":"","Line of Business":{"code":"LOB08","label":"Cognitive Systems"}}]

Document Information

Modified date:
21 March 2022