Topic
7 replies Latest Post - ‏2010-05-13T13:28:18Z by SystemAdmin
SystemAdmin
SystemAdmin
2105 Posts
ACCEPTED ANSWER

Pinned topic parallel job fork error when running, who knows the reason ? thanks.

‏2008-09-28T02:59:07Z |
node_node1: Fatal Error: Unable to start ORCHESTRATE process on node node1 (IBM-L3AA3XA): APT_PMPlayer::APT_PMPlayer: fork() failed, Resource unavailable; try again

main_program: The Section Leader on node node1 has terminated unexpectedly.

main_program: Fatal Error: Unable to start ORCHESTRATE job: APT_PMwaitForPlayersToStart failed while waiting for players to confirm startup. This likely indicates a network problem.
Status from APT_PMpoll is 0; node name is node1
Updated on 2010-05-13T13:28:18Z at 2010-05-13T13:28:18Z by SystemAdmin
  • SystemAdmin
    SystemAdmin
    2105 Posts
    ACCEPTED ANSWER

    Re: parallel job fork error when running, who knows the reason ? thanks.

    ‏2008-10-08T07:08:10Z  in response to SystemAdmin
    Did you find a solution on this?

    Thank you :)
  • SystemAdmin
    SystemAdmin
    2105 Posts
    ACCEPTED ANSWER

    Re: parallel job fork error when running, who knows the reason ? thanks.

    ‏2009-05-13T09:16:29Z  in response to SystemAdmin
    My job have same problem ? can you help me to fix it please ?
  • SystemAdmin
    SystemAdmin
    2105 Posts
    ACCEPTED ANSWER

    Re: parallel job fork error when running, who knows the reason ? thanks.

    ‏2009-05-13T09:23:26Z  in response to SystemAdmin
    IBM does not have experts to resolve these issues ? Many requests but no one answered ???
  • IIS
    IIS
    1 Post
    ACCEPTED ANSWER

    Re: parallel job fork error when running, who knows the reason ? thanks.

    ‏2010-01-12T20:15:43Z  in response to SystemAdmin
    The number of process per user is too low.
    logon as root, go to smit/system environments/ change show characteristics of operating system

    change the "Maximum user PROCESSES allowed per user" to 8912.

    This solved my problem.
  • SystemAdmin
    SystemAdmin
    2105 Posts
    ACCEPTED ANSWER

    Re: parallel job fork error when running, who knows the reason ? thanks.

    ‏2010-03-20T01:02:29Z  in response to SystemAdmin
    Please check the current values for APT_PM_CONDUCTOR_TIMEOUT and APT_PM_PLAYER_TIMEOUT.If not set ,the default is 60 .Set it some high values like 320 and 120

    Thanks
  • krysno
    krysno
    3 Posts
    ACCEPTED ANSWER

    Re: parallel job fork error when running, who knows the reason ? thanks.

    ‏2010-03-21T15:28:54Z  in response to SystemAdmin
    I have a similar problem but nowhere in their environment have not found these environment variables. Documentation also did not find a description of these environment variables.

    My error message:
    main_program:
    Fatal Error: Unable to start ORCHESTRATE job:
    APT_PMwaitForPlayersToStart failed while waiting for players to confirm startup.
    This likely indicates a network problem.
    Status from APT_PMpoll is 0; node name is node1
    • SystemAdmin
      SystemAdmin
      2105 Posts
      ACCEPTED ANSWER

      Re: parallel job fork error when running, who knows the reason ? thanks.

      ‏2010-05-13T13:28:18Z  in response to krysno
      if you are running DS job ,add these variables into administrator .
      If you are running some orchadmin command from a script,simpy export these variables with values