IBM Support

Job submission failed with different error codes

Troubleshooting


Problem

Jobs were failed with exit code=1 in Platform Process Manager. Error messages were different in two jfd.log files, and randomly happened on different accounts. However, all the flows and jobs were successfully executed by other retries.
Exit code 14, exit value 125, return code 105 and 115 could be found in these jfd.log files.

Symptom

 Jobs were failed to submit at a certain time with a certain account, and successful by other retries on different time with the same account . Although the errors were occurred with different accounts, two main accounts took most of these cases.

We got two jfd.log files generated in different time, which recorded different kinds of error messages.

......
3 JFLSFExternalExecution::runCommand: nb_read_fix() read failed; read=-1
7 JFJobExecutionAgent::checkReturnStatus: exit value: 14
3 JFJobExecutionAgent::checkReturnStatus: Failed to execute command<"......"> Exited with <14>.
7 JFJobExecutionAgent::appendHistory: returning returncode: <105>
......

......
7 JFLSFExecutionAgent::_submitToLSF: user: <......>, command:<"......">
7 JFJobExecutionAgent::_executeCommand: Unable to get JS_SU_COMMAND
7 JFJobExecutionAgent::_executeCommand: su command to execute </bin/su user_account -c /tmp/JS_1YuZJB
7 FLSFExternalExecution::runCommand: Exit value of the command is 125
JFJobExecutionAgent::checkReturnStatus: exit value: 0
7 JFJobExecutionAgent::checkReturnStatus: system command succeeded; exited with <0>.
7 JFJobExecutionAgent::checkReturnStatus: exit value: -1
7 JFJobExecutionAgent::_executeCommand: returncode: <115>
3 JFLSFExecutionAgent::_submitToLSF: Line: 746, _executeCommand() failed: 115.
7 JFJobExecutAgent::executeJobs: requeing 7487450:user_account:CurdRptJob:CurdRptJ
......

There were no obvious errors or warning messages in /var/log/messages, which were related to jfd errors.

We issued "jhist" command to retrieve events for a failed flow submission, and got "resource error", and job submission with JobId=0.

Fri Sep 13 04:31:47 GMT-04:00 2019 oybatch      Submission failed 7487479:oybatch:OY_SOTI:soti
Fri Sep 13 04:31:47 GMT-04:00 2019 oybatch      Exception 7487479:oybatch:OY_SOTI:soti
                                                StartFailed - Resource error
Fri Sep 13 04:31:59 GMT-04:00 2019 oybatch      Finished job 7487479:oybatch:OY_SOTI:soti
                                                JobId=0
                                                State=Exit
                                                Status=1
......

All flows and jobs run successfully at last by retrying on other time.

Document Location

Worldwide


Operating System

Cross Brand:Linux


[{"Business Unit":{"code":"BU048","label":"IBM Software"},"Product":{"code":"SSZSHQ","label":"IBM Spectrum LSF Process Manager"},"Component":"","Platform":[{"code":"PF016","label":"Linux"}],"Version":"All Versions","Edition":"","Line of Business":{"code":"LOB77","label":"Automation Platform"}}]

Log InLog in to view more of this document

This document has the abstract of a technical article that is available to authorized users once you have logged on. Please use Log in button above to access the full document. After log in, if you do not have the right authorization for this document, there will be instructions on what to do next.

Document Information

More support for:
IBM Spectrum LSF Process Manager

Software version:
All Versions

Operating system(s):
Linux

Document number:
1103571

Modified date:
21 November 2019

UID

ibm11103571

Manage My Notification Subscriptions