Troubleshooting
Problem
Jobs were failed with exit code=1 in Platform Process Manager. Error messages were different in two jfd.log files, and randomly happened on different accounts. However, all the flows and jobs were successfully executed by other retries.
Exit code 14, exit value 125, return code 105 and 115 could be found in these jfd.log files.
Symptom
Jobs were failed to submit at a certain time with a certain account, and successful by other retries on different time with the same account . Although the errors were occurred with different accounts, two main accounts took most of these cases.
We got two jfd.log files generated in different time, which recorded different kinds of error messages.
......
3 JFLSFExternalExecution::runCommand: nb_read_fix() read failed; read=-1
7 JFJobExecutionAgent::checkReturnStatus: exit value: 14
3 JFJobExecutionAgent::checkReturnStatus: Failed to execute command<"......"> Exited with <14>.
7 JFJobExecutionAgent::appendHistory: returning returncode: <105>
......
......
7 JFLSFExecutionAgent::_submitToLSF: user: <......>, command:<"......">
7 JFJobExecutionAgent::_executeCommand: Unable to get JS_SU_COMMAND
7 JFJobExecutionAgent::_executeCommand: su command to execute </bin/su user_account -c /tmp/JS_1YuZJB
7 FLSFExternalExecution::runCommand: Exit value of the command is 125
JFJobExecutionAgent::checkReturnStatus: exit value: 0
7 JFJobExecutionAgent::checkReturnStatus: system command succeeded; exited with <0>.
7 JFJobExecutionAgent::checkReturnStatus: exit value: -1
7 JFJobExecutionAgent::_executeCommand: returncode: <115>
3 JFLSFExecutionAgent::_submitToLSF: Line: 746, _executeCommand() failed: 115.
7 JFJobExecutAgent::executeJobs: requeing 7487450:user_account:CurdRptJob:CurdRptJ
......
There were no obvious errors or warning messages in /var/log/messages, which were related to jfd errors.
We issued "jhist" command to retrieve events for a failed flow submission, and got "resource error", and job submission with JobId=0.
Fri Sep 13 04:31:47 GMT-04:00 2019 oybatch Submission failed 7487479:oybatch:OY_SOTI:soti
Fri Sep 13 04:31:47 GMT-04:00 2019 oybatch Exception 7487479:oybatch:OY_SOTI:soti
StartFailed - Resource error
Fri Sep 13 04:31:59 GMT-04:00 2019 oybatch Finished job 7487479:oybatch:OY_SOTI:soti
JobId=0
State=Exit
Status=1
......
All flows and jobs run successfully at last by retrying on other time.
Document Location
Worldwide
[{"Business Unit":{"code":"BU048","label":"IBM Software"},"Product":{"code":"SSZSHQ","label":"IBM Spectrum LSF Process Manager"},"Component":"","Platform":[{"code":"PF016","label":"Linux"}],"Version":"All Versions","Edition":"","Line of Business":{"code":"LOB77","label":"Automation Platform"}}]
Log InLog in to view more of this document
This document has the abstract of a technical article that is available to authorized users once you have logged on. Please use Log in button above to access the full document. After log in, if you do not have the right authorization for this document, there will be instructions on what to do next.
Was this topic helpful?
Document Information
More support for:
IBM Spectrum LSF Process Manager
Software version:
All Versions
Operating system(s):
Linux
Document number:
1103571
Modified date:
21 November 2019
UID
ibm11103571
Manage My Notification Subscriptions