Troubleshooting
Problem
bjobs -l reports PGID. Sometimes, for interactive jobs, PGID has already exited on execution host, but bjobs -l still report it.
Why does bjobs -l report PGID that has already exited?
Symptom
User submit an interactive job. When the job runs for a while, PGID which bjobs -l reports exit on execution host, but bjobs -l still reports it.
$ bjobs -l
Job <4110>, User <usrA>, Project <default>, Status <RUN>, Queue <interactive>,
Interactive pseudo-terminal shell mode, Job Priority <50>,
Command <export DISPLAY=localhost:11.0; xterm>, Share grou
p charged </ usrA >
Wed Jul 4 10:58:51: Submitted from host <hostA>, CWD </tmp>;
Wed Jul 4 10:58:51: Started 1 Task(s) on Host(s) <hostA>, Allocated 1 Slot(s)
on Host(s) <hostA>;
Wed Jul 4 11:09:50: Resource usage collected.
The CPU time used is 2 seconds.
MEM: 0 Mbytes; SWAP: 0 Mbytes; NTHREAD: 2
PGID: 23999; PIDs: 24000 24001 <<<<<<<<<<<<< PGID exit. Two PIDs still run on execution host.
MEMORY USAGE:
MAX MEM: 9 Mbytes; AVG MEM: 7 Mbytes
......
Log InLog in to view more of this document
Was this topic helpful?
Document Information
Modified date:
12 August 2018
UID
ibm10717883