IBM Support

Why is rocks-update out of sync on a couple of compute nodes?

Troubleshooting


Problem

Why is rocks-update out of sync on a couple of compute nodes?

Resolving The Problem

Why is rocks-update out of sync on a couple of compute nodes, and seems unable to bring them up to date?Basicly, rocks-update * will call yum to do the update,
for some reason, when yum do the update, it need to
try all the mount point, by strace the yum process:

 

[root@compute-0-0 ~]# ps -ef|grep yum
root 18732 18562 0 16:39 pts/2 00:00:02 /usr/bin/python /usr/bin/yum -y update
root 20748 20464 0 17:06 pts/7 00:00:00 grep yum
[root@compute-0-0 ~]# /tmp/strace -p 18732
Process 18732 attached - interrupt to quit
--- SIGSTOP (Stopped (signal)) @ 0 (0) ---
--- SIGSTOP (Stopped (signal)) @ 0 (0) ---
--- SIGINT (Interrupt) @ 0 (0) ---
rt_sigreturn(0x3b8fc618a0) = -1 EINTR (Interrupted system call)
--- SIGSTOP (Stopped (signal)) @ 0 (0) ---
--- SIGSTOP (Stopped (signal)) @ 0 (0) ---

And if yum hang, that's because the automount problem,
Need go those machines and kill hanging yum process.
after that, rocks-update works.

* This product includes software developed by the Rocks Cluster Group at the San Diego Supercomputer Center at the University of California, San Diego and its contributors.

[{"Product":{"code":"SSZUCA","label":"IBM Spectrum Cluster Foundation"},"Business Unit":{"code":"BU059","label":"IBM Software w\/o TPS"},"Component":"--","Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"4.4.0","Edition":"","Line of Business":{"code":"LOB10","label":"Data and AI"}},{"Product":{"code":"SSZUCA","label":"IBM Spectrum Cluster Foundation"},"Business Unit":{"code":"BU059","label":"IBM Software w\/o TPS"},"Component":null,"Platform":[{"code":"","label":""}],"Version":"","Edition":"","Line of Business":{"code":"LOB10","label":"Data and AI"}}]

Document Information

Modified date:
11 September 2018

UID

isg3T1014043