Topic
  • 12 replies
  • Latest Post - ‏2012-11-09T16:07:00Z by MaksKowalik
Jiravit
Jiravit
9 Posts

Pinned topic Agent status - Missing software scan - I-series

‏2012-06-25T05:11:55Z |
I have an IBM I-Series agent where status is 'Missing software scan'. The Information Center recommends checking steps by running tlmagent -p and tlmagent -sw commands. However, these commands do not seem to apply to IBM I Series. Are there equivalent commands or procedures for IBM I?
Updated on 2012-11-09T16:07:00Z at 2012-11-09T16:07:00Z by MaksKowalik
  • MaksKowalik
    MaksKowalik
    78 Posts

    Re: Agent status - Missing software scan - I-series

    ‏2012-06-25T07:48:55Z  
    Hi,

    equivalent commands for i5/OS are:
    CALL QITLM/QITLMAGENT PARM('-p')
    CALL QITLM/QITLMAGENT PARM('-s')

    Please note that there is '-s' and not '-sw'

    However this commands only tell the agent to execute plugin and software scan. If the agent did not upload any software scan results, there might be a different problem. Please inspect whether user QITLM has any jobs in OUTQ state and list them here.

    Best regards,
    Maks Kowalik
    The postings on this site are my own and don't necessarily represent IBM's positions, strategies or opinions.
  • Jiravit
    Jiravit
    9 Posts

    Re: Agent status - Missing software scan - I-series

    ‏2012-06-27T09:35:29Z  
    Hi,

    equivalent commands for i5/OS are:
    CALL QITLM/QITLMAGENT PARM('-p')
    CALL QITLM/QITLMAGENT PARM('-s')

    Please note that there is '-s' and not '-sw'

    However this commands only tell the agent to execute plugin and software scan. If the agent did not upload any software scan results, there might be a different problem. Please inspect whether user QITLM has any jobs in OUTQ state and list them here.

    Best regards,
    Maks Kowalik
    The postings on this site are my own and don't necessarily represent IBM's positions, strategies or opinions.
    Hi Maks,

    Here's the message in the OUTQ.
    5770SS1 V7R1M0 100423 Job Log TH1PIDEV 25/06/12 10:25:14 Page 1
    Job name . . . . . . . . . . : SH User . . . . . . : QITLM Number . . . . . . . . . . . : 942147
    Job description . . . . . . : QITLMJOBD Library . . . . . : QITLM
    MSGID TYPE SEV DATE TIME FROM PGM LIBRARY INST TO PGM LIBRARY INST
    CPF1124 Information 00 24/06/12 23:50:29.606346 QWTPIIPP QSYS 04C0 *EXT *N
    Message . . . . : Job 942147/QITLM/SH started on 24/06/12 at 23:50:29 in
    subsystem QSYSWRK in QSYS. Job entered system on 24/06/12 at 23:50:29.
    CPC1224 Completion 50 25/06/12 10:25:14.178944 QWTPITP2 QSYS 0636 *EXT *N
    Message . . . . : Job ended abnormally.
    Cause . . . . . : A SIGKILL signal was received for the job. The action for
    the signal was to terminate the job.
    CPF1164 Completion 00 25/06/12 10:25:14.509959 QWTMCEOJ QSYS 014A *EXT *N
    Message . . . . : Job 942147/QITLM/SH ended on 25/06/12 at 10:25:14; .016
    seconds used; end code 30 .
    Cause . . . . . : Job 942147/QITLM/SH completed on 25/06/12 at 10:25:14
    after it used .016 seconds processing unit time. The job had ending code
    30. The job ended after 1 routing steps with a secondary ending code of 0.
    The job ending codes and their meanings are as follows: 0 - The job
    completed normally. 10 - The job completed normally during controlled ending
    or controlled subsystem ending. 20 - The job exceeded end severity (ENDSEV
    job attribute). 30 - The job ended abnormally. 40 - The job ended before
    becoming active. 50 - The job ended while the job was active. 60 - The
    subsystem ended abnormally while the job was active. 70 - The system ended
    abnormally while the job was active. 80 - The job ended (ENDJOBABN command).
    90 - The job was forced to end after the time limit ended (ENDJOBABN
    command). Recovery . . . : For more information, see the Work management
    topic collection in the Systems management category in the IBM i Information
    Center, http://www.ibm.com/systems/i/infocenter/.
  • MaksKowalik
    MaksKowalik
    78 Posts

    Re: Agent status - Missing software scan - I-series

    ‏2012-07-09T14:27:22Z  
    • Jiravit
    • ‏2012-06-27T09:35:29Z
    Hi Maks,

    Here's the message in the OUTQ.
    5770SS1 V7R1M0 100423 Job Log TH1PIDEV 25/06/12 10:25:14 Page 1
    Job name . . . . . . . . . . : SH User . . . . . . : QITLM Number . . . . . . . . . . . : 942147
    Job description . . . . . . : QITLMJOBD Library . . . . . : QITLM
    MSGID TYPE SEV DATE TIME FROM PGM LIBRARY INST TO PGM LIBRARY INST
    CPF1124 Information 00 24/06/12 23:50:29.606346 QWTPIIPP QSYS 04C0 *EXT *N
    Message . . . . : Job 942147/QITLM/SH started on 24/06/12 at 23:50:29 in
    subsystem QSYSWRK in QSYS. Job entered system on 24/06/12 at 23:50:29.
    CPC1224 Completion 50 25/06/12 10:25:14.178944 QWTPITP2 QSYS 0636 *EXT *N
    Message . . . . : Job ended abnormally.
    Cause . . . . . : A SIGKILL signal was received for the job. The action for
    the signal was to terminate the job.
    CPF1164 Completion 00 25/06/12 10:25:14.509959 QWTMCEOJ QSYS 014A *EXT *N
    Message . . . . : Job 942147/QITLM/SH ended on 25/06/12 at 10:25:14; .016
    seconds used; end code 30 .
    Cause . . . . . : Job 942147/QITLM/SH completed on 25/06/12 at 10:25:14
    after it used .016 seconds processing unit time. The job had ending code
    30. The job ended after 1 routing steps with a secondary ending code of 0.
    The job ending codes and their meanings are as follows: 0 - The job
    completed normally. 10 - The job completed normally during controlled ending
    or controlled subsystem ending. 20 - The job exceeded end severity (ENDSEV
    job attribute). 30 - The job ended abnormally. 40 - The job ended before
    becoming active. 50 - The job ended while the job was active. 60 - The
    subsystem ended abnormally while the job was active. 70 - The system ended
    abnormally while the job was active. 80 - The job ended (ENDJOBABN command).
    90 - The job was forced to end after the time limit ended (ENDJOBABN
    command). Recovery . . . : For more information, see the Work management
    topic collection in the Systems management category in the IBM i Information
    Center, http://www.ibm.com/systems/i/infocenter/.
    Hi,

    this is indicating only that QSH jobs (agent submits QSH which then submits the scan jobs) were stopped by SIGKILL.
    The most probable reason that one would like to stop them, is that the software scan was running for a very long time without any success.
    How big is the fs (DB and IFS) of this iseries - GBs and approx. no. of files?
    Also please paste here the content of /etc/cit/cit.ini

    Best regards,
    Maks Kowalik


    The postings on this site are my own and don't necessarily represent IBM's positions, strategies or opinions.
  • Jiravit
    Jiravit
    9 Posts

    Re: Agent status - Missing software scan - I-series

    ‏2012-07-17T11:07:40Z  
    Hi,

    this is indicating only that QSH jobs (agent submits QSH which then submits the scan jobs) were stopped by SIGKILL.
    The most probable reason that one would like to stop them, is that the software scan was running for a very long time without any success.
    How big is the fs (DB and IFS) of this iseries - GBs and approx. no. of files?
    Also please paste here the content of /etc/cit/cit.ini

    Best regards,
    Maks Kowalik


    The postings on this site are my own and don't necessarily represent IBM's positions, strategies or opinions.
    Hi Maks,

    Thanks for your response. Actually, I'm not familiar with the I-series at all, but here's the information I received from the I-Series Admin team. The size of fs is about 3TB. The number of files is about 184700. The content of /etc/cit/cit.ini is as below

    CIT_Version = 2.7.0.1016
    CIT_BuildDate = 11/12/03
    CIT_HomeDirectory = /QSYS.LIB/QTIVCIT.LIB
    CIT_Exploiters = ume:5724-D33:

    Attached is the log around the time when the last software scan ran. It seems to me that the job ran successfully, but somehow the status on ILMT server still shows 'Missing Software Scan' for the particular agent. Please advise.

    Thanks!
  • MaksKowalik
    MaksKowalik
    78 Posts

    Re: Agent status - Missing software scan - I-series

    ‏2012-07-24T15:31:19Z  
    • Jiravit
    • ‏2012-07-17T11:07:40Z
    Hi Maks,

    Thanks for your response. Actually, I'm not familiar with the I-series at all, but here's the information I received from the I-Series Admin team. The size of fs is about 3TB. The number of files is about 184700. The content of /etc/cit/cit.ini is as below

    CIT_Version = 2.7.0.1016
    CIT_BuildDate = 11/12/03
    CIT_HomeDirectory = /QSYS.LIB/QTIVCIT.LIB
    CIT_Exploiters = ume:5724-D33:

    Attached is the log around the time when the last software scan ran. It seems to me that the job ran successfully, but somehow the status on ILMT server still shows 'Missing Software Scan' for the particular agent. Please advise.

    Thanks!
    Hi,

    the history log shows that no software scan job were ran by agent between 15/07/12 09:50:07 and 15/07/12 at 11:00:00. In order to troubleshoot this, please do the following:

    1. Stop the agent using
    ENDTCPSVR *ITLMAGENT

    2. Using
    WRKUSRJOB USER(QITLM)
    ensure that there are no jobs belonging to QITLM in the system. If not - please end them, or delete the spool files (for the OUTQ ones). Deleting spool files is not necessary, however it will make further investigations easier.

    3. Delete agent logs and cache:
    /QIBM/UserData/tivoli/common/COD
    /QIBM/UserData/QITLM/cache/agent.properties
    /QIBM/UserData/QITLM/cache/cache.properties

    4. Reconfigure agent logging, by setting trace_size = 16000000
    in the file /QIBM/UserData/QITLM/conf/tlmagent.ini

    5. Start the agent STRTCPSVR *ITLMAGENT
    and wait few hours. Agent will plugin to the server, download the catalog and perform all the initial scans. After that time check the QITLM user jobs again and paste the output here + the
    zipped /QIBM/UserData/tivoli/common/COD

    Best regards,
    Maks Kowalik

    The postings on this site are my own and don't necessarily represent IBM's positions, strategies or opinions.
  • Jiravit
    Jiravit
    9 Posts

    Re: Agent status - Missing software scan - I-series

    ‏2012-08-03T09:30:57Z  
    Hi,

    the history log shows that no software scan job were ran by agent between 15/07/12 09:50:07 and 15/07/12 at 11:00:00. In order to troubleshoot this, please do the following:

    1. Stop the agent using
    ENDTCPSVR *ITLMAGENT

    2. Using
    WRKUSRJOB USER(QITLM)
    ensure that there are no jobs belonging to QITLM in the system. If not - please end them, or delete the spool files (for the OUTQ ones). Deleting spool files is not necessary, however it will make further investigations easier.

    3. Delete agent logs and cache:
    /QIBM/UserData/tivoli/common/COD
    /QIBM/UserData/QITLM/cache/agent.properties
    /QIBM/UserData/QITLM/cache/cache.properties

    4. Reconfigure agent logging, by setting trace_size = 16000000
    in the file /QIBM/UserData/QITLM/conf/tlmagent.ini

    5. Start the agent STRTCPSVR *ITLMAGENT
    and wait few hours. Agent will plugin to the server, download the catalog and perform all the initial scans. After that time check the QITLM user jobs again and paste the output here + the
    zipped /QIBM/UserData/tivoli/common/COD

    Best regards,
    Maks Kowalik

    The postings on this site are my own and don't necessarily represent IBM's positions, strategies or opinions.
    Hi Maks,

    I have followed the steps you instructed, but the problem still persists. Attached is the job output and files from COD dir.

    Thanks!

    Attachments

  • MaksKowalik
    MaksKowalik
    78 Posts

    Re: Agent status - Missing software scan - I-series

    ‏2012-08-07T15:26:32Z  
    • Jiravit
    • ‏2012-08-03T09:30:57Z
    Hi Maks,

    I have followed the steps you instructed, but the problem still persists. Attached is the job output and files from COD dir.

    Thanks!
    Hi,

    The logs shows, that the scan starts but takes more then allowed 6h. Then agent retries it again and again. 6h scan is not normal (on most systems it should take up to 30 mins).

    Since the timeout seems to be already set twice as the default one, I suggest trying excludedir command on the ILMT server. It can happen that the scanned machine has an unusually big (in terms of # of files) directory which does not need to be scanned.

    After excluding directory on the server, it is good to stop the agent, ensure all it's jobs are not active, delete it's cache, and start it again. Otherwise you have to wait hours for the agent to download new scan config and retry scan.

    Best regards,
    Maks Kowalik


    The postings on this site are my own and don't necessarily represent IBM's positions, strategies or opinions.
  • MaksKowalik
    MaksKowalik
    78 Posts

    Re: Agent status - Missing software scan - I-series

    ‏2012-08-07T15:29:41Z  
    Hi,

    The logs shows, that the scan starts but takes more then allowed 6h. Then agent retries it again and again. 6h scan is not normal (on most systems it should take up to 30 mins).

    Since the timeout seems to be already set twice as the default one, I suggest trying excludedir command on the ILMT server. It can happen that the scanned machine has an unusually big (in terms of # of files) directory which does not need to be scanned.

    After excluding directory on the server, it is good to stop the agent, ensure all it's jobs are not active, delete it's cache, and start it again. Otherwise you have to wait hours for the agent to download new scan config and retry scan.

    Best regards,
    Maks Kowalik


    The postings on this site are my own and don't necessarily represent IBM's positions, strategies or opinions.
    One more thing came to my mind - has this iSeries machine NFS shares mounted somewhere in its IFS?


    The postings on this site are my own and don't necessarily represent IBM's positions, strategies or opinions.
  • SystemAdmin
    SystemAdmin
    340 Posts

    Re: Agent status - Missing software scan - I-series

    ‏2012-09-05T08:07:11Z  
    Hi,

    the history log shows that no software scan job were ran by agent between 15/07/12 09:50:07 and 15/07/12 at 11:00:00. In order to troubleshoot this, please do the following:

    1. Stop the agent using
    ENDTCPSVR *ITLMAGENT

    2. Using
    WRKUSRJOB USER(QITLM)
    ensure that there are no jobs belonging to QITLM in the system. If not - please end them, or delete the spool files (for the OUTQ ones). Deleting spool files is not necessary, however it will make further investigations easier.

    3. Delete agent logs and cache:
    /QIBM/UserData/tivoli/common/COD
    /QIBM/UserData/QITLM/cache/agent.properties
    /QIBM/UserData/QITLM/cache/cache.properties

    4. Reconfigure agent logging, by setting trace_size = 16000000
    in the file /QIBM/UserData/QITLM/conf/tlmagent.ini

    5. Start the agent STRTCPSVR *ITLMAGENT
    and wait few hours. Agent will plugin to the server, download the catalog and perform all the initial scans. After that time check the QITLM user jobs again and paste the output here + the
    zipped /QIBM/UserData/tivoli/common/COD

    Best regards,
    Maks Kowalik

    The postings on this site are my own and don't necessarily represent IBM's positions, strategies or opinions.
    hi Maks,

    i have tried but couldn't success.. can you give me some advice?

    5761SS1 V6R1M0 080215 Job Log CGKAIPA1 05/09/12 14:58:39 Page 1
    Job name . . . . . . . . . . : SH User . . . . . . : QITLM Number . . . . . . . . . . . : 403836
    Job description . . . . . . : QITLMJOBD Library . . . . . : QITLM
    MSGID TYPE SEV DATE TIME FROM PGM LIBRARY INST TO PGM LIBRARY INST
    CPF1124 Information 00 05/09/12 14:30:19.860610 QWTPIIPP QSYS 04C0 *EXT *N
    Message . . . . : Job 403836/QITLM/SH started on 05/09/12 at 14:30:19 in
    subsystem QSYSWRK in QSYS. Job entered system on 05/09/12 at 14:30:19.
    CPC1224 Completion 50 05/09/12 14:58:39.709651 QWTPITP2 QSYS 061A *EXT *N
    Message . . . . : Job ended abnormally.
    Cause . . . . . : A SIGKILL signal was received for the job. The action for
    the signal was to terminate the job.
    CPF1164 Completion 00 05/09/12 14:58:39.710392 QWTMCEOJ QSYS 0148 *EXT *N
    Message . . . . : Job 403836/QITLM/SH ended on 05/09/12 at 14:58:39; .009
    seconds used; end code 30 .
    Cause . . . . . : Job 403836/QITLM/SH completed on 05/09/12 at 14:58:39
    after it used .009 seconds processing unit time. The job had ending code
    30. The job ended after 1 routing steps with a secondary ending code of 0.
    The job ending codes and their meanings are as follows: 0 - The job
    completed normally. 10 - The job completed normally during controlled ending
    or controlled subsystem ending. 20 - The job exceeded end severity (ENDSEV
    job attribute). 30 - The job ended abnormally. 40 - The job ended before
    becoming active. 50 - The job ended while the job was active. 60 - The
    subsystem ended abnormally while the job was active. 70 - The system ended
    abnormally while the job was active. 80 - The job ended (ENDJOBABN command).
    90 - The job was forced to end after the time limit ended (ENDJOBABN
    command). Recovery . . . : For more information, see the Work management
    topic collection in the Systems management category in the i5/OS Information
    Center, http://www.ibm.com/systems/i/infocenter/.
  • MaksKowalik
    MaksKowalik
    78 Posts

    Re: Agent status - Missing software scan - I-series

    ‏2012-09-18T08:52:45Z  
    hi Maks,

    i have tried but couldn't success.. can you give me some advice?

    5761SS1 V6R1M0 080215 Job Log CGKAIPA1 05/09/12 14:58:39 Page 1
    Job name . . . . . . . . . . : SH User . . . . . . : QITLM Number . . . . . . . . . . . : 403836
    Job description . . . . . . : QITLMJOBD Library . . . . . : QITLM
    MSGID TYPE SEV DATE TIME FROM PGM LIBRARY INST TO PGM LIBRARY INST
    CPF1124 Information 00 05/09/12 14:30:19.860610 QWTPIIPP QSYS 04C0 *EXT *N
    Message . . . . : Job 403836/QITLM/SH started on 05/09/12 at 14:30:19 in
    subsystem QSYSWRK in QSYS. Job entered system on 05/09/12 at 14:30:19.
    CPC1224 Completion 50 05/09/12 14:58:39.709651 QWTPITP2 QSYS 061A *EXT *N
    Message . . . . : Job ended abnormally.
    Cause . . . . . : A SIGKILL signal was received for the job. The action for
    the signal was to terminate the job.
    CPF1164 Completion 00 05/09/12 14:58:39.710392 QWTMCEOJ QSYS 0148 *EXT *N
    Message . . . . : Job 403836/QITLM/SH ended on 05/09/12 at 14:58:39; .009
    seconds used; end code 30 .
    Cause . . . . . : Job 403836/QITLM/SH completed on 05/09/12 at 14:58:39
    after it used .009 seconds processing unit time. The job had ending code
    30. The job ended after 1 routing steps with a secondary ending code of 0.
    The job ending codes and their meanings are as follows: 0 - The job
    completed normally. 10 - The job completed normally during controlled ending
    or controlled subsystem ending. 20 - The job exceeded end severity (ENDSEV
    job attribute). 30 - The job ended abnormally. 40 - The job ended before
    becoming active. 50 - The job ended while the job was active. 60 - The
    subsystem ended abnormally while the job was active. 70 - The system ended
    abnormally while the job was active. 80 - The job ended (ENDJOBABN command).
    90 - The job was forced to end after the time limit ended (ENDJOBABN
    command). Recovery . . . : For more information, see the Work management
    topic collection in the Systems management category in the i5/OS Information
    Center, http://www.ibm.com/systems/i/infocenter/.
    Hi,

    if my guess that scan is being interrupted because of timeout was right, then your actions (excluding biggest directories, tuning the timeout) should help. You can try increasing this timeout even more (e.g 12 or 24h), but in my opinion scans are allowed to take so much only on huge systems - few TB in millions of files.

    If you still have unfinished scans, then ask for official support. I can think of many other things that need to be checked, but it requires access to this system.

    Best regards,
    Maks Kowalik


    The postings on this site are my own and don't necessarily represent IBM's positions, strategies or opinions.
  • Jiravit
    Jiravit
    9 Posts

    Re: Agent status - Missing software scan - I-series

    ‏2012-11-01T10:27:11Z  
    Hi,

    if my guess that scan is being interrupted because of timeout was right, then your actions (excluding biggest directories, tuning the timeout) should help. You can try increasing this timeout even more (e.g 12 or 24h), but in my opinion scans are allowed to take so much only on huge systems - few TB in millions of files.

    If you still have unfinished scans, then ask for official support. I can think of many other things that need to be checked, but it requires access to this system.

    Best regards,
    Maks Kowalik


    The postings on this site are my own and don't necessarily represent IBM's positions, strategies or opinions.
    Hi Maks,

    How do I request for official support regarding ILMT? Is there a channel I can directly go to regarding this matter?

    Thanks.
  • MaksKowalik
    MaksKowalik
    78 Posts

    Re: Agent status - Missing software scan - I-series

    ‏2012-11-09T16:07:00Z  
    • Jiravit
    • ‏2012-11-01T10:27:11Z
    Hi Maks,

    How do I request for official support regarding ILMT? Is there a channel I can directly go to regarding this matter?

    Thanks.
    Hi,

    you raise a PMR on https://www.ibm.com/support/servicerequest

    Best regards,
    Maks Kowalik


    The postings on this site are my own and don't necessarily represent IBM's positions, strategies or opinions.