How to get started with troubleshooting
Troubleshooting the issues that are reported in the system is easier when you follow the process step-by-step.
When you experience some issues with the system, go through the following steps to get started with the troubleshooting:
- Check the events that are reported in various nodes of the cluster by using the mmhealth cluster show and mmhealth node show commands.
- Check the user action corresponding to the active events and take the appropriate action. For more information on the events and corresponding user action, see Events.
- If you are facing a deadlock issue, see Managing deadlocks to know how to resolve the issue.
- Check for events that happened before the event you are trying to investigate. They might give you an idea about the root cause of problems. For example, if you see an event nfs_in_grace and node_resumed a minute before you get an idea about the root cause why NFS entered the grace period, it means that the node resumed after a suspend.
- Collect the details of the issues through logs, dumps, and traces. You can use various CLI commands and the For more information, see Collecting details of the issues. GUI page to collect the details of the issues reported in the system.
- Based on the type of issue, browse through the various topics that are listed in the troubleshooting section and try to resolve the issue.
- If you cannot resolve the issue by yourself, contact IBM® Support. For more information on how to contact IBM Support, see Support for troubleshooting.