Question & Answer
Question
Answer
Question: What is the difference between requesting a problem solution (or resolution) and requesting RCA?
RCA, as previously explained, is an assessment after the systems and applications are restored to a working operation, to understand the sequence of events that lead to the issue.
Question: Is RCA provided after business hours and on weekends?
In today's global environment, the following factors have to be considered when providing support:
- The country where the support contract was created.
- The country where the machine is location.
- The countries where the system administrators work.
Question: Is RCA included with standard software maintenance (also known as, support) agreements?
https://www.ibm.com/support/pages/node/796206
https://www.ibm.com/support/pages/node/740855
https://www.ibm.com/support/pages/node/738889
https://www.ibm.com/support/pages/node/739151
Question: Is RCA considered a severity 1 and/or system down situation?
https://www.ibm.com/support/pages/node/739151
Question: What is IBM's SLA (Service-Level Agreement) or Responsive Time Requirement for providing RCA?
Per the IBM Support Guide, there is no SLA or Response Time Requirement when or if RCA can be provided.
- A simple interpretation of application, process, command, or system messages.
- One or more events which occur during the same period and can be re-created.
- One or more events which occur during the same period and cannot be re-created.
- An extremely evasive, low-level event that occurs within millions of other events across multiple devices.
- Release levels of the products and features involved with the solution.
- Complexity of environment (more hardware and software features exponentially increase complexity)
- Frequency of the issue occurring (higher frequency increases opportunity to capture useful data).
- Ability to clients to capture the correct data at the time the original event occurs, not after the event.
- Need to collect, analyze, and correlate data from multiple applications, systems, and devices at the time of (or leading up to) the issue.
- Uniqueness of the workload to the client specific environments.
- Type of issue (for example, crash, hang, error message, unpredictable behavior, timeout, performance).
- Clients following the troubleshooting procedures provided by the IBM Remote Technical Support.
- If the issue occurs with our product, with another vendor product, or custom solution.
Question: What are the common types of RCA?
The common types of RCA are:
- A factual assessment
An assessment based on facts provided by the data provided by clients. This assessment can only be provided when clients provide the complete set of data requested by IBM Remote Technical Support. Only when a complete set of data has been provided, will IBM Remote Technical Support be capable of explaining the sequence of events leading up, during, and after the event occurred. A complete set of data may include a single data file or may require several iterations of data collections, based on the complexity and frequency of the issue. - A hypothetical assessment
A reasonable effort assessment when no data or incomplete data is provided by clients. This assessment will be provided when clients are not able to re-create the reported issue and/or is unable to provide all the requested data. Being a "reasonably effort" assessment, IBM Remote Technical Support will be limited in the time and effort providing the assessment. - No assessment
In some cases, and due to lack of data and information provided by the client, low frequency of occurrences of the issues, and/or the issues are related to events with products or features outside the scope of coverage for the support team products, IBM Remote Technical Support may not be capable of providing either a factual or hypothetical assessment.
Question: Is RCA provided for "End of Service" products?
Question: One or more snap or perfpmr data collections were uploaded. Why can IBM not identify the problem and/or provide RCA?
A factual assessment can only be provided when clients have provided all the required data that captures adequate details leading up to and at the time the issue or problem event occurs.
- Issue was one time event and could not be re-created.
- The data provided did not capture enough historical information to come to either a hypothetical or factual assessment.
- The data was captured at a time when the event did not occur (for example, before or after the event).
- The issue is related to events occurring outside the supported products.
- The issue is specific to the clients\' custom workload or custom solution.
Question: Why does IBM keep asking for more and more data?
Most modern computer environments consist of many connected computing, network, and storage devices, creating extremely complex solutions or infrastructures. This complexity exists whether these devices are provided by one vendor or multiple vendors. Complex issues occur when one or more microsecond events occur within a sequence of million events based on client's unique business workload being processed through these client custom configured and multiple layers of connected devices. For performance reasons, all debug and tracing features can not be enabled across all the connected devices in order to capture any, or all, microsecond events that may trigger a specific issue. There are an infinite number of possible execution paths across all connected devices trying to service unique client workload to implement a complete or comprehensive hooks into all products. Therefore, the default data collection procedures are not likely to capture one of these microsecond events (which likely occurred before the data collection) and simply display a nice message such as "PROBLEM OCCURRED HERE".
- Get an accurate problem description or impact to the business and get a complete and accurate details of any recent changes to the configuration, the environment, the infrastructure, and workload. This information will help the support team to isolate the focus to specific products and/or specific features in the products. Many initial delays are contributed to clients not providing accurate or complete details when the case is first opened.
- Review the initial data collection to understand if the problem may be related to a known defect, to understand the configuration and understand if the configuration could be a contributing factor and review recent error messages. In some situations, the problem can be identified from the initial set of data. In more complex situations, the initial data will provide background information and help to further identify how to implement a targeted data collection. Many delays are contributed to clients not uploading the correct data, uploading more than the requested data, uploading data other than the requested data, and uploading incomplete data, or uploading data after a significant amount of time after the event occurred (for example, capturing data after having rebooted a system).
- For complex issues, performance issues, in-frequent issues, time-out issues, data corruption issues, clients should expect multiple data collections (when the issue does occur) will be required and will be requested by IBM Remote Technical Support. This is required further isolate and drill into specific areas of the features and code associated with the microsecond events leading up to the issue.
Question: Why can IBM not re-create my problem?
Most modern computer environments consist of many connected computing, network, and storage devices, creating extremely complex solutions or infrastructures. This complexity exists whether these devices are provided by one vendor or multiple vendors.
Question: A problem occurred only once and can not be re-created, can IBM provide RCA?
Refer to the Question "What are the common types of RCA?" to understand the types of RCA based on the accuracy and completeness of information and data provided by clients.
Question: What can clients do to help IBM?
Use the following step-by-step instructions to contact IBM to open a case for software with an active and valid support contract.
1. Document (or collect screen captures of) all symptoms, errors, and messages related to your issue.
2. Capture any logs or data relevant to the situation.
3. Contact IBM to open a case:
-For electronic support, visit the IBM Support Community Page to View and Open Cases:
https://www.ibm.com/mysupport
-If you require telephone support, see the web page:
https://www.ibm.com/planetwide/
4. Provide a clear, concise description of the issue.
- For guidance, provide these answers in your description:
- What is your preferred communication method?
- List any error messages (provide full output)
- Describe the unexpected behavior
- List all applications possibly associated with the problem
- List all steps to reproduce the problem if possible
- When did the problem begin?
- Were any changes applied to the product, or system, which seemed to introduce this problem?
5. If the system is accessible, collect a system snap, and upload all the details and data for your case.
- For guidance, select the Mustgather page for your product:
Question: An issue occurred is a result of a known or new defect, can we get RCA as to why it happened now or just to specific machines?
If a known or new defect has been identified as the underlying cause of the reported issue, technically IBM has provided the RCA, which is a code defect contributing to the events that occurred. Not all defects all systems and applications, even when similar configurations are used. Issues that occur are due to a combination of the overall custom infrastructure, the custom configuration of the the devices in the infrastructure, and the client unique business transactions or workload flowing across these multiple devices. Although configurations and workloads may appear similar, there is uniqueness in the overall execution of the millions of events that may be occurring that will contribute to defects occurring for some systems and applications, and not others.
Question #2:This is a test 2
here here
Was this topic helpful?
Document Information
Modified date:
19 June 2024
UID
ibm16595181