Enabling and managing trace for a deployed integration server

To aid with problem determination and troubleshooting, you can enable and manage trace on an integration server that is deployed in the App Connect Dashboard. Enabling trace is useful if you cannot get enough information about a particular problem from the entries that are available in the log. Trace provides more details about what is happening while code runs, and can be analyzed to discover the cause of your problem.

Availability: Tracing is available only for integration servers whose spec.version value resolves to 12.0.2.0-r2 or later, and integration runtimes whose spec.version value resolves to 12.0.7.0-r4 or later.

About this task

Two types of trace are available for an integration server or integration runtime:

User trace: Enable user trace to help you debug your integration solutions; you can trace integration servers or integration runtimes and their deployed message flows.
Service trace: Enable service trace only when you receive an error message that instructs you to do so, or when you are directed to do so by IBM® support. When active, service trace performs more comprehensive integration server or integration runtime tracing and tracks the execution of commands.

When you start user or service trace, additional processing occurs for activities in the integration server or integration runtime that you are tracing, so there might be some effect on performance while the trace is active. You can limit this additional processing by restricting how long the trace is active for.

After you deploy an integration, you can start, stop, download, and clear (reset) user or service trace by using a set of menu options. If you want to capture trace on or before startup of an integration, you can use the server.conf.yaml file or an environment variable to start (and subsequently stop) trace. Only one trace can be enabled at a time on an integration server.

Restriction: In a 12.0.10.0-r2 or later Dashboard, the dashboard-admin role is required to start and read trace. In a 12.0.10.0-r1 or earlier Dashboard, the trace menu options are displayed only if you are assigned the operator or admin role for accessing the Dashboard. For more information, see Roles and permissions for the App Connect Dashboard on Red Hat OpenShift.

Enabling user or service trace before deploying an integration
Enabling user or service trace after deploying an integration
Stopping an active user or service trace
Downloading the user or service trace log files
Clearing user or service trace information

Enabling user or service trace before deploying an integration

You can enable user or service trace on startup of an integration server by using the server.conf.yaml file. If you want to capture trace from the earliest possible time, including the period during which the server.conf.yaml file is read, you can instead set an environment variable within the integration server custom resource (CR) before you deploy the integration.

Procedure

To enable trace before you deploy an integration, complete one of the following steps:

If you want to capture trace from the earliest possible time, including the period during which the server.conf.yaml file is read, set the MQSI_FORCE_DEBUG_TRACING environment variable to a non-null value within the CR that you want to use to create the integration server. You can enable only service trace by using the MQSI_FORCE_DEBUG_TRACING environment variable.
Use the spec.env parameter to set the environment variable as follows:
```
spec:
  env:
    - name: MQSI_FORCE_DEBUG_TRACING
      value: '1'
```
While using the App Connect Dashboard to deploy the integration, you can add the spec.env settings by switching to the YAML editor while in the Properties view.

After you create the integration server, its tile on the Servers page of the App Connect Dashboard displays a status of Ready (trace enabled).

To download the trace information that is captured for the deployed integration, you can run a set of commands as described in Downloading the trace log files.
Note:
- When you enable trace by using the MQSI_FORCE_DEBUG_TRACING environment variable, a trace object is not created like it is when you use one of the Start trace menu options on the tile.
- To stop the trace, you must edit the integration server CR to remove the spec.env setting. The (trace enabled) notation is removed from the tile when tracing stops. (You might need to refresh the browser tab or window to see this update.) For information about editing the CR, see Editing the settings for a deployed integration server. You cannot use the Stop trace menu option on the tile to stop the trace.
If you require user or service trace on startup, you can deploy an integration server with a configuration object of type server.conf.yaml, which defines settings to enable trace.
1. Before you create the integration server, use a server.conf.yaml file to create a configuration object of type server.conf.yaml.
  1. Enable trace in the server.conf.yaml file by using the trace or userTrace properties.
    The following example shows the trace and userTrace properties in a server.conf.yaml file in which user trace is enabled with a debug value.
    
    ... #trace: 'none' # choose 1 of : none|debug|debugTree|diagnostic|diagnosticTree #traceSize: '1G' # Set the service trace size #traceNodeLevel: true # Enable or disable message flow 'Trace nodes' userTrace: 'debug' # choose 1 of : none|debug|debugTree userTraceSize: '1G' # Set the user trace size ...
  2. Save the file and then use it to create a configuration object of type server.conf.yaml. For more information, see server.conf.yaml type and Configuration reference.
2. Define the settings for the integration server that you want to create and ensure that you specify the name of the configuration object of type server.conf.yaml. To do so, you need to select the configuration object's checkbox in the Configuration view. Then, complete the steps to deploy the integration server.
  For more information, see Creating an integration server to run your BAR file resources.
3. From the Servers page of the App Connect Dashboard, locate the tile for the integration server.
  
  The status on the tile is displayed as Ready (trace enabled). In the options menu, a Stop trace menu option is also displayed for the type of trace that is enabled, and no Start trace menu option is displayed for the alternative trace type. The following example shows the tile for an integration server for which user trace is enabled on startup, and the trace menu options.
  
  To download the trace information that is captured for the deployed integration, see Downloading the user or service trace log files.
  Note:
  - When you enable trace on startup by using a configuration object of type server.conf.yaml, a trace object is not created like it is when you use one of the Start trace menu options on the tile.
  - To stop the trace, you must update the configuration object of type server.conf.yaml to set trace or userTrace to none, or to comment out the properties. This configuration update triggers a restart of the integration server and the (trace enabled) notation is removed from its tile when you refresh (or reload) the Servers page. For information about updating a configuration object, see Managing configuration objects from the Configuration page. You cannot use the Stop trace menu option on the tile to stop the trace.

Enabling user or service trace after deploying an integration

You enable user or service trace on a running integration server by starting the trace. You can start a trace only if no other trace is currently enabled.

Note: If a pod is restarted or a new replica is added while trace is active, trace is started on that pod within 60 seconds because the trace reconcile loop is scheduled to run every minute. However, there is a possibility that some trace data might be lost during this interval.

Procedure

To start user or service trace, complete the following steps:

From the Servers page of the App Connect Dashboard, locate the tile for the relevant integration server.
The integration server should be in a Ready state.
Open the options menu on the tile. If trace is not yet enabled, you should see the Start service trace and Start user trace options.
Click the Start option for the trace that you want to enable, as shown in the following example.

A notification is displayed indicating that tracing has started (for example, Success: Started user trace), and the Ready status on the integration server tile changes to Ready (trace enabled).

Tip: The Start action automatically creates a trace object with the same name as the integration server. You can view this object from the Red Hat OpenShift web console or CLI, or the CLI for a Kubernetes environment. For more information, see Trace reference.

If you open the options menu again on the tile, you should notice that the Start option for the trace that you enabled (for example, user trace) has been replaced by a Stop option. Also note that the Start option for the alternative trace type (for example, service trace) is no longer visible because only one trace can be enabled per integration server.

If you want to start an alternative trace, you will need to stop the active trace that is currently enabled. For example, if user trace is enabled and you want to switch to service trace, you must stop the user trace and then start the service trace.

Note: If an integration server container is crashing, you should be able to collect trace as normal by setting an environment variable to prevent the container from stopping. For more information, see Collecting trace from an integration server or integration runtime container that is crashing.

Stopping an active user or service trace

After collecting trace information for analysis, you can stop the trace.

Procedure

To stop an active user or service trace, complete the following steps:

From the Servers page of the App Connect Dashboard, locate the tile for the relevant integration server.
Open the options menu on the tile and locate the Stop option for the active trace. For example, if user trace is currently enabled, you should see a Stop user trace option. Similarly, if service trace is enabled, you should see a Stop service trace option.

A notification is displayed indicating that tracing has stopped (for example, Success: Stopped user trace), and the Ready (trace enabled) status on the integration server tile reverts to Ready.

Tip: The Stop action automatically deletes the associated trace object that was created when the trace was started, which means that the object will no longer be visible from the Red Hat OpenShift web console or CLI, or the CLI for a Kubernetes environment. The trace log files are however retained in the system. Also note that if you delete an integration server, any trace object which references that integration server (in spec.integrationServerName) is also automatically deleted. For more information, see Trace reference.

Downloading the user or service trace log files

When you have collected some trace information, you can download the trace to your computer for analysis. You can download trace while it is running or after it is disabled. The download speed will depend on the volume of trace data and the amount of CPU that is allocated to the integration server.

Trace information is written to the trace log files in plain text format within the workdir/config/common/log directory, where workdir is the work directory (/home/aceuser/ace-server) of the integration server.

Procedure

To download the user or service trace log files, complete the following steps:

From the Servers page of the App Connect Dashboard, locate the tile for the relevant integration server.
Open the options menu on the tile and click the relevant Download trace option.

Depending on your browser, the file might automatically be downloaded to a configured download location or you might be prompted to choose a location.
Wait for the download to complete.

By default, the file is named namespaceName-integrationServerName-traceType-identifier.zip; for example, ace-she-is-fd-toolkit-user-1636954324499.zip or ace-she-is-fd-toolkit-service-1636954543441.zip.

Extract the contents of the ZIP file to a preferred location.

Trace type	Extracted contents
User trace	For each replica pod, you obtain a user-trace-files.zip file in a separate directory. This ZIP file contains one or more TXT files to which user trace information is written. The TXT files are named in the format integration_server.`integrationServerName`.userTrace.`n`.txt; for example, integration_server.is-fd-toolkit.userTrace.0.txt. The value of `n` is an integer in the range 0 through 4, enabling trace to be written to five trace log files in a rotational manner. When user trace is started, trace information is written to the file ending in 0.txt. When the 0.txt file becomes full, trace information is then written to the 1.txt file. The process continues until the 4.txt file is full, at which point the existing trace information in 0.txt is overwritten with new information. The process continues in this rotational manner until user trace is stopped.
Service trace	For each replica pod, you obtain the following files in a separate directory: pod-logs.zip: Contains logs from all containers in the pod service-trace-files.zip: Contains the following files: One or more TXT files to which service trace information is written These files are named in the format integration_server.`integrationServerName`.trace.`n`.txt; for example, integration_server.is-fd-toolkit.trace.0.txt. The value of `n` is an integer in the range 0 through 3, and is used as a numbering mechanism for a series of four files into which the trace log is divided. The exception log (ending in exceptionLog.txt) An env.txt output of environment settings

Trace type

Extracted contents

User trace

For each replica pod, you obtain a user-trace-files.zip file in a separate directory.

This ZIP file contains one or more TXT files to which user trace information is written. The TXT files are named in the format integration_server.integrationServerName.userTrace.n.txt; for example, integration_server.is-fd-toolkit.userTrace.0.txt.

The value of n is an integer in the range 0 through 4, enabling trace to be written to five trace log files in a rotational manner. When user trace is started, trace information is written to the file ending in 0.txt. When the 0.txt file becomes full, trace information is then written to the 1.txt file. The process continues until the 4.txt file is full, at which point the existing trace information in 0.txt is overwritten with new information. The process continues in this rotational manner until user trace is stopped.

Service trace

For each replica pod, you obtain the following files in a separate directory:

pod-logs.zip: Contains logs from all containers in the pod
service-trace-files.zip: Contains the following files:
- One or more TXT files to which service trace information is written
  These files are named in the format integration_server.integrationServerName.trace.n.txt; for example, integration_server.is-fd-toolkit.trace.0.txt. The value of n is an integer in the range 0 through 3, and is used as a numbering mechanism for a series of four files into which the trace log is divided.
- The exception log (ending in exceptionLog.txt)
- An env.txt output of environment settings

The following example shows partial contents of a user trace log file.

Clearing user or service trace information

You can use the reset option to erase the trace information that has been written to an integration server's trace log files. You might choose to clear the log before you start a new trace, to ensure that all the logged information is unique to the new trace.

Procedure

To clear user or service trace information, complete the following steps:

From the Servers page of the App Connect Dashboard, locate the tile for the relevant integration server.
Open the options menu on the tile and click the relevant Reset trace option.
You should see a notification confirming that the trace has been reset (for example, Success: Reset user trace).