Event counter
The messages displayed
in the log file concerning the event counter
table are of three types:
- Messages that confirm the successful event counter initialization. No action is needed.
- Messages that the event counter reports related to problems not related to it. For example, they could reveal that the workstation received a message out of sequence. If action is required it does not impact the event counter.
- Messages that indicate that the event counter has failed. User action is needed to restore the counter.
This section concerns itself with this third type of messages.
Two processes
can display this kind of error message:
- Writer
- When an error message of this
type is received from writer,
the event counters stops. All messages received from the workstation
which asked netman to activate writer, and from all its children,
are ignored. This can lead to two situations:
- The workstation talking to writer is switched to a new manager. In this case the new manager asks for a counter table and receive a corrupt counter table. The replay protocol proceeds following the default behavior.
- Before the switchmgr operation can be performed, writer fails and is automatically restarted. In this case the counter mechanism partially repairs itself. New messages received by the process are stored in the counter, but the messages received by the writer from the moment the error message was displayed up to the point at which writer restarted are not tracked. The situation for a given workstation might be considered as reset only when the new instance of writer receives a message from it.
The situation is recovered after the next scheduled JnextPlan. If you need to recover more urgently, run JnextPlan -for 0000 to refresh the Symphony file.
- Mailman
- When an error message of this
type is received from mailman, the event counters stops. Mailman sets
the IDs of all messages to 0. This means that there is a risk of duplication,
because without the event counter, mailman is unable to properly sequence
and process messages. When the switchmgr is performed, and the new domain manager commences the replay protocol mechanism, for each message in the ftbox it looks at the position of the target workstation with respect to its own position in the tree:
- If the position of the target workstation in the workstation tree is higher than the new domain manager's (the workstation is either the domain manager or a full-status member of the parent domain of the domain where the switchmgr operation took place), the message is sent.
- If the position of the target workstation in the workstation tree is lower than the new domain manager's (the workstation either belongs to the domain where the switchmgr operation took place and it is not the new domain manager or is the domain manager or a full-status member of one of the child domains), the message is not sent.
The situation is recovered after JnextPlan.