FTO HUBS need to be kept up to date with each other.
Have had a number of pmrs recently when there has been issues with a failover to the second TEMs in a FTO pair.
Now there are some known issues with the synchronization of these machines, but there are fixes available;
and I would recommend that you are at 6.3 FP06 to get all the fixes that are out there at the moment for FTO pairs.
However in more than one case the issues were more fundamental than that.
The FTO pair had not been kept up to date with each other.
This was usually due to the secondary HUB not normally being up and running and being kept as just a backup machine.
This is not the way a FTO pair should be kept, they should both be running at all times, as they then sychronize changes in information and so it does
not matter which of the two machines is the primary and there is a smooth transfer if one server fails or needs to be stopped.
The issues I have seen as not being kept up to date are issues like:
1) Application support; either not applied to one of the machines or it has not been updated to the same level on both machines.
This does not result in records being lost from the TEMS database, but it does mean that attributes are not recognized and situations not being handled correctly from one of the servers.
Queries from the TEPS to the TEMS may not be recognized as the attribute is not known.
This can be checked by getting a cinfo list from both machines and checking that the application support is to the same level on both machines,
and all application support is on both machines.
2) If it is used the xml file created by tedgen (output.xml) for the sending of events to an Event Server is not the same on both machines,
if separate TEPS are used for each TEMS.
This can result in events not being able to be sent to the Event Server when the machine missing some definitions is the primary server, as the definitions are not recognized.
The location of the file can be found by checking for the attribute KFW_MCS_XML_FILES=<tedgen_output_file> in the TEPS config file.
Then a quick check of the size and date of the file, should show if the files are the same.
Again changes in application support effect this file, and it should be updated when new application support added, and then the file populated to any other TEPS.
3) Configuration changes being applied to one but not both machines.
Have seen issues like
a) KMS_SDA=N on one machine and KMS_SDA=Y on the other.
If the self-describing agent function is set on at one HUB it should be set as on for both HUBs.
b) missing lines such as:
c) Make sure any tuning parameters (such as MAX_CTIRA_RECURSIVE_LOCKS) are also on both servers, if they are needed on one they will be needed
on the other machine.
4) Both HUB FTO's should be kept at the same level of ITM and this Tech Note gives the best steps for upgrading a FTO pair.
5) Even with a FTO pair backups should be done and this blog gives good details of this:
Subscribe and follow us for all the latest information directly on your social feeds: