IBM Support

通过扩展网络数据完整性检查和重传来提高Db2 HADR恢复能力

Troubleshooting


Problem

在配置了高可用性灾难恢复(HADR)的系统中,事务日志数据通过网络从主数据库传输到备用数据库。
如果在网络传输期间事务日志数据被破坏,则备用数据库可能会出现各种错误症状,这通常包括备用数据库的异常终止和关闭。

由于某些完整性检查已在备用数据库上以记录日志记录重放的方式异步的进行,因此db2diag.log通常会包含错误消息
“Log page checksum mismatch.pageLso xxxxxx,CheckSum xxxxxx”:

2017-08-29-14.45.16.809825-240 I99805E781 LEVEL: Error
PID : <pid#> TID : <tid#> PROC : db2sysc
INSTANCE: <instname> NODE : <node#> DB : <db2name>
HOSTNAME: hostname1
EDUID : 316 EDUNAME: db2lfr.0 (<dbname>)
FUNCTION: DB2 UDB, High Availability Disaster Recovery, hdrGetBlock, probe:40100
MESSAGE : ZRC=0x87800148=-2021654200=HDR_ZRC_BAD_LOG
"HADR standby found bad log"
DATA #1 : <preformatted>
Log page checksum mismatch. pageLso xxxxxx, CheckSum xxxxxx
 

其他可能的错误消息包括:

"Bad page detected. The bytecount is incorrect."

"Bad page detected. Checksum mismatch. pageLso / Checksum / Calculated Checksum"

更重要的是,由于损坏的日志数据可能已在备用数据库上刷新到磁盘,因此备用数据库的后续重新启动将以相同的条件失败。 通常通过使用主数据库中的有效备份手动替换备用数据库活动日志路径中的已损坏日志文件来解决此问题。

[{"Business Unit":{"code":"BU058","label":"IBM Infrastructure w\/TPS"},"Product":{"code":"SSEPGG","label":"Db2 for Linux, UNIX and Windows"},"Component":"","Platform":[{"code":"PF002","label":"AIX"},{"code":"PF016","label":"Linux"},{"code":"PF033","label":"Windows"}],"Version":"10.5 Fix Pack 9 and Version 11.1 Mod Pack 3 and above","Edition":"","Line of Business":{"code":"LOB10","label":"Data and AI"}}]

Log InLog in to view more of this document

This document has the abstract of a technical article that is available to authorized users once you have logged on. Please use Log in button above to access the full document. After log in, if you do not have the right authorization for this document, there will be instructions on what to do next.

Document Information

Modified date:
01 May 2025

UID

ibm10734357