IBM InfoSphere Master Data Management, Version 11.3Stopping and restarting batch jobs are essential functions when processing large volumes of data.
The batch processor can restart the following types of stopped batch jobs:
To restart a stopped task, update the task status to Pending by either:
The batch processor will then pick up the pending request and try to restart it from where it stopped.
The batch processor is able to restart a batch job at the correct place by using three detailed status files that are kept for each task: a Stage file, a Result file, and a Restart file.
The first line of the Stage file is a title. Each line after the title stores a message ID and record. The message ID is a unique, sequential number generated by the batch processor at runtime to identify each record involved in the job. The types of information for each record in the Stage file depend on what is defined in the METADATA_KEY of the CDMETADATAINFOTP code table for the task.
For example:
MessageID,ENTITY_ID,ENTITY_TYPE
1,100000000000000001,mdmper
2,100000000000000002,mdmper
3,100000000000000003,mdmper
4,100000000000000004,mdmper
Alternate example:
MessageID,NO_TITLE_LINEENTITY_ID
1,<?xml version="1.0" encoding="UTF-8"?><TCRMService ...
2,<?xml version="1.0" encoding="UTF-8"?><TCRMService ...
3,<?xml version="1.0" encoding="UTF-8"?><TCRMService ...
4,<?xml version="1.0" encoding="UTF-8"?><TCRMService ...
The Result file stores the unique message ID of each record in the batch job along with a result category to represent the outcome of the processing for that record:
Each line in the Result file represents a different record. For example:
1,S
2,F
3,S
4,S
The batch processor determines whether to mark a processing outcome as a success or failure depending on the result categorizer class, as defined in the Batch.properties file.
resultCategorizer=com.ibm.mdm.batchframework.message.BatchMessageCategorizer
The BatchMessageCategorizer determines the message outcome based on whether the transaction results in a DWLResponseException message. If so, the outcome is a failure (F); otherwise, the outcome is a success (S).
resultCategorizer=com.ibm.mdm.batchframework.bulkprocessing.restart.ResultCodeMessageCategorizer
The ResultCodeMessageCategorizer determines the message outcome based on the value of the <ResultCode> tag from its response output. If the value is SUCCESS, then the outcome is S; otherwise, the outcome is F.
The Restart file has the same format as the Stage file. The Restart file contains a subset of the Stage file, and is made up of the entities in the Stage file, minus a subset of the entities in the Result file.
The batch processor uses the Restart file as an input file to process the remaining entities in the restarted batch job.