IBM Support

JR51052: XML STAGE'S DISK-BASED OPERATIONS FOR HJOIN, SORT AND REGROUP STEPS HAVE VERY LARGE MEMORY FOOT PRINT.

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as program error.

Error description

  • XML Stage's disk-based operations for HJoin, Sort and Regroup
    steps have very large memory foot print.
    
    The solution is be to fix the the disk-based implementation,
    reducing it's memory consumption.
    

Local fix

  • 1. Sort the data records first in the DataStage and then use the
     optimized regroup operation. In this case, if the Regroup step
    is not the first step in the stage the developer should
    guarantee that no re-ordering of the data occurs prior to the
    Regroup that would cause it to become unsorted.
    
    2. Sort the data records first in the DataStage using the
    DataStage operator instead of using the Hierarchical Data stage
    sort step.
    
    3. Use the Wave Generator in DataStage to divide the data into
    waves and run the job for each wave of data. This approach only
    works for Parallel Engine jobs, not for DataStage Server jobs.
    
    4. Break a job run with a large data set into multiple job runs.
    Each run processes a subset of the large data set.
    

Problem summary

  • XML STAGE'S DISK-BASED OPERATIONS FOR HJOIN, SORT AND REGROUP
    STEPS HAS VERY LARGE MEMORY FOOT PRINT
    

Problem conclusion

  • Increased the default thread as 4
    

Temporary fix

Comments

APAR Information

  • APAR number

    JR51052

  • Reported component name

    INFO SRVR PLATF

  • Reported component ID

    5724Q3612

  • Reported release

    850

  • Status

    CLOSED PER

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt

  • Submitted date

    2014-08-19

  • Closed date

    2015-12-09

  • Last modified date

    2015-12-09

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

  • Fixed component name

    INFO SRVR PLATF

  • Fixed component ID

    5724Q3612

Applicable component levels

  • R850 PSY

       UP

[{"Business Unit":{"code":"BU059","label":"IBM Software w\/o TPS"},"Product":{"code":"SSZJPZ","label":"IBM InfoSphere Information Server"},"Component":"","ARM Category":[],"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"8.5","Edition":"","Line of Business":{"code":"LOB10","label":"Data and AI"}}]

Document Information

Modified date:
09 December 2015