IBM Support

PH07586: QUERIES AGAINST PARQUET FILE FORMAT TABLES WITH COMPLEX TYPES (ARRAY, ROW or STRUCT) FAIL WITH ERROR

Subscribe

You can track all active APARs for this component.

 

APAR status

  • Closed as fixed if next.

Error description

  • After querying a table which includes a complex type (ARRAY,
    ROW or STRUCT), all subsequent queries against PARQUET file
    format tables will use the non-optimized reader, which will
    result in poor performance.   You might also see a trap similar
    to the following when selecting from a PARQUET file format
    table:
    db2diag.log:
    FUNCTION: DB2 UDB, BSU Java support, sqlejLogExceptionToDiag,
    probe:1519
    DATA #1 : String, 403 bytes
    COM.ibm.db2.app.DB2ExtTableException:
    DeprecatedParquetHiveInput : size of object differs. Value size
    :  0, Current Object size : 21
        at
    com.ibm.biginsights.bigsql.dfsrw.DfsException.mapToDb2Exception(
    DfsException.java:161)
        at
    com.ibm.biginsights.bigsql.dfsrw.DfsException.mapAndLogException
    (DfsException.java:245)
        at
    com.ibm.biginsights.bigsql.dfsrw.reader.DfsBaseReader.read(DfsBa
    seReader.java:703)
    bigsql.log:
    2018-12-27 15:48:44,079 ERROR
    com.ibm.biginsights.bigsql.dfsrw.reader.DfsBaseReader
    [Master-17-S:21.17.1000001.175.0.23282] : [BSL-17-7f16d302e]
    Exception raised by Reader at node: 17 Scan ID:
    S:21.17.1000001.175.0.23282 Table: bigsql.testtable Spark:
    false VORC: false VPQ: false VAVRO: false VTEXT: false VRCFILE:
    false VANALYZE: false
    Exception Label: UNMAPPED(java.io.IOException:
    DeprecatedParquetHiveInput : size of object differs. Value size
    :  0, Current Object size : 21)
    java.io.IOException: DeprecatedParquetHiveInput : size of
    object differs. Value size :  0, Current Object size : 21
        at
    org.apache.hadoop.hive.ql.io.parquet.read.ParquetRecordReaderWra
    pper.next(ParquetRecordReaderWrapper.java:216)
        at
    org.apache.hadoop.hive.ql.io.parquet.read.ParquetRecordReaderWra
    pper.next(ParquetRecordReaderWrapper.java:61)
        at
    com.ibm.biginsights.bigsql.dfsrw.reader.DfsHiveSplit2Batch.split
    2Batch(DfsHiveSplit2Batch.java:110)
        at
    com.ibm.biginsights.bigsql.dfsrw.jaro.DfsSplitManager$SplitRunna
    ble.run(DfsSplitManager.java:123)
        at
    java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExec
    utor.java:1160)
        at
    java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExe
    cutor.java:635)
        at java.lang.Thread.run(Thread.java:785)
        at
    com.ibm.biginsights.bigsql.dfsrw.jaro.DfsSplit2BatchThread.run(D
    fsSplit2BatchThread.java:58)
    

Local fix

  • - Recycle Big SQL to restore performance
    - Use ORC file format tables
    

Problem summary

  • Please see problem description
    

Problem conclusion

Temporary fix

Comments

APAR Information

  • APAR number

    PH07586

  • Reported component name

    IBM BIG SQL

  • Reported component ID

    5737E7400

  • Reported release

    503

  • Status

    CLOSED FIN

  • PE

    NoPE

  • HIPER

    NoHIPER

  • Special Attention

    NoSpecatt / Xsystem

  • Submitted date

    2019-01-22

  • Closed date

    2020-09-09

  • Last modified date

    2020-09-09

  • APAR is sysrouted FROM one or more of the following:

  • APAR is sysrouted TO one or more of the following:

Fix information

Applicable component levels

[{"Line of Business":{"code":"LOB10","label":"Data and AI"},"Business Unit":{"code":"BU059","label":"IBM Software w\/o TPS"},"Product":{"code":"SSCRJT","label":"IBM Db2 Big SQL"},"Platform":[{"code":"PF025","label":"Platform Independent"}],"Version":"503"}]

Document Information

Modified date:
10 September 2020