IBM Support

java.lang.ClassNotFoundException: Class org.apache.hive.hcatalog.data.JsonSerDe not found when running zeppelin noteblook in DSXL

Troubleshooting


Problem

In zeppelin when trying to  query using hive the following error is thrown: 

java.lang.ClassNotFoundException: Class org.apache.hive.hcatalog.data.JsonSerDe not found

Example:

%livy.pyspark3

from pyspark.sql.functions import *

# df = spark.sql(" SELECT * FROM submissionengine_raw_access.vw_submissionstrings_parsed WHERE Receiveddate = '2018-09-14'")
df = spark.sql(" SELECT * FROM submissionengine_raw_access.submissionstrings_parsed WHERE Receiveddate = '2018-09-14'")
df.show()

Cause

The  HIVE_AUX_JARS_PATH are being overwritten.

Resolving The Problem

Review this Knowledge Article:

https://community.hortonworks.com/questions/105431/orgapachehivehcatalogdatajsonserde-not-found.html

It references the error. 

Further, it is liekly the  HIVE_AUX_JARS_PATH are being overwritten.

The jar files should be available from HDP and this Knowledge Article talks about how to How to permanently add SerDe JARs to Hive.

https://community.hortonworks.com/questions/2390/methods-to-add-jars-to-hive.html

Attached is a print screen to the case from Ambari Console showing the HIVE_AUX_JARS_PATH path.

[{"Business Unit":{"code":"BU059","label":"IBM Software w\/o TPS"},"Product":{"code":"SSKKD2","label":"IBM Data Science Experience Local"},"Component":"","Platform":[{"code":"PF016","label":"Linux"}],"Version":"All Versions","Edition":"","Line of Business":{"code":"LOB10","label":"Data and AI"}}]

Document Information

Modified date:
12 November 2018

UID

ibm10732942