Troubleshooting
Problem
The processing time of a single batch in the pipeline is longer than the maximum polling interval (
max.poll.interval.ms) in the Kafka stage.max.poll.interval.mscan be configured under Configuration Properties in the Connection tab in the Kafka Multitopic Consumer. If it is not configured, it defaults to300000(5 minutes) per the Kafka documentation.This causes the pipeline to not poll the consumer within the maximum interval, which causes the consumer to be removed from the consumer group, leading to the failure of the pipeline.
Additionally, because the consumer has been removed from the group, it is unable to write the offset to the Kafka broker, and therefore the offset is lost (as, due to the nature of Kafka consumer groups, in Kafka pipelines, the offset is maintained by Kafka, not by SDC)
Symptom
A Kafka Multitopic Consumer pipeline which is processing large amounts of data per batch fails.
The pipeline is unable to retain the offset after the pipeline fails.
Logs contain the following message:
ERROR ProductionPipelineRunnable - An exception occurred while running the pipeline, com.streamsets.pipeline.api.StageException: KAFKA_29 - Error fetching data from Kafka: org.apache.kafka.clients.consumer.CommitFailedException: Offset commit cannot be completed since the consumer is not part of an active group for auto partition assignment; it is likely that the consumer was kicked out of the group.
Resolving The Problem
The processing time for any single batch of data in the pipeline must be shorter than the configured value of max.poll.interval.ms in the Kafka Multitopic Consumer. This can be accomplished in three ways:
Reduce the batch size in order to reduce the processing delay.
Optimize the pipeline’s data processing in order to redcue the processing delay.
Configure a value for
max.poll.interval.mswhich is longer than the longest expected processing time for a single batch of data.For example, if the longest expected processing time for a batch of data is 10 minutes,
max.poll.interval.msmust be configured to a value which is larger than600000(be sure to allow some margin for variance in processing times).
Document Location
Worldwide
Was this topic helpful?
Document Information
Modified date:
15 March 2025
UID
ibm17186223