Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Won't Fix
-
3.0.0
-
None
Description
The java kafka consumer creates multiple collections during a single poll command: in my test system i have a consumer that polls a topic with 100 partitions and even though no messages are coming through, the code allocates around 100M per 5 minutes.
I've investigated the allocations and the biggest ones can be easily avoided by moving them to the instance level, something that can be done as KafkaConsumer is not thread safe. Purpose of this Jira is to get rid of most of them applying either this or a similar approach.
Attachments
Issue Links
- is related to
-
SPARK-36837 Upgrade Kafka to 3.1.0
- Resolved
- links to