question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Unable to consume messages, need to re-subscribe client

See original GitHub issue

This is similar to issue 2013. We have several shared subscriptions of partitioned topics and we found some cases where a subscription’s client is not receiving messages, but it’s not related to a specific topic partition. In this case, if we re-subscribe the client (PulsarClient.subscribeAsync), messages start to be delivered as expected.

Client logs this warn:

[log_time:16:17:42.377] [thread:pulsar-timer-6-1] [level:WARN ] [logger:UnAckedMessageTracker] - [ConsumerBase{subscription='picking-consumer-prod.fbm-wms-picking', consumerName='i-08e97d991d75e9d07-10.73.125.156', topic='persistent://fury/global/wms-wave-feed-prod.fbm-wms-picking/wms-wave-feed-prod.fbm-wms-picking'}] 1 messages have timed-out

And we don’t see any error from brokers.

Any idea? Any log or metric that may be useful to debug this problem?

System configuration

Pulsar version (brokers): 1.20.0 Pulsar clients version: 1.22.0 Bookkeeper version: 4.7.1

Issue Analytics

  • State:closed
  • Created 5 years ago
  • Comments:14 (8 by maintainers)

github_iconTop GitHub Comments

3reactions
Poluectcommented, Dec 27, 2019

Hello! We started using Pulsar instead of Kafka in our system, and currently facing the same issue, when few subscriptions (like 1 - 3 of 30) stop receiving messages, and they get stuck in a backlog, until either a subscriber, or pulsar proxy restarted.

We are using version 2.4.2, with 3 brokers in a cluster. Issue appears pretty often, like 3 - 6 times in a day. Stats also attached. Hope this issue can be fixed or at least hope there can be any workaround except of force deleting kube pods with subscriptions to make them reconnect. We used Kafka to handle some critical parts of our system, unfortunately we can’t rely on Pulsar without having simplest consumer - producer scenario work well. Thank you!

topic-stats-internal.txt topic-stats.txt

0reactions
sijiecommented, Jul 2, 2020

@Poluect thank you for your update! I would close this issue for now. Please re-open one if there is still an issue in the latest release.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Consumer not receiving messages, kafka console, new ...
I my MAC box I was facing the same issue of console-consumer not consuming any messages when used the command kafka-console-consumer --bootstrap-server ...
Read more >
Messaging - Apache Pulsar
When a consumer fails to consume a message and needs to re-consume it, the consumer sends a negative acknowledgement (nack) to the broker,...
Read more >
Chapter 4. Kafka Consumers: Reading Data from Kafka
During a rebalance, consumers can't consume messages, so a rebalance is basically a short window of unavailability of the entire consumer group. In...
Read more >
Troubleshoot Amazon SNS publishing and subscription issues
An AWS Identity and Access Management (IAM) resource or identity can't publish or subscribe to an Amazon SNS topic without the required ......
Read more >
Consuming Messages - KafkaJS
When a consumer fails the load is automatically distributed to other members of the ... If your broker has topic-A and topic-B ,...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found