question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

numEntriesScannedInFilter > numDocsScanned in certain scenarios

See original GitHub issue

When running a query like the following

SELECT COUNT(*) FROM table WHERE id = x AND time BETWEEN a AND b

numEntriesScannedInFilter is showing more records than the ones scanned numDocsScanned , why is this the case? shouldn’t it only scan the number that we have selected already on?

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Comments:12 (12 by maintainers)

github_iconTop GitHub Comments

2reactions
lfernandez93commented, Oct 28, 2021

in that example query we went from 50 ms to 10 ms

1reaction
lfernandez93commented, Oct 28, 2021

after upgrading pinot and making the above change i can confirm that the above query now is not only faster but numEntriesScannedInFilter went from > than a mill to 0.

thank you so much @richardstartin and @kishoreg

Read more comments on GitHub >

github_iconTop Results From Across the Web

[GitHub] [pinot] lfernandez93 commented on issue #7642
[GitHub] [pinot] lfernandez93 commented on issue #7642: numEntriesScannedInFilter > numDocsScanned in certain scenarios.
Read more >
Troubleshooting Pinot - Apache Pinot Docs
Specifically look at numEntriesScannedInFilter and numDocsScanned . ... If numDocsScanned is very high, that means the selectivity for the query is low and ......
Read more >
Low Latency (OLAP) Movie Rating Analytics Using Apache Pinot
Use Apache Pinot to stream ingesting movie rating events across countries from a Kafka topic and make them available for immediate querying ...
Read more >
Apache Pinot: Exploring range queries - Mark Needham
We loaded this data into Apache and Pinot and then analysed the numEntriesScannedInFilter and ... Now it's time to do some range querying....
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found