Star-tree Optimizations
See original GitHub issueProblems:
- Certain predicates (e.g.
NOT_EQUAL
) can select too many branches, limiting the performance of star-tree - Star-tree does not support filters with
OR
Potential solutions for 1:
- During aggregation, negate the aggregation result of the non-selected branches from the star node branch. This solution could become complicated for multiple predicates.
- Create a derived column for the predicate result, and apply star-tree on top of it. This solution can only work on the fixed predicate.
- First use inverted index to do the filtering, then use pre-aggregated docs to cover the contiguous selected docs
Potential solution for 2:
- Use bitmap to store the selected docs for each predicate under
OR
, then union the bitmaps
Issue Analytics
- State:
- Created 2 years ago
- Reactions:1
- Comments:10 (5 by maintainers)
Top Results From Across the Web
What Makes Apache Pinot Fast: Chapter 2 | StarTree
We introduced Apache Pinot and brushed over-optimization techniques that give Pinot the ability to meet these new latencies and throughput ...
Read more >Optimizing Apache Pinot's Query Performance - YouTube
00:00:00 Welcome 00:01:17 Subbu Intro00:03:21 Optimizing Pinot Realtime ... in Apache Pinot by Jackie Jiang, Founding Engineer at StarTree, ...
Read more >Pinot Partial Upsert and Upsert Optimizations (Qiaochu Liu ...
GitHub: https://github.com/startreedata Site: https:// startree.ai ABOUT STARTREE When you hear “decision maker,” it's natural to think, ...
Read more >StarTree company information, funding & investors
Cookies for app.dealroom.co. Thank you for visiting our website! We use cookies to optimize your user experience, to analyze web traffic and for...
Read more >StarTree (@startreedata) / Twitter
We're thrilled to share that StarTree has received $47M in a Series B funding round led by ... which is an opportunity for...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
You may read the star-tree doc to get a high level idea of how star-tree works: https://docs.pinot.apache.org/basics/indexing/star-tree-index?q=star+tree+index
@jpohanka Actually, that would require fundamental changes to the structure of the star tree index, since the current structure would lead to a double counting of certain values for multiple dimensions.
Let me talk to @Jackie-Jiang and get back