Rerun benchmark with elasticsearch 7.5 or above
See original GitHub issueIn ES 7.5, we made some improvements to the performance of Elasticsearch dense_vector
operations (https://github.com/elastic/elasticsearch/pull/46294). Although I still expect the QPS to be significantly worse than Vespa’s, it would be helpful to rerun the benchmarks against ES 7.5 to get an up-to-date comparison.
Issue Analytics
- State:
- Created 4 years ago
- Comments:14 (9 by maintainers)
Top Results From Across the Web
Benchmarking and sizing your Elasticsearch cluster for logs ...
In this post, we'll tackle performance Elasticsearch benchmarking and sizing questions like the above. We'll go beyond “it depends” to equip ...
Read more >Rally Documentation
You want to benchmark Elasticsearch? Then Rally is for you. It can help you with the following tasks: • Setup and teardown of...
Read more >Elasticsearch: Adventures in scaling a multitenant platform
The original plan was to either benchmark from the existing production cluster or to use our staging platform. However, not wanting to impact...
Read more >A benchmark-based evaluation of search-based crash ...
To that end, we devise a new benchmark of real-world crashes, ... 36.9% for Defects4J, to 7.5% for XWiki, and only 3% for...
Read more >Vdbench performance test on raw device - FlamingBytes
Master and Slave: Vdbench runs as two or more Java Virtual Machines (JVMs). ... The parameters include General, Host Definition (HD), Replay ......
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
@jtibshirani the vector is not returned with the result, if that was the case yes - I would have spotted it.
Sample response from ES
On cpu architectures, yes it’s explained by us using avx512 instructions See
Will soon update with results using our HNSW implementation for approximate nearest neighbor search, some sample data with gist data set:
@jobergum I’m sorry for the late reply. I’m not sure why your benchmarking results aren’t lining up with @mayya-sharipova’s. The only other difference that comes to mind is that we always make sure to omit the returning the full document source in results by setting
_source: false
in the search request body: https://www.elastic.co/guide/en/elasticsearch/reference/7.6/search-request-body.html#request-body-search-source-filtering. Otherwise ES will load and return the whole stored vector for the top 10 results, whereas we are just interested in the document IDs.Thanks! The ‘Ivy Bridge’ numbers make sense to me, based on the previous results and the performance improvements in ES. However the Haswell numbers are more surprising – do you know why Vespa shows a latency improvement of ~2x between the Ivy Bridge and Haswell processors?