Query with another index/ graph?
See original GitHub issueThis is a question/ feature request.
My (cursory) understanding of the NNDescent algorithm makes me think it should be possible to increase query speed for a test set if we already had an index built on it. Is this the case?
If so, would this query be in scope for this project?
# a: NNDescent, b: NNDescent
a.query(b)
Issue Analytics
- State:
- Created 4 years ago
- Comments:5 (4 by maintainers)
Top Results From Across the Web
Search multiple data streams and indices - Elastic
To search multiple data streams and indices, add them as comma-separated values in the search API's request path. The following request searches the...
Read more >Indexing Query Graphs to Speedup Graph Query Processing
The supergraph index on the other hand is a new index to swiftly determine supergraph status between new and previous queries.
Read more >[Workshop] Indexing and Querying with The Graph - YouTube
[Workshop] Indexing and Querying with The Graph. 1.8K views Streamed 2 years ago. Chainlink. Chainlink. 49.7K subscribers. Subscribe.
Read more >Indexing for Graph Query Evaluation | SpringerLink
Given a graph, an index is a data structure supporting a map from a collection of keys to a collection of elements in...
Read more >Modifying relevance score in elasticsearch using results from ...
If i understad your problem, you have your primary results comming from another source (graph DB) and that score is highly dependent of ......
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
I think this is possible, but tricky. The catch is that you want to to restrict the potential neighbors to be only those in the subset. This is hard to do and still end up with an adequte number of neighbors for each sample by simply subselecting from an existing index. I don’t actually see any easy ways to enable this.
In general, I think I’d be interested in contributing an implementation here. I’ve had some time to look over the paper (thanks for recommending that!), and had a couple questions – if you don’t mind. First, I’d like to make sure I’m on the same page about what I’d like to be able to do, and what’s described in the paper, then I had some questions about potential modifications.
The problem I would like to address is finding K neighbors from one dataset in another, essentially creating a bipartite graph. My understanding of this paper is that it only deals with building a joint graph containing all samples where all nodes have out-degree K. Is this right?
These main idea of using the predefined structure for faster search as in P-merge is definitely shared. I had a couple naive ideas for modification towards my ends: (1) Change the final step of P-merge. Instead of merging G- and H- into U, you could take the between dataset neighbors of U (i.e. 𝑈∩(𝐺+∪𝐻+)) replacing within dataset neighbors with sampled points, then optimize until convergence again. (2) Change the initialization so that you’ve got at least k samples from each dataset to be searched. Then remove all intra dataset edges and take top k remaining edges. Do you think these approaches might be worth pursuing?