question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Query with another index/ graph?

See original GitHub issue

This is a question/ feature request.

My (cursory) understanding of the NNDescent algorithm makes me think it should be possible to increase query speed for a test set if we already had an index built on it. Is this the case?

If so, would this query be in scope for this project?

# a: NNDescent, b: NNDescent
a.query(b)

Issue Analytics

  • State:open
  • Created 4 years ago
  • Comments:5 (4 by maintainers)

github_iconTop GitHub Comments

1reaction
lmcinnescommented, Jun 23, 2020

I think this is possible, but tricky. The catch is that you want to to restrict the potential neighbors to be only those in the subset. This is hard to do and still end up with an adequte number of neighbors for each sample by simply subselecting from an existing index. I don’t actually see any easy ways to enable this.

1reaction
ivirshupcommented, Sep 20, 2019

In general, I think I’d be interested in contributing an implementation here. I’ve had some time to look over the paper (thanks for recommending that!), and had a couple questions – if you don’t mind. First, I’d like to make sure I’m on the same page about what I’d like to be able to do, and what’s described in the paper, then I had some questions about potential modifications.

The problem I would like to address is finding K neighbors from one dataset in another, essentially creating a bipartite graph. My understanding of this paper is that it only deals with building a joint graph containing all samples where all nodes have out-degree K. Is this right?

These main idea of using the predefined structure for faster search as in P-merge is definitely shared. I had a couple naive ideas for modification towards my ends: (1) Change the final step of P-merge. Instead of merging G- and H- into U, you could take the between dataset neighbors of U (i.e. 𝑈∩(𝐺+∪𝐻+)) replacing within dataset neighbors with sampled points, then optimize until convergence again. (2) Change the initialization so that you’ve got at least k samples from each dataset to be searched. Then remove all intra dataset edges and take top k remaining edges. Do you think these approaches might be worth pursuing?

Read more comments on GitHub >

github_iconTop Results From Across the Web

Search multiple data streams and indices - Elastic
To search multiple data streams and indices, add them as comma-separated values in the search API's request path. The following request searches the...
Read more >
Indexing Query Graphs to Speedup Graph Query Processing
The supergraph index on the other hand is a new index to swiftly determine supergraph status between new and previous queries.
Read more >
[Workshop] Indexing and Querying with The Graph - YouTube
[Workshop] Indexing and Querying with The Graph. 1.8K views Streamed 2 years ago. Chainlink. Chainlink. 49.7K subscribers. Subscribe.
Read more >
Indexing for Graph Query Evaluation | SpringerLink
Given a graph, an index is a data structure supporting a map from a collection of keys to a collection of elements in...
Read more >
Modifying relevance score in elasticsearch using results from ...
If i understad your problem, you have your primary results comming from another source (graph DB) and that score is highly dependent of ......
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found