Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Query with another index/ graph?

See original GitHub issue

This is a question/ feature request.

My (cursory) understanding of the NNDescent algorithm makes me think it should be possible to increase query speed for a test set if we already had an index built on it. Is this the case?

If so, would this query be in scope for this project?

# a: NNDescent, b: NNDescent
a.query(b)

Issue Analytics

State:
Created 4 years ago
Comments:5 (4 by maintainers)

Top GitHub Comments

1reaction

lmcinnescommented, Jun 23, 2020

I think this is possible, but tricky. The catch is that you want to to restrict the potential neighbors to be only those in the subset. This is hard to do and still end up with an adequte number of neighbors for each sample by simply subselecting from an existing index. I don’t actually see any easy ways to enable this.

1reaction

ivirshupcommented, Sep 20, 2019

In general, I think I’d be interested in contributing an implementation here. I’ve had some time to look over the paper (thanks for recommending that!), and had a couple questions – if you don’t mind. First, I’d like to make sure I’m on the same page about what I’d like to be able to do, and what’s described in the paper, then I had some questions about potential modifications.

The problem I would like to address is finding K neighbors from one dataset in another, essentially creating a bipartite graph. My understanding of this paper is that it only deals with building a joint graph containing all samples where all nodes have out-degree K. Is this right?

These main idea of using the predefined structure for faster search as in P-merge is definitely shared. I had a couple naive ideas for modification towards my ends: (1) Change the final step of P-merge. Instead of merging G- and H- into U, you could take the between dataset neighbors of U (i.e. 𝑈∩(𝐺⁺∪𝐻⁺)) replacing within dataset neighbors with sampled points, then optimize until convergence again. (2) Change the initialization so that you’ve got at least k samples from each dataset to be searched. Then remove all intra dataset edges and take top k remaining edges. Do you think these approaches might be worth pursuing?

Top Results From Across the Web

Search multiple data streams and indices - Elastic

To search multiple data streams and indices, add them as comma-separated values in the search API's request path. The following request searches the...

Indexing Query Graphs to Speedup Graph Query Processing

The supergraph index on the other hand is a new index to swiftly determine supergraph status between new and previous queries.

[Workshop] Indexing and Querying with The Graph - YouTube

[Workshop] Indexing and Querying with The Graph. 1.8K views Streamed 2 years ago. Chainlink. Chainlink. 49.7K subscribers. Subscribe.

Indexing for Graph Query Evaluation | SpringerLink

Given a graph, an index is a data structure supporting a map from a collection of keys to a collection of elements in...

Modifying relevance score in elasticsearch using results from ...

If i understad your problem, you have your primary results comming from another source (graph DB) and that score is highly dependent of ......