question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Index my own data

See original GitHub issue

Even though this issue is a duplicate from 102 I was wondering if it was still possible to index my own data from pyserini.

102 was solved by exposing JIndexCollection in the pyclass.py file at the time. It has been removed ever since and is not accessible anymore.

The only way I found to index data is to call the main within pyserini/index with command line arguments but I might be missing something.

There is also a hidden exposure of IndexCollection in JCounters within pyserini/index/__base.py but not accessible from the code.

Is there still a programmatic way to index my data using pyserini ?

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Comments:7 (5 by maintainers)

github_iconTop GitHub Comments

1reaction
lintoolcommented, Mar 25, 2021

tbh, I don’t remember the design rationale… but nothing comes to mind as to why we hid the functionality you need…

Seems like this is a relatively small fix? Would you be willing to send a PR and we can discuss more concretely?

0reactions
lintoolcommented, May 1, 2021

Hearing no follow up, closing.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Indexing Data to a Common Starting Point - Dallasfed.org
Indexing numerical data is useful in a variety of contexts. It shows up all the time in economic, financial and business analysis. Equity...
Read more >
Exercise 3 Index Your Own Data - Apache Solr
For this last exercise, work with a dataset of your choice. This can be files on your local hard drive, a set of...
Read more >
​Custom Index | Refinitiv
Access a vast range of data for custom index creation. Build indices, track your investment strategies, and benefit from 40 years' experience with...
Read more >
Create and use an index to improve performance
Access uses indexes in a table as you use an index in a book: to find data, Access looks up the location of...
Read more >
Preparing your data for indexing - Algolia
An index is a collection of records. When you perform a search, you look into the records of an Algolia index. An important...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found