question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

More flexibility in the processing pipeline

See original GitHub issue

Hi! I like a lot the simplicity of this library. I see that v2 is planned to have a caseSensitive option. In my opinion, the preprocessing of the tokens (and the query) could be more general. For example, lunr.js offers the possibility to define an arbitrary chain of functions for this.

For this library, the following two lines could be replaced by a function passed by the user. https://github.com/fergiemcdowall/search-index/blob/421c8213958ef76bd139f0900c85c88968848be8/src/write.js#L44-L45 The same function should be also applied to the query tokens.

My use-case is the need for phonetic encoders (e.g. https://github.com/nextapps-de/flexsearch#encoders) in languages other than English. For example, a token like Björn could be matched by the user query bjorn (notice that both case-insensitiveness and phonetic normalization are needed).

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Reactions:3
  • Comments:5 (2 by maintainers)

github_iconTop GitHub Comments

1reaction
eklemcommented, Oct 23, 2020

Create two building blocks that people can choose to use?

  • Empty function/step for indexing side
  • Empty function/step for query side

And an easy way to communicate and use these two with existing functions / building blocks? Could be external as long as it’s easy to plug in.

0reactions
eklemcommented, Jan 23, 2021

So this one should maybe be an example for search-index and for libraries that builds on search-index, it can be actual (empty) functions.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Improving stability and flexibility of ML pipelines at Amazon ...
Pipelines allows you to reduce the number of steps across the entire ML workflow and is flexible enough to allow data scientists to...
Read more >
Building Data Pipelines for Greater Control and Flexibility
Building your own data pipeline and centralizing the collection, processing, and storage of your data with an in-house team ensures that you ...
Read more >
ArrayPipe: a flexible processing pipeline for microarray data
Flexibility in the selection and arrangement of analysis modules allows tailoring of the process to many scenarios that differ in experimental set-ups and ......
Read more >
A flexible cross-platform single-cell data processing pipeline
Single-cell RNA-sequencing analysis to quantify the RNA molecules in individual cells has become popular, as it can obtain a large amount of ...
Read more >
Adding Workflow Management Flexibility to LSST Pipelines ...
Abstract: Data processing pipelines need to be executed at scales ranging from small runs up through large production data release runs ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found