Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

More flexibility in the processing pipeline

See original GitHub issue

Hi! I like a lot the simplicity of this library. I see that v2 is planned to have a caseSensitive option. In my opinion, the preprocessing of the tokens (and the query) could be more general. For example, lunr.js offers the possibility to define an arbitrary chain of functions for this.

For this library, the following two lines could be replaced by a function passed by the user. https://github.com/fergiemcdowall/search-index/blob/421c8213958ef76bd139f0900c85c88968848be8/src/write.js#L44-L45 The same function should be also applied to the query tokens.

My use-case is the need for phonetic encoders (e.g. https://github.com/nextapps-de/flexsearch#encoders) in languages other than English. For example, a token like Björn could be matched by the user query bjorn (notice that both case-insensitiveness and phonetic normalization are needed).

Issue Analytics

State:
Created 3 years ago
Reactions:3
Comments:5 (2 by maintainers)

Top GitHub Comments

1reaction

eklemcommented, Oct 23, 2020

Create two building blocks that people can choose to use?

Empty function/step for indexing side
Empty function/step for query side

And an easy way to communicate and use these two with existing functions / building blocks? Could be external as long as it’s easy to plug in.

0reactions

eklemcommented, Jan 23, 2021

So this one should maybe be an example for search-index and for libraries that builds on search-index, it can be actual (empty) functions.

Top Results From Across the Web

Improving stability and flexibility of ML pipelines at Amazon ...

Pipelines allows you to reduce the number of steps across the entire ML workflow and is flexible enough to allow data scientists to...

Building Data Pipelines for Greater Control and Flexibility

Building your own data pipeline and centralizing the collection, processing, and storage of your data with an in-house team ensures that you ...

ArrayPipe: a flexible processing pipeline for microarray data

Flexibility in the selection and arrangement of analysis modules allows tailoring of the process to many scenarios that differ in experimental set-ups and ......

A flexible cross-platform single-cell data processing pipeline

Single-cell RNA-sequencing analysis to quantify the RNA molecules in individual cells has become popular, as it can obtain a large amount of ...

Adding Workflow Management Flexibility to LSST Pipelines ...

Abstract: Data processing pipelines need to be executed at scales ranging from small runs up through large production data release runs ...