Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

[Search] Improve and normalizes the search data model

See original GitHub issue

Things to keep in mind:

Normalize text inputs fields: text, inputs, words must be normalized and use a common pattern for all tasks
Several es analyzers for text fields: standard and whitespace(?) for fine tuning searches. Default as standard
What about text fields in metadata ? For now, only terms queries are supported. It’s mean that metadata fields with large content are not enabled to be queries as full text search.
Created indices should contain mapping info only for its fields. A text classification index should not include mapping info for tokens or text predicted (text2text).
Review filter fields and align with UI names (if any)
What about nested fields? like token or metrics info for token classification, or label and its score for text classification. As default, query string dsl does not support nested queries, but it could be nice include some minimal support for that kind of queries.

@dvsrepo @dcfidalgo Anything to include here?

Tasks

To achieve to do the work, we need tackle following tasks (that will be created as separated issues and linked here)

[Datasets] Avoid using global template for all indices
[Datasets] Dataset migration mechanisms for each release
[Datasets] New es document model per task with backward compatibility fields
[Datasets] Apply migration to new es doc model
[Datasets] Build searches and aggregations using new doc model

Issue Analytics

State:
Created 2 years ago
Comments:11 (11 by maintainers)

Top GitHub Comments

1reaction

frascuchoncommented, Jan 20, 2022

Not, really. The only “problem” is that you cannot select with predicted sentence you use. It will search in all of them. But i think we can assume that

0reactions

frascuchoncommented, May 26, 2022

Note: PR recognai/rubrix#1018 introduces breaking changes to version <0.9.0. So we cannot include those changes until v0.11.0 in order to keep compatibility at lease 2 version prior to release