Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Token classification weak labelling

See original GitHub issue

As in the text classification task, the “Weak labeling” mode in token classification must allow tag entities by defining a query and entity label (the rule).

Given a rule, the Weak labeling mode for token classification will tag entities based on the matched tokens/words in the search results returned by the API.

The way the entity will be tagged from the matched token will be determined by a labeling function provided as an attribute of the rule. For now, just one single labeling function will be supported, the exact_match, where all matched tokens/words will be tagged as the provided rule.

For example, given a labeling rule with the query Par*, the label PLACE, and the matched record Paris is the city of light, the labeling function will tag the token Paris as a PLACE.

An important behavior of this feature is to provide a visualization of the tagged entities in the visible records from the UI.

Issue Analytics

State:
Created 2 years ago
Comments:6 (5 by maintainers)

Top GitHub Comments

1reaction

Amelie-Vcommented, Nov 8, 2022

Decision Notes

Record list

Maintain view
Improve query visibility into records

Module to set rules

Improve information architecture: 1. Start with a query (this first step is too shy for now)
Apply changes on the main view. removing or adapting components inherited from annotation/explore mode. e.g: remove the label legend, add information colors on labels buttons…

0reactions

davidberenstein1957commented, Dec 22, 2022

Also, think about including docs and reference TextClassification usecases too #1986

Top Results From Across the Web

Weak Supervision: Labeling Your Data Without Actually ...

Weak supervision combines the outputs of many labeling functions to programmatically ... We'll use snorkel for a multilabel classification task.

Weakly Supervised Token Labelling for Toxic Spans Detection

In the second approach, we perform weak supervision with soft attention to learn token level labels from sentence labels. Our experiments show improvements ......

Token classification - Hugging Face

Token classification assigns a label to individual tokens in a sentence. One of the most common token classification tasks is Named Entity Recognition...

Weak supervision - Rubrix 0.18.0 documentation

This guide gives you a brief introduction to weak supervision with Rubrix. Rubrix currently supports weak supervision for multi-class text classification use ...

Named Entity Recognition with Small Strongly Labeled and ...

ble with CRF, we use the token-wise classification model for the Stage II training. • Partial WSL: WSL with non-entity weak labels.