question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Token classification weak labelling

See original GitHub issue

As in the text classification task, the “Weak labeling” mode in token classification must allow tag entities by defining a query and entity label (the rule).

Given a rule, the Weak labeling mode for token classification will tag entities based on the matched tokens/words in the search results returned by the API.

The way the entity will be tagged from the matched token will be determined by a labeling function provided as an attribute of the rule. For now, just one single labeling function will be supported, the exact_match, where all matched tokens/words will be tagged as the provided rule.

For example, given a labeling rule with the query Par*, the label PLACE, and the matched record Paris is the city of light, the labeling function will tag the token Paris as a PLACE.

An important behavior of this feature is to provide a visualization of the tagged entities in the visible records from the UI.

Issue Analytics

  • State:open
  • Created 2 years ago
  • Comments:6 (5 by maintainers)

github_iconTop GitHub Comments

1reaction
Amelie-Vcommented, Nov 8, 2022

Decision Notes

Record list

  • Maintain view
  • Improve query visibility into records

Module to set rules

  • Improve information architecture: 1. Start with a query (this first step is too shy for now)
  • Apply changes on the main view. removing or adapting components inherited from annotation/explore mode. e.g: remove the label legend, add information colors on labels buttons…
0reactions
davidberenstein1957commented, Dec 22, 2022

Also, think about including docs and reference TextClassification usecases too #1986

Read more comments on GitHub >

github_iconTop Results From Across the Web

Weak Supervision: Labeling Your Data Without Actually ...
Weak supervision combines the outputs of many labeling functions to programmatically ... We'll use snorkel for a multilabel classification task.
Read more >
Weakly Supervised Token Labelling for Toxic Spans Detection
In the second approach, we perform weak supervision with soft attention to learn token level labels from sentence labels. Our experiments show improvements ......
Read more >
Token classification - Hugging Face
Token classification assigns a label to individual tokens in a sentence. One of the most common token classification tasks is Named Entity Recognition...
Read more >
Weak supervision - Rubrix 0.18.0 documentation
This guide gives you a brief introduction to weak supervision with Rubrix. Rubrix currently supports weak supervision for multi-class text classification use ...
Read more >
Named Entity Recognition with Small Strongly Labeled and ...
ble with CRF, we use the token-wise classification model for the Stage II training. • Partial WSL: WSL with non-entity weak labels.
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found