question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

SetFit for a large number of classes

See original GitHub issue

Hi there, thanks for releasing such an interesting library.

I am curious if any experiments have been run using SetFit in the extreme multiclass setting, say as n_classes>=100?

Issue Analytics

  • State:closed
  • Created a year ago
  • Comments:5 (2 by maintainers)

github_iconTop GitHub Comments

5reactions
PhilipMaycommented, Nov 4, 2022

At the moment I am training on a German dataset with ~90 very unbalanced classes. The minority class has 20 samples. The majority class has 183 samples.

It works very good.

1reaction
PhilipMaycommented, Dec 1, 2022

Do you have a multiclass or multilabel task?

I have a normal multi class task.

Do you use a sklearn head?

yes

Are there any minority classes that are fully “ignored” (show 0 recall)?

no

Read more comments on GitHub >

github_iconTop Results From Across the Web

SetFit: Efficient Few-Shot Learning Without Prompts
SetFit dispenses with prompts altogether by generating rich embeddings directly from a small number of labeled text examples. Fast to train: ...
Read more >
Sentence Transformer Fine-Tuning (SetFit): Outperforming ...
We validated SetFit by applying it to the RAFT (Real-World Few-Shot Text ... but they require a large amount of labeled training data....
Read more >
Outperform OpenAI GPT-3 with SetFit for text-classification
In this blog, you will learn how to use SetFit to create a text-classification model with only a 8 labeled samples per class,...
Read more >
SetFit – A New Text-Classification Model That Outperforms ...
The team has generated a high-performing text-classification model with 8 samples per class or only 32 labeled samples using the new approach. “ ......
Read more >
Prompt-free Efficient Few Shot Learning | What is SetFit?
This requires a very large amount of computational resources (GPUs) for the training process, and for inference time processes.
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found