Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

`T5ForSequenceClassification`

See original GitHub issue

🚀 Feature request

T5 to classify sequences by using only the encoder of T5 and a ClassificationHead.

Motivation

This gives the benefits of fine-tuning a model with no maximum sequence length (useful for long sequence tasks) without having to load the decoder weights into memory/treat it as a generative task.

Your contribution

I already have working code for this, and saw some requests for it in other forums (slack, torch, huggingface) so if it’s a welcome addition I’d be happy to add it to the library.

Issue Analytics

State:
Created 2 years ago
Reactions:5
Comments:17 (2 by maintainers)

Top GitHub Comments

2reactions

minmaxmecommented, Dec 10, 2021

This seems like a useful addition, especially considering the EncT5 paper

1reaction

stefan-itcommented, Dec 14, 2022

Hi @osainz59 I think one really interesting dataset would be the CoNLL-2003 (see https://huggingface.co/datasets/conll2003).

When testing the mT5 model series, the WikiANN (Rahimi splits from here: https://huggingface.co/datasets/wikiann) is also very interesting (train on English split only and test it on the other languages for comparisons with the mT5 paper) 😃