Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

How to generate data using beam search from a custom gpt2 model?

See original GitHub issue

❓ Questions & Help

Details

I have a custom model with classification and an LM head. `

    self.config = AutoConfig.from_pretrained("gpt2", num_labels=3)
    self.base_model = AutoModel.from_pretrained("gpt2", config=self.config)
    self.classifier = nn.Sequential(
        nn.Linear(self.config.hidden_size, self.config.num_labels),
    )

    self.lm_head = nn.Linear(self.base_model.config.n_embd, self.base_model.config.vocab_size, bias=False)`

I want to generate the sentences using this model (given the initial prefix) via beam search. How can I achieve that?

I know that LM with double head exists but it’s not fit for my usecase

Issue Analytics

State:
Created 3 years ago
Comments:7 (3 by maintainers)

Top GitHub Comments

3reactions

LysandreJikcommented, Oct 5, 2020

I would recommend you check the source code for the generate method and see how the beam search is implemented. It is not trivial, however.

Maybe @sshleifer and @patrickvonplaten have better tips on how to best do this.

0reactions

talha1503commented, Jun 14, 2022

@nrjvarshney Did you find any suitable way to use generate function for a custom model? I am facing a similar issue with a model of mine, and would be really grateful if you could let me know how to solve the issue.