Different behaviour when extending this project to Bart
See original GitHub issueHello there. This is a really fantastic project. I’m trying to extend your work to Bart but I’ve run into some strange behaviour.
I’ve made a Colab notebook to illustrate the problem. Specifically when converting Bart to ONNX, the encoder_hidden_states
input does not get included in the ONNX model’s graph. As you can see from the notebook though, it works perfectly for T5.
I realise this is out of scope for the fastT5 project but thought someone who comes across this issue might have experienced a similar problem and be able to help. This may also be useful to know in case you have plans to expand this project to include models like Bart in the future.
Issue Analytics
- State:
- Created 2 years ago
- Comments:35 (6 by maintainers)
Top Results From Across the Web
Travel Characteristics of Transit-Oriented Development in ...
TOD is generally considered to be moderate- to high-density mixed-use development located within an easy walk of a major transit stop.
Read more >Projects & Plans
Below are some key projects in the works that will help make BART better, safer, ... Authority (VTA) is managing a project to...
Read more >Transit-Oriented Development Guidelines
The BART TOD Policy favors long-term ground leases of no more than. 66 years - rather than sale of property - as the...
Read more >BART Progressive Policing: Stakeholder Engagement ...
homelessness, behavioral health issues and substance use. ... In February 2020, BART launched a pilot project of 10 unarmed.
Read more >BART ARTS MASTER PLAN
Over time, the Arts Program will also support other ... a framework for infusing the arts into BART projects by ... expanding ridership....
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Hey, I also came across this when trying to adapt the fastT5 code for converting (m)BART to onnx and I think it is due to the fact that the
encoder_hidden_states
are passed askey_value_states
into theMBartAttention
down the line and are not used in casepast_key_value
is given.So my guess was, that it is not included in the exported graph for that reason. I’ll try to share my progress in a Colab notebook tomorrow.
Very odd error that I also get sometimes and my only answer at this point is… have you tried running the cell again? That usually sorts it