the input of the hallucinator?
See original GitHub issueDear author
I am a little bit confused on hallucinator.
In paper (Figure 2.), it says “We also train a hallucinator h that takes a single image feature phi_t and learns to hallucinate its temporal representation”. However, in the source code, the input is the whole sequence image features. is it contradict? Thanks in advance.
` def fc2_res(phi, name=‘fc2_res’): “”" Converts pretrained (fixed) resnet features phi into movie strip.
This applies 2 fc then add it to the orig as residuals.
Args:
phi (B x T x 2048): Image feature.
name (str): Scope.
Returns:
Phi (B x T x 2048): Hallucinated movie strip.
"""
`
Issue Analytics
- State:
- Created 4 years ago
- Comments:5
Top Results From Across the Web
The hallucination results. (a) The input 24×32 low-resolution ...
A learning-based face hallucination method is proposed in this paper for the reconstruction of a high-resolution face image from a low-resolution ...
Read more >Jue Wang on Twitter: "Hallucination is powerful but slow ...
Hallucination is powerful but slow, taking 10 min/design. We noticed that RF takes both seq+struc as inputs and outputs, even though we usually...
Read more >Tensor Feature Hallucination for Few-Shot Learning
This dataset is used to learn the pa- rameters θ of a mapping fθ : X → Rd×h×w from an input image space...
Read more >arXiv:2105.01294v1 [cs.CV] 4 May 2021
The hallucinator takes as input available training examples and generates hal- lucinated examples. The set of hallucinated examples Sgen.
Read more >A Simple Recipe towards Reducing Hallucination in ... - Vimeo
... for data refinement with self-training iterations to effectively induce strong equivalence between the input data and the paired text.
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
That is correct. My pleasure.
So basically, for a specific frame t, {Phi}_t will be 1x1x2048 as well? I mean, {Phi}_t and \tilde{\Phi} should be in the same dimension in order to calculate the loss function in formula (3).
Thanks for your help!