MSA tensor format
See original GitHub issueIn the Usage, there is code like
seq = torch.randint(0, 21, (1, 128)).cuda()
msa = torch.randint(0, 21, (1, 5, 64)).cuda()
.
If I have a a3m msa file, how to encode the file to this tensor? And why the seq length is 128 but the msa is 5 times 64 (5 timeshalf the length of seq?).
Could you give an example of how to use that or how to generate that msa tensor?
Issue Analytics
- State:
- Created 2 years ago
- Reactions:1
- Comments:5 (3 by maintainers)
Top Results From Across the Web
A-Prot: protein structure modeling using MSA transformer
An MSA feature tensor and row attention maps are extracted and converted into 2D residue-residue distance and dihedral angle predictions for ...
Read more >TensorFlow Basics: Tensor, Shape, Type, Sessions & Operators
A tensor is a vector or matrix of n-dimensions that represents all types ... In TensorFlow, all the operations are conducted inside a...
Read more >Diffusion tensor imaging in the characterization of multiple ...
Two forms of MSA are currently recognized: a parkinsonian form (MSA-P, formerly striatonigral degeneration) characterized by parkinsonian ...
Read more >A-Prot: Protein structure modeling using MSA transformer
In this study, we propose a new protein 3D structure modeling method, A-Prot, using MSA Transformer, one of the state-of-the-art protein ...
Read more >protein structural property prediction using MSA transformer
Through the token and position embedding of the MSA Transformer, the matrix is embedded into an \mathrm{r}\times \mathrm{c}\times 768 tensor, ...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
No, I’m working with Free Modelling mode
I use the combined sidechainnet data which does not contain the MSA and we run hhblits on CASP data to get the MSA files. I want to combine those two to be a new dataset. And the MSA and the primary sequence are of the same length