MSA tensor format #56

panganqi · 2021-03-31T02:14:38Z

In the Usage, there is code like
seq = torch.randint(0, 21, (1, 128)).cuda()
msa = torch.randint(0, 21, (1, 5, 64)).cuda().
If I have a a3m msa file, how to encode the file to this tensor? And why the seq length is 128 but the msa is 5 times 64 (5 timeshalf the length of seq?).
Could you give an example of how to use that or how to generate that msa tensor?

The text was updated successfully, but these errors were encountered:

lucidrains · 2021-03-31T17:23:32Z

@panganqi Hi! I just wanted to demonstrate that the MSA and the primary sequence does not have to be the same length (although they would probably be aligned in practice)

The framework is in a good enough place that I'll start thinking about how to tackle data preprocessing! (I'd like to make it as seamless and easy as possible) How is the data laid out in your directory at the moment?

lucidrains · 2021-03-31T17:23:49Z

@panganqi are you working with templates by any chance?

panganqi · 2021-04-01T09:44:58Z

@panganqi Hi! I just wanted to demonstrate that the MSA and the primary sequence does not have to be the same length (although they would probably be aligned in practice)

The framework is in a good enough place that I'll start thinking about how to tackle data preprocessing! (I'd like to make it as seamless and easy as possible) How is the data laid out in your directory at the moment?

I use the combined sidechainnet data which does not contain the MSA and we run hhblits on CASP data to get the MSA files. I want to combine those two to be a new dataset. And the MSA and the primary sequence are of the same length

panganqi · 2021-04-01T09:46:55Z

@panganqi are you working with templates by any chance?

No, I'm working with Free Modelling mode

lucidrains · 2021-04-01T16:56:34Z

@panganqi do you want to chat about this in Discord? we have an alphafold2 channel

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MSA tensor format #56

MSA tensor format #56

panganqi commented Mar 31, 2021 •

edited

Loading

lucidrains commented Mar 31, 2021

lucidrains commented Mar 31, 2021

panganqi commented Apr 1, 2021

panganqi commented Apr 1, 2021

lucidrains commented Apr 1, 2021

MSA tensor format #56

MSA tensor format #56

Comments

panganqi commented Mar 31, 2021 • edited Loading

lucidrains commented Mar 31, 2021

lucidrains commented Mar 31, 2021

panganqi commented Apr 1, 2021

panganqi commented Apr 1, 2021

lucidrains commented Apr 1, 2021

panganqi commented Mar 31, 2021 •

edited

Loading