New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Update multi-chain permutation and permutation unittest #406

Merged

jnwei merged 12 commits into aqlaboratory:main from dingquanyu:update-permutation-unittest

May 11, 2024

Contributor

dingquanyu commented Feb 15, 2024 •

edited

Hi @christinaflo and @jnwei

Sorry I forgot to update the unittest for multi-chain permutations after the major updates on the functions. Here I have added necessary steps to prepare fake test data for testing these functions. Now all the 3 tests can run successfully. Hope it helps.

BTW I'm now adding typing to the functions in multi_chain_permutation.py and fixing some comments in that file as well. These can however go to another PR if you prefer?

Dingquan

dingquanyu added 6 commits

February 15, 2024 18:04


          make sure padded asym_id won't affect permutation steps

df96b58


          fixed bugs in unittests for multi-chain permutation. now working on e…

d74b09c

…xtra subtests


          remove unnecessary lines

aa18a56


          restore to the verison on main

2c56566


          added typing hints and fixed some comments

7df201e


          make sure no padded features are going to be selected as anchors

170d9c5

jnwei requested changes

View reviewed changes

Collaborator

jnwei left a comment

Thank you so much for expanding the docstrings in multi_chain_permutation.py and for fixing the tests.

There is a lot of non-trivial code associated with determining the best permutations, and the docstrings will go a long way towards making the code more approachable.

I have some suggestions to help further improve the clarity of the code, but overall, really nice work!

openfold/utils/multi_chain_permutation.py Outdated Show resolved Hide resolved

openfold/utils/multi_chain_permutation.py Outdated

    
            @@ -88,7 +103,7 @@ def get_optimal_transform(
          
                  return r, x

              def get_least_asym_entity_or_longest_length(batch, input_asym_id):

              def get_least_asym_entity_or_longest_length(batch:dict, input_asym_id:list)->Tuple[torch.Tensor, List[torch.Tensor]]:

Collaborator

jnwei Feb 23, 2024

nit: please add 1 space between the argument and the type.

openfold/utils/multi_chain_permutation.py Outdated Show resolved Hide resolved

openfold/utils/multi_chain_permutation.py

+                      pred_ca_pos: predicted positions of c-alpha atoms from the results of model.forward()
+                      pred_ca_mask: a boolean tensor that masks pred_ca_pos
+                      true_ca_poses: a list of tensors, corresponding to the c-alpha positions of the ground truth structure. e.g. If there are 5 chains, this list will have a length of 5
+                      true_ca_masks: a list of tensors, corresponding to the masks of c-alpha positions of the ground truth structure. If there are 5 chains, this list will have a length of 5

Collaborator

jnwei Feb 23, 2024

Could you explain what relationship (if any) there is between true_ca_masks and pred_ca_mask? Is this an indication of which residues between chains are expected to align.

If you think this is sufficiently defined elsewhere in the multimer codebase, then maybe a simple addition will suffice here.

openfold/utils/multi_chain_permutation.py Show resolved Hide resolved

openfold/utils/multi_chain_permutation.py Show resolved Hide resolved

openfold/utils/multi_chain_permutation.py Outdated Show resolved Hide resolved

openfold/utils/multi_chain_permutation.py Outdated Show resolved Hide resolved

openfold/utils/multi_chain_permutation.py Outdated Show resolved Hide resolved

openfold/utils/multi_chain_permutation.py Outdated Show resolved Hide resolved


          fixed typing errors; added more comments

8dfe77e

Contributor Author

dingquanyu commented Mar 21, 2024

@jnwei Many thanks for your suggestions and reviews :D I've updated the PR in the new commit


          added comments

2dbc8c0

jnwei requested changes

View reviewed changes

Collaborator

jnwei left a comment

Thanks for updating the docstrings Dingquan! Just a few more minor comments

openfold/utils/multi_chain_permutation.py Outdated Show resolved Hide resolved

openfold/utils/multi_chain_permutation.py Outdated

-              def calculate_input_mask(true_ca_masks, anchor_gt_idx, anchor_gt_residue,
-                                       asym_mask, pred_ca_mask):
+              def calculate_input_mask(true_ca_masks: List[torch.Tensor], anchor_gt_idx: torch.Tensor,
+                                       anchor_gt_residue: list,

Collaborator

jnwei Apr 19, 2024

nit: The docstring the type is a Tensor, is this a list or a Tensor?

openfold/utils/multi_chain_permutation.py Outdated

-                                              asym_mask,
-                                              pred_ca_pos):
+              def calculate_optimal_transform(true_ca_poses: List[torch.Tensor],
+                                              anchor_gt_idx: int, anchor_gt_residue: list,

Collaborator

jnwei Apr 19, 2024

Same thing here, is anchor_gt_residue a list or a tensor?

tests/test_permutation.py Outdated

+                      fake_input_features['all_atom_mask'] = pad_features(true_atom_mask, nres_pad=nres_pad, pad_dim=1)
+                      # NOTE
+                      # batch: simulates ground_truth features

Collaborator

jnwei Apr 19, 2024

nit: replace gonna with going to

tests/test_permutation.py Outdated

                                                                 batch)
-                      print(f"##### aligns is {aligns}")
                       possible_outcome = [[(0, 1), (1, 0), (2, 3), (3, 4), (4, 2)], [(0, 0), (1, 1), (2, 3), (3, 4), (4, 2)]]

Collaborator

jnwei Apr 19, 2024

Just a reminder here, a comment explaining why you expect the given possible outcome, and why the wrong_outcome is bad would be very helpful.

To help explain the examples, you could even break up the examples into different variables with acceptable / not acceptable cases. For example:

chain_a_permuted = [(0, 1), (1, 0), (2, 2), (3, 3), (4, 4)]
chain_b_permuted = [(0, 0), (1, 1), (2, 3), (3, 4), (4, 2)]
chains_a_and_b_permuted = [(0, 1), (1, 0), (2, 3), (3, 4), (4, 2)]
no_permutation = [(0, 0), (1, 1), (2, 2), (3, 3), (4, 4)]

possible_outcome = [chain_a_permuted, chain_b_permuted]
wrong_outcome = [chain_a_and_b_permuted, no_permutation]

Although in this example, I still don't understand why chain_a_and_b_permuted would be under wrong_outcome

dingquanyu added 3 commits

May 10, 2024 16:00


          update comments;fixed typos

61191bf


          Update tests and comments

5f78237


          fixed typing error of anchor_gt_residue

15113dc

dingquanyu requested a review from jnwei

May 10, 2024 15:29


          Update test_permutation.py

55c293c

Fixed a small typo in permutation unit test docstring

jnwei approved these changes

View reviewed changes

Collaborator

jnwei commented May 11, 2024

Thanks for the additions to the docstring Dingquan! The explanation for the tests are much clearer now.

jnwei merged commit 9d88b8e into aqlaboratory:main

2 checks passed

dingquanyu deleted the update-permutation-unittest branch

May 11, 2024 13:17

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment