[Community Sprint] Improving Code Coverage 🚀 #6528

rusty1s · 2023-01-27T10:08:47Z

🚀 The feature, motivation and pitch

We are kicking off our third community sprint!

This community sprint resolves around improving test coverage across the PyG code base.
Currently, our tests cover 85.68% of all code in PyG. The goal of the community sprint is to bump this number into the high 90s (and to get yourself more familiar with the various parts of the code base). Each individual contribution is designed to only take around 30 minutes to two hours to complete.

The sprint begins Friday Janurary 27th and will last 2 weeks. If you are interested in helping out, please also join our PyG slack channel #code-coverage-sprint for more information.

You can assign yourself to the test you are planning to work on here (code-coverage tab).

🚀 Improving Code Coverage

Example

Take a look at the current code coverage report of PyG. For example, we can see that we never test the copy() function of the InMemoryDataset class, see here:

As such, we create a test_in_memory_dataset_copy() function in test/data/test_dataset.py to add a corresponding test:

def test_in_memory_dataset_copy():
    data_list = [Data(x=torch.randn(5, 16)) for _ in range(4)]
    dataset = MyTestDataset(data_list)

    copied_dataset = dataset.copy()

    # Test that we actually do a copy:
    assert id(copied_dataset) != id(dataset)

    # Test that the copied dataset holds the same objects:
    assert len(copied_dataset) == len(dataset) == 4
    
    # Tests that the data is identical:
    for copied_data, data in zip(copied_dataset, dataset):
        assert torch.equal(copied_data.x, data.x)

Furthermore, we see in the code coverage report that copy() utilizes different code paths, depending on whether the dataset should be filtered before copying. As such, we test this functionality as well:

def test_in_memory_dataset_copy():
    ...
    
    copied_dataset = dataset.copy([1, 2])
    assert len(copied_dataset) == 2
    assert torch.equal(copied_dataset[0].x, data_list[1].x)
    assert torch.equal(copied_dataset[1].x, data_list[2].x)

We can check that everything works by running pytest test/data/test_dataset.py -k test_in_memory_dataset_copy:

test/data/test_dataset.py .

========================== 1 passed, 9 deselected in 0.07s =========================

Guide to contributing

See here for a basic example to follow.

Ensure you have read our contributing guidelines.
Claim the test you want to improve here (code coverage tab).
Implement the test changes as in [Code Coverage] InMemoryDataset #6523. For this, look closely at the parts of a model and function you want to cover. Think about test cases that would increase the coverage. If you stumble upon a bug in untested code paths, try to fix the bug on your own, create a GitHub issue or discuss it with us in our PyG slack channel #code-coverage-sprint.
Open a PR to the PyG repository and name it: "[Code Coverage] {model_name/function_name}". Afterwards, add your PR number to the "Improved code coverage" line in CHANGELOG.md.

Tips for making your PR

If you are unfamiliar with how the current test pipeline works, you can read more about it here. We use pytest to run all tests.
The corresponding tests of PyG models and functions can be found in the test/ directory. For example, tests for torch_geometric/utils/isolated.py can be found in test/utils/test_isolated.py. You can run individual test files via pytest test/utils/test_isolated.py. You can run individual test functions via pytest test/utils/test_isolated.py -k test_contains_isolated_nodes.
You can use @pytest.mark.parametrize('arg_name', [1, 2, 3]) to test different configurations inside your test. See here for an example.
There exists special test decorators for testing in torch_geometric/testing/decorators.py, e.g., to only run with specific packages installed via the @withPackage('networkx') decorator.
For code paths that are nearly impossible to test, consider adding a # pragma: no cover comment, e.g., @overload routines

Tests to update

This list may be incomplete. If you still find a function with missing code coverage, please let us know or add them on your own.

The text was updated successfully, but these errors were encountered:

Improve coverage for `tests_inits.py` (#6528) --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Matthias Fey <matthias.fey@tu-dortmund.de>

Part of #6528, improves typing and code coverage for "[SchNet: A Continuous-filter Convolutional Neural Network for Modeling Quantum Interactions](https://arxiv.org/abs/1706.08566)" --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: rusty1s <matthias.fey@tu-dortmund.de>

Part of #6528, improves typing and code coverage for DimeNet. --------- Co-authored-by: rusty1s <matthias.fey@tu-dortmund.de>

Part of #6528. Completes #6799.

… homogeneous graphs (#7807) Fixes `edge_label_time.size() == (2*batch_size,)` to have `(batch_size,)`. Adds a test case for #7791. Part of #7796 and #6528. --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: rusty1s <matthias.fey@tu-dortmund.de>

Part of #6528. IMO, exceptions are also part of the public API so we should measure the test coverage over them, but feel free to close this PR if you think otherwise ;) --------- Co-authored-by: rusty1s <matthias.fey@tu-dortmund.de>

Part of #6528. --------- Co-authored-by: rusty1s <matthias.fey@tu-dortmund.de>

…mer_conv.py` (#7968) Part of #6528. --------- Co-authored-by: rusty1s <matthias.fey@tu-dortmund.de>

…conv.py` (#8047) - Part of #6528 - Fix `AttributeError: 'function' object has no attribute 'pop'` when calling `remove_edge_index` Not sure if I am misunderstanding, please take a look:) --------- Co-authored-by: rusty1s <matthias.fey@tu-dortmund.de>

…mer_conv.py` (#7968) Part of #6528. --------- Co-authored-by: rusty1s <matthias.fey@tu-dortmund.de>

…conv.py` (#8047) - Part of #6528 - Fix `AttributeError: 'function' object has no attribute 'pop'` when calling `remove_edge_index` Not sure if I am misunderstanding, please take a look:) --------- Co-authored-by: rusty1s <matthias.fey@tu-dortmund.de>

rusty1s added feature 0 - Priority P0 test roadmap labels Jan 27, 2023

wsad1 pinned this issue Jan 27, 2023

zechengz mentioned this issue Feb 2, 2023

[Code Coverage] HeteroConv #6568

Merged

SauravMaheshkar mentioned this issue Feb 8, 2023

[Code Coverage] test_inits.py #6645

Merged

SauravMaheshkar mentioned this issue Feb 21, 2023

[Code Coverage] models/schnet.py #6763

Merged

SauravMaheshkar mentioned this issue Feb 23, 2023

[Code Coverage] models/dimenet.py #6781

Merged

rusty1s added a commit that referenced this issue Feb 24, 2023

[Code Coverage] models/dimenet.py (#6781)

d336d13

Part of #6528, improves typing and code coverage for DimeNet. --------- Co-authored-by: rusty1s <matthias.fey@tu-dortmund.de>

akihironitta mentioned this issue Apr 17, 2023

[Code Coverage] data/datapipes.py #7195

Merged

rusty1s pushed a commit that referenced this issue Apr 18, 2023

[Code Coverage] data/datapipes.py (#7195)

3e6fafa

Part of #6528. Completes #6799.

akihironitta self-assigned this Jul 24, 2023

This was referenced Jul 24, 2023

Verify LinkNeighborLoader supports temporal homogeneous graph with an example #7796

Open

Fix LinkNeighborLoader producing double-sized edge_label_time for homogeneous graphs #7807

Merged

akihironitta mentioned this issue Jul 31, 2023

Measure test coverage of exceptions #7823

Merged

akihironitta mentioned this issue Aug 7, 2023

[Code Coverage] loader/utils.py #7857

Merged

rusty1s added a commit that referenced this issue Aug 10, 2023

[Code Coverage] loader/utils.py (#7857)

e95cdaf

Part of #6528. --------- Co-authored-by: rusty1s <matthias.fey@tu-dortmund.de>

akihironitta mentioned this issue Sep 3, 2023

[Code Coverage] loader/temporal_dataloader.py and nn/conv/transformer_conv.py #7968

Merged

rusty1s added a commit that referenced this issue Sep 4, 2023

[Code Coverage] loader/temporal_dataloader.py and `nn/conv/transfor…

6db1453

…mer_conv.py` (#7968) Part of #6528. --------- Co-authored-by: rusty1s <matthias.fey@tu-dortmund.de>

xnuohz mentioned this issue Sep 17, 2023

[Code Coverage] data/data.py & data/hetero_data.py & nn/conv/eg_conv.py #8047

Merged

JakubPietrakIntel pushed a commit that referenced this issue Sep 27, 2023

[Code Coverage] loader/temporal_dataloader.py and `nn/conv/transfor…

ec82d0c

…mer_conv.py` (#7968) Part of #6528. --------- Co-authored-by: rusty1s <matthias.fey@tu-dortmund.de>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Community Sprint] Improving Code Coverage 🚀 #6528

[Community Sprint] Improving Code Coverage 🚀 #6528

rusty1s commented Jan 27, 2023 •

edited

[Community Sprint] Improving Code Coverage 🚀 #6528

[Community Sprint] Improving Code Coverage 🚀 #6528

Comments

rusty1s commented Jan 27, 2023 • edited

🚀 The feature, motivation and pitch

🚀 Improving Code Coverage

Example

Guide to contributing

Tips for making your PR

Tests to update

rusty1s commented Jan 27, 2023 •

edited