[Feature Request] Saving and loading memmap tensordicts / tensorclasses #176

vmoens · 2023-01-23T11:38:39Z

Motivation

We can easily save tensordicts using torchsnapshot. However, as MemmapTensors store a file on disk, it would be fairly easy to save a tensordict that contains only MemmapTensors (is_memmap() returns True).
Ideally, the saved tensordict structure would follow the one of the original tensordict (+ metadata)

tensordict = TensorDict({"a": torch.randn(3, 4), {"b": torch.randn(3, 4)}}, [3, 4])
tensordict.memmap_()
save_memmap_tensordict(tensordict, "/path/to/save")

would result in

/path/to/save/metadata.pt
/path/to/save/a.memmap
/path/to/save/b/metadata.pt
/path/to/save/b/c.memmap

(we'd need metadata for each subtensordict since they may have a different device / batch_size).

Loading from such file would also be easy (and would not create a copy):

tensordict_loaded = load_from_memmap("/path/to/save/", mode="r+")
assert tensordict_loaded.is_memmap()
assert (tensordict_loaded == tensordict).all()

This should work for tensordict and tensorclass, but a little extra work may be needed for the latter.

TensorDict
tensorclass

@sreevasthav

The text was updated successfully, but these errors were encountered:

vmoens added the enhancement New feature or request label Jan 23, 2023

vmoens self-assigned this Jan 23, 2023

vmoens mentioned this issue Jan 23, 2023

[Feature Request] More arguments for td.memmap_() #178

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Saving and loading memmap tensordicts / tensorclasses #176

[Feature Request] Saving and loading memmap tensordicts / tensorclasses #176

vmoens commented Jan 23, 2023

[Feature Request] Saving and loading memmap tensordicts / tensorclasses #176

[Feature Request] Saving and loading memmap tensordicts / tensorclasses #176

Comments

vmoens commented Jan 23, 2023

Motivation