[NOMERG, WIP, POC] Auto-nested TensorDict #201

tcbegley · 2023-02-06T11:39:05Z

Description

This PR adds support for auto-nesting inside TensorDict. This is a proof-of-concept with missing features. Supporting auto-nested values is challenging because of the large number of methods in the TensorDict class and its children which employ recursion. Checking for cycles during iteration also inevitably introduces some overhead. These trade-offs still need to be varefully benchmarked and evaluated.

Here's a summary of the state of this branch and outstanding issues. We have implemented the following:

New functionality in _TensorDictKeysView that can detect a cycle ad raise an error or continue (internal usage only)
A new function _apply_safe which can safely map any function onto all entries of the TensorDict, preserving auto-nesting if detected.

The updated keys view is useful for iterating over all values in the TensorDict and applying some in-place operation, or aggregating some computed quantities. For example, zeroing all values in the TensorDict

for key in _TensorDictKeysView(
    self, include_nested=True, leaves_only=True, error_on_loop=False
):
    value = self.get(key)
    value.zero_()

or alternatively, in the implementation of any

any(
    self.get(key).any()
    for key in _TensorDictKeysView(
        self, include_nested=True, leaves_only=True, error_on_loop=False
    )
)

On the other hand, _apply_safe can be used to reimplement any function which returns a TensorDict of the same structure as the input. For example, implementing to_tensordict is as simple as

_apply_safe(lambda _, value: value.clone(), self)

Fixed so far

apply_: implemented with _apply_safe
expand: implemented with _apply_safe
__eq__: implemented with _apply_safe
__ne__: implemented with _apply_safe
to_tensordict: implemented with _apply_safe
zero_: implemented with _TensorDictKeysView
clone: implemented with _apply_safe
__repr__: fixed manually, without either paradigm
all: implemented with _TensorDictKeysView when dim is not specified, and _apply_safe when it is
any: implemented with _TensorDictKeysView when dim is not specified, and _apply_safe when it is
lock: implemented with _TensorDictKeysView
unlock: implemented with _TensorDictKeysView
_index_tensordict: implemented with _apply_safe
masked_fill_: implemented with _TensorDictKeysView

Outstanding bugs

Tests to refactor

Open questions

select: what should happen when we select keys from an auto-nested tensordict.
detect_loop: we don't actually use this in any of our implementations, should we have such a public method?

Co-authored-by: Tom Begley <[email protected]> Co-authored-by: Ruggero Vasile <[email protected]>

#209) Co-authored-by: Tom Begley <[email protected]> Co-authored-by: Ruggero Vasile <[email protected]>

ruleva1983 · 2023-02-10T10:44:38Z

test_chunk: fails for auto-nested due to usage of cat function. Tentatively disable it for that specific case
test_lock_write: corrected as suggested
test_apply: using keys view and also calling the apply_ function instead which in turn calls apply_safe

Co-authored-by: Tom Begley <[email protected]> Co-authored-by: Ruggero Vasile <[email protected]>

…ested-tensordicts

…219) Co-authored-by: Tom Begley <[email protected]> Co-authored-by: Ruggero Vasile <[email protected]>

vmoens · 2023-02-15T09:58:56Z

We're putting this PR on hold for now.
The changes are extensive and substantially reduce code readability.
The plan is either to finish this PR at some point in the future, or adopt another strategy (e.g. a specialized class for auto-nesting).

tcbegley · 2023-02-15T10:19:24Z

For the benefit of anyone who picks this up in future, copying my comment from #220 about the reasons for some outstanding test failures:

The following tests fail because assert_allclose_td does not support auto-nested values

test_from_empty
test_masking
test_getitem_ellipsis
test_getitem_range

The following tests fail because we can't instantiate a TensorDict from a Python dict with auto-nested values

test_broadcast
test_equal_dict
test_nested_dict_init

Finally test_nestedtensor_stack is failing because LazyStackedTensorDict.contiguous is broken. I think a few other methods for LazyStackedTensorDict could be broken but not caught by the tests. The issues here largely stem from the fact that values are computed lazily, and hence use of id to check for repeated values is brittle.

tcbegley and others added 6 commits February 3, 2023 16:28

PoC

f61f94e

Incorporate _items_safe into _TensorDictKeysView

17062a6

Off-by-one error

ea6b268

Fix __ne__

6d7732b

Loop check and iter check for TensorDictKeysView (#200)

de5cc92

Co-authored-by: Tom Begley <[email protected]> Co-authored-by: Ruggero Vasile <[email protected]>

Formatting and linting fixes

f09d292

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 6, 2023

tcbegley and others added 12 commits February 7, 2023 10:29

Fix TensorDict indexing in presence of auto-nesting

1003a06

Test masked fill and test locks adapted to autonesting (#202)

02ba936

Co-authored-by: Tom Begley <[email protected]> Co-authored-by: Ruggero Vasile <[email protected]>

Fix lint issue

8d9df79

Merge branch 'main' into auto-nested-tensordicts

52dc3b6

[BugFix][Auto-nested] Fix to_dict method (#207)

af4c9fd

Disabled tests for autonested case for flatten keys, select and memmap (

40da2c0

#209) Co-authored-by: Tom Begley <[email protected]> Co-authored-by: Ruggero Vasile <[email protected]>

[Test, Bugfix] skip test_outputsize_vmap if no functorch (#204)

0c1b3f5

[CI] Temporarily disable torchrec tests (#208)

f1c8860

[Test] MemmapTensor should be cast to tensor and viceversa (#206)

3b6b1ff

[BugFix] Fix _getitem_batch_size in various edge cases. (#211)

6fe9382

Fix test_batchsize_reset

7a05d2c

Support instantiation of _TensorDictKeysView from SubTensorDict

34a0275

tcbegley force-pushed the auto-nested-tensordicts branch from 441b521 to 34a0275 Compare February 9, 2023 11:03

Fix recursive setitem with index

10f2a82

ruleva1983 and others added 7 commits February 13, 2023 11:10

Solving tests (#214)

b879907

Co-authored-by: Tom Begley <[email protected]> Co-authored-by: Ruggero Vasile <[email protected]>

Format

3d06e42

Make stack and cat robust to auto-nesting (#217)

3503412

Merge branch 'main' of github.com:pytorch-labs/tensordict into auto-n…

018d1c2

…ested-tensordicts

Test suit adapted to changes in code base for autonested Tensordicts (#…

a87bd04

…219) Co-authored-by: Tom Begley <[email protected]> Co-authored-by: Ruggero Vasile <[email protected]>

Test fixes

d3529cf

Fix all / any with dim

1b26892

tcbegley changed the title ~~[WIP] Auto-nested TensorDict~~ [WIP, POC] Auto-nested TensorDict Feb 14, 2023

tcbegley mentioned this pull request Feb 14, 2023

Add recursion guard and tidy tests #220

Merged

vmoens mentioned this pull request Feb 15, 2023

[BugFix] Allowing for auto-nested tensordict #119

Closed

4 tasks

vmoens changed the title ~~[WIP, POC] Auto-nested TensorDict~~ [NOMERG, WIP, POC] Auto-nested TensorDict Feb 15, 2023

Add recursion guard and tidy tests (#220)

f0eede7

apbard force-pushed the main branch from c270a5e to 05ee720 Compare April 7, 2023 15:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NOMERG, WIP, POC] Auto-nested TensorDict #201

[NOMERG, WIP, POC] Auto-nested TensorDict #201

tcbegley commented Feb 6, 2023 •

edited

Loading

ruleva1983 commented Feb 10, 2023 •

edited

Loading

vmoens commented Feb 15, 2023

tcbegley commented Feb 15, 2023

[NOMERG, WIP, POC] Auto-nested TensorDict #201

Are you sure you want to change the base?

[NOMERG, WIP, POC] Auto-nested TensorDict #201

Conversation

tcbegley commented Feb 6, 2023 • edited Loading

Description

Fixed so far

Outstanding bugs

Tests to refactor

Open questions

ruleva1983 commented Feb 10, 2023 • edited Loading

vmoens commented Feb 15, 2023

tcbegley commented Feb 15, 2023

tcbegley commented Feb 6, 2023 •

edited

Loading

ruleva1983 commented Feb 10, 2023 •

edited

Loading