[Feature] Add RecorderHook #1300

Xinyu302 · 2023-08-10T15:17:52Z

A glcc summer camp project.

Motivation

The user expects to visualize the output of any layer of the network in the visualization hook, not necessarily the output of nn.module.forward, but also intermediate variables, and will inevitably need to invasively modify the code to update the output into the message hub. Then retrieve the value of the message hub in the visualization hook for visualization. Rather than intrusively modifying the code, we want to have a scheme that can non-intrusively obtain the output of any layer of the network. In addition, it also needs to provide the ability to get properties of a specified instance.

Modification

In short, use ast module to modify the forward function of runner.model.
Here is a simple example.

class ToyModel(BaseModel):

    def __init__(self, data_preprocessor=None):
        super().__init__(data_preprocessor=data_preprocessor)
        self.linear1 = nn.Linear(2, 2)
        self.linear2 = nn.Linear(2, 1)

    def forward(self, inputs, data_samples, mode='tensor'):
        if isinstance(inputs, list):
            inputs = torch.stack(inputs)
        if isinstance(data_samples, list):
            data_sample = torch.stack(data_samples)
        outputs = self.linear1(inputs)
        outputs = self.linear2(outputs)

        if mode == 'tensor':
            return outputs
        elif mode == 'loss':
            loss = (data_sample - outputs).sum()
            outputs = dict(loss=loss)
            return outputs
        elif mode == 'predict':
            return outputs

The API of RecorderHook is like this:

cfg.custom_hooks = [
    dict(
        type='RecorderHook',
        recorders=[
            dict(type='FunctionRecorder', ...),  # Function recorder1
            dict(type='FunctionRecorder', ...),  # Function recorder2
            dict(type='AttributeRecorder', ...)  # AttributeRecorder1
            ... 
        ],
    )
]

RecorderHook uses FunctionRecorder and AttributeRecorder to record different things in forward method.

FunctionRecorder

function

Gets the output and intermediate variables of the specified function or method. If the function has several intermediate variables of the same name

case

    custom_hooks=[
        dict(
            type='RecorderHook',
            recorders=[
                dict(type='FunctionRecorder', target='outputs', index=[1])
            ],
            save_dir='./work_dir',
            print_modification=True)
    ]

Forward method after modification

def forward(self, inputs, data_samples, mode='tensor'):
    from mmengine.logging import MessageHub
    import copy
    message_hub = MessageHub.get_current_instance()
    if isinstance(inputs, list):
        inputs = torch.stack(inputs)
    if isinstance(data_samples, list):
        data_sample = torch.stack(data_samples)
    outputs = self.linear1(inputs)
    outputs = self.linear2(outputs)
    message_hub.update_info('runner_model:forward:outputs@1', outputs)
    if mode == 'tensor':
        return outputs
    elif mode == 'loss':
        loss = (data_sample - outputs).sum()
        outputs = dict(loss=loss)
        return outputs
    elif mode == 'predict':
        return outputs

AttributeRecorder

function

Gets the value of the specified property. Insert the recorder code just at the front of the function.

case

    custom_hooks=[
        dict(
            type='RecorderHook',
            recorders=[
                dict(type='AttributeRecorder', target='self.linear1.weight')
            ],
            save_dir='./work_dir',
            print_modification=True)
    ]

Forward method after modification

def forward(self, inputs, data_samples, mode='tensor'):
    from mmengine.logging import MessageHub
    import copy
    message_hub = MessageHub.get_current_instance()
    if isinstance(self.linear1.weight, torch.Tensor):
        _deep_copy_self_linear1_weight = self.linear1.weight.detach().clone()
    else:
        _deep_copy_self_linear1_weight = copy.deepcopy(self.linear1.weight)
    message_hub.update_info('runner_model:forward:self.linear1.weight', _deep_copy_self_linear1_weight)
    if isinstance(inputs, list):
        inputs = torch.stack(inputs)
    if isinstance(data_samples, list):
        data_sample = torch.stack(data_samples)
    outputs = self.linear1(inputs)
    outputs = self.linear2(outputs)
    if mode == 'tensor':
        return outputs
    elif mode == 'loss':
        loss = (data_sample - outputs).sum()
        outputs = dict(loss=loss)
        return outputs
    elif mode == 'predict':
        return outputs

A more complicated case

Users can specify the model and function that they want to record.

class MMResNet50(BaseModel):

    def __init__(self):
        super().__init__()
        self.resnet = torchvision.models.resnet50()

    def forward(self, imgs, labels, mode):
        x = self.resnet(imgs)
        if mode == 'loss':
            return {'loss': F.cross_entropy(x, labels)}
        elif mode == 'predict':
            return x, labels

    custom_hooks=[
        dict(
            type='RecorderHook',
            recorders=[
                dict(
                    model='resnet',
                    method='_forward_impl',
                    type='FunctionRecorder',
                    target='x', index=[0,1,2])
            ],
            save_dir='./work_dir',
            print_modification=True)
    ]

after modification

def _forward_impl(self, x: Tensor) -> Tensor:
    from mmengine.logging import MessageHub
    import copy
    message_hub = MessageHub.get_current_instance()
    x = self.conv1(x)
    message_hub.update_info('resnet:_forward_impl:x@0', x)
    x = self.bn1(x)
    message_hub.update_info('resnet:_forward_impl:x@1', x)
    x = self.relu(x)
    message_hub.update_info('resnet:_forward_impl:x@2', x)
    x = self.maxpool(x)
    x = self.layer1(x)
    x = self.layer2(x)
    x = self.layer3(x)
    x = self.layer4(x)
    x = self.avgpool(x)
    x = torch.flatten(x, 1)
    x = self.fc(x)
    return x

TODO

Add unit test
Store data recorded to message hub, provide visualizer with support
Add docstring and type hint.

…ttribute by var name

…ify restore forward logic

zhouzaida · 2023-09-19T15:59:34Z

mmengine/hooks/recorder_hook.py

+
+
+class FunctionRecorderTransformer(ast.NodeTransformer):
+


Please add docstring and type hint.

Xinyu302 added 16 commits July 26, 2023 19:45

Add recorder_hook and use ast to print assign node

823d238

use messagehub to store information

85cde71

use message_hub.update_scalar

074ee1e

use message_hub.update_scalar

f4dcc13

design class Recorder

e8144c9

add recover forward logic

c60b934

FunctionRecord actually should be AttributeRecorder because we find a…

54412dc

…ttribute by var name

add FunctionRecorder

c7df8bb

add update2 messagehub logic

e4351ba

clean up code

9fe0e7d

add comment and registry for AttributeRecorder and FunctionRecorder

fd6b8e4

fix commit verify

25b2415

do some clean up

ce0bfbe

add recorder_hook_test.py

9ef4e44

redesign FunctionRecorder and AttributeRecorder

4b396aa

modify recorder_hook_test.py

2d48fae

Xinyu302 requested review from C1rN09, HAOCHENYE and zhouzaida as code owners August 10, 2023 15:17

Xinyu302 changed the title ~~[Feature] Add RecorderHook~~ [WIP][Feature] Add RecorderHook Aug 11, 2023

Xinyu302 added 10 commits September 4, 2023 13:44

modify attribute recorder

9a6ff6f

store function recorder in a format of assign_name@index

4c5d27b

modify function recorder index: start from 0

2d8b64b

use torch.save to dump data; handle when index is int

7bdf2c0

add default value for FunctionRecorder's index

9fa6c94

add copy.deepcopy to collect weight in layer

4102fa2

rename var name

f72c7b1

add model select in recorder

33dd386

refactor: modify AttributeRecorderTransformer; modify _get_model; mod…

a995399

…ify restore forward logic

add deepcopy, if var is Tensor, use Tensor.detach().clone()

963b54e

Xinyu302 added 2 commits September 17, 2023 10:11

refactor about store var name

4f434e4

delete useless lines

1f54cfc

Xinyu302 changed the title ~~[WIP][Feature] Add RecorderHook~~ [Feature] Add RecorderHook Sep 17, 2023

Xinyu302 added 3 commits September 17, 2023 14:52

add appoint specify method

581d668

update test script

10e447f

use MessageHub.get_instance

2d5447b

zhouzaida reviewed Sep 19, 2023

View reviewed changes

mmengine/hooks/recorder_hook.py Outdated

class FunctionRecorderTransformer(ast.NodeTransformer):

Copy link

Member

zhouzaida Sep 19, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please add docstring and type hint.

Xinyu302 added 8 commits September 20, 2023 13:26

add docs

e7e439d

try to add type hint

b58540b

add type hint

ea29bfa

add type ignore

06fabbe

add recorder_hook test

d4406d6

modify test_recorder_hook

9f5f35a

delete modification option

4e81004

add save to messagehub

ec757ba

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Add RecorderHook #1300

[Feature] Add RecorderHook #1300

Xinyu302 commented Aug 10, 2023 •

edited

zhouzaida Sep 19, 2023

[Feature] Add RecorderHook #1300

Are you sure you want to change the base?

[Feature] Add RecorderHook #1300

Conversation

Xinyu302 commented Aug 10, 2023 • edited

A glcc summer camp project.

Motivation

Modification

FunctionRecorder

function

case

Forward method after modification

AttributeRecorder

function

case

Forward method after modification

A more complicated case

after modification

TODO

zhouzaida Sep 19, 2023

Choose a reason for hiding this comment

Xinyu302 commented Aug 10, 2023 •

edited