Mode 'parallel' for EncSALayer to speed up infer on ONNX #191

KakaruHayate · 2024-05-12T05:04:20Z

'transformer-parallel' is widely used in GPT-J-6B and has been proven to have the same effect as traditional transformer.
It can be simplified as:

This saves a skip link and a LayerNorm.This can bring a slight improvement in training speed on Diffsinger.

After experimentation, this modification has shown a more significant improvement on ONNX.
The following are the experimental parameters and results. The benchmark was performed using infer_acoustic.py, and the backbone of the model used lynxnet, without using shallow diffusion.

run_parallel
...20/20 [00:20<00:00,  1.00s/it]
run_series
...20/20 [00:23<00:00,  1.15s/it]

run_parallel
...20/20 [00:20<00:00,  1.01s/it]
run_series
...20/20 [00:22<00:00,  1.13s/it]

run_parallel
...20/20 [00:21<00:00,  1.07s/it]
run_series
...20/20 [00:23<00:00,  1.17s/it]

run_parallel
...20/20 [00:20<00:00,  1.03s/it]
run_series
...20/20 [00:22<00:00,  1.13s/it]

run_parallel
...20/20 [00:20<00:00,  1.04s/it]
run_series
...20/20 [00:23<00:00,  1.16s/it]

On average, the inference speed has increased by 8%.
This change has been applied to yousaV1.42ReFlow and there have been no reports of any issues yet.

COPY

This reverts commit 0f27a4b.

KakaruHayate and others added 5 commits May 12, 2024 12:32

Mode 'parallel' for EncSALayer to speed up infer on ONNX

608bc34

Mode 'parallel' for EncSALayer to speed up infer on ONNX

4d3d921

COPY

0f27a4b

Merge pull request #1 from KakaruHayate/master

9211038

COPY

Revert "COPY"

55f5166

This reverts commit 0f27a4b.

KakaruHayate closed this May 16, 2024

KakaruHayate deleted the patch-1 branch May 16, 2024 02:13

KakaruHayate restored the patch-1 branch May 16, 2024 02:16

KakaruHayate deleted the patch-1 branch May 16, 2024 02:17

KakaruHayate restored the patch-1 branch May 18, 2024 10:51

KakaruHayate reopened this May 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mode 'parallel' for EncSALayer to speed up infer on ONNX #191

Mode 'parallel' for EncSALayer to speed up infer on ONNX #191

KakaruHayate commented May 12, 2024 •

edited

Mode 'parallel' for EncSALayer to speed up infer on ONNX #191

Are you sure you want to change the base?

Mode 'parallel' for EncSALayer to speed up infer on ONNX #191

Conversation

KakaruHayate commented May 12, 2024 • edited

KakaruHayate commented May 12, 2024 •

edited