SSL w/o torchaudio dependency #5537

wanchichen · 2023-11-07T21:48:34Z

What?

This PR allows for HuBERT pre-training w/o using torchaudio, allowing for more customization and use of different ESPnet components. It also introduces some tricks to better support large-scale training.

Features:

Flash Attention (by @pyf98)
Activation checkpointing in E-Branchformer
Convolutional Feature Extractor as frontend
Convolutional positional embeddings
Efficient batch sampler for large-scale
Efficient multi-shard iterator for large-scale
WavLM noise augmentation
More efficient distributed training
Recommended tunable DDP args for multi-node training
Cross-Entropy loss for HuBERT SSL
Intermediate losses for HuBERT SSL
HuBERT SSL using filterbank input features
More detailed GPU memory reporting

Supports only HuBERT for Transformer and E-Branchformer so far.

To do:

Fbank training configs
Conformer, branchformer encoder
check in hubert.sh which implementation to use

mergify · 2023-11-07T21:49:29Z

This pull request is now in conflict :(

sw005320 · 2024-01-04T09:32:24Z

@wanchichen, can you restart this PR?

mergify · 2024-02-06T02:50:06Z

This pull request is now in conflict :(

sw005320 · 2024-02-15T08:19:40Z

@wanchichen, let’s finish this PR.
There are a lot of conflicts now. So, please resolve them.

@simpleoier, please also review this PR.

wanchichen · 2024-02-15T18:01:35Z

espnet2/train/trainer.py

@@ -512,11 +520,6 @@ def train_one_epoch(
        ):
            assert isinstance(batch, dict), type(batch)

-            if distributed:


I performed several experiments (ASR, SSL) and found that this was useless. But we may need to experiment with other batch samplers to make sure

wanchichen · 2024-02-15T18:02:47Z

espnet2/train/trainer.py

@@ -709,7 +731,7 @@ def train_one_epoch(
                for iopt, optimizer in enumerate(optimizers):
                    if optim_idx is not None and iopt != optim_idx:
                        continue
-                    optimizer.zero_grad()
+                    optimizer.zero_grad(set_to_none=True)


This is the recommended setting by PyTorch now, it is both faster and more memory efficient. However, it may also slightly affect the learning curve

espnet2/train/trainer.py

simpleoier

Thanks! I made some comments.
One question I have is about the espnet2/ssl/mask. What is the benefit of a new mask module? If it is only used in HuBERT models during pre-training, a single function in the hubert model would be enough.

egs2/librispeech/ssl1/conf/tuning/train_ssl_espnethubert_base_960h_pretrain_it1.yaml

egs2/ml_superb/asr1/local/get_ssl_weights.py

espnet/nets/pytorch_backend/transformer/attention.py

espnet/nets/pytorch_backend/transformer/embedding.py

espnet2/asr/encoder/e_branchformer_encoder.py

espnet2/s2t/espnet_model.py

espnet2/ssl/loss/hubert_loss_ce.py

espnet2/tasks/abs_task.py

espnet2/train/preprocessor.py

espnet2/train/trainer.py

for more information, see https://pre-commit.ci

William Chen and others added 4 commits October 22, 2023 20:28

setup ssl

397f784

ml_superb vis

ebc0773

working espnet hubert

41cf71a

stable fbank training

28dded4

mergify bot added conflicts ESPnet1 ESPnet2 labels Nov 7, 2023

William Chen added 4 commits November 7, 2023 15:53

remove extra files

8c16cea

wavlm denoising

823663f

remove comments

fe05bdc

add subsampling as a pre encoder

7ae161e

William Chen added 2 commits February 11, 2024 22:47

wavlm

3443bf2

SSL w/ CE

dd1acf7

sw005320 requested a review from simpleoier February 15, 2024 08:19

William Chen added 4 commits February 15, 2024 11:03

cleanup

8d05f0a

cleanup

df350e9

clean trainer

e40a045

clean trainer

15350a9

wanchichen commented Feb 15, 2024

View reviewed changes

espnet2/train/trainer.py Outdated Show resolved Hide resolved

merge conflicts

c13994f

simpleoier reviewed Feb 15, 2024

View reviewed changes

William Chen added 3 commits February 15, 2024 12:26

clean iterators

15b6ec4

clean attn/emb

85fcd88

clean preprocess

c9e943f

mergify bot added the conflicts label May 8, 2024

Merge branch 'master' into ssl

0d99cda

mergify bot removed the conflicts label May 8, 2024

William Chen and others added 5 commits May 7, 2024 22:03

unit tests

e3114a0

[pre-commit.ci] auto fixes from pre-commit.com hooks

edf967e

for more information, see https://pre-commit.ci

typeguard

6e4366e

unit test

30c8e0d

error on duplicate text

1537c81

wanchichen force-pushed the ssl branch from ff25b36 to 1537c81 Compare May 8, 2024 05:34

pre-commit-ci bot and others added 4 commits May 8, 2024 05:35

[pre-commit.ci] auto fixes from pre-commit.com hooks

23dfe7f

for more information, see https://pre-commit.ci

pit unit test

44bd2b0

preencoder

c2a0ef1

collate fn downsampling for hubert

b371c0f

wanchichen force-pushed the ssl branch from 49f52c3 to b371c0f Compare May 8, 2024 19:25

comment length

dc9716a

wanchichen force-pushed the ssl branch from ca41743 to dc9716a Compare May 8, 2024 21:36

test device in pad mask

3f994a3

wanchichen force-pushed the ssl branch from def80c7 to 3f994a3 Compare May 9, 2024 01:30

test typo

ad9f51a

wanchichen force-pushed the ssl branch from bbda180 to ad9f51a Compare May 9, 2024 02:35

pad mask test

d403c66

wanchichen force-pushed the ssl branch from 33ab921 to d403c66 Compare May 9, 2024 16:22

pre-commit-ci bot and others added 4 commits May 9, 2024 16:24

[pre-commit.ci] auto fixes from pre-commit.com hooks

06a0e29

for more information, see https://pre-commit.ci

docs

7b8d5b0

ssl test

1511c4d

missing init

17a6d78

wanchichen force-pushed the ssl branch from fe7dac9 to 17a6d78 Compare May 11, 2024 12:20

remove unneeded test params

f67b066

wanchichen force-pushed the ssl branch from ec2e489 to f67b066 Compare May 12, 2024 01:33

[pre-commit.ci] auto fixes from pre-commit.com hooks

5a1ca3d

for more information, see https://pre-commit.ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SSL w/o torchaudio dependency #5537

SSL w/o torchaudio dependency #5537

wanchichen commented Nov 7, 2023 •

edited

mergify bot commented Nov 7, 2023

sw005320 commented Jan 4, 2024

mergify bot commented Feb 6, 2024

sw005320 commented Feb 15, 2024

wanchichen Feb 15, 2024

wanchichen Feb 15, 2024

simpleoier left a comment

SSL w/o torchaudio dependency #5537

Are you sure you want to change the base?

SSL w/o torchaudio dependency #5537

Conversation

wanchichen commented Nov 7, 2023 • edited

What?

mergify bot commented Nov 7, 2023

sw005320 commented Jan 4, 2024

mergify bot commented Feb 6, 2024

sw005320 commented Feb 15, 2024

wanchichen Feb 15, 2024

Choose a reason for hiding this comment

wanchichen Feb 15, 2024

Choose a reason for hiding this comment

simpleoier left a comment

Choose a reason for hiding this comment

wanchichen commented Nov 7, 2023 •

edited