Why is MBD not allowed for stereo? #9

ajayarora1235 · 2024-01-10T15:54:12Z

They specify how to do MBD for stereo in the docs, here's how I've implemented it. Happy to make a PR for this as well.

output_2 = model_2.generate(
    descriptions=[
        description
    ],
    progress=True, return_tokens=True
)

tokens = output_2[1]
left, right = model_2.compression_model.get_left_right_codes(tokens)
tokens = torch.cat([left, right])
outputs_diffusion_2 = mbd.tokens_to_wav(tokens)
assert outputs_diffusion_2.shape[1] == 1  # output is mono
outputs_diffusion_2 = rearrange(outputs_diffusion_2, '(s b) c t -> b (s c) t', s=2)
``` /

rsxdalv mentioned this issue Feb 7, 2024

IndexError: index 4 is out of range error when checked Multi-band Diffusion rsxdalv/tts-generation-webui#275

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why is MBD not allowed for stereo? #9

Why is MBD not allowed for stereo? #9

ajayarora1235 commented Jan 10, 2024 •

edited

Why is MBD not allowed for stereo? #9

Why is MBD not allowed for stereo? #9

Comments

ajayarora1235 commented Jan 10, 2024 • edited

ajayarora1235 commented Jan 10, 2024 •

edited