content提取方式的比较？ #16

hongwen-sun · 2023-03-13T03:35:43Z

您好，这个项目做的很好。
我看你用了第九层的hubert，出于什么考虑呢？如何权衡内容信息丢失、音色泄漏的问题，您有对比过其他层或者whisper这种方式吗？

hongwen-sun · 2023-03-13T08:33:49Z

还有个疑问：

logits = model.extract_features(**inputs)
feats = model.final_proj(logits[0])

这部分代码提取的第9层的信息，又经过了final_proj的结构，这样是bug还是有意为之？我的理解是最后一层经过它才是合理的

leng-yue · 2023-03-24T04:26:57Z

我个人倾向于第 9 层是故意的, 但是 final proj 是不小心的...
有不少论文讨论了不同层的 feature 是有区别的, 第 9 层可能是从这些论文来的.
原理上不应该经过 final proj, 但是可能经过了也不影响, 最多丢点信息. 具体得问开发第一版本的佬了...

leng-yue · 2023-03-31T08:10:59Z

根据 content vec 原作者的信息, final_proj 是一个错误的用法, 但是不幸的是我们现在所有的模型都在用 (

w-okada · 2023-04-03T07:47:44Z

So, does officail svc-develop-team have any plan to fix this usage of content vec?

Likkkez · 2023-04-04T18:00:09Z

Fix when? pls

MuruganR96 · 2023-04-04T19:14:14Z

Fix when? pls

https://huggingface.co/lengyue233/content-vec-best

Likkkez · 2023-04-04T21:30:22Z

Fix when? pls

https://huggingface.co/lengyue233/content-vec-best

Sorry I'm a bit confused. What do i need to do with that to apply the fix to so-vits-svc?

MuruganR96 · 2023-04-04T22:12:00Z

This is fixed in https://github.com/34j/so-vits-svc-fork.

Check out this issue: voicepaw/so-vits-svc-fork#213
Check out this PR: voicepaw/so-vits-svc-fork#197

in utils.py. get_hubert_content https://github.com/svc-develop-team/so-vits-svc/blob/4.0/utils.py#L225

    with torch.no_grad(), timer() as t:
        params = {"output_layer": 9} if legacy_final_proj else {}
        c: torch.Tensor = cmodel.extract_features(audio, **params)[0]
        if legacy_final_proj:
            warnings.warn("legacy_final_proj is deprecated")
            assert hasattr(cmodel, "final_proj")
            assert isinstance(cmodel.final_proj, torch.nn.Module)
            c = cmodel.final_proj(c)
        c = c.transpose(1, 2)

I haven't tried yet.

Likkkez · 2023-04-04T22:51:57Z

This is fixed in https://github.com/34j/so-vits-svc-fork.

Check out this issue: 34j/so-vits-svc-fork#213 Check out this PR: 34j/so-vits-svc-fork#197

in utils.py. get_hubert_content https://github.com/svc-develop-team/so-vits-svc/blob/4.0/utils.py#L225
    with torch.no_grad(), timer() as t:
        params = {"output_layer": 9} if legacy_final_proj else {}
        c: torch.Tensor = cmodel.extract_features(audio, **params)[0]
        if legacy_final_proj:
            warnings.warn("legacy_final_proj is deprecated")
            assert hasattr(cmodel, "final_proj")
            assert isinstance(cmodel.final_proj, torch.nn.Module)
            c = cmodel.final_proj(c)
        c = c.transpose(1, 2)
I haven't tried yet.

Alright, I'll try. thanks!

MuruganR96 · 2023-04-16T08:55:40Z

@Likkkez refer this options -> https://github.com/yxlllc/DDSP-SVC/blob/master/ddsp/vocoder.py#L114

Likkkez · 2023-04-18T15:58:45Z

@Likkkez refer this options -> https://github.com/yxlllc/DDSP-SVC/blob/master/ddsp/vocoder.py#L114

A ye thanks! I think now theres also a branch here that does the same thing right? The '4.0-Vec768-Layer12'.

Geraint-Dou added the help wanted The issue author is asking for help label Mar 19, 2023

Miuzarte removed the help wanted The issue author is asking for help label Apr 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

content提取方式的比较？ #16

content提取方式的比较？ #16

hongwen-sun commented Mar 13, 2023

hongwen-sun commented Mar 13, 2023

leng-yue commented Mar 24, 2023

leng-yue commented Mar 31, 2023

w-okada commented Apr 3, 2023

Likkkez commented Apr 4, 2023

MuruganR96 commented Apr 4, 2023

Likkkez commented Apr 4, 2023

MuruganR96 commented Apr 4, 2023 •

edited

Likkkez commented Apr 4, 2023

MuruganR96 commented Apr 16, 2023

Likkkez commented Apr 18, 2023

content提取方式的比较？ #16

content提取方式的比较？ #16

Comments

hongwen-sun commented Mar 13, 2023

hongwen-sun commented Mar 13, 2023

leng-yue commented Mar 24, 2023

leng-yue commented Mar 31, 2023

w-okada commented Apr 3, 2023

Likkkez commented Apr 4, 2023

MuruganR96 commented Apr 4, 2023

Likkkez commented Apr 4, 2023

MuruganR96 commented Apr 4, 2023 • edited

Likkkez commented Apr 4, 2023

MuruganR96 commented Apr 16, 2023

Likkkez commented Apr 18, 2023

MuruganR96 commented Apr 4, 2023 •

edited