Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The question about training CLAP RVQ #27

Open
yangdongchao opened this issue Jul 17, 2023 · 4 comments
Open

The question about training CLAP RVQ #27

yangdongchao opened this issue Jul 17, 2023 · 4 comments

Comments

@yangdongchao
Copy link

It seems that in the ClapRVQTrainer code, you donot use any gradient backward? How to understood this?

@zhvng
Copy link
Owner

zhvng commented Jul 17, 2023

Hey, thanks for your interest in this repo! The RVQ parameters are updated via ema. See this relevant thread

@yangdongchao
Copy link
Author

Hey, thanks for your interest in this repo! The RVQ parameters are updated via ema. See this relevant thread

thanks for your reply. I want to ask how to judge the training is successful? When we can stop the training?

@zhvng
Copy link
Owner

zhvng commented Jul 18, 2023

I stopped when the eval loss stopped going down. seems to work good enough.

@yangdongchao
Copy link
Author

I stopped when the eval loss stopped going down. seems to work good enough.

Thanks for your relpy. Lastly ,I want to ask, whether you only use audio to extract embedding and use RVQ to quantize it in the training proces. The text is only used in the inference stage?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants