New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CQCC size? #61
Comments
The resulting cqcc features should be a 2d array with the shape (num_frames x num_ceps). |
yes,but i got the same size cqcc with different duration audio samples,they have the same num_frames with 66 |
This could be a bug, try to play with the frame length, the frame hop and the number of ceps. If the error persists, please provide a small reproduce-able example in Python that displays the error and I will try to review the code this weekend. If you have a possible solution feel free to open a PR. |
I ran into the same issue, I think it is due to an incorrect shape handling here. The output of A simple unit test to reproduce this (you can run it in Google Colab),
I'm not too familiar with cqcc, but if these lines are working as intended (i.e., resampling the frequency bins), then the fix is simply changing this line to @SuperKogito could you please confirm if I understood correctly? Thanks! |
thank you both for reporting this.
https://github.com/SuperKogito/spafe/blob/b6b1428df52694c95bb295a6ec291ae442053fcc/spafe/features/cqcc.py#L286C27-L286C43 I still need to review the litterature and update the docs before publishing this. I will try to do this on the weekend or next week. In the mean time you can use #63 |
Why after extracting cqcc features, the time dimension becomes 66, not the duration of the original audio?
The text was updated successfully, but these errors were encountered: