Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Example of fine-tuning the audio sub-network. #91

Open
mattiacampana opened this issue Jul 16, 2022 · 1 comment
Open

Example of fine-tuning the audio sub-network. #91

mattiacampana opened this issue Jul 16, 2022 · 1 comment

Comments

@mattiacampana
Copy link

I want to perform the fine-tuning of the audio subnetwork to fit my audio classification problem.
To this aim, I plan to use the _construct_linear_audio_network, _construct_mel128_audio_network, and _construct_mel256_audio_network functions to load the pre-trained Keras model and then append one or more fully-connected layers to perform the classification.

However, I don't understand the Input shape of such models. According to the models.py, the input shape is input_shape = (1, asr * audio_window_dur), where asr= 48000 and audio_window_dur=1; what's asr and why it has that value? Can you please provide an example of using the Keras model from the .wav file?

I really appreciate any help you can provide.

@sreenivasaupadhyaya
Copy link

Hi @mattiacampana Could you please tell me how you got the pre trained keras weights for the audio sub network or any code to read the model and load the pre trained weights?
Thanks you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants