You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am trying to train on GTZAN dataset. The accuracy is around 50% which seems very low. I have a question, the mel spectrogram has dimension of 96x2584 for each audio sample. Shall i use the whole sample as one "image" for the CNN network or do I need to divide the audio file into samples like 2048 and use CNN on that one.
The text was updated successfully, but these errors were encountered:
I am trying to train on GTZAN dataset. The accuracy is around 50% which seems very low. I have a question, the mel spectrogram has dimension of 96x2584 for each audio sample. Shall i use the whole sample as one "image" for the CNN network or do I need to divide the audio file into samples like 2048 and use CNN on that one.
The text was updated successfully, but these errors were encountered: