GTZAN dataset #37

sarbjit-longia · 2018-05-12T05:06:26Z

I am trying to train on GTZAN dataset. The accuracy is around 50% which seems very low. I have a question, the mel spectrogram has dimension of 96x2584 for each audio sample. Shall i use the whole sample as one "image" for the CNN network or do I need to divide the audio file into samples like 2048 and use CNN on that one.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GTZAN dataset #37

GTZAN dataset #37

sarbjit-longia commented May 12, 2018 •

edited

GTZAN dataset #37

GTZAN dataset #37

Comments

sarbjit-longia commented May 12, 2018 • edited

sarbjit-longia commented May 12, 2018 •

edited