[BUG] Loss drops, model still produces gibberish? #23

MichelNivard · 2024-02-29T19:14:55Z

Describe the bug

After 5300 iteraitons loss near 2.7, is it still supposed to spit out near giberish?

To Reproduce

Running on CPU, macbookkair M2, omitting the model.cuda() line

Expected behaviour

Some kind of convergence on sentences that are at least english-ish?

Screenshots

Additional context

Maybe my expectations are just off and I should train way way more?

kyegomez · 2024-03-03T17:16:17Z

@MichelNivard try training it now and see what happens, I've made many optimizations

MichelNivard · 2024-03-03T21:58:23Z

Okay digging into it later today, thanks!

xwin · 2024-03-07T17:01:14Z

Hi, I trained model using train.py script to completion, although I used a larger batch size and less epochs due to different GPU usued for training.

training loss: 2.462737798690796
validation loss: 2.5802037715911865

However the model produces gibberish

nlsl,slontpg -ytasetcratiioec m  eenu u- nol b m=&o eliets ao =e raersly rif  rc&ssp eaeteen se llr l vc o&roi eet e-e ialsl dsssenr-cffso&- clafsebnnnu&o&ld&&s l&t;spe &e&n g=cciobod& re broen b o&  geposc efi&lu& lcercudrondllailo&na&dnienhi it en h & f&k& e lo&&p  n t ilng,itptoe& &l &opc-pi   mr&& l-=o&l &eetnsc& rdhe&ctn&e air std lciedeimm=ap&&c&ttoyi&c&a;&  e aa aa&s&oelaabueaconksts&    e&glll r& orrhad    ecn etant&c &   te& nc t& m  ugoleetcic&&eadtryr&hl eelairfd &prnldsiectl&sar fnup c&ie a c&in

The validation line was

'ml]  === The Octave Harmonica ===  Octave harmonicas have two reeds per hole.  The two reeds are tuned to the same note a perfect octave apart.  Many share their basic design with the tremolo harmonica explained above and are built upon this &quot;Weiner system&quot; of construction.  Octave harmonicas also come in what is called the &quot;Knittlinger system&quot;.  In this design the top and bottom reed-plates contain all of the blow and draw notes for either to lower or higher pitched set of reeds.  The comb is constructed so that the blow and draw reeds on each reed-plate are paired side-by-side in a single chamber in the same manner as on a standard diatonic but that the top and bottom pairs each have their own chamber.  Thus, in a C harmonica the higher pitched C blow and D draw found in the first &quot;hole&quot; would be placed side-by-side on the upper reed-plate and share a single chamber in the comb and the lower pitched C blow and D draw would be placed side-by-side on the bottom reed-plate and sha'

JohnnyOpcode · 2024-04-13T10:48:17Z

Could we add proper checkpointing to the training loop in train.py?

I've tried torch.save({}), but the model can't be opened with Netron for validation. I'm missing something obviously ..

MichelNivard added the bug Something isn't working label Feb 29, 2024

MichelNivard assigned kyegomez Feb 29, 2024

AnonymousA12345 mentioned this issue Mar 17, 2024

Encountering Size Mismatch Error in Updated Code #45

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Loss drops, model still produces gibberish? #23

[BUG] Loss drops, model still produces gibberish? #23

MichelNivard commented Feb 29, 2024

kyegomez commented Mar 3, 2024

MichelNivard commented Mar 3, 2024

xwin commented Mar 7, 2024

JohnnyOpcode commented Apr 13, 2024

[BUG] Loss drops, model still produces gibberish? #23

[BUG] Loss drops, model still produces gibberish? #23

Comments

MichelNivard commented Feb 29, 2024

kyegomez commented Mar 3, 2024

MichelNivard commented Mar 3, 2024

xwin commented Mar 7, 2024

JohnnyOpcode commented Apr 13, 2024