Running out of ram #1

jesuistay · 2017-10-01T20:56:47Z

I couldnt get HTK to work properly, possibly due to bad installation. But seamed to work fine with librosa.

However when it comes to '===> Reading audio files... it seams like the for loop going over the audio paths just fills up my 8gb of ram and swap. And this is only on the 28539 files from train-clean-100. And it doesn't produce any files at this stage.
Is there a trick I am missing to get the preprocessor going without reading everything into ram all at once?
Eta was over 1 hour and it broke down after 54% of the train-clean-100 dataset.

hirofumi0810 · 2017-10-01T23:20:25Z

Hi, @jesuistay

Only 1 file will be loaded In each loop for the memory efficiency, so I don't know why.
Which loop do you mean?
There are 3 loops in librispeech/inputs/input_data.py.

jesuistay · 2017-10-02T10:08:00Z

First one: for i, audio_path in enumerate(tqdm(audio_paths)):
To me it looks like it traverses the entire dataset and creating the dict, in order to calculate the mean std I assume.

For now I've just skipped the all the normalization and just writing the npy files after I get the input_data_utt from librosa.

jesuistay · 2017-10-07T10:04:40Z

I managed to get HTK working, but the ram problem still confuses me. I had to increase my swap partition to 16 gb (+ 8gb of ram) just to manage and preprocess the clean-100

update master

wolverineq pushed a commit to wolverineq/asr_preprocessing that referenced this issue Feb 14, 2018

Merge pull request hirofumi0810#1 from hirofumi0810/master

4a8e5ee

update master

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running out of ram #1

Running out of ram #1

jesuistay commented Oct 1, 2017 •

edited

hirofumi0810 commented Oct 1, 2017

jesuistay commented Oct 2, 2017 •

edited

jesuistay commented Oct 7, 2017

Running out of ram #1

Running out of ram #1

Comments

jesuistay commented Oct 1, 2017 • edited

hirofumi0810 commented Oct 1, 2017

jesuistay commented Oct 2, 2017 • edited

jesuistay commented Oct 7, 2017

jesuistay commented Oct 1, 2017 •

edited

jesuistay commented Oct 2, 2017 •

edited