SRNN Datapreprocessing script #124

pushkalkatara · 2019-08-21T21:17:07Z

Hi @harsha-simhadri ,
this is a quick implementation of the script process_google.py.
Also, I have checked SRNN and the accuracy is in the .ipynb of the PR.
Solves issue #122

metastableB · 2019-08-21T21:22:08Z

@pushkalkatara is there any reason you preferred h5py over numpy.memmap?

pushkalkatara · 2019-08-21T21:54:17Z

@metastableB numpy.memmap does not store the dims, dtypes, thus we would have to mention the test, train, val dims and dtypes in SRNN_example.py. Also, I have seen generally h5py or pandas being used for the purpose. We can shift to numpy.memmap if extra dependency is an issue.

metastableB · 2019-08-22T01:57:18Z

@pushkalkatara Yes I am apprehensive about adding an extra dependency just for one script though I must admit I don't have an idea of how complex the code will become if we do plain numpy. Lets use pandas instead? Its already part of the requirements here.

harsha-simhadri · 2019-08-22T05:39:08Z

@metastableB are you able to fix this using pandas?

metastableB · 2019-08-22T17:06:00Z

@pushkalkatara do you want me to take over or are you working on this?

pushkalkatara · 2019-08-22T18:56:02Z

I can work on it. We would require to save the pandas data-frame in a format csv or pickel or h5. which one should i use?

metastableB · 2019-08-23T19:03:58Z

Thanks!

Ah, I did not think this through. CSV will causes file sizes to bloat. It seems pickel is the best route as numpy.load(here) also supports loading from pickled files.

We might have to change the scripts to reflect the new files names.

metastableB · 2019-08-27T16:36:34Z

@pushkalkatara Any updates?

pushkalkatara · 2019-08-28T07:42:32Z

@metastableB Yes, I'll make the changes today.

pushkalkatara added 2 commits August 22, 2019 02:40

SRNN Datapreprocessing script

b6f828f

clean

25b0210

pushkalkatara changed the title ~~SRNN Datapreprocessing script #122~~ SRNN Datapreprocessing script Aug 21, 2019

clean

58b2cb5

pushkalkatara closed this Aug 22, 2019

pushkalkatara reopened this Aug 22, 2019

harsha-simhadri force-pushed the harsha/reorg branch 6 times, most recently from 2120eb9 to 7f90603 Compare October 20, 2019 04:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SRNN Datapreprocessing script #124

SRNN Datapreprocessing script #124

pushkalkatara commented Aug 21, 2019 •

edited

metastableB commented Aug 21, 2019

pushkalkatara commented Aug 21, 2019

metastableB commented Aug 22, 2019

harsha-simhadri commented Aug 22, 2019

metastableB commented Aug 22, 2019

pushkalkatara commented Aug 22, 2019

metastableB commented Aug 23, 2019

metastableB commented Aug 27, 2019

pushkalkatara commented Aug 28, 2019

SRNN Datapreprocessing script #124

Are you sure you want to change the base?

SRNN Datapreprocessing script #124

Conversation

pushkalkatara commented Aug 21, 2019 • edited

metastableB commented Aug 21, 2019

pushkalkatara commented Aug 21, 2019

metastableB commented Aug 22, 2019

harsha-simhadri commented Aug 22, 2019

metastableB commented Aug 22, 2019

pushkalkatara commented Aug 22, 2019

metastableB commented Aug 23, 2019

metastableB commented Aug 27, 2019

pushkalkatara commented Aug 28, 2019

pushkalkatara commented Aug 21, 2019 •

edited