Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error with large number of columns #39

Open
nabarunaguha opened this issue Sep 10, 2019 · 1 comment
Open

Error with large number of columns #39

nabarunaguha opened this issue Sep 10, 2019 · 1 comment
Labels
question Further information is requested

Comments

@nabarunaguha
Copy link

nabarunaguha commented Sep 10, 2019

Hi @ManuelAlvarezC !

I am really intrigued to work with TGAN and it is my first project working with a GAN in general as well.

Recently I trained it with tabular data that contains 1000+ columns and it gave the error:
InvalidArgumentError: Cannot serialize protocol buffer of type tensorflow.GraphDef as the serialized size (3015923402bytes) would be larger than the limit (2147483647 bytes)

I understand I need to go with dimensionality reduction if I want to train the TGAN with this particular dataset but there are correlations among the columns that will be lost if I use technique like PCA.

Is there any way to work with TGAN with a large number of columns ?

Regards,
Nabaruna

@csala csala added the question Further information is requested label Sep 16, 2019
@leix28
Copy link
Contributor

leix28 commented Sep 17, 2019

For each column, TGAN uses a hidden layer to generate such column. The hidden layers consume a large amount of memory, so it's not easy to adapt TGAN on datasets with 1000+ columns. We developed a new framework called CTGAN. It can work on datasets with 1000+ columns.

Thanks,
Lei

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants