Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to prevent overfitting? #15

Open
smita-09 opened this issue Jun 23, 2022 · 1 comment
Open

How to prevent overfitting? #15

smita-09 opened this issue Jun 23, 2022 · 1 comment
Assignees

Comments

@smita-09
Copy link

Hello there,

I am working on a very similar area, that is using reinforcement learning for optimization of the semiconductor manufacturing process and your implementation is easy to follow and has been very helpful I would say. Just wanted to ask a question here, how are preventing overfitting?

I can see that you are training the model on just one instance from your instances folder and iterating it for a while, so how are making sure that the model would not overfit? I believe, it should be trained on multiple instances to make it more stable. If you are already training it on multiple instances, ignore my question but please help me by pointing me how did you achieve that in your implementation?

Cheers,
Smita

@ingambe
Copy link
Collaborator

ingambe commented Nov 24, 2022

Hi,

Sorry for the late answer.
This is a good question, or the first approach was just a prototype, but if you want a model that generalizes, training on multiple instances is a must, IMO.
Also, the architecture of our neural network is not so good. You can get way better results with a Transformer and get a size-invariant network.

@ingambe ingambe self-assigned this Nov 24, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants