Skip to content
This repository has been archived by the owner on Nov 3, 2023. It is now read-only.

Question about dataprocess of SeeKeR #5087

Open
ahxchzt opened this issue Oct 24, 2023 · 1 comment
Open

Question about dataprocess of SeeKeR #5087

ahxchzt opened this issue Oct 24, 2023 · 1 comment

Comments

@ahxchzt
Copy link

ahxchzt commented Oct 24, 2023

I'd like to know how you handle data, as you have three different tasks (search query, knowledge, response). During the training process, do you mix these three different types of data together for training, or do you train one task after another once a task is completed? If it's the latter, I'm curious about how you address the issue of catastrophic forgetting.

@mojtaba-komeili
Copy link
Contributor

It is actually the first. We usually multi-task them together.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants