Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mulite processing on batch #135

Open
peterdudfield opened this issue Jan 5, 2023 · 2 comments
Open

Mulite processing on batch #135

peterdudfield opened this issue Jan 5, 2023 · 2 comments

Comments

@peterdudfield
Copy link
Contributor

Instead of looping on the batch, could we use multi processing?

code is here

@jacobbieker
Copy link
Member

Each worker thread is the multiprocessing though, and each example is emitted one by one by the datapipes before it, so not sure multiprocessing that code would speed it up at all. Adding more workers seems like probably the best solution to it, I would think.

@jacobbieker
Copy link
Member

Unless we want to break that each datapipe is independent of other ones, in which case we could maybe make a function or datapipe that loads multiple examples per step at once, and then gives them all to the next datapipe, still not sure that would be faster though

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants