Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

perf(runner): add 'DEEPCACHE' optimization method #83

Merged
merged 1 commit into from
May 14, 2024

Conversation

rickstaa
Copy link
Contributor

@rickstaa rickstaa commented May 14, 2024

This pull request gives Orchestrators the ability to enable the DeepCache optimization method. This method can provide a 50% inference speed up for multi-step inference (See https://livepeer-ai-spe.productlane.com/roadmap?id=899de8b9-812f-461c-9866-820af7e27438).

@rickstaa rickstaa added the enhancement New feature or request label May 14, 2024
@rickstaa rickstaa force-pushed the add_deep_cache_optim branch 2 times, most recently from f182f4b to 8c97a15 Compare May 14, 2024 10:55
This commit enables Orchestrators to utilize the
[DeepCache](https://github.com/horseee/DeepCache) optimization method.
DeepCache has demonstrated the potential to achieve a 50% inference
speedup for multi-step inference processes (referenced from
https://livepeer-ai-spe.productlane.com/roadmap?id=899de8b9-812f-461c-9866-820af7e27438).

Furthermore, the commit includes minor optimizations and cleanup within
the relevant modules.
@rickstaa rickstaa merged commit 8bd32b0 into main May 14, 2024
1 check passed
@rickstaa rickstaa deleted the add_deep_cache_optim branch May 14, 2024 11:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant