[WIP] AI video prototype #2959

yondonfu · 2024-01-23T22:15:55Z

Opening this draft PR to kick off CI build process for the WIP ai-video branch. This PR should not be merged as-is and the code on this branch will likely be refactored + cleaned up separately later on.

yondonfu · 2024-01-24T17:15:02Z

d32b1b5 temporarily disables Linux arm64 builds because they fail due to an error related to not being able to find zlib during ffmpeg compilation. This error doesn't occur for amd64 builds so I suspect the issue is related to the amd64 -> arm64 cross-compilation process. I noticed that we compile an arm64 specific version of x264 before compiling ffmpeg - perhaps we have to do something similar with zlib?

zlib is currently required as a dependency as of 133050d#diff-4ae778054809274731b9da0c6a5a869c0bd214e92f954a5c9c39181748c2f175 which enabled the png decoder and image2 muxer which are used to demux + decode a sequence of PNG files so they can be encoded into an mp4 file. Ideally, we would replace the PNG demux/decode component by passing tensors (that represent frames) outputted by a model directly from GPU memory to NVENC using torchaudio.StreamWriter, but torchaudio.StreamWriter doesn't support RGB -> YUV conversion on the GPU yet - it can still encode a larger, less-streaming friendly (my understanding is yuv420p is preferred for streaming) RGB output, but I didn't jump to implement this yet due to current limitations. Until this replacement happens, zlib would be a required dependency to support the temporary PNG demux + decode component.

iameli · 2024-01-24T18:19:53Z

@yondonfu Weird - on release go-livepeer right now zlib is dynamically linked. I'll have a look.

* feat(ai): add pipelines optimization flags This commit adds a new `OptimizationFlags` field to the `aiModels` config so that users can forward optimization environment variables to the [ai-worker](git@github.com:livepeer/ai-worker.git) for more information see livepeer/ai-worker#61. * chore: update ai-worker to latest commit This commit ensures that the https://github.com/livepeer/ai-worker dependency is on the latest commit that includes the new optimization flags feature (see livepeer/ai-worker#61). * refactor: improve OptFlags logging This commit ensures that the `optimzation flag not supported` warning is shown for each model that is not loaded warm.

This commit adds a new section explaining the new `optimization_flags` that were enabled #3013.

This commit temporary changes the push branch of the `build.yml` to the `ai-video` branch since the `ai-video` branch has conflicts with the `master` branch preventing the containers to be build.

This commit temporary changes the push branch of the `docker.yml` to the `ai-video` branch since the `ai-video` branch has conflicts with the `master` branch preventing the containers to be build.

This commit gets rid of the Pull request labeler configuration file warning.

This commit flushes the data in the image writer to ensure that all data gets written to the PNG.

This commit ensures that the labeler action also runs on a 'pull_request_target' to ensure pull requests from forks are correctly labeled.

This commit ensures that all AI related issues and feature requests are assigned to the AI team.

* feat(ai): enable AI orchestrator discovery This commit incorporates the AIServiceRegistry contract address, superseding the conventional ServiceRegistry contract address. This strategic alteration streamlines the discovery process of AI Orchestrators within the AI Subnet, thereby bolstering network accessibility and interaction. While this approach serves as a swift workaround to enable the feature without extensive code modification, it's important to note that it may disrupt the existing transcoding discovery mechanism. We have to fix this if we want to merge the two networks in the future. * docs(ai): improve discovery documentation This commit ensures that people are aware that they have to interact with the `AIServiceRegistry` using their main Orch wallet. * fix: fix 'AIServiceRegistry' devnet and testnet issue This commit ensure that the hardcoded `AIServiceRegistry` contract doesn't break the go-livepeer binary on local devnets or testnets.

This commit adds extra devtool input arguments allowing developers to spin up multiple Os on the ETH devnet.

This commit improves the devtool documentation and adds a helpful script if developers want to create multiple Os at the same time.

This commit logs the advertised capabilities and price on startup if users have their logging verbosity level set to 6 or higher.

This commit ensures that an error is thrown when users don't specify the 'aiModels' flag but have the 'aiWorker' flag set.

* fix(ai): improve selection algorithm This commit modifies the selection algorithm to continue retrying for a duration of one second instead of stopping after four attempts. This change addresses issues encountered with the current algorithm's performance in environments with 15 nodes on the network, ensuring more robust and reliable operation until further optimizations can be implemented. * refactor(ai): enhance selection algorithm retry logic This commit replaces the time-based for-loop in the selection algorithm's retry logic with a more context-aware approach.

This commit refines context handling in the orchestrator selection loop for idiomatic Go and enhanced propagation of parent cancellations.

This commit improves the orchestrator selection retry ctx timeout msg.

* fix(ai): handle insufficient capacity payments This commit enhances the Orchestrator's capacity handling by returning an error prior to processing payments when capacity is insufficient. This prevents that the Gateway overpays for requests. * chore(ai): update ai-worker dependency This commit updates the ai-worker dependency to the latest version.

This commit ensures that the `upload_build.sh` script uploads the latest binary that is deployed to the `ai-video` branch under one url. This is done to simplify binary installation.

This commit removes the tempoary AI subnet docs now that the final docs have been deployed on https://docs.livepeer.ai/ai/introduction.

This commit prevents the orchestrator selection go routine from staying in a infinite loop when no Orchestrators can be found.

) This commit introduces latency consideration into the orchestrator selection process, addressing two key issues. Firstly, it resolves a minor bug where the algorithm consistently selected known orchestrators due to a condition that never evaluated to true (see [this condition](https://github.com/livepeer/go-livepeer/blob/1239b4e56133003fe6a98a863cce6bdd6b5f2532/server/selection.go#L110)). Secondly, this change ensures that, once all orchestrators have been evaluated, the one with the fastest response time for a specific job is chosen. While the current method for calculating latency is somewhat basic, it sets the foundation for more sophisticated enhancements in the future. Co-authored-by: Brad P <0xb79orch@gmail.com>

This commit updates the 'ai-worker' dependency to the latest commit.

This commit adds the `gateway` flag and deprecates the `broadcaster` flag per core team decision (details: https://discord.com/channels/423160867534929930/1051963444598943784/1210356864643109004).

* Remove -pricePerUnit requirement for orchestrator with -AIWorker flag * refactor: add PricePerUnit comment This commit reintroduces the previously omitted comment for the PricePerUnit variable, improving code readability and maintainability. * refactor: simplify PricePerUnit flag check condition This commit simplifies the conditional check used to check if the `PricePerUnit` flag is needed. --------- Co-authored-by: Rick Staa <rick.staa@outlook.com>

This commit updates the https://github.com/livepeer/ai-worker to the latest version so that Orchestrators can enable the [DeepCache](https://github.com/horseee/DeepCache) optimization. This optimization will provide a 50% speedup for multi-step inference requests.

yondonfu changed the title ~~[WIP] AI Video prototype~~ [WIP] AI video prototype Jan 23, 2024

yondonfu force-pushed the ai-video branch 5 times, most recently from feb3f3e to 563d199 Compare January 24, 2024 16:53

yondonfu force-pushed the ai-video branch from 39c4aad to ac4729c Compare January 25, 2024 01:37

yondonfu force-pushed the ai-video branch from 650aae7 to 4cecce3 Compare February 8, 2024 18:43

yondonfu force-pushed the ai-video branch from a436dc3 to c8350af Compare February 20, 2024 17:07

yondonfu force-pushed the ai-video branch from f1718fa to 9f7270e Compare March 14, 2024 21:00

yondonfu added 18 commits March 25, 2024 13:40

server: Add unimplemented AI handler

3ede48f

multi: Add /text-to-image for O

6f29698

core+server: Add /image-to-image to O

e2735cd

core+server: Add /image-to-video for O

8883824

multi: Transcode PNG -> mp4 for image-to-video

92bfa74

server: Impl B -> O image-to-video

be72c37

server: Impl B -> O text-to-image

7cd7913

server: Set Content-Type header on B /image-to-video

abe1b5a

server: Impl B -> O image-to-image

7ec59b3

mod: Bump go-tools to v0.3.5

1822db5

server: Upload to OS for all AI endpoints

52785b2

mod: Bump ai-worker + go-tools

332ecbd

cmd: Add -aiModels to load models

14deb6a

server: Log oapi validation error

289cb49

temp disable CI tests

48f560c

ci+docker: Use go1.21.5

24c1623

docker: Install zlib

d60b801

temp disable CI arm64 builds

b49d503

rickstaa added 2 commits April 16, 2024 16:19

docs(ai): add optimization flags to docs (#3014)

cea4e94

This commit adds a new section explaining the new `optimization_flags` that were enabled #3013.

rickstaa marked this pull request as ready for review April 16, 2024 14:41

rickstaa marked this pull request as draft April 16, 2024 14:42

rickstaa and others added 8 commits April 16, 2024 16:43

ci(ai): temporary change build action branch to ai-video

bc629b7

This commit temporary changes the push branch of the `build.yml` to the `ai-video` branch since the `ai-video` branch has conflicts with the `master` branch preventing the containers to be build.

ci(ai): temporary change docker action branch to ai-video

6aa0b00

This commit temporary changes the push branch of the `docker.yml` to the `ai-video` branch since the `ai-video` branch has conflicts with the `master` branch preventing the containers to be build.

ci(ai): fix pull request config warning (#3018)

cecd3a5

This commit gets rid of the Pull request labeler configuration file warning.

fix: flush writer when encoding AI results (fix invalid PNG) (#3020)

2a782ed

This commit flushes the data in the image writer to ensure that all data gets written to the PNG.

ci(ai): add myself as branch CODE OWNER

e1db239

ci(ai): run labeler also on 'pull_request_target'

1643a1e

This commit ensures that the labeler action also runs on a 'pull_request_target' to ensure pull requests from forks are correctly labeled.

ci(ai): cleanup labeler actions

5bb92fa

ci(ai): auto assign AI issues and feature requests

23fdfcb

This commit ensures that all AI related issues and feature requests are assigned to the AI team.

rickstaa force-pushed the ai-video branch from 2db40f7 to 23fdfcb Compare April 18, 2024 17:16

rickstaa and others added 17 commits April 18, 2024 20:23

refactor(ai): add extra devtool input arguments (#3026)

bfccbc4

This commit adds extra devtool input arguments allowing developers to spin up multiple Os on the ETH devnet.

chore: improve devtool documentation and add scripts

865314d

This commit improves the devtool documentation and adds a helpful script if developers want to create multiple Os at the same time.

refactor: log advertised capabilities and price on startup (#3031)

ea82cde

This commit logs the advertised capabilities and price on startup if users have their logging verbosity level set to 6 or higher.

feat(ai): enforce 'aiModels' flag requirement (#3032)

93caa3b

This commit ensures that an error is thrown when users don't specify the 'aiModels' flag but have the 'aiWorker' flag set.

refactor(ai): improve orch select retry ctx logic (#3039)

72dced7

This commit refines context handling in the orchestrator selection loop for idiomatic Go and enhanced propagation of parent cancellations.

refactor(ai): improve orch retry timeout msg

6fc1afd

This commit improves the orchestrator selection retry ctx timeout msg.

ci(ai): add temporary ai-video latest binary url upload

fb9764b

This commit ensures that the `upload_build.sh` script uploads the latest binary that is deployed to the `ai-video` branch under one url. This is done to simplify binary installation.

chore(ai): remove temporary AI subnet docs

40a40a5

This commit removes the tempoary AI subnet docs now that the final docs have been deployed on https://docs.livepeer.ai/ai/introduction.

fix(ai): fix infinite loop when no Os are found (#3042)

ebd5045

This commit prevents the orchestrator selection go routine from staying in a infinite loop when no Orchestrators can be found.

chore(ai): update 'ai-worker' dependency

45cf167

This commit updates the 'ai-worker' dependency to the latest commit.

feat: add '-gateway' and deprecate '-broadcaster' (#3048)

180041d

This commit adds the `gateway` flag and deprecates the `broadcaster` flag per core team decision (details: https://discord.com/channels/423160867534929930/1051963444598943784/1210356864643109004).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] AI video prototype #2959

[WIP] AI video prototype #2959

yondonfu commented Jan 23, 2024 •

edited

yondonfu commented Jan 24, 2024 •

edited

iameli commented Jan 24, 2024

[WIP] AI video prototype #2959

Are you sure you want to change the base?

[WIP] AI video prototype #2959

Conversation

yondonfu commented Jan 23, 2024 • edited

yondonfu commented Jan 24, 2024 • edited

iameli commented Jan 24, 2024

yondonfu commented Jan 23, 2024 •

edited

yondonfu commented Jan 24, 2024 •

edited