Long_Video_Generation

A pipeline to generate long videos according to text prompt
Xinchen Zhang
Tsinghua University

Pipeline

Gallery

A spectacular waterfall	A car driving down the road.

Astronauts traveling in space	A cat looking out the window

Inference

Before inference, you need to use LLMs to obtain segmented fragments based on the prompt, along with complex descriptions of each fragment.

We provide a template in template.txt. Then copy and paste the template to ChatGPT, you can get the generated prompts.

We offer two ways to generate a long video. If you choose I2VGen-XL as the backbone, run:

python pipeline_i2vgenxl.py --seed 1234 --fps 16

If you choose SVD as the backbone, run:

python pipeline_svd.py --seed 1234 --fps 16

After that, we use EMA-VFI to interpolate the video.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
README.md		README.md
pipeline_i2vgenxl.py		pipeline_i2vgenxl.py
pipeline_svd.py		pipeline_svd.py
template.txt		template.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

README.md

README.md

pipeline_i2vgenxl.py

pipeline_i2vgenxl.py

pipeline_svd.py

pipeline_svd.py

template.txt

template.txt

Repository files navigation

Long_Video_Generation

Pipeline

Gallery

Inference

About

Releases

Packages

Languages

Cominclip/Long_Video_Generation

Folders and files

Latest commit

History

Repository files navigation

Long_Video_Generation

Pipeline

Gallery

Inference

About

Topics

Resources

Stars

Watchers

Forks

Languages