Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Would you like to train a manim code generation model yourself ? #4

Open
Yangget opened this issue Apr 14, 2023 · 10 comments
Open

Would you like to train a manim code generation model yourself ? #4

Yangget opened this issue Apr 14, 2023 · 10 comments

Comments

@Yangget
Copy link

Yangget commented Apr 14, 2023

Hello macky.

These days I am thinking about how to make a more general path of text -> mainm code -> video.
Since the gpt code and weights are not open, I plan to train a model by myself.
We can work together if you want.

@360macky
Copy link
Owner

Hello @Yangget! Sorry about delayed answer. Sure, I'd like to hear your idea. Feel free to share it in a new reply for this issue, or on my personal email: hello@marceloarias.com.

@Yangget
Copy link
Author

Yangget commented Apr 18, 2023

I have sent an email

@waizwafiq
Copy link

Hi @Yangget and @360macky ! I'm currently working on a project that involves converting text queries to Manim-specific codes, just like @Yangget mentioned. Then, I stumbled upon this repository. Hahaha.

I would really love to collaborate with you two regarding this project and bring it to life. It would help increase the precision and flexibility of a no-code to low-code programmatic animation platform.

I'll send you guys an email soon so we can discuss on this further! 😄

@Yangget
Copy link
Author

Yangget commented May 4, 2023 via email

@360macky
Copy link
Owner

360macky commented May 4, 2023

Hi @waizwafiq!

That sounds great. It's a good idea that I'm working on.

You can add me to Discord, if it's okay for you: 360macky • Marcelo#7552

@360macky 360macky closed this as not planned Won't fix, can't repro, duplicate, stale Aug 26, 2023
@360macky 360macky reopened this Mar 18, 2024
@Yangget
Copy link
Author

Yangget commented Mar 19, 2024

Let's create a Discord Server !

@360macky @waizwafiq

@Puiching-Memory
Copy link
Contributor

Hi!
Can you tell me your latest news? I am very curious about this project

@360macky
Copy link
Owner

360macky commented May 7, 2024

Hi! Can you tell me your latest news? I am very curious about this project

Hi @Puiching-Memory!

We are currently testing an implementation of a flow to generate code inspired by LangGraph:
image

Our Discord server is small, I invite you to join us. I am updating this experiment continuously these weeks.

@Puiching-Memory
Copy link
Contributor

This link shows that the invitation is invalid, it could be that the link has expired, or there is a problem with my network.
Your idea inspired me. This is my first contact with LangGraph, and I can't help you for the time being.

Recently I have been working on a similar task, hoping to use GPT to drive manim to generate animations. Existing GPT's still make frequent errors. I've heard that there are ways to fine-tune an LLM to turn it into an expert model. I wonder if this method is suitable for the present situation.

If so, what data sets do we need to prepare and which fine-tuning framework should we choose?

@360macky
Copy link
Owner

360macky commented May 9, 2024

This link shows that the invitation is invalid, it could be that the link has expired, or there is a problem with my network. Your idea inspired me. This is my first contact with LangGraph, and I can't help you for the time being.

Recently I have been working on a similar task, hoping to use GPT to drive manim to generate animations. Existing GPT's still make frequent errors. I've heard that there are ways to fine-tune an LLM to turn it into an expert model. I wonder if this method is suitable for the present situation.

If so, what data sets do we need to prepare and which fine-tuning framework should we choose?

Thanks for the heads up on the Discord link. It's updated!

I'm glad to read that the idea inspires you too. I think there are many systems to explore, so I couldn't determine which is best at this point. We started with Zero-Shot Learning, but Fine-Tuning and LangGraph are ideas we explored. Then we will evaluate which is better.


For the datasets: We would need datasets that contain the following properties: prompt, code, type.

Basically describe as a human would, a video scene or an image scene for the prompt, categorize it into what would be a video or an image, and the code itself. After this I plan to transform all the code and the prompt (which, in addition to the instruction, will specifically include the type of media to be generated) into the JSONL format that is used for fine tuning.

We have 27 Python files (from Manim Community), usually at least 50+ are preferred (at this point: more is better).

Feel free to propose new ideas, add new files to our recent small dataset, make new things on this workflow or follow the project closely. I'm happy to answer if you have more questions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants