Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Transformer Day: Get miniGPT up and running #177

Open
6 tasks
ramon-astudillo opened this issue Jun 6, 2023 · 0 comments
Open
6 tasks

Transformer Day: Get miniGPT up and running #177

ramon-astudillo opened this issue Jun 6, 2023 · 0 comments
Assignees

Comments

@ramon-astudillo
Copy link
Member

ramon-astudillo commented Jun 6, 2023

Objective: Integrate miniGPT https://github.com/karpathy/minGPT into the toolkit.

  • Add a basic notebook labs/notebooks/transformers.pynb that runs a miniGPT.from_pretrained decoding with an example
    • Port only the minGPT code needed (e.g. under labs/lxmls/miniGPT).
      • Less code is better for clarity. Code is already pretty compact so it may be ok as is.
      • Add License and ensure we attribute properly
    • Load a basic model (GPT-2 large? XL?)
    • Check a basic text completion exercise works, where we can give different prefixes are completions make more or less sense
    • Other proposals @venelink @gonmelo ?

Branch: https://github.com/LxMLS/lxmls-toolkit/tree/transformer-day

Expected Finishing date: Ideally before June 12 meeting. If not during that week.

More ambitious objectives would include

  • Add a fine-tuning example
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants