March Madness AI

Train the OpenAI API on the results of every March Madness game since 1985, then use the model to predict the outcome of this year's tournament. Great for learning how OpenAI's fine-tuning works.

Credits to danvk for providing the data necessary to train the model.

Getting Started

Install Node.js, Yarn, & other system dependencies.
Install OpenAI's CLI (OpenAI's CLI) locally. This is so you can train your own model. Make sure to set your OpenAI API key as instructed in the article linked above.
Clone this repo, and run yarn to install package dependencies.

Training the Model

In order to train a model based on Davinci, we'll need to create a JSONL file containing the training data. This is done by running yarn run prepare. This will create a file called training-data.jsonl in the root directory with the results of every March Madness game since 1985.

Now that the training data is ready, we can train the model by running yarn run train. This will take a while, but timing depends on the size of the queue at the time of training– it could take around 90 minutes during peak hours. You can read more about the training process here.

Once your model is fully trained, you'll be given a model ID that looks something like davinci:ft-personal-0000-00-00-00-00-00.

Setting Environment Variables

In order to get predictions, you'll need to set environment variables in the .env file. You can do this by running cp .env.example .env and then filling in the values.

OPENAI_API_KEY: Your OpenAI API key. You can find this in your OpenAI dashboard.
OPENAI_MODEL_ID: The model ID you were given when your model finished training, as described above.
DEBUG_MODE: Set this to true if you want to avoid calling the OpenAI API and instead use a random number generator to generate predictions. This is useful for testing the app without using up your API credits.

Running Predictions

Now that you have a trained model and have set your local environment variables, you can run predictions by running yarn run predict. This will generate predictions for every game in this year's tournament. It will begin to output the results of each round as they are completed.

Finally, the results are saved to a file called bracket.txt. You can view the results by running cat bracket.txt.

Based on my results, here is a screenshot of the final bracket:

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
docs		docs
src		src
.editorconfig		.editorconfig
.env.example		.env.example
.eslintignore		.eslintignore
.eslintrc		.eslintrc
.gitignore		.gitignore
.node-version		.node-version
.prettierignore		.prettierignore
README.md		README.md
bracket.txt		bracket.txt
package-lock.json		package-lock.json
package.json		package.json
training-data.jsonl		training-data.jsonl
tsconfig.json		tsconfig.json
yarn.lock		yarn.lock

torreyleonard/march-madness-ai

Folders and files

Latest commit

History

Repository files navigation

March Madness AI

Getting Started

Training the Model

Setting Environment Variables

Running Predictions

About

Resources

Stars

Watchers

Forks

Languages