Autogrammer

Autogrammer is a library that solves the problem of controlling and constraining the output of language models (LLMs) to generate syntactically valid code, JSON, or SQL.

By leveraging pre-defined patterns called grammars, Autogrammer ensures that the LLM generates output that adheres to the desired structure and syntax, even with smaller models. This enables interesting use cases such as live code generation in the browser, natural language to SQL conversion, generating visualizations from text descriptions, and offline or privacy-focused scenarios.

Key features:

Bring your own model: Works seamlessly with Transformers.js, web-llm, and REST endpoints for llama.cpp and llamafile, allowing you to use your preferred LLM.
JSON and SQL support with syntactic validity guaranteed by GBNF grammars.
Provide schemas (database schema or JSON schema) to further restrict possible output and ensure semantic correctness.
(Coming soon) Plugins enable additional functionality, such as RAG, on-the-fly error correction, and more.

Installation

npm install autogrammer

Quickstart

import { pipeline } from '@xenova/transformers'
import { Autogrammer } from 'autogrammer'

// Load your preferred model
const model = pipeline('text-generation', 'Xenova/gpt2')

// Create an Autogrammer instance for JSON output
const autogrammer = new Autogrammer({
  language: 'json',
})

// Provide a prompt to the model telling it to generate JSON
const prompt = 'Write me JSON that captures the following address: 1600 Pennsylvania Avenue NW, Washington, DC 20500'

// Execute the prompt with Autogrammer
const response = await autogrammer.execute(prompt)

// Output the generated JSON
console.log(response) // { ... json object }

Packages

Autogrammer is made up of a number of subpackages that each offer a sliver of functionality:

gbnf - parses a GBNF grammar into a graph of rules, which can be used to determine the validity of a next token
json2gbnf - generates a GBNF grammar for JSON, with optional JSON schema
sql2gbnf - generates a GBNF grammar for SQL, with optional database schema
contortionist - implements a Logits post-processor that restricts LLM output to only include valid next tokens
autogrammer - provides top-level support for SQL and JSON grammar generation with arbitrary LLM models.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 450 Commits
.github/workflows		.github/workflows
.vscode		.vscode
packages		packages
.deepsource.toml		.deepsource.toml
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

.vscode

.vscode

packages

packages

.deepsource.toml

.deepsource.toml

.gitignore

.gitignore

.gitmodules

.gitmodules

.pre-commit-config.yaml

.pre-commit-config.yaml

LICENSE

LICENSE

Makefile

Makefile

README.md

README.md

pnpm-lock.yaml

pnpm-lock.yaml

pnpm-workspace.yaml

pnpm-workspace.yaml

tsconfig.json

tsconfig.json

Repository files navigation

Autogrammer

Installation

Quickstart

Packages

License

About

Releases 23

Contributors 2

Languages

License

thekevinscott/Autogrammer

Folders and files

Latest commit

History

Repository files navigation

Autogrammer

Installation

Quickstart

Packages

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages