Skip to content

DavidSouther/ailly

Repository files navigation

Ailly: AI Writing Ally

Load your writing. Guide Ailly to your voice. Write your outline. Prompt Ailly to continue to keep writing. Edit its output, and get even more text like that.

Rhymes with daily.

Ailly's best feature is rapid prompt engineering iteration. By keeping your prompts in snippets on the file system, you can make very fine-grained changes to your prompt and immediately see the difference in the output. You can also use all of your normal source control tooling to track changes over time: both your changes, and those from the LLM.

CLI Quickstart

To get started on the command line, follow these steps:

  1. Ask for a joke - npx @ailly/cli --prompt 'Tell me a joke'
  2. Create a folder named jokes and change directory into it.
  3. Create a file named 10_chickens.md with "Tell me a joke about chickens" as the content.
  4. Run Ailly using NodeJS: npx @ailly/cli
    • See the joke in 10_chickens.md.ailly.md
  5. Create a file named .aillyrc with "You are a farmer writing jokes for your other barnyard animals."
    • Include other system prompts, level setting expectations. etc.
    • Run Ailly with the same command, and see how the joke changes.
  6. Create more numbered files, such as 20_knock_knock.md with "Turn the chicken joke into a knock knock joke."
  7. Run Ailly using NodeJS: npx @ailly/cli 20_knock_knock.md
    • 20_knock_knock.md.ailly.md will have the new knock knock joke based on the chicken joke it first wrote!

System Context

System prompts provide grounding and background information for LLMs. There are a number of techniques and "best practices" for developing LLM system prompts. In Ailly, these are in files with the name .aillyrc, and apply to all files in the current folder when using Ailly. These files can also specify properties that control how the LLM prompts are constructed, again for every file.

Properties

Ailly generates LLM responses for one file at a time. It prepares the conversational history and context by using the file system and folders the file is in. There are variations in how Ailly composes the prompt, which can be controlled with several properties.

You can set these properties in a combination of locations, including the command line, .aillyrc files, and greymatter in each file. Later settings override earlier settings.

  • combined: boolean If true, the file's body is the response and the prompt is in the greymatter key prompt. If false, the file's body is the prompt and the response is written to {file_name}.ailly.md. The default is false.
  • skip: boolean If true, the prompt is not sent through the LLM (but is part of the context).
  • isolated: boolean If true, the LLM inference only includes the system prompt, and not the prior context in this folder.
  • context conversation | folder | none
    • conversation (default, unless editing) loads files from the root folder and includes them alphabetically, chatbot history style, before the current file.
    • folder includes all files in the folder at the same level as the current file. (Default when editing.)
    • none includes no additional content (including no system context).
  • parent root | always | never
    • root (default) start the chain of system prompts from the loaded .aillyrc file.
    • always include the .aillyrc file in the parent directory as part of this system prompt.
    • never don't include any other system prompts.
    • Note: always goes up one level, and then parent gets reapplied. To include several ancestors, have parent: always in each, with root as the base .aillyrc of the project.
  • template-view [path] loads variables to use with mustache templates in your prompts.

PLAN

PLAN to use Ailly effectively. Iterate often. Provide context. Put words in Ailly's mouth.

  • Prepare a precise prompt (by writing an aillyrc system prompt, providing supporting documents, and specifying individual prompt steps).
  • Leverage LLM models (by running Ailly on some or all parts of the context chain).
  • Assess the generated content (as Ailly and the LLM write output, make sure they're on the right track).
  • Narrow your context (by editing Ailly's generated content to keep the conversation going where you want it to go).

Engines

To choose an engine, export AILLY_ENGINE=[bedrock|openai] or provide ailly --engine on the command line.

VSCode extension

The VSCode extension provides two commands, Generate and Edit.

Generate runs Ailly on a file or folder. You can activate it with either the right-click context menu or the command palette.

  • When you run Ailly from the context menu, it runs on the file with its parent folder as the root, or on an entire folder using the folder as the root.

  • When you run Ailly from the command prompt, it runs on the current active file (or does nothing if there's no active editor).

Edit runs from the command prompt on the current active file (or does nothing if there's no active editor).

  • Edit uses the current file and its root, in the folder context, and includes the current highlighted selection as the edit range.

  • With multiple selections, it only uses the first selection.

Installing the Ailly extension

Update May 8, 2024: The extension is available as an artifact in the most recent workflow run.

To install the Ailly extension:

  • Clone the repo and install dependencies.
    • git clone https://github.com/davidsouther/ailly.git ; cd ailly ; npm install
  • Package the extension with npm run package.
  • In VSCode extensions, install ./extension/ailly-0.*.*.vsix from vsix.
  • Right-click a file in the content explorer and select Ailly: Generate.
  • Make a selection and edit it with Ailly: Edit in the command palette.

The Ailly extension and Bedrock

When using the Bedrock engine, the Ailly extension uses your current AWS configuration. It also exposes two settings, Ailly: Aws Profile and Ailly: Aws Region, to choose which profile and (optionally) which Region to use for Bedrock requests. Use aws configure or the equivalent command to configure ~/.aws/config and ~/.aws/credentials. If Ailly shows a credentials error, check this setting first.

Developing

See DEVELOPING.md for details on how to run and debug various Ailly components.

Ailly plugins

Ailly can use plugins to provide additional data when calling the LLM models using retrieval augmented generation (RAG).

The default plugin rag (AILLY_PLUGIN=rag, --plugin=rag) uses a vectra database in ./vectors/index.json.

You can provide custom plugins with --plugin=file:/absolute/path/to/plugin.mjs. This plugin must export a default factory function matching the PluginBuilder interface in core/src/plugin/index.ts.

Conversational history

In LLM chat interfaces like ChatGPT or chains like Langchain, the history of the conversation remains in the sequence of interactions between the human and the assistant.

This history is typically in a format that is inaccessible to the user. The user can only regenerate sequences, or add their next prompt at the end.

Ailly removes this limitation by using your file system as the conversational history.

You maintain full control over the sequence of prompts, including editing the LLM's response before (re)generating the next prompt so you can decide how the conversation should evolve.

By editing an LLM prompt, you can keep the best of what the LLM produced and modify the rest. Using this filesystem-based conversational history, Ailly stores each piece of the session in source control. With version tracking, you can see how your prompts and responses have changed over time, and unlock long-term process improvements that are difficult or even impossible with chat interfaces.

In one session, a developer was working on a long sequence of prompts to build a software project. While reviewing an LLM-generated draft of the README, the developer wanted the list of API calls to be links to the reference documentation. With a chat conversational history, the developer would have needed to modify the instructions for the entire prompt to encourage creating the list, rerun the generation, and hoped the rest of the README came out similarly.

Instead, with Ailly, the developer created a new file with only the list and an instruction on how to create URLs from the list items, saved it as list.md (with isolated: true in the combined head), and ran ailly list.md.

The LLM followed the instructions, generated just the updated list, and the developer copied that list into the original (generated) README.md. In later prompts, the context window included the entire list of URLs, and the agent model could intelligently request to download their contents.

To the author's knowledge, no other LLM interface provides this level of interaction with LLMs.