[Misc] Logits processor plugins #4769

NadavShmayo · 2024-05-11T21:40:15Z

This pull request adds support for Logits processor plugins.
This makes implementing custom Logits processors very easy, and eliminates the need to change vLLM directly to implement it.

For example with this merge request we could implement all of the guided decoding features, just by implementing a Python package and installing it in the same virtualenv as vLLM, without actually changing vLLM source code.

Example code for a logits processor plugin that given a token id multiplies its logit by 100:

from pydantic import BaseModel


class MyParameters(BaseModel):
    token_id: int


class MyLogitsProcessor:
    def __init__(self, tokenizer, parameters: MyParameters):
        self.tokenizer = tokenizer
        self.parameters = parameters

    def __call__(self, token_ids, logits):
        new_logits = logits.clone()
        new_logits[self.parameters.token_id] *= 100
        return new_logits


LOGITS_PROCESSOR_PLUGIN = {
    'logits_processor_class': MyLogitsProcessor,
    'parameters_model': MyParameters
}

And the setup.py file for the package should look something like this:

from setuptools import setup

setup(name='example_logits_processor',
      version='0.1',
      install_requires=[
            "pydantic>=1.8.2"
      ],
      entry_points={
            'vllm.logits_processors': ['example_plugin=example_plugin.main:LOGITS_PROCESSOR_PLUGIN']
      }
      )

With this merge request vLLM will load all the plugins at startup, and each inference request can specify usage of custom logits processors using the logits_processors field in the request body.
The parameters_model in the plugin dictionary is used to validate and parse the request body.

I will soon add to this pull request a page in the documentation explaining how to implement custom logits processors.

NadavShmayo · 2024-05-13T15:54:52Z

I added some documentation about this feature :)

simon-mo · 2024-05-14T23:14:36Z

@mmoskal @noamgat @br3no curious about your feedback on this!

mmoskal · 2024-05-15T00:21:12Z

This looks cool - a distribution mechanism for logit processors. When #4775 gets merged this PR would need to be updated to support the more generic interface.

noamgat · 2024-05-15T05:24:06Z

I am very much in favor of this approach. A few months ago I tried to get a similar concept in huggingface-tgi:
huggingface/text-generation-inference#1274
But have since switched to vLLM :)

br3no · 2024-05-15T08:24:47Z

I like this idea. And I agree with @mmoskal that it would be important to support the more involved API being worked on in #4775.

I wonder though how one would implement support for the OpenAI API on tool use if guided decoding were to be provided by such a plugin. The code on the OpenAI server depends on the guided decoding backend and will need to know how to transform the OpenAI API conformant parameters into valid guided decoding parameters (c.f. #4656).

Supporting the OpenAI API as thoroughly as possible is a very valuable thing that should not be sacrificed for software-architectural reasons.

So we can either define guided decoding as a core vLLM feature that is not in the scope of logit-processor plugins or we can think about e.g. also making the frontend part necessary to "correctly" use the plugins also pluggable. Latter would be a challenging endeavor.

NadavShmayo · 2024-05-21T15:34:55Z

Thank you for the feedback everyone.

Regarding @br3no response: It's a good point, I believe as a first step it does make sense to keep the guided decoding code as core vLLM logic, and even more so as it's already implemented this way.

I will try and think how it would be possible to implement it as plugins but still allow tool calling, but I believe this pull request is valuable both ways :)

NadavShmayo added 2 commits May 11, 2024 18:48

Add plugin contracts package with logits processor file

e6431cc

Add support for logits processor plugins

bdd54c6

rkooo567 self-assigned this May 13, 2024

NadavShmayo added 2 commits May 13, 2024 18:51

Add logits processor plugins documentation

c1cb428

Add missing spaces in errors for logit processor plugins

613a586

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Misc] Logits processor plugins #4769

[Misc] Logits processor plugins #4769

NadavShmayo commented May 11, 2024 •

edited

NadavShmayo commented May 13, 2024

simon-mo commented May 14, 2024

mmoskal commented May 15, 2024

noamgat commented May 15, 2024

br3no commented May 15, 2024

NadavShmayo commented May 21, 2024

[Misc] Logits processor plugins #4769

Are you sure you want to change the base?

[Misc] Logits processor plugins #4769

Conversation

NadavShmayo commented May 11, 2024 • edited

NadavShmayo commented May 13, 2024

simon-mo commented May 14, 2024

mmoskal commented May 15, 2024

noamgat commented May 15, 2024

br3no commented May 15, 2024

NadavShmayo commented May 21, 2024

NadavShmayo commented May 11, 2024 •

edited