Modifiy post processing in chat #698

srdas · 2024-03-23T20:14:56Z

Fixes #686

Removed spurious fragments from the suggestion, like backticks

    While most models (especially instruct and infill models do not require
    any pre-processing), some models such as gpt-4 which only have chat APIs
    may require removing spurious fragments (such as backticks).
    This function uses heuristics and request data to remove such fragments.
    It comments out all lines that are not code, irrespective of programming language.

Changes made to packages/jupyter-ai/jupyter_ai/tests/completions/test_handlers.py

All lines with backticks have been removed.
All text outside code blocks is commented out.

Result looks like the following:

Remove spurious fragments from the suggestion. While most models (especially instruct and infill models do not require any pre-processing), some models such as gpt-4 which only have chat APIs may require removing spurious fragments (such as backticks). This function uses heuristics and request data to remove such fragments. It comments out all lines that are not code, irrespective of programming language.

for more information, see https://pre-commit.ci

krassowski

There are many languages where comments are written differently. This would generate invalid code for these languages.

Why not just modify the previous logic to strip the triple backtick suffix? Why do you want to ignore language? There is a very different use case of generating suggestions for markdown where having triple backtick is desired and of generating Python (or other) code.

krassowski · 2024-03-24T16:41:44Z

In general, if an LLM wants to generate comments, it should generate comments as valid language-specific syntax. It is not responsibility of jupyter-ai to transform any spurious text to comments. If jupyter-ai were to do that, one would need to write (and maintain) a very extensive multi-language logic to handle it. Instead, a LLM should be prompted differently (to only return non-code as comments with syntax matching given language). If a given LLM cannot do that, user should be advised that this is not a good LLM for code completions (ideally we would have two lists, one for chat and one for code completion; some models would be on both, but many are fine tuned for one or the other task).

srdas · 2024-04-12T16:47:32Z

Closing this PR as discussed here: #726 (review)

srdas and others added 2 commits March 23, 2024 13:05

[pre-commit.ci] auto fixes from pre-commit.com hooks

e1e42fe

for more information, see https://pre-commit.ci

srdas added enhancement New feature or request good first issue Good for newcomers bug Something isn't working and removed enhancement New feature or request labels Mar 23, 2024

krassowski removed the good first issue Good for newcomers label Mar 24, 2024

krassowski requested changes Mar 24, 2024

View reviewed changes

srdas mentioned this pull request Apr 12, 2024

Remove trailing Markdown code tags in completion suggestions #726

Merged

srdas closed this Apr 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modifiy post processing in chat #698

Modifiy post processing in chat #698

srdas commented Mar 23, 2024

krassowski left a comment •

edited

krassowski commented Mar 24, 2024

srdas commented Apr 12, 2024

Modifiy post processing in chat #698

Modifiy post processing in chat #698

Conversation

srdas commented Mar 23, 2024

krassowski left a comment • edited

Choose a reason for hiding this comment

krassowski commented Mar 24, 2024

srdas commented Apr 12, 2024

krassowski left a comment •

edited