Inline completer displays ``` at the end of suggested code when using provider #686

bensonlee5 · 2024-03-10T19:58:57Z

As flagged in this Jupyterlab forum post; should be removed in post-processing.

welcome · 2024-03-10T19:59:00Z

Thank you for opening your first issue in this project! Engagement like this is essential for open source projects! 🤗

If you haven't done so already, check out Jupyter's Code of Conduct. Also, please try to follow the issue template as it helps other other community members to contribute more effectively.

You can meet the other Jovyans by joining our Discourse forum. There is also an intro thread there where you can stop by and say Hi! 👋

Welcome to the Jupyter community! 🎉

dlqqq · 2024-03-11T16:53:58Z

@krassowski Do you want to own this issue? 👀

krassowski · 2024-03-12T09:47:26Z

It looks like the problem is two-fold:

when streaming, post-processing is only used when a problematic prefix is detected:

jupyter-ai/packages/jupyter-ai/jupyter_ai/completions/handlers/default.py

Lines 86 to 101 in b18b7d0

    
           async for fragment in self.llm_chain.astream(input=model_arguments): 
        
               suggestion += fragment 
        
               if suggestion.startswith("```"): 
        
                   if "\n" not in suggestion: 
        
                       # we are not ready to apply post-processing 
        
                       continue 
        
                   else: 
        
                       suggestion = self._post_process_suggestion(suggestion, request) 
        
               self.write_message( 
        
                   InlineCompletionStreamChunk( 
        
                       type="stream", 
        
                       response={"insertText": suggestion, "token": token}, 
        
                       reply_to=request.number, 
        
                       done=False, 
        
                   ) 
        
               )

post-processing does not look for suffixes at all in general:

jupyter-ai/packages/jupyter-ai/jupyter_ai/completions/handlers/default.py

Lines 130 to 154 in b18b7d0

    
               def _post_process_suggestion( 
        
                   self, suggestion: str, request: InlineCompletionRequest 
        
               ) -> str: 
        
                   """Remove spurious fragments from the suggestion. 
        
                   While most models (especially instruct and infill models do not require 
        
                   any pre-processing, some models such as gpt-4 which only have chat APIs 
        
                   may require removing spurious fragments. This function uses heuristics 
        
                   and request data to remove such fragments. 
        
                   """ 
        
                   # gpt-4 tends to add "```python" or similar 
        
                   language = request.language or "python" 
        
                   markdown_identifiers = {"ipython": ["ipython", "python", "py"]} 
        
                   bad_openings = [ 
        
                       f"```{identifier}" 
        
                       for identifier in markdown_identifiers.get(language, [language]) 
        
                   ] + ["```"] 
        
                   for opening in bad_openings: 
        
                       if suggestion.startswith(opening): 
        
                           suggestion = suggestion[len(opening) :].lstrip() 
        
                           # check for the prefix inclusion (only if there was a bad opening) 
        
                           if suggestion.startswith(request.prefix): 
        
                               suggestion = suggestion[len(request.prefix) :] 
        
                           break 
        
                   return suggestion

I think (2) is easy to solve, I am happy to review a PR from new contributors here if anyone is interested. Ideally the PR would come with a new test case in packages/jupyter-ai/jupyter_ai/tests/completions/test_handlers.py.

srdas · 2024-03-20T17:21:17Z

@dlqqq @krassowski Thanks for bringing up this issue. I did a detailed analysis and provide my thoughts below.
The issue is that for the example shown, the JupyterLab completer enters ``` at the end of the generated code snippet. Here is a visual:

Installed jupyterlab-transformers-completer based on https://github.com/krassowski/jupyterlab-transformers-completer.

Replicated this error on mac-os for the example above using both openai-chat:gtp-3.5-turbo and bedrock-chat:anthropic.claude-instant-v1 . As shown below for the Claude case:

But it does not always give the same result (see the two examples below):

Here we mimic the add_two_numbers case but get different results:

Here we want the backticks. Replicated the same results using Linux as well. At least we know this is not LLM specific nor os specific.

Clearly, adding special code to remove the ``` at the end as suggested here https://github.com/jupyterlab/jupyter-ai/issues/686#issuecomment-1991202710, will only handle one edge case, which may just be the outcome of some LLM completions, not all. It may not be desirable to build in special code for one edge case into jupyter-ai. Maybe this needs to be handled in JupyterLab , if at all.

Further, the outcome of the transformer completion also depends on the Settings in JupyterLab. In the Settings
drop down, select Settings Editor.

Then select Inline Completer.

If you scroll down the Inline Completer panel, you get three boxes where transformer completion can be “Enabled”. As shown below.

The Issue can be reproduced when only the third box is checked as shown below.

Checking any other configuration of the checkboxes generates other behavior, for example let’s check the second and third boxes and then we get the following examples:

Checking the first and third boxes gives different behavior as well

Checking all three boxes again alters behavior:

A slightly different experience may arise depending on machine, environment, and code completion use cases.

We may not want to fix jupyter-ai code for varying responses from the transformer used for JupyterLab completions. Also, the UX is altered with completions, so it is better to set all the three “Enabled” checkboxes off as default as different users may want completions versus not.

We need to think/discuss the UX and generality of the code mods before going ahead.

krassowski · 2024-03-20T17:55:39Z

Maybe this needs to be handled in JupyterLab , if at all

No, we do not want to special case anything in JupyterLab.

I don't think that the LLMs should add any examples like on your second snapshot.

Here is my point of view - there are two issues:

specific LLMs need a different template to coerce them to only produce code, which is configurable
the default post-processing should handle the case of returned code being wrapped in extra backticks; nothing more nothing less

Also, the UX is altered with completions, so it is better to set all the three “Enabled” checkboxes off as default as different users may want completions versus not.

I do not understand how this relates to this issue.

* Remove closing markdown identifiers (#686) * Remove whitespace after closing markdown identifier * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

bensonlee5 added the bug Something isn't working label Mar 10, 2024

JasonWeill mentioned this issue Mar 11, 2024

jupyter-ai provider always displays ``` at the end #687

Closed

krassowski added the good first issue Good for newcomers label Mar 12, 2024

srdas mentioned this issue Mar 23, 2024

Modifiy post processing in chat #698

Closed

bartleusink added a commit to bartleusink/jupyter-ai that referenced this issue Apr 11, 2024

Remove closing markdown identifiers (jupyterlab#686)

dbbcce1

bartleusink added a commit to bartleusink/jupyter-ai that referenced this issue Apr 11, 2024

Remove closing markdown identifiers (jupyterlab#686)

fdf660d

bartleusink mentioned this issue Apr 11, 2024

Remove trailing Markdown code tags in completion suggestions #726

Merged

dlqqq pushed a commit to bartleusink/jupyter-ai that referenced this issue Apr 15, 2024

Remove closing markdown identifiers (jupyterlab#686)

a2fb32e

dlqqq pushed a commit to bartleusink/jupyter-ai that referenced this issue Apr 22, 2024

Remove closing markdown identifiers (jupyterlab#686)

079900e

dlqqq pushed a commit to bartleusink/jupyter-ai that referenced this issue Apr 22, 2024

Remove closing markdown identifiers (jupyterlab#686)

fe69c54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inline completer displays ``` at the end of suggested code when using provider #686

Inline completer displays ``` at the end of suggested code when using provider #686

bensonlee5 commented Mar 10, 2024 •

edited

welcome bot commented Mar 10, 2024

dlqqq commented Mar 11, 2024

krassowski commented Mar 12, 2024

srdas commented Mar 20, 2024

krassowski commented Mar 20, 2024

Inline completer displays ``` at the end of suggested code when using provider #686

Inline completer displays ``` at the end of suggested code when using provider #686

Comments

bensonlee5 commented Mar 10, 2024 • edited

welcome bot commented Mar 10, 2024

dlqqq commented Mar 11, 2024

krassowski commented Mar 12, 2024

srdas commented Mar 20, 2024

krassowski commented Mar 20, 2024

bensonlee5 commented Mar 10, 2024 •

edited