Investigate/document how to use with OpenRouter #1099

ErikBjare · 2024-04-02T23:08:53Z

As I mentioned in #1082 (review), OpenRouter is an easy way to run lots of different LLMs through a OpenAI-compatible API (notably recommended by Aider).

We should consider investigating this as a user-friendly way to use open models without needing your own hardware, and document it accordingly.

@zigabrencic did some investigation on this. Opening this issue to track our investigation and potentially documenting to users how to use it.

One notable downside of it, and open models generally, is that it may lead to all kinds of bugs caused by less powerful models. We might want to mention this in docs so that users are aware, and don't open spurious bug reports.

zigabrencic · 2024-04-03T06:59:49Z

@vibor you can assign me to this one. Will start looking into it at towards the end of the week.

viborc · 2024-04-03T07:23:44Z

Done, thanks, @zigabrencic and @ErikBjare!

TheoMcCabe · 2024-04-15T08:40:29Z

I wrote an article on this

https://medium.com/@tedisaacs/from-openai-to-opensource-in-2-lines-of-code-b4b8d2cf2541

zigabrencic · 2024-04-15T09:22:23Z

@TheoMcCabe that's great. Thanks for sharing the post.

The issue I came across with gpte was that specific open model {sizes, types} result into rather poor performance of gpte. Running on the local machine has the same issue as with OpenRouter.

In other words. Making the API calls to open router via langchain is the easy part. Making gpte work with open models, is well harder. Hopefully running new gpte benchmarks with Open Models gives us some clarity here.

I'm waiting for llama3 to come out this week to see if it's any better. If it is I suspect llama3 to be also available on OpenRouter as well.

I'm not against adding a few lines from your post to the docs right away. We just need to warn the users as Erik mentioned above about experimentally of the feature.

TheoMcCabe · 2024-04-16T22:57:05Z

Yeah i saw the same thing with mynt. The only things that could handle the logic required to contruct the requests properly were anthropic or open ai models. Even google gemini sucked!

There are other benefits to using open router other than just getting access to additional models which work well with gpte... like being visible on the open router app rankings. And having access to new models early (which may in future work well with gpte)

I'd say the models not working very well shouldnt be a blocker on us completing this integration

TheoMcCabe · 2024-05-16T17:19:27Z

I think this can be closed now as you PR is merged @zigabrencic ?

ErikBjare added the documentation Improvements or additions to documentation label Apr 2, 2024

viborc assigned zigabrencic Apr 3, 2024

zigabrencic mentioned this issue May 2, 2024

Adding instructions for use of open router #1139

Merged

viborc closed this as completed May 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate/document how to use with OpenRouter #1099

Investigate/document how to use with OpenRouter #1099

ErikBjare commented Apr 2, 2024

zigabrencic commented Apr 3, 2024

viborc commented Apr 3, 2024 •

edited

TheoMcCabe commented Apr 15, 2024

zigabrencic commented Apr 15, 2024 •

edited

TheoMcCabe commented Apr 16, 2024 •

edited

TheoMcCabe commented May 16, 2024

Investigate/document how to use with OpenRouter #1099

Investigate/document how to use with OpenRouter #1099

Comments

ErikBjare commented Apr 2, 2024

zigabrencic commented Apr 3, 2024

viborc commented Apr 3, 2024 • edited

TheoMcCabe commented Apr 15, 2024

zigabrencic commented Apr 15, 2024 • edited

TheoMcCabe commented Apr 16, 2024 • edited

TheoMcCabe commented May 16, 2024

viborc commented Apr 3, 2024 •

edited

zigabrencic commented Apr 15, 2024 •

edited

TheoMcCabe commented Apr 16, 2024 •

edited