GEMINI-1.5-PRO Main Day-1 support🧵 #2881

jpshack-at-palomar · 2024-04-07T00:18:32Z

What happened?

This is a placeholder for others who have this issue. There is likely no bug that needs to be fixed in LiteLLM but we won't know until more people have access to the gemini-1.5-pro API. There is some evidence that the model will actually be provided as gemini-1.5-pro-latest. Note that this issue DOES NOT relate to the Vertex API which is a different API and LiteLLM provider.

Source code:

from litellm import completion
import os

os.environ['GEMINI_API_KEY'] = "<<REDACTED>>"
response = completion(
    model="gemini/gemini-1.5-pro", 
    messages=[{"role": "user", "content": "write code for saying hi from LiteLLM"}]
)

See https://docs.litellm.ai/docs/providers/gemini#pre-requisites

Versions

google-generativeai                      0.4.1
litellm                                  1.34.28

Exception:

Give Feedback / Get Help: https://github.com/BerriAI/litellm/issues/new
LiteLLM.Info: If you need to debug this error, use `litellm.set_verbose=True'.

Traceback (most recent call last):
  File ".env/lib/python3.12/site-packages/litellm/llms/gemini.py", line 216, in completion
    response = _model.generate_content(
               ^^^^^^^^^^^^^^^^^^^^^^^^
  File ".env/lib/python3.12/site-packages/google/generativeai/generative_models.py", line 232, in generate_content
    response = self._client.generate_content(
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File ".env/lib/python3.12/site-packages/google/ai/generativelanguage_v1beta/services/generative_service/client.py", line 566, in generate_content
    response = rpc(
               ^^^^
  File ".env/lib/python3.12/site-packages/google/api_core/gapic_v1/method.py", line 131, in __call__
    return wrapped_func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File ".env/lib/python3.12/site-packages/google/api_core/retry/retry_unary.py", line 293, in retry_wrapped_func
    return retry_target(
           ^^^^^^^^^^^^^
  File ".env/lib/python3.12/site-packages/google/api_core/retry/retry_unary.py", line 153, in retry_target
    _retry_error_helper(
  File ".env/lib/python3.12/site-packages/google/api_core/retry/retry_base.py", line 212, in _retry_error_helper
    raise final_exc from source_exc
  File ".env/lib/python3.12/site-packages/google/api_core/retry/retry_unary.py", line 144, in retry_target
    result = target()
             ^^^^^^^^
  File "/Volumes/boxy-01/code-archive/viewing/OpenDevin/.env/lib/python3.12/site-packages/google/api_core/timeout.py", line 120, in func_with_timeout
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File ".env/lib/python3.12/site-packages/google/api_core/grpc_helpers.py", line 78, in error_remapped_callable
    raise exceptions.from_grpc_error(exc) from exc
google.api_core.exceptions.NotFound: 404 models/gemini-1.5-pro is not found for API version v1beta, or is not supported for GenerateContent. Call ListModels to see the list of available models and their supported methods.

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File ".env/lib/python3.12/site-packages/litellm/main.py", line 1637, in completion
    model_response = gemini.completion(
                     ^^^^^^^^^^^^^^^^^^
  File ".env/lib/python3.12/site-packages/litellm/llms/gemini.py", line 222, in completion
    raise GeminiError(
litellm.llms.gemini.GeminiError: 404 models/gemini-1.5-pro is not found for API version v1beta, or is not supported for GenerateContent. Call ListModels to see the list of available models and their supported methods.

See the upstream issue here: google-gemini/generative-ai-python#227

Relevant log output

No response

Twitter / LinkedIn details

No response

The text was updated successfully, but these errors were encountered:

jpshack-at-palomar · 2024-04-07T00:21:53Z

See also: https://ai.google.dev/models/gemini

jpshack-at-palomar · 2024-04-07T00:31:33Z

To detemine whether you are seeing a limitation related to your key use this test:

If you have a key that is good for gemini pro 1.0 but not 1.5 you will get a response for this request:

`gemini-1.0-pro-latest`

curl \
  -H 'Content-Type: application/json' \
  -d '{"contents":[{"parts":[{"text":"Write a story about a magic backpack"}]}]}' \
  -X POST 'https://generativelanguage.googleapis.com/v1beta/models/gemini-1.0-pro-latest:generateContent?key=YOUR_API_KEY'

Response:

{
  "candidates": [
      {
          "content": {
              "parts": [
                  {
                      "text": "In the quaint town of Willow Creek, amidst the rolling hills and whispering willow trees, there lived an ordinary boy named Ethan. Unbeknownst to him, an extraordinary adventure awaited him, hidden within the dusty attic of his grandmother's ancient home.\n\nAs Ethan rummaged through forgotten artifacts, his hands stumbled upon a worn-out leather backpack. Curiosity sparked within him as he unzipped the faded exterior, revealing a chaotic jumble of old schoolbooks, maps, and trinkets.\n\nUnbeknownst to Ethan, this was no ordinary backpack. It whispered ancient secrets and held a dormant power that was about to be awakened. As he delved deeper into the backpack's contents, his fingers brushed against a smooth, glowing orb.\n\nSuddenly, the backpack trembled, and a faint blue aura enveloped it. Ethan gasped in astonishment as the orb emitted a surge of energy that coursed through his body. In that instant, he felt an overwhelming connection to the world around him.\n\nThe backpack's newfound magic granted Ethan the ability to understand and interact with nature in an extraordinary way. He could hear the whispers of the wind, read the patterns of the stars, and communicate with animals.\n\nEmbarking on a thrilling odyssey, Ethan used the magic backpack to explore the hidden wonders of Willow Creek. He befriended the wise old owl that perched atop the town clock, learned the secret language of the squirrels, and discovered the enchanted waterfall tucked away in the deepest part of the forest.\n\nHowever, with great power came great responsibility. Ethan soon realized that the backpack's magic could be both a blessing and a curse. When the backpack's power flared out of control, it attracted the attention of malicious forces who sought to harness its energy for their own evil schemes.\n\nFacing danger at every turn, Ethan learned the true meaning of courage and perseverance. Guided by the wisdom of the backpack and his newfound friends, he outsmarted cunning sorcerers, outmaneuvered sly thieves, and ultimately defeated the forces of darkness.\n\nIn the end, Ethan emerged from his adventure as a wiser and more compassionate boy. The magic backpack, once a forgotten relic, became a lifelong companion and a symbol of the boundless possibilities that lie within us. And so, in the annals of Willow Creek, the tale of the boy with the magic backpack was passed down for generations to come, inspiring dreams and igniting the flames of imagination in every child who dared to believe in the extraordinary."
                  }
              ],
              "role": "model"
          },
          "finishReason": "STOP",
          "index": 0,
          "safetyRatings": [
              {
                  "category": "HARM_CATEGORY_SEXUALLY_EXPLICIT",
                  "probability": "NEGLIGIBLE"
              },
              {
                  "category": "HARM_CATEGORY_HATE_SPEECH",
                  "probability": "NEGLIGIBLE"
              },
              {
                  "category": "HARM_CATEGORY_HARASSMENT",
                  "probability": "NEGLIGIBLE"
              },
              {
                  "category": "HARM_CATEGORY_DANGEROUS_CONTENT",
                  "probability": "NEGLIGIBLE"
              }
          ]
      }
  ],
  "promptFeedback": {
      "safetyRatings": [
          {
              "category": "HARM_CATEGORY_SEXUALLY_EXPLICIT",
              "probability": "NEGLIGIBLE"
          },
          {
              "category": "HARM_CATEGORY_HATE_SPEECH",
              "probability": "NEGLIGIBLE"
          },
          {
              "category": "HARM_CATEGORY_HARASSMENT",
              "probability": "NEGLIGIBLE"
          },
          {
              "category": "HARM_CATEGORY_DANGEROUS_CONTENT",
              "probability": "NEGLIGIBLE"
          }
      ]
  }
}

`gemini-1.5-pro-latest`

curl \
  -H 'Content-Type: application/json' \
  -d '{"contents":[{"parts":[{"text":"Write a story about a magic backpack"}]}]}' \
  -X POST 'https://generativelanguage.googleapis.com/v1beta/models/gemini-1.5-pro-latest:generateContent?key=YOUR_API_KEY'

Response:

{
  "error": {
      "code": 404,
      "message": "models/gemini-1.5-pro-latest is not found for API version v1beta, or is not supported for GenerateContent. Call ListModels to see the list of available models and their supported methods.",
      "status": "NOT_FOUND"
  }
}

krrishdholakia · 2024-04-07T00:33:24Z

cc: @Manouchehri you might have some context on this - #2841

krrishdholakia · 2024-04-07T00:34:14Z

@jpshack-at-palomar Will also update our exception here

litellm.llms.gemini.GeminiError: 404 models/gemini-1.5-pro is not found for API version v1beta, or is not supported for GenerateContent. Call ListModels to see the list of available models and their supported methods.

To point to this ticket.

Thanks for this!

Manouchehri · 2024-04-07T01:46:23Z

I think you're right too.

That said, Gemini's naming conventions do seem to barely follow any conventions at times, so I wouldn't be shocked if they change naming when it goes into public preview. 😅

jameshiggie · 2024-04-07T09:46:00Z

got it working if you force the update of google-generativeai to 0.4.1. Since its set to 0.3.2 litellm reqs.
code:

from litellm import completion
import os

os.environ['GEMINI_API_KEY'] = "<<REDACTED>>"
response = completion(
    model="gemini/gemini-1.5-pro-latest", 
    messages=[{"role": "user", "content": "write code for saying hi from LiteLLM"}]
)

LLM response formatted:

Saying Hi from LiteLLM: Code Options

Here are a few ways to write code for saying hi from LiteLLM, depending on your desired output and programming language:

Python:

print("Hi from LiteLLM!")

JavaScript:

console.log("Hi from LiteLLM!");

C++:

#include <iostream>

int main() {
  std::cout << "Hi from LiteLLM!" << std::endl;
  return 0;
}

Java:

public class Main {
  public static void main(String[] args) {
    System.out.println("Hi from LiteLLM!");
  }
}

Using a Function:

def say_hi():
  print("Hi from LiteLLM!")

say_hi()

This code defines a function called say_hi that prints the message and then calls the function to execute it.

Adding User Input:

name = input("What is your name? ")
print(f"Hi {name}, from LiteLLM!")

This code asks the user for their name and then includes it in the greeting.

Choosing the Right Code:

Simplicity: If you just want a basic output, the first examples in each language are the most straightforward.
Reusability: If you plan to use the greeting multiple times, consider using a function.
Interactivity: If you want to personalize the greeting, adding user input is a good option.

Remember to choose the code that best suits your specific needs and programming language.

jpshack-at-palomar · 2024-04-09T23:56:49Z

I just received access to gemini/gemini-1.5-pro-latest this evening and can confirm @jameshiggie's result with the following versions:

google-generativeai          0.5.0
litellm                      1.34.38

I will open a PR for a correction to https://docs.litellm.ai/docs/providers/gemini to show the model name as gemini/gemini-1.5-pro-latest.

krrishdholakia · 2024-04-10T00:05:51Z

testing on my end as well - i believe the new genai module also supports system instructions

krrishdholakia · 2024-04-10T00:17:44Z

pinning thread, for anyone else making issues on the new gemini updates. will be easier to consolidate discussion here.

jameshiggie · 2024-04-10T00:41:49Z

Is there a plan to get Async Streaming working for "google AI Studio - gemini" as well? In testing its great but for our use case we need streaming :)

krrishdholakia · 2024-04-10T00:53:24Z

Hey @jameshiggie this should already be working. Do you see an error?

jameshiggie · 2024-04-10T01:00:25Z

o nice! let me test now :), I didn't bother trying since I saw in the readme it wasn't supported :S

jameshiggie · 2024-04-10T01:44:43Z

running the example case:

from litellm import acompletion
import asyncio, os, traceback

async def completion_call():
    try:
        print("test acompletion + streaming")
        messages=[{"role": "user","content": "write code for saying hi from LiteLLM"}]
        response = await acompletion(
            model="gemini/gemini-1.5-pro-latest", 
            messages=messages, 
            stream=True
        )
        print(f"response: {response}")
        async for chunk in response:
            print(chunk)
    except:
        print(f"error occurred: {traceback.format_exc()}")
        pass

await completion_call()

fails :(

output:

test acompletion + streaming

Give Feedback / Get Help: https://github.com/BerriAI/litellm/issues/new
LiteLLM.Info: If you need to debug this error, use `litellm.set_verbose=True'.

error occurred: Traceback (most recent call last):
  File "/usr/local/lib/python3.10/dist-packages/litellm/llms/gemini.py", line 175, in async_streaming
    response = await _model.generate_content_async(
  File "/usr/local/lib/python3.10/dist-packages/google/generativeai/generative_models.py", line 263, in generate_content_async
    iterator = await self._async_client.stream_generate_content(
  File "/usr/local/lib/python3.10/dist-packages/google/api_core/retry_async.py", line 223, in retry_wrapped_func
    return await retry_target(
  File "/usr/local/lib/python3.10/dist-packages/google/api_core/retry_async.py", line 121, in retry_target
    return await asyncio.wait_for(
  File "/usr/lib/python3.10/asyncio/tasks.py", line 445, in wait_for
    return fut.result()
  File "/usr/local/lib/python3.10/dist-packages/google/api_core/grpc_helpers_async.py", line 177, in error_remapped_callable
    raise TypeError("Unexpected type of call %s" % type(call))
TypeError: Unexpected type of call <class 'google.api_core.rest_streaming.ResponseIterator'>

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/usr/local/lib/python3.10/dist-packages/litellm/main.py", line 317, in acompletion
    response = await init_response
  File "/usr/local/lib/python3.10/dist-packages/litellm/llms/gemini.py", line 192, in async_streaming
    raise GeminiError(status_code=500, message=str(e))
litellm.llms.gemini.GeminiError: Unexpected type of call <class 'google.api_core.rest_streaming.ResponseIterator'>

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "<ipython-input-5-f5bdc6ae1aa1>", line 8, in completion_call
    response = await acompletion(
  File "/usr/local/lib/python3.10/dist-packages/litellm/utils.py", line 3418, in wrapper_async
    raise e
  File "/usr/local/lib/python3.10/dist-packages/litellm/utils.py", line 3250, in wrapper_async
    result = await original_function(*args, **kwargs)
  File "/usr/local/lib/python3.10/dist-packages/litellm/main.py", line 330, in acompletion
    raise exception_type(
  File "/usr/local/lib/python3.10/dist-packages/litellm/utils.py", line 8533, in exception_type
    raise e
  File "/usr/local/lib/python3.10/dist-packages/litellm/utils.py", line 8501, in exception_type
    raise APIConnectionError(
litellm.exceptions.APIConnectionError: Unexpected type of call <class 'google.api_core.rest_streaming.ResponseIterator'>

krrishdholakia · 2024-04-10T01:53:23Z

ok - i'll work on repro'ing + push a fix @jameshiggie

jameshiggie · 2024-04-10T01:54:18Z

thanks! 🔥

jameshiggie · 2024-04-10T02:13:23Z

after leaving the example case. I tried it out in a dev version of our product and it works! :D Using same versions for both tho... :S

google-generativeai          0.4.1
litellm                      1.34.38

A little disappointed its very chunky streaming from google, i'm guessing due to safety screening. But easy fix and can deal with that with some stream buffering on our side. thanks for helping with all of this again :)

krrishdholakia · 2024-04-10T04:10:29Z

@jameshiggie so this is a no-op?

jameshiggie · 2024-04-10T04:44:26Z

the example case still fails in a fresh venv with the same error above, not sure why.

from litellm import acompletion
import asyncio, os, traceback

os.environ['GEMINI_API_KEY'] = "XYZ"

async def completion_call():
    try:
        print("test acompletion + streaming")
        messages=[{"role": "user","content": "write code for saying hi from LiteLLM"}]
        response = await acompletion(
            model="gemini/gemini-1.5-pro-latest", 
            messages=messages, 
            stream=True
        )
        print(f"response: {response}")
        async for chunk in response:
            print(chunk)
    except:
        print(f"error occurred: {traceback.format_exc()}")
        pass

await completion_call()

the chunking and yield is slightly diff between the two. It'll be good to have some example code like this for the litellm docs that people can run quickly to test

krrishdholakia · 2024-04-10T04:48:38Z

I'm seeing a similar issue - it works on ci/cd but throws the responseiterator issue on local. Not sure why.

I see a similar hang when i use the raw google sdk. So i wonder if it's something in my environment.

I'm able to get it working via curl though. Will investigate and aim to have a better solution here by tomorrow @jameshiggie

jameshiggie · 2024-04-10T05:33:14Z

ok thanks! 🏋️

CXwudi · 2024-04-11T23:09:26Z

Sorry that I made a new issue just for supporting the system message for Gemini 1.5 pro #2963.

Also I can confirm that the system message is supported by the playground UI in Google Ai Studio, but unfortunately I can't find the API documentation of how to input the system message

aleclarson · 2024-04-12T00:15:52Z

Is gemini/gemini-1.5-pro-latest still the recommended model name? I don't see it in the registry: https://raw.githubusercontent.com/BerriAI/litellm/main/model_prices_and_context_window.json

Manouchehri · 2024-04-12T00:17:15Z

I'm using vertex_ai/gemini-1.5-pro-preview-0409.

krrishdholakia · 2024-04-12T00:27:45Z

@CXwudi system message is already supported for gemini - google ai studio -

litellm/litellm/llms/gemini.py

Line 146 in 6e934cb

system_prompt, messages = get_system_prompt(messages=messages)

@aleclarson gemini/.. is for calling via Google AI Studio (it's the one where google gives you an api key), and yep the way you're calling it looks fine.

vertex_ai/.. is for google's vertex ai implementation.

@aleclarson Curious - how do you use our model_prices json?

krrishdholakia · 2024-04-12T00:30:22Z

Pending items:

[Feature]: Support responseMimeType for Gemini (aka JSON mode?) #2962
support system messages on vertex ai

aleclarson · 2024-04-12T00:30:41Z

@krrishdholakia My PR which adds LiteLLM support to Aider needs it for various metadata (most importantly, which models are supported). See here: https://github.com/paul-gauthier/aider/pull/549/files#diff-da3f6418cba825fc2eac007d80f318784be5cf8f0f9a27433e2693338ca4c8b9R114

Manouchehri · 2024-04-12T00:48:33Z

@krrishdholakia You may want to merge #2964 before starting on Vertex AI system message.

krrishdholakia · 2024-04-12T00:53:58Z

@aleclarson got it.

The right way to check is provider. For eg.- any model on togetherai could be called via litellm - together_ai/<model-name>

This might not be in the map (which is used for tracking price and context window for popular models)

You can check which providers we support via -

litellm/litellm/__init__.py

Line 466 in cd834e9

provider_list: List = [

aleclarson · 2024-04-12T01:22:27Z

@krrishdholakia Good to know. Looks like the model_cost dict is more what I'm looking for. Though I don't like how main is hard-coded. Shouldn't the version tag of the current LiteLLM installation be used instead? Then you could avoid hitting the network every time, since the local cache of it would be version-specific (and so the cache would be invalidated when LiteLLM is updated).

krrishdholakia · 2024-04-12T01:28:35Z

The purpose is to avoid needing to upgrade litellm, to get new models. Would welcome any improvement here.

Related issue: #411 (comment)

CXwudi · 2024-04-12T19:05:18Z

@CXwudi system message is already supported for gemini - google ai studio -

litellm/litellm/llms/gemini.py

Line 146 in 6e934cb

system_prompt, messages = get_system_prompt(messages=messages)

The code is correct, but looks like we need to upgrade the google-generativeai dependency to like 0.5.0 to make the system message really pass through. Not sure if 0.4.1 is ok.

demux79 · 2024-04-15T17:54:27Z

I am using vertex_ai/gemini-1.5-pro-preview-0409 which should support function calls. However, using it with the proxy never returns a valid function call and crashes. It works well with other models. Any idea what I am doing wrong?

litellm | 17:11:27 - LiteLLM Router:DEBUG: router.py:1184 - TracebackTraceback (most recent call last):
litellm | File "/usr/local/lib/python3.11/site-packages/litellm/llms/vertex_ai.py", line 819, in async_completion
litellm | for k, v in function_call.args.items():
litellm | ^^^^^^^^^^^^^^^^^^^^^^^^
litellm | AttributeError: 'NoneType' object has no attribute 'items'

CXwudi · 2024-04-17T00:38:08Z

For some reason, I am getting the message saying gemini-1.5-pro-latest model is not in the model_prices_and_context_window.json

where I have following in the YAML config:

- model_name: gemini-1.5-pro-latest
  litellm_params:
    model: gemini/gemini-1.5-pro-latest
    api_key: os.environ/GEMINI_API_KEY

krrishdholakia · 2024-04-17T01:51:48Z

Just added. should be fixed now @CXwudi

This was caused b/c we run a check if the gemini model supports vision

krrishdholakia · 2024-04-17T01:55:05Z

@demux79 tested on my end with our function calling test -

litellm/litellm/tests/test_amazing_vertex_completion.py

Line 533 in 7656bd1

def test_gemini_pro_function_calling():

Works fine. I suspect this is an issue with the endpoint returning a weird response (maybe None?). If you're able to repro with litellm.set_verbose = True and share the raw response, that would help

demux79 · 2024-04-17T09:32:55Z

@krrishdholakia Thanks. Indeed, Gemini returns None for my function call. With a simple function call it works.

router.py:517 - litellm.acompletion(model=vertex_ai/gemini-1.5-pro-preview-0409) Exception VertexAIException - 'NoneType' object has no attribute 'items'

GPT-4 and Opus handle my more complicated function call quite well. It seems Gemini just isn't quite up to the same standard then ;)

Traceback (most recent call last):
  File "/usr/local/lib/python3.11/site-packages/litellm/llms/vertex_ai.py", line 819, in async_completion
    for k, v in function_call.args.items():
                ^^^^^^^^^^^^^^^^^^^^^^^^
AttributeError: 'NoneType' object has no attribute 'items'

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/usr/local/lib/python3.11/site-packages/litellm/main.py", line 317, in acompletion
    response = await init_response
               ^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/litellm/llms/vertex_ai.py", line 971, in async_completion
    raise VertexAIError(status_code=500, message=str(e))
litellm.llms.vertex_ai.VertexAIError: 'NoneType' object has no attribute 'items'

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/usr/local/lib/python3.11/site-packages/litellm/router.py", line 1276, in async_function_with_retries
    response = await original_function(*args, **kwargs)
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/litellm/router.py", line 522, in _acompletion
    raise e
  File "/usr/local/lib/python3.11/site-packages/litellm/router.py", line 506, in _acompletion
    response = await _response
               ^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/litellm/utils.py", line 3433, in wrapper_async
    raise e
  File "/usr/local/lib/python3.11/site-packages/litellm/utils.py", line 3265, in wrapper_async
    result = await original_function(*args, **kwargs)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/litellm/main.py", line 338, in acompletion
    raise exception_type(
          ^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/litellm/utils.py", line 8573, in exception_type
    raise e
  File "/usr/local/lib/python3.11/site-packages/litellm/utils.py", line 7817, in exception_type
    raise APIError(
litellm.exceptions.APIError: VertexAIException - 'NoneType' object has no attribute 'items'

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/usr/local/lib/python3.11/site-packages/litellm/proxy/proxy_server.py", line 3589, in chat_completion
    responses = await asyncio.gather(
                ^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/litellm/router.py", line 412, in acompletion
    raise e
  File "/usr/local/lib/python3.11/site-packages/litellm/router.py", line 408, in acompletion
    response = await self.async_function_with_fallbacks(**kwargs)
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/litellm/router.py", line 1259, in async_function_with_fallbacks
    raise original_exception
  File "/usr/local/lib/python3.11/site-packages/litellm/router.py", line 1180, in async_function_with_fallbacks
    response = await self.async_function_with_retries(*args, **kwargs)
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/litellm/router.py", line 1370, in async_function_with_retries
    raise e
  File "/usr/local/lib/python3.11/site-packages/litellm/router.py", line 1332, in async_function_with_retries
    response = await original_function(*args, **kwargs)
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/litellm/router.py", line 522, in _acompletion
    raise e
  File "/usr/local/lib/python3.11/site-packages/litellm/router.py", line 427, in _acompletion
    deployment = await self.async_get_available_deployment(
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/litellm/router.py", line 2566, in async_get_available_deployment
    return self.get_available_deployment(
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/litellm/router.py", line 2678, in get_available_deployment
    rpm = healthy_deployments[0].get("litellm_params").get("rpm", None)
          ~~~~~~~~~~~~~~~~~~~^^^
IndexError: list index out of range```

CXwudi · 2024-04-17T20:44:27Z

Just added. should be fixed now @CXwudi

This was caused b/c we run a check if the gemini model supports vision

For some reason, it is still not fixed for me. The last working version I tried is v1.35.5. Also, it looks like the multiple gemini models are affected, so far I knew gemini-1.0-pro-latest is also affected

hi019 · 2024-04-17T21:59:06Z

@demux79 The error is due to a faulty if statement in LiteLLM, see #3097 and my PR, #3101

CXwudi · 2024-04-22T00:19:02Z

Just added. should be fixed now @CXwudi
This was caused b/c we run a check if the gemini model supports vision

For some reason, it is still not fixed for me. The last working version I tried is v1.35.5. Also, it looks like the multiple gemini models are affected, so far I knew gemini-1.0-pro-latest is also affected

The problem is now solved with #3186, which is working in v1.35.17

jpshack-at-palomar added the bug Something isn't working label Apr 7, 2024

krrishdholakia changed the title ~~[Bug]: litellm.llms.gemini.GeminiError: 404 models/gemini-1.5-pro is not found for API~~ [No-op] litellm.llms.gemini.GeminiError: 404 models/gemini-1.5-pro is not found for API Apr 7, 2024

krrishdholakia pinned this issue Apr 10, 2024

krrishdholakia changed the title ~~[No-op] litellm.llms.gemini.GeminiError: 404 models/gemini-1.5-pro is not found for API~~ [No-op] GEMINI-1.5-PRO Main Day-1 support🧵 Apr 10, 2024

krrishdholakia changed the title ~~[No-op] GEMINI-1.5-PRO Main Day-1 support🧵~~ GEMINI-1.5-PRO Main Day-1 support🧵 Apr 10, 2024

krrishdholakia mentioned this issue Apr 10, 2024

feat(gemini.py): support google-genai system instruction #2925

Merged

jpshackelford mentioned this issue Apr 10, 2024

Gemini pro 1.5 model is now named correctly for AI Studio. #2926

Merged

krrishdholakia closed this as completed Apr 12, 2024

krrishdholakia reopened this Apr 12, 2024

demux79 mentioned this issue Apr 17, 2024

Fix check for gemini function call response #3101

Closed

krrishdholakia unpinned this issue Apr 23, 2024

GEMINI-1.5-PRO Main Day-1 support🧵 #2881

GEMINI-1.5-PRO Main Day-1 support🧵 #2881

Comments

jpshack-at-palomar commented Apr 7, 2024 • edited

What happened?

Relevant log output

Twitter / LinkedIn details

jpshack-at-palomar commented Apr 7, 2024

jpshack-at-palomar commented Apr 7, 2024 • edited

gemini-1.0-pro-latest

gemini-1.5-pro-latest

krrishdholakia commented Apr 7, 2024

krrishdholakia commented Apr 7, 2024

Manouchehri commented Apr 7, 2024

jameshiggie commented Apr 7, 2024

Saying Hi from LiteLLM: Code Options

jpshack-at-palomar commented Apr 9, 2024 • edited

krrishdholakia commented Apr 10, 2024

krrishdholakia commented Apr 10, 2024

jameshiggie commented Apr 10, 2024

krrishdholakia commented Apr 10, 2024

jameshiggie commented Apr 10, 2024 • edited

jameshiggie commented Apr 10, 2024

krrishdholakia commented Apr 10, 2024

jameshiggie commented Apr 10, 2024

jameshiggie commented Apr 10, 2024

krrishdholakia commented Apr 10, 2024

jameshiggie commented Apr 10, 2024

krrishdholakia commented Apr 10, 2024

jameshiggie commented Apr 10, 2024

CXwudi commented Apr 11, 2024

aleclarson commented Apr 12, 2024

Manouchehri commented Apr 12, 2024

krrishdholakia commented Apr 12, 2024

krrishdholakia commented Apr 12, 2024 • edited

aleclarson commented Apr 12, 2024

Manouchehri commented Apr 12, 2024 • edited

krrishdholakia commented Apr 12, 2024

aleclarson commented Apr 12, 2024

krrishdholakia commented Apr 12, 2024 • edited

CXwudi commented Apr 12, 2024

demux79 commented Apr 15, 2024 • edited

CXwudi commented Apr 17, 2024

krrishdholakia commented Apr 17, 2024

krrishdholakia commented Apr 17, 2024

demux79 commented Apr 17, 2024 • edited

CXwudi commented Apr 17, 2024

hi019 commented Apr 17, 2024

CXwudi commented Apr 22, 2024

jpshack-at-palomar commented Apr 7, 2024 •

edited

jpshack-at-palomar commented Apr 7, 2024 •

edited

`gemini-1.0-pro-latest`

`gemini-1.5-pro-latest`

jpshack-at-palomar commented Apr 9, 2024 •

edited

jameshiggie commented Apr 10, 2024 •

edited

krrishdholakia commented Apr 12, 2024 •

edited

Manouchehri commented Apr 12, 2024 •

edited

krrishdholakia commented Apr 12, 2024 •

edited

demux79 commented Apr 15, 2024 •

edited

demux79 commented Apr 17, 2024 •

edited