Owlv2 model keeps crashing #30874

preethiseshadri518 · 2024-05-17T05:51:44Z

I am trying to run OWLv2 (google/owlv2-base-patch16-ensemble) to perform object detection.

I am following the example code to perform inference. I am using a Colab notebook with a T4 GPU and using transformers version 4.40.2. When I try to perform inference, the cell just keeps running and eventually crashes with the message: Your session crashed after using all available RAM. This is surprising because the model is not that large (relatively speaking) and inference for a single image using OWL-ViT (google/owlvit-base-patch32) takes < 0.001 seconds. Not sure where this difference is coming from? Here is the code I am running:

import requests
from PIL import Image
import torch
from transformers import Owlv2Processor, Owlv2ForObjectDetection

url = "http://images.cocodataset.org/val2017/000000039769.jpg"
image = Image.open(requests.get(url, stream=True).raw)
texts = [["a photo of a cat", "a photo of a dog"]]

processor = Owlv2Processor.from_pretrained("google/owlv2-base-patch16-ensemble")
model = Owlv2ForObjectDetection.from_pretrained("google/owlv2-base-patch16-ensemble")

with torch.no_grad(): # tried with and without this line
    inputs = processor(text=texts, images=image, return_tensors="pt")
    outputs = model(**inputs)

target_sizes = torch.Tensor([image.size[::-1]])
# Convert outputs (bounding boxes and class logits) to Pascal VOC Format (xmin, ymin, xmax, ymax)
results = processor.post_process_object_detection(outputs=outputs, target_sizes=target_sizes, threshold=0.1)
i = 0  # Retrieve predictions for the first image for the corresponding text queries
text = texts[i]
boxes, scores, labels = results[i]["boxes"], results[i]["scores"], results[i]["labels"]

Has anyone run into a similar issue and resolved it? I imagine there's some issue with actually utilizing the GPU, but again this issue is not happening for OWL-ViT with nearly identical code. Thanks!

The text was updated successfully, but these errors were encountered:

amyeroberts · 2024-05-17T11:56:22Z

cc @qubvel if you have time :)

qubvel · 2024-05-17T12:48:52Z

Hi @preethiseshadri518, thanks for the issue!

I found that your code for reproducing is not using GPU. I have updated it as follows

import requests
from PIL import Image
import torch
from transformers import Owlv2Processor, Owlv2ForObjectDetection

device = "cuda"

url = "http://images.cocodataset.org/val2017/000000039769.jpg"
image = Image.open(requests.get(url, stream=True).raw)
texts = [["a photo of a cat", "a photo of a dog"]]

processor = Owlv2Processor.from_pretrained("google/owlv2-base-patch16-ensemble")
model = Owlv2ForObjectDetection.from_pretrained("google/owlv2-base-patch16-ensemble", device_map=device)
#                                                                                      ^^^^^^^^^^^^^

inputs = processor(text=texts, images=image, return_tensors="pt").to(device)
#                                                                ^^^^^^^^^^^

with torch.no_grad(): # tried with and without this line
    outputs = model(**inputs)

target_sizes = torch.Tensor([image.size[::-1]])
# Convert outputs (bounding boxes and class logits) to Pascal VOC Format (xmin, ymin, xmax, ymax)
results = processor.post_process_object_detection(outputs=outputs, target_sizes=target_sizes, threshold=0.1)
i = 0  # Retrieve predictions for the first image for the corresponding text queries
text = texts[i]
boxes, scores, labels = results[i]["boxes"], results[i]["scores"], results[i]["labels"]

print(boxes, scores, labels)

It works fine locally and in colab. I use the following setup

!pip install -U transformers==4.40.2 accelerate

Here are the results, inference takes only ~3-4GB of GPU RAM:

Are you running exactly this script or is there anything else that can cause the problem?

amyeroberts added Examples Which is related to examples in general Vision labels May 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Owlv2 model keeps crashing #30874

Owlv2 model keeps crashing #30874

preethiseshadri518 commented May 17, 2024 •

edited by qubvel

amyeroberts commented May 17, 2024

qubvel commented May 17, 2024

Owlv2 model keeps crashing #30874

Owlv2 model keeps crashing #30874

Comments

preethiseshadri518 commented May 17, 2024 • edited by qubvel

amyeroberts commented May 17, 2024

qubvel commented May 17, 2024

preethiseshadri518 commented May 17, 2024 •

edited by qubvel