Bug: TypeError in SAM2ImagePredictor.predict() method #1431

dongxiaolong · 2024-12-18T08:10:56Z

@cpuhrsch Hi! I need your help with a bug in SAM2ImagePredictor.predict() method

# Bug: TypeError in SAM2ImagePredictor.predict() method

## Description
When using `SAM2ImagePredictor.predict()`, two errors occur:
1. When `return_logits=False`: RuntimeError: "Boolean value of Tensor with more than one value is ambiguous"
2. When `return_logits=True`: AssertionError at `postprocess_masks_1_channel()` due to incorrect tensor dimension (expecting channel dimension to be 1)

## Reproduction Steps
```python
from torchao._models.sam2.build_sam import build_sam2
from torchao._models.sam2.sam2_image_predictor import SAM2ImagePredictor

# Initialize model
sam2_checkpoint = "sam2.1_hiera_large.pt"
model_cfg = "sam2.1_hiera_l.yaml"
sam2_model = build_sam2(model_cfg, sam2_checkpoint, device="cuda")
predictor = SAM2ImagePredictor(sam2_model)

# Set image and input points
predictor.set_image(image)
input_point = np.array([[500, 375]])
input_label = np.array([1])

# This call raises the error
masks, scores, logits = predictor.predict(
    point_coords=input_point,
    point_labels=input_label,
    multimask_output=True,
    return_logits=False  # or True
)

Error Messages

With return_logits=False:

RuntimeError: Boolean value of Tensor with more than one value is ambiguous

With return_logits=True:

AssertionError at transforms.py:128: assert masks.size(1) == 1

Additional Context

The input tensor has shape torch.Size([1, 256, 64, 64]). The error seems to occur in the parameter passing between _predict() and _predict_masks_postprocess() methods, specifically around the handling of return_logits parameter.

Environment

Python version: 3.11
CUDA: enabled

Could you please help me understand what's going wrong here? Thank you in advance!

The text was updated successfully, but these errors were encountered:

cpuhrsch · 2024-12-19T05:58:40Z

Hey @dongxiaolong - the copy of SAM2 in torchao isn't intended to be general purpose just yet, but specialized towards the example in https://github.com/pytorch/ao/tree/main/examples/sam2_amg_server . The assert comes up because some assumption that was made along the development is being invalidated.

dongxiaolong · 2024-12-20T03:06:40Z

Thank you for your reply.

cpuhrsch · 2024-12-20T03:12:51Z

I'll keep this issue open so I can revisit it later on when this example works.

supriyar assigned cpuhrsch Dec 19, 2024

supriyar added the bug Something isn't working label Dec 19, 2024

dongxiaolong closed this as completed Dec 20, 2024

cpuhrsch reopened this Dec 20, 2024

jerryzh168 added the triaged label Dec 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug: TypeError in SAM2ImagePredictor.predict() method #1431

Bug: TypeError in SAM2ImagePredictor.predict() method #1431

dongxiaolong commented Dec 18, 2024 •

edited

Loading

cpuhrsch commented Dec 19, 2024

dongxiaolong commented Dec 20, 2024

cpuhrsch commented Dec 20, 2024

Bug: TypeError in SAM2ImagePredictor.predict() method #1431

Bug: TypeError in SAM2ImagePredictor.predict() method #1431

Comments

dongxiaolong commented Dec 18, 2024 • edited Loading

Error Messages

Additional Context

Environment

cpuhrsch commented Dec 19, 2024

dongxiaolong commented Dec 20, 2024

cpuhrsch commented Dec 20, 2024

dongxiaolong commented Dec 18, 2024 •

edited

Loading