[BUG] <title>Failed to load clip model #313

Desperat1on · 2024-12-11T03:00:24Z

Issue Description

On Windows, I cannot use multimodal models like llava-v1.6-vicuna-7b:q4_0 or llava-llama-3-8b-v1.1, as it throws an error: "Error running ggml inference: Failed to load clip model: C:\Users\xxx.cache\nexa\hub\official\llava-v1.6-vicuna-7b\projector-q4_0.gguf. Please refer to our docs to install the nexaai package: https://docs.nexaai.com/getting-started/installation." However, I am able to use the omniVLM:fp16 model, and I am not sure where the issue is occurring.

Steps to Reproduce

1.$env:CMAKE_ARGS="-DGGML_CUDA=ON -DSD_CUBLAS=ON"; pip install nexaai --prefer-binary --index-url https://github.nexa.ai/whl/cu124 --extra-index-url https://pypi.org/simple --no-cache-dir
2.nexa run llava-v1.6-vicuna-7b:q4_0

OS

Windows 11

Python Version

3.12.7

Nexa SDK Version

0.0.9.6

GPU (if using one)

NVIDIA RTX 4090 Laptop，CUDA 12.3

The text was updated successfully, but these errors were encountered:

Davidqian123 · 2024-12-14T01:17:23Z

What about trying to run another llava model like nexa run nanollava and check if it has the same error.

Desperat1on · 2024-12-14T02:05:32Z

What about trying to run another llava model like nexa run nanollava and check if it has the same error.

Sorry, the situation is the same. Only the OmniVLM multimodal model is available.

Desperat1on added the 🐞 bug Something isn't working label Dec 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] <title>Failed to load clip model #313

[BUG] <title>Failed to load clip model #313

Desperat1on commented Dec 11, 2024 •

edited

Loading

Davidqian123 commented Dec 14, 2024

Desperat1on commented Dec 14, 2024

[BUG] <title>Failed to load clip model #313

[BUG] <title>Failed to load clip model #313

Comments

Desperat1on commented Dec 11, 2024 • edited Loading

Issue Description

Steps to Reproduce

OS

Python Version

Nexa SDK Version

GPU (if using one)

Davidqian123 commented Dec 14, 2024

Desperat1on commented Dec 14, 2024

Desperat1on commented Dec 11, 2024 •

edited

Loading