How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("image-text-to-text", model="roleplaiapp/Omni-Reasoner-2B-Q3_K_M-GGUF")
messages = [
    {
        "role": "user",
        "content": [
            {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"},
            {"type": "text", "text": "What animal is on the candy?"}
        ]
    },
]
pipe(text=messages)
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("roleplaiapp/Omni-Reasoner-2B-Q3_K_M-GGUF", dtype="auto")
Quick Links

roleplaiapp/Omni-Reasoner-2B-Q3_K_M-GGUF

Repo: roleplaiapp/Omni-Reasoner-2B-Q3_K_M-GGUF
Original Model: Omni-Reasoner-o1 Organization: prithivMLmods Quantized File: omni-reasoner-2b-q3_k_m.gguf Quantization: GGUF Quantization Method: Q3_K_M
Use Imatrix: False
Split Model: False

Overview

This is an GGUF Q3_K_M quantized version of Omni-Reasoner-o1.

Quantization By

I often have idle A100 GPUs while building/testing and training the RP app, so I put them to use quantizing models. I hope the community finds these quantizations useful.

Andrew Webby @ RolePlai

Downloads last month
20
GGUF
Model size
2B params
Architecture
qwen2vl
Hardware compatibility
Log In to add your hardware

3-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for roleplaiapp/Omni-Reasoner-2B-Q3_K_M-GGUF

Base model

Qwen/Qwen2-VL-2B
Quantized
(53)
this model