-
-
-
-
-
-
Inference Providers
Active filters:
nvidia
unsloth/Nemotron-3-Nano-30B-A3B-GGUF
Text Generation
•
32B
•
Updated
•
90.1k
•
200
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
Text Generation
•
32B
•
Updated
•
263k
•
524
nvidia/Qwen2.5-CascadeRL-RM-72B
Text Generation
•
71B
•
Updated
•
20
•
8
Token Classification
•
Updated
•
3.04k
•
53
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16
Text Generation
•
32B
•
Updated
•
8.28k
•
86
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8
Text Generation
•
32B
•
Updated
•
552k
•
218
Image-Text-to-Text
•
9B
•
Updated
•
18.3k
•
20
nvidia/Llama-4-Scout-17B-16E-Instruct-NVFP4
56B
•
Updated
•
20.6k
•
17
nvidia/Cosmos-Predict2.5-2B
Updated
•
41k
•
43
nvidia/Nemotron-Cascade-8B-Thinking
Text Generation
•
8B
•
Updated
•
1.45k
•
30
nvidia/Qwen3-Next-80B-A3B-Thinking-NVFP4
Text Generation
•
Updated
•
227
•
4
Image-Text-to-Text
•
Updated
•
6.94k
•
11
Image-Text-to-Text
•
8B
•
Updated
•
73.4k
•
224
nvidia/OpenMath-Nemotron-14B
Text Generation
•
15B
•
Updated
•
190
•
16
nvidia/Qwen3-Nemotron-235B-A22B-GenRM
Text Generation
•
235B
•
Updated
•
270
•
16
nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
•
Updated
•
1.78k
•
2
nvidia/Nemotron-Cascade-8B
Text Generation
•
8B
•
Updated
•
4.27k
•
45
Ex0bit/Elbaz-NVIDIA-Nemotron-3-Nano-30B-A3B-PRISM
Text Generation
•
32B
•
Updated
•
2.34k
•
5
nvidia/Mistral-NeMo-12B-Base
Updated
•
159
•
41
nvidia/Llama-3.1-Nemotron-70B-Reward
Updated
•
20
•
78
nvidia/Llama-3.1-Nemotron-70B-Instruct
Updated
•
49
•
568
nvidia/Cosmos-1.0-Diffusion-7B-Text2World
Text-to-Video
•
Updated
•
3.39k
•
230
roleplaiapp/Llama-3.1-Nemotron-70B-Instruct-HF-Q4_K_M-GGUF
Text Generation
•
71B
•
Updated
•
202
•
2
Updated
•
17.9k
•
14
nvidia/Nemotron-H-8B-Base-8K
Text Generation
•
8B
•
Updated
•
11.3k
•
53
nvidia/Cosmos-Predict2-2B-Text2Image
Text-to-Image
•
Updated
•
135
•
65
nvidia/Cosmos-Predict2-2B-Video2World
Image-to-Video
•
Updated
•
1.6k
•
34
nvidia/Cosmos-Predict2-14B-Video2World
Image-to-Video
•
Updated
•
116
•
28
unsloth/AceReason-Nemotron-14B-GGUF
Text Generation
•
15B
•
Updated
•
426
•
8
bartowski/nvidia_AceReason-Nemotron-14B-GGUF
Text Generation
•
15B
•
Updated
•
429
•
10