Models

57

Full-text search

Active filters: sglang

Alexzander85/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled-NVFP4-MLP-FP8KV

Text Generation • 8B • Updated 30 days ago • 1.16k • 8

AxionML/Qwen3.5-9B-NVFP4

Image-Text-to-Text • 7B • Updated Mar 3 • 49.8k • 11

bullpoint/Qwen3-Coder-Next-AWQ-4bit

Text Generation • 14B • Updated Feb 3 • 272k • 23

drbaph/s2-pro-fp8

Text-to-Speech • Updated 22 days ago • 1.82k • 16

QuantTrio/Qwen3-Coder-Next-E336

Text Generation • 53B • Updated Feb 6 • 9 • 2

AxionML/Qwen3.5-122B-A10B-NVFP4

Image-Text-to-Text • 62B • Updated Mar 3 • 2.72k • 5

AxionML/Qwen3.5-4B-NVFP4

Image-Text-to-Text • 3B • Updated Mar 3 • 2.11k • 2

AxionML/Qwen3.5-35B-A3B-NVFP4

Image-Text-to-Text • Updated Mar 3 • 89.5k • 3

AxionML/Qwen3.5-2B-NVFP4

Image-Text-to-Text • 2B • Updated Mar 3 • 3.09k • 1

MickJ/Wan2.2-S2V-14B-overlay

Updated 5 days ago • 297 • 1

SurfaceData/llava-v1.6-mistral-7b-sglang

Image-Text-to-Text • 8B • Updated Mar 7, 2024 • 14 • 9

SurfaceData/llava-v1.6-vicuna-7b-sglang

Image-Text-to-Text • 7B • Updated Mar 7, 2024 • 19 • 1

tclf90/qwen2.5-72b-instruct-gptq-int4

Text Generation • 73B • Updated May 12, 2025 • 106 • 2

tclf90/qwen2.5-72b-instruct-gptq-int3

Text Generation • 69B • Updated May 12, 2025 • 105

alvarobartt/grok-2-tokenizer

Updated Aug 27, 2025 • 2

unsloth/grok-2

Text Generation • Updated Sep 6, 2025 • 38 • 4

osmapi/MiniMax-M2-THRIFT

173B • Updated Nov 13, 2025 • 1.67k • 35

mradermacher/MiniMax-M2-THRIFT-GGUF

Updated Nov 7, 2025 • 2

JasmineBBB/Kimi-Linear-48B-A3B-Instruct-bnb-4bit

Text Generation • 49B • Updated Nov 5, 2025 • 3 • 1

mradermacher/MiniMax-M2-THRIFT-i1-GGUF

173B • Updated Dec 10, 2025 • 155 • 10

bartowski/VibeStudio_MiniMax-M2-THRIFT-GGUF

Text Generation • 173B • Updated Nov 20, 2025 • 3.12k • 8

osmapi/MiniMax-M2-THRIFT-55

106B • Updated Dec 3, 2025 • 155 • 5

JinnP/SGLang-EAGLE3-Qwen3-Coder-30B-A3B-Instruct

Text Generation • 0.2B • Updated Nov 25, 2025 • 56 • 1

mradermacher/MiniMax-M2-THRIFT-55-GGUF

106B • Updated Nov 26, 2025 • 57 • 2

mradermacher/MiniMax-M2-THRIFT-55-i1-GGUF

106B • Updated Dec 5, 2025 • 476 • 2

osmapi/MiniMax-M2-THRIFT-55-MLX-4bit

106B • Updated Dec 2, 2025 • 99 • 2

osmapi/MiniMax-M2-THRIFT-55-MLX-6bit

106B • Updated Dec 3, 2025 • 42

Doradus-AI/MiroThinker-v1.0-30B-FP8

Text Generation • 31B • Updated Dec 5, 2025 • 71 • 4

Doradus-AI/Hermes-4.3-36B-FP8

Text Generation • 36B • Updated Dec 7, 2025 • 67 • 2

Doradus-AI/RnJ-1-Instruct-FP8

Text Generation • 9B • Updated Dec 7, 2025 • 4 • 4