inference-optimization/DFlash-SWA-Causal-Qwen3-8B-Magpie-Ultrachat 2B • Updated about 8 hours ago • 38
inference-optimization/Laguna-XS.2-speculator.dflash-Qwen235B-ckpt6 0.6B • Updated about 12 hours ago
inference-optimization/Laguna-XS.2-speculator.dflash-Qwen235B-ckpt5 0.6B • Updated about 17 hours ago
inference-optimization/Ministral-3-14B-Instruct-2512-NVFP4 Text Generation • Updated 8 days ago • 305 • 1
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w4a16 Text Generation • 32B • Updated 14 days ago • 225
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w8a8 Text Generation • 235B • Updated 14 days ago • 221
inference-optimization/Qwen3-235B-A22B-Instruct-2507-quantized.w4a16 Text Generation • 32B • Updated 14 days ago • 206
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-noise Image-Text-to-Text • 32B • Updated 14 days ago • 132
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-hybrid Image-Text-to-Text • 32B • Updated 14 days ago • 129
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-heuristic Image-Text-to-Text • 32B • Updated 14 days ago • 158
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-noise Image-Text-to-Text • 30B • Updated 14 days ago • 131
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-hybrid Image-Text-to-Text • 30B • Updated 14 days ago • 117
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-heuristic Image-Text-to-Text • 30B • Updated 14 days ago • 110
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-noise Image-Text-to-Text • 28B • Updated 14 days ago • 115
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-hybrid Image-Text-to-Text • 28B • Updated 14 days ago • 291
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-heuristic Image-Text-to-Text • 28B • Updated 14 days ago • 120
inference-optimization/Qwen3.6-35B-A3B-5.5-bits-mode-noise Image-Text-to-Text • 26B • Updated 14 days ago • 122
inference-optimization/Qwen3.6-35B-A3B-5.5-bits-mode-hybrid Image-Text-to-Text • 26B • Updated 14 days ago • 126
inference-optimization/Qwen3.6-35B-A3B-5.5-bits-mode-heuristic Image-Text-to-Text • 26B • Updated 14 days ago • 116