Tiny models used for testing
Inference Optimization
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
-
inference-optimization/Qwen3-Next-80B-A3B-Instruct-MTP-ultrachat-epoch3
2B • Updated • 9 -
inference-optimization/Qwen3-Next-80B-A3B-Instruct-MTP-ultrachat-epoch1
2B • Updated • 8 -
inference-optimization/Qwen3-Next-80B-A3B-Instruct-MTP-ultrachat-epoch2
2B • Updated • 3 -
inference-optimization/Qwen3-Next-80B-A3B-Instruct-GSM8K-MTP-finetuned
81B • Updated • 3
Tiny models used for testing
-
inference-optimization/Qwen3-Next-80B-A3B-Instruct-MTP-ultrachat-epoch3
2B • Updated • 9 -
inference-optimization/Qwen3-Next-80B-A3B-Instruct-MTP-ultrachat-epoch1
2B • Updated • 8 -
inference-optimization/Qwen3-Next-80B-A3B-Instruct-MTP-ultrachat-epoch2
2B • Updated • 3 -
inference-optimization/Qwen3-Next-80B-A3B-Instruct-GSM8K-MTP-finetuned
81B • Updated • 3
models 160
inference-optimization/Qwen3-30B-A3B-speculator.dflash
0.7B • Updated
inference-optimization/Qwen3-30B-A3B-Instruct-2507-speculator.dflash
0.7B • Updated • 12
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step63012
2B • Updated
inference-optimization/Laguna-XS.2-speculator.dflash-Qwen235B-500k-ckpt5
0.6B • Updated
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step42008
2B • Updated • 43
inference-optimization/Qwen3-8B-speculators.peagle-qwen3arch-ckpt4
2B • Updated • 3
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step21004
2B • Updated • 64
inference-optimization/Laguna-XS.2-speculator.dflash-Qwen235B-500k-ckpt4
0.6B • Updated • 113
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step126024
2B • Updated • 301
inference-optimization/Qwen3-8B-speculator.dflash.swa.non-qwen3-step56712
2B • Updated • 410
datasets 26
inference-optimization/every-eval-ever-demo
Viewer • Updated • 1 • 48
inference-optimization/DeepSeek-V4-Flash-responses
Viewer • Updated • 508k • 13
inference-optimization/Qwen3.5-4B-responses
Viewer • Updated • 7.47k • 74
inference-optimization/Qwen3.5-0.8B-responses
Viewer • Updated • 7.47k • 101
inference-optimization/Qwen3.5-9B-responses
Viewer • Updated • 7.67k • 49
inference-optimization/Qwen3-8B-Regenerated-Collection
Preview • Updated • 196
inference-optimization/Qwen3-30B-A3B-responses
Preview • Updated • 65
inference-optimization/gpt-oss-120b-responses
Preview • Updated • 8
inference-optimization/Qwen3-32B-responses
Preview • Updated • 40
inference-optimization/ctest-Qwen3.6-27B-speculator-dataset
Viewer • Updated • 5.61k • 34