ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q8 Reinforcement Learning • 8B • Updated Mar 28, 2025 • 17.4k • 200
Open-Reasoner-Zero/Open-Reasoner-Zero-1.5B Reinforcement Learning • 2B • Updated Apr 6, 2025 • 63 • 1
NousResearch/DeepHermes-ToolCalling-Specialist-Atropos Reinforcement Learning • 8B • Updated Apr 28, 2025 • 15