CEIA-RL/energy-gpt-regulatorio-v2-GRPO-step140-Safety Text Generation • 4B • Updated about 2 hours ago
CEIA-RL/qwen3-4b-dw-lr-dpo-offline-energy-GRPO Text Generation • 4B • Updated about 1 month ago • 143
CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_energyv2-dpo-offline-GRPO_v3 Updated 2 days ago • 60
CEIA-RL/energy-eval-filtered_responses_multichoice_cemig-nlp-releases_enregy-gpt-regulatorio-v2_v3 Updated 2 days ago • 54
CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_energyv2-dpo-offline_v3 Updated 2 days ago • 63
CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-dpo-offline-energy-GRPO_v3 Updated 2 days ago • 51
CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_energy-exp1-dpo-offline_v3 Updated 2 days ago • 60
CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-GRPO_v3 Updated 2 days ago • 55
CEIA-RL/energy-eval-filtered_responses_multichoice_CEIA-RL_qwen3-4b-dw-lr-dpo-offline-energy_v3 Updated 2 days ago • 59