Muhammad Khalifa

mkhalifa

·

https://mukhal.github.io/

AI & ML interests

natural language genration, reinforcement learning

Recent Activity

upvoted a paper 14 days ago

MET: Theory-Grounded and Culture-Aware Multilingual Moral Reasoning

upvoted a paper 3 months ago

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

liked a dataset 3 months ago

nvidia/Nemotron-Personas-Korea

View all activity

Organizations

Papers 9

arxiv:2504.16828

arxiv:2412.04144

arxiv:2410.02899

arxiv:2405.16337

models 21

mkhalifa/flan-t5-large-gsm8k

Text Generation • Updated Jan 7 • 12

mkhalifa/flan-t5-large-svamp

Text Generation • Updated Jan 7 • 11

mkhalifa/flan-t5-large-mathqa

Text Generation • Updated Jan 7 • 7

mkhalifa/ThinkPRM-gptoss-20B

Updated Aug 18, 2025 • 15

mkhalifa/r1_14b_discriminative_prm

Text Generation • 15B • Updated Mar 27, 2025 • 5

mkhalifa/r1_14b_longthought-1K

Text Generation • 15B • Updated Mar 25, 2025 • 10

mkhalifa/r1-1.5b-longthought-outcome-matching

Text Generation • 2B • Updated Mar 20, 2025 • 5

mkhalifa/r1-1.5b-longthought-1K

Text Generation • 2B • Updated Mar 10, 2025 • 3

mkhalifa/r1_14b_longthought-1K-outcome-only

Text Generation • 15B • Updated Mar 9, 2025 • 5

mkhalifa/r1-1.5b-longthought-v2

Text Generation • 2B • Updated Mar 9, 2025 • 5

datasets 18

mkhalifa/agent

Updated Nov 26, 2025 • 20

mkhalifa/gpqa-diamond-physics

Viewer • Updated Mar 15, 2025 • 86 • 176

mkhalifa/short-to-long-5K

Viewer • Updated Feb 26, 2025 • 5k • 21

mkhalifa/CoGEX

Viewer • Updated Feb 13, 2025 • 51.8k • 46

mkhalifa/llama-3.1-8b-instruct-math-trajectories-64-sample-per-problem

Viewer • Updated Jan 29, 2025 • 736k • 9

mkhalifa/llama-3.1-8b-instruct-math-trajectories-48-sample-per-problem

Viewer • Updated Jan 29, 2025 • 552k • 47

mkhalifa/llama-3.1-8b-instruct-math-trajectories-32-sample-per-problem

Viewer • Updated Jan 29, 2025 • 368k • 8

mkhalifa/llama-3.1-8b-instruct-math-trajectories-16-sample-per-problem

Viewer • Updated Jan 29, 2025 • 184k • 12

mkhalifa/llama-3.1-8b-instruct-math-trajectories-8-sample-per-problem

Viewer • Updated Jan 29, 2025 • 92k • 23

mkhalifa/llama-3.1-70b-instruct-math-trajectories-8-sample-per-problem

Viewer • Updated Jan 29, 2025 • 92k • 87

View 18 datasets