Models and datasets for Elastic Reset (NeurIPS 2023), code at https://github.com/mnoukhov/elastic-reset
Michael N
mnoukhov
AI & ML interests
Representation learning for functional language
Recent Activity
updated a model 1 day ago
mnoukhov/nuevamol-135M-wsd-6Btok-wd2.0 updated a model 2 days ago
mnoukhov/nuevamol-135m-6B-wd3 published a model 2 days ago
mnoukhov/nuevamol-135m-6B-wd3Organizations
models 50
mnoukhov/nuevamol-135M-wsd-6Btok-wd2.0
Text Generation • 0.1B • Updated • 18
mnoukhov/nuevamol-135m-6B-wd3
Text Generation • 0.1B • Updated • 11
mnoukhov/nuevamol-80m-reinvent-sft
Text Generation • 78.1M • Updated • 298
mnoukhov/nuevamol-80m-base
Text Generation • 78.1M • Updated • 31
mnoukhov/nuevamol-220m-reinvent-sft
Text Generation • 0.2B • Updated • 301
mnoukhov/nuevamol-80m-init
Text Generation • 0.1B • Updated • 27
mnoukhov/nuevamol-135m-reinvent-sft
Text Generation • 0.1B • Updated • 549
mnoukhov/nuevamol-46m-reinvent-sft
Text Generation • 46.2M • Updated • 433
mnoukhov/nuevamol-220m-base
Text Generation • 0.2B • Updated • 34
mnoukhov/nuevamol-135m-base
Text Generation • 0.1B • Updated • 28
datasets 102
mnoukhov/chembl_filtered
Viewer • Updated • 1.18M • 39
mnoukhov/brumo-2025-openinstruct-qwen3-4b-base-32samples-quartiles
Viewer • Updated • 60 • 21
mnoukhov/brumo-2025-openinstruct-qwen3-4b-base-32samples
Viewer • Updated • 30 • 54
mnoukhov/aime-2025-openinstruct-qwen3-4b-base-32samples-quartiles
Viewer • Updated • 60 • 111
mnoukhov/aime-2025-openinstruct-qwen3-4b-base-32samples
Viewer • Updated • 30 • 55
mnoukhov/dapo-math-17k-processed-filtered-qwen3-4b-base-32samples-quartiles
Viewer • Updated • 25.3k • 20
mnoukhov/dapo-math-17k-processed-filtered-qwen3-4b-base-32samples
Viewer • Updated • 12.6k • 159
mnoukhov/gsm8k-train-harder-quartiles
Viewer • Updated • 11.2k • 160
mnoukhov/manufactoria-qwen3-4b-instruct-warmup650-pass128
Viewer • Updated • 874 • 105
mnoukhov/manufactoria-qwen3-4b-instruct-warmup650-pass128-completions
Viewer • Updated • 874 • 176