Claude/MTP?

#10
by Harland121574412 - opened

I'm still using qwen3.6-35b-a3b-uncensored-claude-wasserstein-mtp. Does the absence of Claude have a significant impact?

I'm still using qwen3.6-35b-a3b-uncensored-claude-wasserstein-mtp. Does the absence of Claude have a significant impact?

Claude distillation for 35B-A3B makes model dumb. Short thinking, constant loops, reduced overall intelligence. For 27B Claude distill is fine.

all right, I will try this new model, thanks

Sign up or log in to comment