CPU offload not working as expected on LM Studio

#2
by momocome - opened

I’m running the Genesis-MTP-Q8_K_P model on Windows 11 using LM Studio v0.4.14, but the "Number of layers forced into CPU" parameter (cpu_layers / n_gpu_layers) isn’t working as expected.

I’m running the Genesis-MTP-Q8_K_P model on Windows 11 using LM Studio v0.4.14, but the "Number of layers forced into CPU" parameter (cpu_layers / n_gpu_layers) isn’t working as expected.

So far, this is known issue in LM Studio. Try to use llama-server instead.

Sign up or log in to comment