Will upload the APEX version?

#1
by ziyins - opened

Will upload the APEX version? I want to try both the MTP and APEX models because my GPU has too little VRAM.

Will upload the APEX version? I want to try both the MTP and APEX models because my GPU has too little VRAM.

Yes. MTP version for Q8_K_P and APEX is still in development. APEX Compact will be first. APEX Balanced next.

thks very much

will be apex-i version?

will be apex-i version?

Don't have computing resources for this. I am on RTX 3060 12GB and Google Collab Free. Please ask mudler. May be he can help.

I am about to have 2 rtx 3060 12GB (currently 1) can I help? 😅

I am about to have 2 rtx 3060 12GB (currently 1) can I help? 😅

If you want to experiment check this repo: https://github.com/localai-org/apex-quant. But it's wrong way I think, because for importance matrix extraction we don't have BF16 weights. Hauhau not shared them.

Sign up or log in to comment