looking forward to the quantized versions for edge deployment

by tomasmcm - opened 29 days ago

pretty cool model you got here. I hope you are considering releasing a quantized version as well, so we can try running this on SBCs, phones, tablets 🙏

kwajiehao

Reka AI org 27 days ago

Hi Tomas, we are working on quantizing the model and will update here when we have more to share!

DeltaWhiplash

18 days ago

Hello do you have any news about GGUF ?

kwajiehao

Reka AI org 18 days ago

Hi @DeltaWhiplash , we should have it within the next couple of weeks to a month

kwajiehao

Reka AI org 3 days ago

I've made a PR upstream to llama.cpp to add support for Reka Edge - https://github.com/ggml-org/llama.cpp/pull/21616

We've also added scripts for converting to GGUF (convert_reka_vlm_to_gguf.py) and basic quantization scripts (quantize_reka_q4_last8_q8.sh, quantize_reka_q4.sh)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment