đŽđŗ New in my Hindi LLM Series: Gemma-4 E4B, fine-tuned for Hindi â and it runs on your laptop's CPU. I fine-tuned Google's new Gemma-4 E4B on ~10k Hindi instruction pairs (AI4Bharat: anudesh + dolly) using Unsloth + LoRA, on a single L4 GPU. Then I ran an honest side-by-side eval: base Gemma-4 vs my fine-tune, across 25 Hindi prompts. The results were interesting đ â My fine-tune is more concise â ask for "3 tips" and it gives exactly 3. Base writes a 1,200-character essay.
â Pure native Hindi â base keeps slipping into English ("⤏ā¤ā¤¤āĨ⤞ā¤ŋ⤤ ā¤ā¤šā¤žā¤° (Eat a Balanced Diet)", "ā¤¤ā¤žā¤°ā¤ž (Star)"). My fine-tune stays in clean Hindi.
â Tighter instruction-following â ask for a "short message" and it gives one, not a menu of options. âī¸ And to be honest: base Gemma-4 is more detailed and comprehensive. I didn't build a "smarter" model â I built a focused, Hindi-native, edge-friendly one that runs as a 5GB GGUF (Q4) on CPU. đ Try it: