-
pankajpandey-dev/Carbon-3B-GGUF
Text Generation ⢠3B ⢠Updated ⢠639 ⢠3 -
pankajpandey-dev/MiniCPM5-1B-Hindi-Instruct-v1-GGUF
Text Generation ⢠1B ⢠Updated ⢠614 ⢠1 -
pankajpandey-dev/Qwen3-0.6B-Hindi-Instruct-v1-GGUF
Text Generation ⢠0.6B ⢠Updated ⢠604 ⢠1 -
pankajpandey-dev/Qwen3-4B-Hindi-Instruct-v2-GGUF
Text Generation ⢠4B ⢠Updated ⢠230 ⢠1
Pankaj Pandey
pankajpandey-dev
AI & ML interests
Natural Language Processing, Text Generation, Large Language Models, Quantization, Fine-Tuning, RLHF, Model Merging
Recent Activity
reacted to theirpost with š„ 7 days ago
š®š³ Gemma-3-1B Hindi Instruct ā a Hindi LLM that runs fully offline, anywhere.
Last week I shipped Qwen3-4B Hindi. This week I went the other direction: how tiny can a useful Hindi model get? So I fine-tuned Gemma-3-1B on quality-filtered Hindi instruction data and shipped the full GGUF ladder.
ā
Fine-tune (16-bit): https://huggingface.co/pankajpandey-dev/gemma-3-1b-hindi-instruct
ā
GGUF (Q4/Q5/Q8): https://huggingface.co/pankajpandey-dev/gemma-3-1b-hindi-instruct-GGUF
Runs in Ollama, llama.cpp, and LM Studio. The Q4_K_M is just 806 MB ā runs on CPU, a cheap laptop, even a Raspberry Pi.
What I tried this round: chrF-filtered the training data to drop weak translations, and used response-only loss so the model learns how to answer, not how to repeat prompts.
Honest note: at 1B, Hindi fluency is strong but coherence is bounded by size ā it's a lightweight/edge experiment, not a 4B replacement. Gemma-3-4B Hindi is next.
Part of my Hindi LLM Series ā openly-licensed Indic models for local & edge use. Feedback welcome š
#Hindi #IndicNLP #GGUF #LocalLLM #Gemma #EdgeAI
liked a Space 10 days ago
pankajpandey-dev/qwen3-4b-hindi-demo updated a Space 10 days ago
pankajpandey-dev/qwen3-4b-hindi-demo