gemma-4-E2B-sft-tran-tigre

GGUF builds of a Tigre translation fine-tune of google/gemma-4-E2B.

Pipeline

  1. Base โ€” google/gemma-4-E2B (Gemma 4 E2B)
  2. CPT โ€” Continued pre-training adapter for Tigre fluency, merged into base weights
  3. SFT โ€” Translation LoRA on bidirectional translation pairs (Xโ†”Tigre), merged into the CPT-merged base
  4. GGUF โ€” Converted with llama.cpp (convert_hf_to_gguf.py) and quantized with llama-quantize

Files

File Quantization Notes
gemma-4-E2B-sft-tran-tigre-BF16.gguf BF16 Full precision base โ€” use to derive other quants
gemma-4-E2B-sft-tran-tigre-Q8_0.gguf Q8_0 Near-lossless, recommended if size allows
gemma-4-E2B-sft-tran-tigre-Q5_K_M.gguf Q5_K_M Good quality, smaller than Q8
gemma-4-E2B-sft-tran-tigre-Q4_K_M.gguf Q4_K_M Best size/quality tradeoff for most users

Usage with llama.cpp

./llama-cli -hf google/gemma-4-E2B-sft-tran-tigre:Q4_K_M \
    --temp 0.3 --top-p 0.9 \
    -p "Translate the following English text to Tigre.\n\nHello, how are you?"

Chat template

Standard Gemma turn template:

<bos><start_of_turn>user
{instruction}

{input}<end_of_turn>
<start_of_turn>model
{output}<end_of_turn>

License

Inherits the Gemma Terms of Use from the base model.

Downloads last month
36
GGUF
Model size
5B params
Architecture
gemma4
Hardware compatibility
Log In to add your hardware

4-bit

5-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for BeitTigreAI/gemma-4-E2B-sft-tran-tigre

Quantized
(38)
this model