ai4bharat/IndicCorpV2
Preview • Updated • 1.35k • 20
How to use BERTCHEESIE/KannaBERT-xl with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("fill-mask", model="BERTCHEESIE/KannaBERT-xl") # Load model directly
from transformers import AutoTokenizer, AutoModelForMaskedLM
tokenizer = AutoTokenizer.from_pretrained("BERTCHEESIE/KannaBERT-xl")
model = AutoModelForMaskedLM.from_pretrained("BERTCHEESIE/KannaBERT-xl")This model is a pretrained model on the Kannada IndicCorp dataset. It achieves the following results on the evaluation set:
General pretrained TinyBERT model for Kannada (kn).
Used 4.9GB IndicCorp data for pretraining. Model is not finetuned.
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 4.0098 | 1.0 | 590626 | 3.8378 |