MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder
Paper
•
2409.14074
•
Published
•
3
Please refer to newer version which integrates ASR + MT models: https://huggingface.co/leduckhai/MultiMed-ST
Please press ⭐ button and/or cite papers if you feel helpful.
@article{le2024multimed,
title={MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder},
author={Le-Duc, Khai and Phan, Phuc and Pham, Tan-Hanh and Tat, Bach Phan and Ngo, Minh-Huong and Ngo, Chris and Nguyen-Tang, Thanh and Hy, Truong-Son},
journal={arXiv preprint arXiv:2409.14074},
year={2024}
}
Dataset: 🤗 HuggingFace dataset, Paperswithcodes dataset
Pre-trained models: 🤗 HuggingFace models
| Model Name | Description | Link |
|---|---|---|
Whisper-Small-Chinese |
Small model fine-tuned on medical Chinese set | Hugging Face models |
Whisper-Small-English |
Small model fine-tuned on medical English set | Hugging Face models |
Whisper-Small-French |
Small model fine-tuned on medical French set | Hugging Face models |
Whisper-Small-German |
Small model fine-tuned on medical German set | Hugging Face models |
Whisper-Small-Vietnamese |
Small model fine-tuned on medical Vietnamese set | Hugging Face models |
Whisper-Small-Multilingual |
Small model fine-tuned on medical Multilingual set (5 languages) | Hugging Face models |
If any links are broken, please contact me for fixing!
Le Duc Khai
University of Toronto, Canada
Email: [email protected]
GitHub: https://github.com/leduckhai
Base model
openai/whisper-small