VANPY: Voice Analysis Framework
Paper
•
2502.17579
•
Published
•
1
This model is a 8-class SVM classifier trained on the RAVDESS dataset using SpeechBrain ECAPA-TDNN embeddings as features.
You can install the package directly from GitHub:
pip install git+https://github.com/griko/voice-emotion-classification.git
from pipelines.emotion_classifier import EmotionClassificationPipeline
# Load the model
classifier = EmotionClassificationPipeline.from_pretrained("griko/emotion_8_cls_svm_ecapa_ravdess")
# Use it for prediction
result = classifier("path/to/audio.wav")
print(result) # ['angry'] or ['disgust'] or ['fearful'] or ['happy'] or ['neutral'] or ['calm'] or ['sad'] or ['surprised']
# Batch prediction
results = classifier(["audio1.wav", "audio2.wav"])
print(results) # ['angry', 'disgust']
If you use this model in your research, please cite:
@misc{koushnir2025vanpyvoiceanalysisframework,
title={VANPY: Voice Analysis Framework},
author={Gregory Koushnir and Michael Fire and Galit Fuhrmann Alpert and Dima Kagan},
year={2025},
eprint={2502.17579},
archivePrefix={arXiv},
primaryClass={cs.SD},
url={https://arxiv.org/abs/2502.17579},
}