Whisper Small Fine-tuned on Nepali (OpenSLR 54)

This model is a fine-tuned version of openai/whisper-small on the OpenSLR 54 (Nepali Speech Corpus) dataset.

Model Details

  • Model: Whisper Small (244M Parameters)
  • Dataset: ~154 Hours of Nepali Audio
  • Language: Nepali
  • Hardware: NVIDIA A100 80GB

Results

  • WER: 26.69%
  • Loss: 0.210

Usage

from transformers import pipeline

transcriber = pipeline(
    "automatic-speech-recognition", 
    model="Dragneel/whisper-small-nepali-openslr"
)

text = transcriber("path_to_audio.mp3")
print(text["text"])
Downloads last month
26
Safetensors
Model size
0.2B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train Dragneel/whisper-small-nepali-openslr

Evaluation results