kingabzpro
/

whisper-large-v3-turbo-urdu

Automatic Speech Recognition

mozilla-foundation/common_voice_17_0

hf-asr-leaderboard

Model card Files Files and versions

Metrics Training metrics Community

kingabzpro commited on Jul 5

Commit

f4d458d

·

1 Parent(s): c2d5c6b

Update README.md

Files changed (1) hide show

README.md +49 -13

README.md CHANGED Viewed

@@ -36,19 +36,55 @@ It achieves the following results on the evaluation set:
 - Loss: 0.3534
 - Wer: 25.7842
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters

 - Loss: 0.3534
 - Wer: 25.7842
+## Usage
+```python
+from transformers import AutoModelForSpeechSeq2Seq, AutoProcessor, pipeline
+from datasets import load_dataset
+import torch, warnings, os
+device = "cuda:0"
+torch_dtype = torch.float16
+model_id = "kingabzpro/whisper-large-v3-turbo-urdu"
+model = AutoModelForSpeechSeq2Seq.from_pretrained(
+    model_id, torch_dtype=torch_dtype, use_safetensors=True
+).to(device)
+model.config.use_cache = False
+model.generation_config.language = "ur"
+model.generation_config.task = "transcribe"
+processor = AutoProcessor.from_pretrained(model_id)
+pipe = pipeline(
+    "automatic-speech-recognition",
+    model=model,
+    tokenizer=processor.tokenizer,
+    feature_extractor=processor.feature_extractor,
+    torch_dtype=torch_dtype,
+    device=device,
+)
+ds = load_dataset(
+    "mozilla-foundation/common_voice_17_0",
+    "ur",
+    split="test",
+    trust_remote_code=True,
+    cache_dir="./hf_cache",
+)
+audio = ds[100]["audio"]
+result = pipe(audio)
+print("Original  :", ds[100]["sentence"])
+print("Predicted :", result["text"])
+```
+```sh
+Original  : اگر عمران خان ٹھیک کر رہے ہیں۔
+Predicted : اگر عمران خان ٹھیک کر رہے ہیں۔
+```
 ### Training hyperparameters