Kunal Dhawan
commited on
Commit
·
e2ff9b4
1
Parent(s):
f3ccd7e
updated model description
Browse filesSigned-off-by: Kunal Dhawan <kunaldhawan97@gmail.com>
README.md
CHANGED
|
@@ -266,7 +266,8 @@ img {
|
|
| 266 |
</style>
|
| 267 |
|
| 268 |
## Description:
|
| 269 |
-
NVIDIA NeMo Canary Flash [1] is a family of multilingual multi-tasking models based on Canary architecture [2] that
|
|
|
|
| 270 |
|
| 271 |
|
| 272 |
## Model Architecture:
|
|
@@ -576,7 +577,6 @@ F1-score on [Librispeech Test sets](https://www.openslr.org/12) at collar value
|
|
| 576 |
|:-----------:|:---------:|:----------:|:----------:|
|
| 577 |
| nemo-main | canary-1b-flash | 95.5 | 93.5 |
|
| 578 |
|
| 579 |
-
Note that this is an experimental feature currently and not recommended for production use cases.
|
| 580 |
|
| 581 |
### Hallucination Robustness
|
| 582 |
Number of characters per minute on [MUSAN](https://www.openslr.org/17) 48 hrs eval set
|
|
|
|
| 266 |
</style>
|
| 267 |
|
| 268 |
## Description:
|
| 269 |
+
NVIDIA NeMo Canary Flash [1] is a family of multilingual multi-tasking models based on Canary architecture [2] that achieve state-of-the-art performance on multiple speech benchmarks. With 883 million parameters and running at more than 900 RTFx (on open-asr-leaderboard datasets), canary-1b-flash supports automatic speech-to-text recognition (ASR) in four languages (English, German, French, Spanish) and translation from English to German/French/Spanish and from German/French/Spanish to English with or without punctuation and capitalization (PnC). Additionally, Canary-1B-Flash offers an experimental feature for word-level and segment-level timestamps in English, German, French, and Spanish.
|
| 270 |
+
This model is released under the permissive CC-BY-4.0 license and is available for commercial use.
|
| 271 |
|
| 272 |
|
| 273 |
## Model Architecture:
|
|
|
|
| 577 |
|:-----------:|:---------:|:----------:|:----------:|
|
| 578 |
| nemo-main | canary-1b-flash | 95.5 | 93.5 |
|
| 579 |
|
|
|
|
| 580 |
|
| 581 |
### Hallucination Robustness
|
| 582 |
Number of characters per minute on [MUSAN](https://www.openslr.org/17) 48 hrs eval set
|