just record your voice and send to the model
Mohamed Rashad PRO
AI & ML interests
Computer Vision, Robotics, Natural Language Processing
Recent Activity
new activity
26 days ago
MohamedRashad/Arabic-Chatbot-Arena:Demande de rรฉactivation du modรจle โ projet PalmX-2025 replied to their post about 1 month ago
I made a demo for the latest PersonaPlex model from nvidia, Try it out here:
https://huggingface.co/spaces/MohamedRashad/PersonaPlex new activity
about 1 month ago
MohamedRashad/PersonaPlex:add audio file upload instead of forcing users to use their mic Organizations
replied to their post about 1 month ago
Post
1083
I made a demo for the latest PersonaPlex model from nvidia, Try it out here:
MohamedRashad/PersonaPlex
MohamedRashad/PersonaPlex
posted an
update about 1 month ago
Post
1083
I made a demo for the latest PersonaPlex model from nvidia, Try it out here:
MohamedRashad/PersonaPlex
MohamedRashad/PersonaPlex
Post
3425
I have update my https://huggingface.co/collections/MohamedRashad/arabic-speech-datasets
with new datasets, making the full audio data more than 3000 hours of good arabic speech.
Feel Free to use it in your new innovations, And happy new year!
with new datasets, making the full audio data more than 3000 hours of good arabic speech.
Feel Free to use it in your new innovations, And happy new year!
posted an
update about 2 months ago
Post
3425
I have update my https://huggingface.co/collections/MohamedRashad/arabic-speech-datasets
with new datasets, making the full audio data more than 3000 hours of good arabic speech.
Feel Free to use it in your new innovations, And happy new year!
with new datasets, making the full audio data more than 3000 hours of good arabic speech.
Feel Free to use it in your new innovations, And happy new year!
replied to their post 6 months ago
the output of the model is json. that's what is crazy about it in my opinion
Post
3287
If someone is interested in trying the new rednote-hilab/dots.ocr model. I made this space for you:
MohamedRashad/Dots-OCR
MohamedRashad/Dots-OCR
posted an
update 7 months ago
Post
3287
If someone is interested in trying the new rednote-hilab/dots.ocr model. I made this space for you:
MohamedRashad/Dots-OCR
MohamedRashad/Dots-OCR
Post
1934
For anyone who wants to try the new Voxtral models, you can do this from here:
MohamedRashad/Voxtral
Also you can find the transformers version of them here:
MohamedRashad/Voxtral-Mini-3B-2507-transformers
MohamedRashad/Voxtral-Small-24B-2507-transformers
MohamedRashad/Voxtral
Also you can find the transformers version of them here:
MohamedRashad/Voxtral-Mini-3B-2507-transformers
MohamedRashad/Voxtral-Small-24B-2507-transformers
posted an
update 7 months ago
Post
1934
For anyone who wants to try the new Voxtral models, you can do this from here:
MohamedRashad/Voxtral
Also you can find the transformers version of them here:
MohamedRashad/Voxtral-Mini-3B-2507-transformers
MohamedRashad/Voxtral-Small-24B-2507-transformers
MohamedRashad/Voxtral
Also you can find the transformers version of them here:
MohamedRashad/Voxtral-Mini-3B-2507-transformers
MohamedRashad/Voxtral-Small-24B-2507-transformers
posted an
update 9 months ago
Post
1903
I think we just got the best Image to Markdown VLM out there and it's hosted here:
MohamedRashad/Nanonets-OCR
MohamedRashad/Nanonets-OCR
Post
397
I just updated an old (non working) space i had with the implementation of a cool research paper named UniRig
The idea is that you upload any 3d model and it rigs it for you with correct armature and the skinning process to give you the final model fully rigged and ready to be used.
Check it out here:
MohamedRashad/UniRig
The idea is that you upload any 3d model and it rigs it for you with correct armature and the skinning process to give you the final model fully rigged and ready to be used.
Check it out here:
MohamedRashad/UniRig
posted an
update 9 months ago
Post
397
I just updated an old (non working) space i had with the implementation of a cool research paper named UniRig
The idea is that you upload any 3d model and it rigs it for you with correct armature and the skinning process to give you the final model fully rigged and ready to be used.
Check it out here:
MohamedRashad/UniRig
The idea is that you upload any 3d model and it rigs it for you with correct armature and the skinning process to give you the final model fully rigged and ready to be used.
Check it out here:
MohamedRashad/UniRig
posted an
update 10 months ago
Post
1100
I have processed and cleaned the famous SADA2022 dataset from SADIA for Arabic ASR and other related tasks and uploaded it here:
MohamedRashad/SADA22
Edit:
I also added another dataset from SADIA named SCC22
MohamedRashad/SCC22
MohamedRashad/SADA22
Edit:
I also added another dataset from SADIA named SCC22
MohamedRashad/SCC22
replied to their post 11 months ago
Speech data in audio and text format
replied to their post 11 months ago
Start with gathering high quality data first. This is by far the biggest hurdle against TTS systems out there.
posted an
update 11 months ago
Post
2731
I collected the recitations of the holy quran from 20 different reciters and uploaded the full dataset here:
MohamedRashad/Quran-Recitations
Check it out ๐ฅท
MohamedRashad/Quran-Recitations
Check it out ๐ฅท
Post
2177
For those interested in trying the new canopylabs/orpheus-3b-0.1-ft model i made a space for you:
MohamedRashad/Orpheus-TTS
MohamedRashad/Orpheus-TTS
posted an
update 11 months ago
Post
2177
For those interested in trying the new canopylabs/orpheus-3b-0.1-ft model i made a space for you:
MohamedRashad/Orpheus-TTS
MohamedRashad/Orpheus-TTS
Post
3543
I think we have released the best Arabic model under 25B at least based on https://huggingface.co/spaces/inceptionai/AraGen-Leaderboard
Yehia = https://huggingface.co/ALLaM-AI/ALLaM-7B-Instruct-preview + GRPO
and its ranked number one model under the 25B parameter size mark.
Now, i said "i think" not "i am sure" because this model used the same metric of evaluation the AraGen developers use (the 3C3H) as a reward model to improve its responses and this sparks the question. Is this something good for users or is it another type of overfitting that we don't want ?
I don't know if this is a good thing or a bad thing but what i know is that you can try it from here:
Navid-AI/Yehia-7B-preview
or Download it for your personal experiments from here:
Navid-AI/Yehia-7B-preview
Ramadan Kareem ๐
Yehia = https://huggingface.co/ALLaM-AI/ALLaM-7B-Instruct-preview + GRPO
and its ranked number one model under the 25B parameter size mark.
Now, i said "i think" not "i am sure" because this model used the same metric of evaluation the AraGen developers use (the 3C3H) as a reward model to improve its responses and this sparks the question. Is this something good for users or is it another type of overfitting that we don't want ?
I don't know if this is a good thing or a bad thing but what i know is that you can try it from here:
Navid-AI/Yehia-7B-preview
or Download it for your personal experiments from here:
Navid-AI/Yehia-7B-preview
Ramadan Kareem ๐