Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

He was talking about STT models, not TTS. Whisper is open source and a good solution in many cases (in particular finetuned ones).


regarding STT we got also today 2 new models from Nvidia:

https://huggingface.co/nvidia/canary-180m-flash

https://huggingface.co/nvidia/canary-1b-flash

second in Open ASR leaderboard https://huggingface.co/spaces/hf-audio/open_asr_leaderboard

Sadly only supports 4 languages (english, german, spanish, french)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: