Some people (like me) are primarily verbal processors: - I am dictating this mes...

sumanyusharma · on Aug 15, 2024

I use Superwhisper (no affiliation, just a happy user), which runs a local Whisper model, to create most of my email drafts and post-meeting notes. I find Whisper more accurate than Mac’s built-in speech-to-text, plus I’m faster at speaking than typing.

Sometimes, I even ‘talk’ into Cursor’s chat window instead of typing. The only downside? It can get a bit annoying for others when you're talking to yourself all day.

smeej · on Aug 16, 2024

I'm looking for something like this that runs on Linux. Best thing I've found is LiveCaptions, but its output is janky. I can't just use it to type in any old text field, and its output requires substantial editing after the fact.

I guess I understand that a lot of things are being developed for Apple silicon specifically. It's just frustrating that despite hours of searching, I'm not finding anything decent.

Centigonal · on Aug 17, 2024

Talon Voice is good and runs on linux.

https://talonvoice.com/

smeej · on Aug 17, 2024

This looks really powerful for controlling the system with different scripts, but what if all I want it to do is let me narrate something and print out the sentences as close to real-time as possible? It's really just good STT that I'm looking for out of it.

Centigonal · on Aug 23, 2024

The Talon voice dev created his own STT model that's very performant. The transcription quality is... good, but not world-class. It's better than anything that came out before Whisper IMO, but the newest generator of models can do things like inferring punctuation and words outside of its vocabulary (although the downside of the new generation of VTT is that they can sometimes hallucinate words that are very different from what you said).

It's a bit overkill to use Talon for just voice dictation, but that is 90% of what I use it for, and it's pretty good at it.

Centigonal · on Aug 16, 2024

Interesting! I'll give Superwhisper a try.