Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The paid models are still too expensive for personal long-form use-cases. For example: if I want to generate an audiobook from a web novel, the price can go as high as thousands of dollars. If I'm just a regular reader (not the author), that's prohibitively expensive for someone who just wants to enjoy the story in a different medium.


I listen to web novels with Elevenlabs reader all the time (The 11 dollar a month unlimited plan). I love it.

When it's a foreign web novel with no English translation, I first translate the Web novel with Claude Sonnet.


Despite ElevenLabs API usage being expensive, ElevenReader is $11 a month for unlimited personal long-form content.

Even with a local model and hardware you already own, you're not beating that on electricity costs.


I dunno about the electricity claims for practical purposes— where I live, that’d be roughly 128 hours of 600W. I suppose the real question is, would it take 128 hours ($11) of power to generate 720 hours of TTS content assuming a good enough model was available?

On Android I’d imagine, big emphasis on imagine since I don’t use it, you could probably script something up and use a phone with an audio jack to record it. Theoretically hitting that maximum of 720 hours of content per month, but I’d imagine at some point they’d find it peculiar you’re listening to content 24/7.


Kokoro is available as a system TTS for Android via OSS project called "sherpa": https://k2-fsa.github.io/sherpa/onnx/android/index.html

I believe its power usage is negligible in comparison to, for example, screen or maybe even Bluetooth audio.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: