Show HN: New Audiobook Generator for Nvidia Using Chatterbox TTS (opens in new tab)

(github.com)

4 pointsbeboplifa10mo ago3 comments

I am an audiobook addict that coded this https://github.com/cpttripzz/Chatterblez. I am using it all the time and it works nice. I have only bothered to get it working on windows but it should be cross-platform as it uses pyqt, I would be happy for contributors to help get it working on macos and linux and also ATI and other video cards.

If you are stuck without a video card I recommend using https://github.com/cpttripzz/audiblez it can generate an audiobook in around 4 hours with a decent CPU

3 comments

drewbitt10mo ago

With Chatterbox this finally feels almost possible. I find that I am sensitive to pacing issues which it often has. Kokoro was just alright. I'm using a tool I hacked together that runs Minimax Speech-02-HD which is still a whole other level, IMO, but not that cheap. Inworld-TTS-1-max is cheaper - I'm trialing it these days. async.ai seems promising too.

Thanks for the tool! I'm also quite interested in this space.

BinaryIgor10mo ago

Interesting; I was thinking about creating something like that a few years ago - since I love listening to information a lot while doing some chores/walking - but back then, all available text-to-speech converters were unbearably robotic.

How much time does it take to convert a book/doc into audio using your approach? Also, as I understood it all runs locally, so you don't need to pay for any API access/usage?

beboplifaOP10mo ago

on an nvidia rtx 2060 mobile about half a day for a medium sized novel. Chatterbox TTS is really emotive, sometimes too much so.

j / k navigate · click thread line to collapse