EDIT: I could also definitely see Audapolis being useful if you could integrate it into a podcast's post processing flow (volume normalization, de-essing) by recognizing certain verbal tics and automatically removing them from the audio such as "ummmm...", etc.
Disclaimer: I work at Descript
I've always liked the idea of Descript and was considering building something similar before it came out. The problem is my use case is a couple of videos a year so doesn't fit with an expensive monthly subscription
Take a look at https://matcha.video
This functionality is some of my favorite when editing videos in Descript. It’s so much easier than chopping up waveforms in Audacity
[0] descript.com/
I wondered if this particular feature was really worth paying for so I was happy that I found Audapolis.
> Hindenburg’s manuscript feature gives you a complete overview of your audio. You can select the text just as you would in a text document and watch as your edits are made in real-time. If you need to export your text in a specific format, no problem. Hindenburg supports the most common text and transcription export formats.
I built something similar here: https://bigwav.app
What do you typically do with the text on export? E.g. Do you parse the times?
A number of comments turned me onto Descript -- made a similar comment on another audio thread recently: drives me absolutely insane how all audio tools with any AI are web based monthly saas instead of offline private gpu upfront purchase.
What does that have to do with non-Web-based applications?
Is 1 emoji for each commit title a new trend?
Can anyone clarify if this project is active?
I'm stuck in editing hell right now, and it would be very nice to just visually scroll past a few pages of pre-episode bullshitting and be able to wipe out whole minutes at a stretch, without having to listen to the whole thing. Even at increased speed, it's a bit of a slog.
And why the rude response?
It's not a good idea, but then tons of the LLM ideas we see here aren't either.
I think I just lack experience in this area. I've used Audacity to cut out parts of audio / splice together two clips and that's about it, so I clearly don't have enough background to understand what this tool does.
Can someone clarify what this tool does, please? :)
What it does not do is generate new words (ie you type a sentence and it adds that to your file as voice).