Show HN: Navigate by speaker in YouTube videos (opens in new tab)

(zanshin.sh)

2 pointshamza_q_6mo ago2 comments

2 comments

Wow! What an awesome interface with the visual representations of speakers that can be clicked. I got it immediately and wish it was part of all the media players everywhere now!

hamza_q_OP6mo ago

Thanks :) Agreed, the limiting factor has been diarization (generating the "who speaks when" data) speed. But the diarization backend of this app that I developed can now process 1 hour of audio in ~8 seconds on a M3 Mac. So that's more or less a solved problem now (at least on Mac), just UI work remains.

j / k navigate · click thread line to collapse

2 comments

leakycap6mo ago

Wow! What an awesome interface with the visual representations of speakers that can be clicked. I got it immediately and wish it was part of all the media players everywhere now!

hamza_q_OP6mo ago

j / k navigate · click thread line to collapse