OpenAI Whisper Analysis (opens in new tab)

(twitter.com)

10 pointslunixbochs3y ago2 comments

2 comments

After testing Whisper (small model) on real recordings, some of them just noice such as chairs moving or doors being shut, I have seen that it sometimes invent crazy things. Sometimes, it infers urls from what, to my ear, is just noise or things that may have emerged from movies (like "GUNSHOT" or "The storm becomes stronger"), also from just noise. Since it often creates strange things with correct grammar, it can be hard to identify which parts that are erroneous

kylixz3y ago

Great breakdown… with some interesting results and a ton of effort.

Are there any open benchmarks like this for all models that are actually runnable like the data exposed in https://github.com/syhw/wer_are_we but with some of your additional metrics?

j / k navigate · click thread line to collapse

2 comments

karencarits3y ago

kylixz3y ago

Great breakdown… with some interesting results and a ton of effort.

Are there any open benchmarks like this for all models that are actually runnable like the data exposed in https://github.com/syhw/wer_are_we but with some of your additional metrics?

j / k navigate · click thread line to collapse