PocketSphinx+PG+JavaScript Voice/Text Experiment (opens in new tab)

(vmorgulys.github.io)

3 pointsvmorgulis9y ago3 comments

3 comments

Original video: https://www.youtube.com/watch?v=0KR2MSFROLI

CMUSphinx: http://cmusphinx.sourceforge.net/

So, what am I looking at? It seems like you fed the audio in PocketSphinx to get time-tagged text and the site basically shows said text as subtitles to what was said, is that the gist of it?

vmorgulisOP9y ago

> ... is that the gist of it?

Yes, it is.

I'd like to improve the speech recognition and expected some advice about that.

Another possibility is to add a semantic level with NLP or use another library like Kaldi (http://kaldi-asr.org/).

Another particularity: the WAV file is serialized in JSON (as an array).

j / k navigate · click thread line to collapse

3 comments

vmorgulisOP9y ago

Original video: https://www.youtube.com/watch?v=0KR2MSFROLI

CMUSphinx: http://cmusphinx.sourceforge.net/

detaro9y ago

So, what am I looking at? It seems like you fed the audio in PocketSphinx to get time-tagged text and the site basically shows said text as subtitles to what was said, is that the gist of it?

vmorgulisOP9y ago

> ... is that the gist of it?

Yes, it is.

I'd like to improve the speech recognition and expected some advice about that.

Another possibility is to add a semantic level with NLP or use another library like Kaldi (http://kaldi-asr.org/).

Another particularity: the WAV file is serialized in JSON (as an array).

j / k navigate · click thread line to collapse