Alert icon
We're changing our privacy policy. This stuff matters.  Learn more  Dismiss

PyCon 2010 talk on Pocketsphinx

Loading...

Sign in or sign up now!
Alert icon
Upgrade to the latest Flash Player for improved playback performance. Upgrade now or more info.
8,207
Loading...
Alert icon
Sign in or sign up now!
Alert icon

Uploaded by on Jul 10, 2010

This is a talk by David Huggins-Daines on Pocketsphinx and Python on PyCon 2010 in Atlanta

It contains quick and nice introduction in Pocketsphinx API with all major issues covered. It will let you write your own speech recognition in 5 minutes.

Category:

Science & Technology

Tags:

License:

Standard YouTube License

  • likes, 1 dislikes

Link to this comment:

Share to:
see all

All Comments (4)

Sign In or Sign Up now to post a comment!
  • Convert mp3 to WAV-------------- on Ubuntu-------------

    apt-get -y install sox libsox-fmt-mp3

    # convert mp3 to WAV (16kHz)

    # sox -r 16000 -2 -s -t mp3 mp3filename wavfilename

    sox -r 16000 -2 -s -t mp3 rec0505-004232.mp3 goforward.wav

  • I used Google Search:

    +"asr.py" +sphinx

  • Installation Procedure for Ubuntu 11-04 (worked the same on 10-10 Ubuntu, just the same)

    # install packages (Acoustics Model, Language Model, and a decoder)

    sudo apt-get install python-pocketsphinx pocketsphinx-lm-wsj pocketsphinx-hmm-wsj1 ipython

    # sudo apt-get -y install vim ssh

    Created one Python file, and one audio file (16KHz); intelligibility (specifically consonants) is considerably improved with the band with afforded by a 16kHz sample rate.The path2audio is in asr.py

Loading...

Alert icon
0 / 00Unsaved Playlist Return to active list
    1. Your queue is empty. Add videos to your queue using this button:
      or sign in to load a different list.
    Loading...Loading...Saving...
    • Clear all videos from this list
    • Learn more