Voice commands / speech to and from robot? [closed]

asked 2011-02-21 14:48:22 -0500

evanmj
250 ●5 ●7 ●9

I have used the sound_play package with festival to synthesize voices to make my robot "talk", but I would also like to be able to command the robot by voice.

Basically, I feel like someone has used a tool like CMU Sphinx with ROS, but I am unable to find any examples.

edit retag flag offensive reopen merge delete

Closed for the following reason the question is answered, right answer was accepted by SL Remy
close date 2017-07-21 00:33:44.806765

add a comment

11

answered 2011-02-21 14:55:31 -0500

Eric Perko

8406 ●73 ●90 ●144 http://ericperko.com/

updated 2011-04-06 06:20:37 -0500

It's quite experimental and definitely not documented, but we have been using PocketSphinx to do speech recognition with ROS. See the cwru_voice package for source.

If you run the voice.launch file (after changing some of the hardcoded model paths appropriately in whichever node it launches), you should be able to get certain keywords out on the "chatter" topic. As an example, voice.launch should recognize a command to "Open the Door" or "Go to the hallway" and output a keyword on the chatter topic. If you do try it out and have problems, let me know as you would be the first outside our lab to try it that I know of.

Stanford also has a speech package in their repository. EDIT: Thanks to @fergs for finding the Stanford package.

UPDATE: Make sure to take a look at Scott's answer below for a nice tutorial and demo code for getting speech recognition up and running for your own uses.

edit flag offensive delete link

Comments

Thanks. I'll give it a go when I get a chance. I found another implementation of some sort here: http://www.ros.org/wiki/ua_language

evanmj ( 2011-02-21 15:13:16 -0500 )edit

I hadn't seen that package before. Thanks for the info.

Eric Perko ( 2011-02-21 15:18:45 -0500 )edit

Here's the sail-ros-pkg version you were probably thinking about: https://sail-ros-pkg.svn.sourceforge.net/svnroot/sail-ros-pkg/trunk/semistable/audio/speech/

fergs ( 2011-02-22 03:38:31 -0500 )edit

add a comment

7

answered 2011-04-05 17:20:55 -0500

Scott
693 ●33 ●45 ●53

I took me a bit longer to get this done and documented than I thought it would, but as promised, here is the tutorial and files for how to do basic robot speech recognition based on pocketsphinx. It also includes a handy wav file player based on sfml and derived from Garratt Gallagher's kinect piano playing code. The wav/audio player can also be used separately from the speech recognition code as a convenient and dependable way to play audio files. Instead of uploading the tutorial here, I have created a google.code page. You will find the code samples under the downloads page and the tutorial details on the wiki page. The link to the tutorial home is at...

http://code.google.com/p/ros-pocketsphinx-speech-recognition-tutorial/

If anyone creates some great new code derived from this and extends or improves this work, please create a google.code site of your own (very easy) and post the link to your work and code samples here so others can benefit from it as well. Best Regards, -Scott

edit flag offensive delete link

Comments

Cool! One thing I see is that you might want to include a link to the actual cwru_voice package. The demo script still requires it (the roslib.load_manifest("cwru_voice") line), so you should either create your own package to depend on (and keep the script in) or add a note about getting cwru_voice.

Eric Perko ( 2011-04-05 17:30:12 -0500 )edit

I'm wondering if you tried using http://www.ros.org/wiki/sound_play to play your sound files and found something you didn't like that made you go with the wave player you wrote?

Eric Perko ( 2011-04-05 17:30:19 -0500 )edit

hi, fergs wrote a program without the GUI that i found really nice when coupled with my script which is really only the if else statement that publishes to speak and chatter. i found i could let it run continuously when it has a match the robot will speak

Peter Heim ( 2011-04-05 22:48:19 -0500 )edit

here is the link to Fergs program (http://code.google.com/p/albany-ros-pkg/source/browse/#svn%2Ftrunk%2Frharmony%2Fpocketsphinx) I will post a link to my code later

Peter Heim ( 2011-04-05 22:51:43 -0500 )edit

Thanks for the reply Eric. Yes I tried sound_play and I could get it to do the "computer speech" which is the say.py code, but when I tried to use the play.py code which is the section that should play a was. it would just do a quick click sound and not play right/ I worked on that for about two days researching why it as doing that, found a few suggestions that did not work and then dug into Garratt's code for his piano demo to see how he did it and stumbled across the

Scott ( 2011-04-05 23:48:26 -0500 )edit

sfml code and found that to be more dependable for me.

Scott ( 2011-04-05 23:49:42 -0500 )edit

Reminder: For those that like the tutorial I created, please don't forget to vote for it! If people find this useful I will post tutorials on other topics as well. (Thanks, Scott)

Scott ( 2011-04-06 04:59:24 -0500 )edit

add a comment

5

answered 2011-04-06 12:20:36 -0500

fergs

13902 ●59 ●157 ●196 http://www.robotandchisel.com

We've released a pocketsphinx package at Albany several weeks ago:

http://www.ros.org/wiki/pocketsphinx

This is basically the same gstreamer demo, but we've removed the GUI (it now just does continuous diction, although we've also added ROS services to start/stop the recognizer), added parameters for setting language model and dictionary, and added rosdep configurations so that you can install pocketsphinx itself using the rosdep tool. Parameter names and topics are listed in the ROS wiki page.

edit flag offensive delete link

Comments

I am new to ROS and have been using the pocketsphinx package that you have mentioned above. I have managed to install it and also get a sample lm and dic file through the Sphinx Knowledge Base Tool. However, I am unable to set the path to it.Pls help

dexter05 ( 2016-08-25 22:40:35 -0500 )edit

add a comment

3

answered 2017-03-02 10:58:15 -0500

gorinars
31 ●2 ●2

We recently provided a very simple example of using pocketsphinx-python to control turtlebot.

Compared to similar projects that I've found (if I missed some better ones, please let me know) so far the advantages are:

removed GStreamer dependency
support latest CMU Sphinx decoder (pocketsphinx-5prealpha) with its last features and models for several languages
key phrase spotting mode that allows continuous listening and filtering out-of-vocabulary words and noises
simple code (one python script) and tutorial for the beginners

I'll be happy for any suggestions on how to integrate this properly in ROS community.

edit flag offensive delete link

Comments

I tried this on my turtlebot and it worked for me! Thank you! The recognition is pretty poor though but I'm sure it can be improved.

danie11am ( 2017-04-28 19:16:55 -0500 )edit

add a comment

2

answered 2011-02-27 04:37:57 -0500

Scott
693 ●33 ●45 ●53

Excellent. Thanks for the information. I will check out the link and post back with tips for others or problems I run into once I get a chance to dig in and try it. Very appreciated. Best Regards, -Scott

edit flag offensive delete link

add a comment

2

answered 2011-02-25 07:12:50 -0500

Peter Heim

215 ●6 ●8 ●17

here is a link to the online tool that will generate a dic file link text

peter

edit flag offensive delete link

add a comment

2

answered 2011-02-22 19:30:34 -0500

KoenBuys
2314 ●21 ●35 ●56 http://people.mech.kul...

TUM also has a speach recognition package.

edit flag offensive delete link

add a comment

2

answered 2016-06-03 01:32:31 -0500

Kei Okada

1186 ●29 ●52 ●57

for speech recognition, you can also use following ROS nodes

web based -> https://github.com/tork-a/visualizati...

and

android based -> https://play.google.com/store/apps/de...

edit flag offensive delete link

add a comment

2

answered 2011-02-24 08:27:14 -0500

Scott
693 ●33 ●45 ●53

Thanks for the post. Very cool. I was able to get this to work as well. I tried to update the dictionary with a few additional words. I want to be able to say TURN LEFT, TURN RIGHT, STOP, BACK UP, and some basic directional command. Any suggestions for how to update the code to allow for that would be appreciated.

Here is a cut and past of what I added to the 1495.dic file.

TURN T ER N

LEFT L EH F T

RIGHT R IY T

STOP S T AH P

Thanks, -Scott

edit flag offensive delete link

Comments

I'll have to check with my colleague on some of this. I believe there is a tool online somewhere that he uses to generate the models for speech recognition. When I find it, I'll update my answer with that info. You couldd take a look at the "motoric.launch" file for a "verbal joystick" interface.

Eric Perko ( 2011-02-24 09:00:30 -0500 )edit

add a comment

1

answered 2011-02-22 18:54:06 -0500

Peter Heim

215 ●6 ●8 ●17

I just tried it and it works fine, the voice recognition works for both me and my 9 year old son. I previously used voice recognition with the leaf robot (MS sapi5 ?) this works just as good for my voice and much better for my son's voice

edit flag offensive delete link

add a comment

Voice commands / speech to and from robot? [closed]

Closed for the following reason the question is answered, right answer was accepted by SL Remy
close date 2017-07-21 00:33:44.806765

10 Answers

Comments

Comments

Comments

Comments

Comments

Question Tools

Stats

Related questions

Voice commands / speech to and from robot? [closed] edit

Closed for the following reason the question is answered, right answer was accepted by SL Remy close date 2017-07-21 00:33:44.806765

10 Answers

Comments

Comments

Comments

Comments

Comments

Question Tools

Stats

Related questions

Voice commands / speech to and from robot? [closed]

Closed for the following reason the question is answered, right answer was accepted by SL Remy
close date 2017-07-21 00:33:44.806765