Send .wav files to other ROS module

asked 2016-03-05 06:23:20 -0600

steinaak
151 ●3 ●4 ●4

Hi!

I am TOTALLY new in ROS. I have a package consisting of two ROS modules where the first module takes input from a microphone and saves it as a ".wav" file. I want this module to publish this ".wav"-file to a topic so that the other ROS module is able to receive it(subscribe the topic). How do I do this? I have been looking at the ROS package called audio_common, but I cant figure out how to do it. Do I need to convert the ".wav"-file into bytes and stuff before sending? I have no idea!

Any advices would help here!

Comments

Are you trying to stream audio, or are you trying to capture short bursts? What are you trying to do with the audio samples in the receiving node?

ahendrix ( 2016-03-05 14:06:59 -0600 )edit

Hey! Now, the microphone module captures small bursts(.wav files with duration up to 10 seconds). The other module will receive this audio(if I am able to solve that problem) and make a call to an API to translate the speech into text. What message type should I use? And how to do it?

steinaak ( 2016-03-06 02:53:43 -0600 )edit

add a comment

answered 2016-03-06 03:20:39 -0600

ahendrix

47576 ●179 ●367 ●662 http://namniart.com/

There isn't a good audio transport convention in ROS. audio_common_msgs provides the AudioData message, but there's no documented convention about what data it's supposed to carry, and no way to indicate which format the data is in. audio_capture usually uses it for transporting chunks of MP3 streams, but you could use it for wav data too. If you need any kind of metadata with your audio, you should create your own message.

In practice, if you want to do speech recognition I'd recommend the ROS pocketsphinx package. It's what most ROS users use for speech recognition, and it's reasonably well maintained. The only place where it may not work is if you want to do the audio capture and speech processing on two different computers.

edit flag offensive delete link

Comments

Thank you very much for your reply! I have had a look at the pocketsphinx package already, but it seems like you have to build the vocabulary of words to recognize yourself? Or does the pocketsphinx include a vocabulary so it is able to recognize speech already?

steinaak ( 2016-03-06 05:50:11 -0600 )edit

There are two varieties of speech recognition - dictation and vocabulary-based recognizers. Vocabulary-based recognizers are usually good for commands. Dictation recognizers use a much larger vocabulary are are usually good for "speech typing".

ahendrix ( 2016-03-06 15:21:07 -0600 )edit

Thank you! Is there a dictation-recognizer in pocketsphinx that works well?

steinaak ( 2016-03-07 02:37:11 -0600 )edit

I'm not aware of anyone using pocketsphinx for dictation.

ahendrix ( 2016-03-07 12:10:37 -0600 )edit

add a comment

Send .wav files to other ROS module

Comments

1 Answer

Comments

Question Tools

Stats

Related questions

Send .wav files to other ROS module edit

Comments

1 Answer

Comments

Question Tools

Stats

Related questions

Send .wav files to other ROS module