Skip to content

How to capture audio from Pepper Robot with ROS

Notifications You must be signed in to change notification settings

JayRob101/ROS_Mic_to_Wav

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CAPTURE AUDIOS

  • This node is intended to be used to create the dataset of the rz_se package.
  • Firstly, audios must be captured using the robot microphone. The folder structure for the training set should have as many subfolders as emotions need to be detected, so that each subfolder will store the audios of the same emotion. This rule has to be applied to create the structure of test subfolders.
  • Turn on the robot, go to the next step while becomes up and running.
  • Open Terminal1:
    • Load Anaconda environment:
      • $ source activate environment
    • Load ROS environment:
      • $ source $HOME/catkin_ws/devel/setup.bash
      • $ export ROS_MASTER_URI=http://<ROBOT_IP>:11311
    • Once the robot is up and running:
      • $ cd $HOME/catkin_ws/src/mic2wav/launch/
      • $ gedit mic2wav_params.yaml
      • There are several things to look at:
        • The topic where the robot publishes the raw audio it captures is already set (/pepper_robot/audio). If another device is going to be used, the topic can be set changing the value of the parameter called raw_audio_topic.
        • The destination folder can be changed using the sound_path parameter. Each recording session should be done repeating several sentences, using the same voice emotion. After that it will depend on the tagger to place the audio recording made in the train or test subfolder corresponding to the emotion used during the recording.
        • There are other parameters that can be changed, for instance dest_num_channels which is the number of channels for the desired conversion, dest_rate which is the rate for the desired conversion or max_iter which sets the number of iterations after which the sound will be dumped into a WAV file.
        • Once we're done with the tweaking, let's capture some audios!
    • $ roslaunch mic2wav mic2wav.launch
    • At the end of the process there should be as many folders as emotions. All these folders under the same parent folder.
    • Now it's time to split all the pictures (the original and the distorted ones) into TRAIN/TEST set.

About

How to capture audio from Pepper Robot with ROS

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 98.4%
  • Other 1.6%