RAI: Human-Robot Interaction

RAI provides a Human-Robot Interaction (HRI) package that enables communication with your robots. This package allows you to chat with your robot, give it tasks, and receive feedback and reports. You have the following options for interaction:

Voice communication using Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) models
Text communication using Streamlit

Voice communication might be challenging in noisy environments. In such cases, it's recommended to use the text channel.

How it works?

General Architecture

The general architecture follows the diagram above. Text is captured from the input source, transported to the Human-Machine Interface (HMI), processed according to the given tools and robot's rules, and then sent to the output source.

Voice Interface

In the voice interface, the input source is a microphone, while the output source is a speaker. The input is processed using the OpenAI Whisper model (cloud-based, paid) or with the local model, while the output can be produced using OpenTTS (Apache-2.0, depending on the model used) or ElevenLabs (cloud-based, paid).

Text Interface

The text interface is implemented directly in RAI_HMI using Streamlit. The GUI closely follows standard chat-like conversations, with built-in support for tool integration.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

human_robot_interface.md

human_robot_interface.md

RAI: Human-Robot Interaction

How it works?

General Architecture

Voice Interface

Text Interface

Files

human_robot_interface.md

Latest commit

History

human_robot_interface.md

File metadata and controls

RAI: Human-Robot Interaction

How it works?

General Architecture

Voice Interface

Text Interface