[FEATURE REQUEST] Media player capabilities and voice controls using mic #234

andzejsp · 2024-08-10T09:54:36Z

Is your feature request related to a problem? Please describe.
Im looking for a way to use the pc to control my home assistant using voice controls STT and receive back responses TTS and jingle on the pc (media player)

Describe the solution you'd like
Similar to what ESPhome but with pc power. ESP devices are weak, unreliable. PC is much stable and in my case always running. I would like for it to record mic while its on to check for openwakeword, if wake word is triggered then it sends a jingle to my speakers that are connected to pc and starts listening for my commands. After the command is done its sent to assist and then it returns approval or something to the speakers on the pc. I would use this to for example lower volume on the pc when something is playing or interact with home assistant when im there.

Describe alternatives you've considered
No alternatives have been found so far.

Additional context
This would use the existing devices and would give more quality of life to people with disabilities without buying more unstable devices like ESP, raspberry etc..

joshuar · 2024-08-12T03:55:52Z

Hey @andzejsp that is a super interesting idea.

So from what I am understanding:

You want to "talk" to the device running Go Hass Agent. Something like "hey agent pay the moody blues on the lounge speaker".
Go Hass Agent converts your speech to a command.
Go Hass Agent sends the command text to Home Assistant.
Home Assistant does its magic to start playing the moody blues on the lounge speaker.
Home Assistant sends the results back to Go Hass Agent.
Go Hass Agent gives you some kind of acknowledgement.

From what is available, I think I can do the above, but I'd need to look into some kind of text to speech engine (which shouldn't be a big hurdle).

The wake word functionality might be trickier. I'll have to look into that.

I think this all is possibly doable. It's a very cool idea!

andzejsp · 2024-08-12T04:12:44Z

Hey, dont go spending your time inventing new stuff. There are already things figured out for this.

As for wake word - Wyoming satellite
Home assistant has Wyoming protocol integration meaning STT AND TTS capabilities.

As for media player i found that you can use vlc with telnet to send TTS responses.

Im still Jerry rigging these together with my limited coding skills.

https://github.com/rhasspy/wyoming-satellite

https://www.home-assistant.io/integrations/wyoming/

https://alanedwardes.com/blog/posts/vlc-remote-control-home-assistant/

Im struggling with Wyoming satellite because of python intricacies..

andzejsp · 2024-08-17T14:32:45Z

Actually vlc is not reliable media player. MPD (media player demon) server is much more reliabe (on arch) but i havent had luck setting it up on popos. It set up alright but no sound is playing when i send audio - tts - from HA to MPD on the popOS device.

And i have set up that wyoming satellite on arc too - works great in testing but again.. have issues on popOS :D...

andzejsp added the enhancement New feature or request label Aug 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE REQUEST] Media player capabilities and voice controls using mic #234

[FEATURE REQUEST] Media player capabilities and voice controls using mic #234

andzejsp commented Aug 10, 2024

joshuar commented Aug 12, 2024

andzejsp commented Aug 12, 2024

andzejsp commented Aug 17, 2024

[FEATURE REQUEST] Media player capabilities and voice controls using mic #234

[FEATURE REQUEST] Media player capabilities and voice controls using mic #234

Comments

andzejsp commented Aug 10, 2024

joshuar commented Aug 12, 2024

andzejsp commented Aug 12, 2024

andzejsp commented Aug 17, 2024