Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE REQUEST] Media player capabilities and voice controls using mic #234

Open
andzejsp opened this issue Aug 10, 2024 · 3 comments
Open
Labels
enhancement New feature or request

Comments

@andzejsp
Copy link

Is your feature request related to a problem? Please describe.
Im looking for a way to use the pc to control my home assistant using voice controls STT and receive back responses TTS and jingle on the pc (media player)

Describe the solution you'd like
Similar to what ESPhome but with pc power. ESP devices are weak, unreliable. PC is much stable and in my case always running. I would like for it to record mic while its on to check for openwakeword, if wake word is triggered then it sends a jingle to my speakers that are connected to pc and starts listening for my commands. After the command is done its sent to assist and then it returns approval or something to the speakers on the pc. I would use this to for example lower volume on the pc when something is playing or interact with home assistant when im there.

Describe alternatives you've considered
No alternatives have been found so far.

Additional context
This would use the existing devices and would give more quality of life to people with disabilities without buying more unstable devices like ESP, raspberry etc..

@andzejsp andzejsp added the enhancement New feature or request label Aug 10, 2024
@joshuar
Copy link
Owner

joshuar commented Aug 12, 2024

Hey @andzejsp that is a super interesting idea.

So from what I am understanding:

  • You want to "talk" to the device running Go Hass Agent. Something like "hey agent pay the moody blues on the lounge speaker".
  • Go Hass Agent converts your speech to a command.
  • Go Hass Agent sends the command text to Home Assistant.
  • Home Assistant does its magic to start playing the moody blues on the lounge speaker.
  • Home Assistant sends the results back to Go Hass Agent.
  • Go Hass Agent gives you some kind of acknowledgement.

From what is available, I think I can do the above, but I'd need to look into some kind of text to speech engine (which shouldn't be a big hurdle).

The wake word functionality might be trickier. I'll have to look into that.

I think this all is possibly doable. It's a very cool idea!

@andzejsp
Copy link
Author

Hey, dont go spending your time inventing new stuff. There are already things figured out for this.

As for wake word - Wyoming satellite
Home assistant has Wyoming protocol integration meaning STT AND TTS capabilities.

As for media player i found that you can use vlc with telnet to send TTS responses.

Im still Jerry rigging these together with my limited coding skills.

https://github.com/rhasspy/wyoming-satellite

https://www.home-assistant.io/integrations/wyoming/

https://alanedwardes.com/blog/posts/vlc-remote-control-home-assistant/

Im struggling with Wyoming satellite because of python intricacies..

@andzejsp
Copy link
Author

Actually vlc is not reliable media player. MPD (media player demon) server is much more reliabe (on arch) but i havent had luck setting it up on popos. It set up alright but no sound is playing when i send audio - tts - from HA to MPD on the popOS device.

And i have set up that wyoming satellite on arc too - works great in testing but again.. have issues on popOS :D...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants