Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve voice control #411

Open
pnhofmann opened this issue Feb 13, 2019 · 17 comments
Open

Improve voice control #411

pnhofmann opened this issue Feb 13, 2019 · 17 comments

Comments

@pnhofmann
Copy link
Collaborator

Since basic Voice control (#395) is merged (#408), I'm opening this as a discussion thread for any kind of improvements for voice control.

Any help or suggestions welcome!

@pnhofmann
Copy link
Collaborator Author

List of stuff, I would appreciate:

  • Make speech recognition backend select-able (SpeechRecognition supports multiple backends
  • Having a Speech recognition backend, which works offline
  • Voice recorder without native requirements (e. g. using SoundCard, disadvantage: sometimes crashes)

@pnhofmann pnhofmann pinned this issue Jun 5, 2019
@TobiOkanlawon
Copy link

I would like to work on this voice control feature. Please guide me as to where to begin, and also is it a good first issue. Kinda new to all this

@pnhofmann
Copy link
Collaborator Author

pnhofmann commented Jun 27, 2019

Hi!

and also is it a good first issue

Wouldn't know why not.

Please guide me as to where to begin

So obviously we don't write own Speech Recognition - but just use a library. In our case:

https://pypi.org/project/SpeechRecognition/

Voice control is implemented in jarvicli/plugins/voice_control.py. Guess you should just take a look and try to understand the code. Then you could just try bit experimenting, add some lines and check if everything behaves like you expected ;).

@TobiOkanlawon
Copy link

How about us implementing the offline voice control with pocketsphinx. If you are still particular about keeping the SpeechRecognition module then maybe you can implement something to check if there's internet connection and switch accordingly.

What do you think

@pnhofmann
Copy link
Collaborator Author

CMU Sphinx - which contains pocketsphinx - is in fact supported by SpeechRecognition. SpeechRecognition is a meta-module which unites multiple Speech Recognition services.

If you have a good reason to call pocketsphinx directly, that would perfectly ok. But yes: Having offline SpeechRecognition would be great and CMUSphinx was in fact what I had in mind.

@codecakes
Copy link

codecakes commented Oct 21, 2019

Was skimming through issues. Would like to know for which of the following backends, support is built?:

  • CMU Sphinx (works offline)
  • Google Speech Recognition
  • Google Cloud Speech API
  • Wit.ai
  • Microsoft Bing Voice Recognition
  • Houndify API
  • IBM Speech to Text
  • Snowboy Hotword Detection (works offline)

@pnhofmann
Copy link
Collaborator Author

Currently only Google Speech Recognition - Speech Recognition is in a really, really early stage...

@MikeyBeez
Copy link

Use vosk for STT. It's the best and all local.

@MikeyBeez
Copy link

@pnhofmann
Copy link
Collaborator Author

Thanks for the tip! Will definitely take a close look at STT and Juliet

@gordodev
Copy link

gordodev commented Aug 5, 2020

Any update on this?

@pnhofmann
Copy link
Collaborator Author

Any update on this?

Let's hope soon ;).

@44-Unknown
Copy link

I working on jarvis project but I get error of Pyaudio how can I fix it

@pnhofmann
Copy link
Collaborator Author

@44-Unknown
What's does the error message say?

@MikeyBeez
Copy link

pip can't install pyaudio correctly. Use conda instead.

@Akul2010
Copy link

Akul2010 commented Nov 5, 2022

or install pyaudio wheel and pip install the wheel file name

@Veebaa
Copy link

Veebaa commented Jan 7, 2025

I would like to contribute to the voice control feature. What is the latest update on this and where can I help?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

8 participants