@rpino it’s perfectly something I was wondering!
I was aware about Rasa, but only for the text messaging, not for the audio, the implementation of the bridge with the speech to text capability is smart.
What is the main “init” command you are using? I’d like to have a init keyword as the “Hey Siri” or “Ok Google” for example (mine would be something similar to “Hey Jarvis” - in honour of Edwin Jarvis the Tony Stark’s AI).
Would be possible to setup this kind of specific command or is the mic in always-on active listening? Or maybe it’s a mix of both and when the right pattern is identified, then it starts?
Another thing I was wondering is the vocal inflection: the Apple assistant uses a pattern recognition on one specific voice (I guess it would be the same for Google) avoiding the reception of the command of non-authorised people.
About the security, in my mind, due the fact that I want also to add some actions (as close the windows for eg) I want to add also another security layer, probably with a biometric check (fingerprint), so that in case of specific command as ", please " it may say something as “Ok, place your finger in the reader to authorise”.
For me the music & lights and other simple actuations as “make a phone call to xxx” are ok, and maybe available from almost everyone (with the basic inflection auth sys), but the most complex ones must require a second level of check.