Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> Why Pi-card? > Raspberry Pi - Camera Audio Recognition Device.

Missed opportunity for LCARS - LLM Camera Audio Recognition Service, responding to the keyword "computer," naturally. I guess if this ran elsewhere from a Pi, it could be LCARS.



Pi-C.A.R.D is perfect. Read it 100% as Picard, and more recognizable that LCARS.


Just configure it to respond to "Computer" and you're good to go.


The wake word detection is an interesting problem here. As you can see in the repo, I have a lot of mis-heard versions of the wake word in place, in this case being "Raspberry". Since the system heats up fast you need a fan, and with the microphone directly on a USB port next to the fan, I needed something distinct, and computer wasn't cutting it for this.

Changing the transcription model to something a bit better or moving the mic away from the fan could help this happen.


Have a look at the openWakeWord model which is especially built for detecting wakewords in a stream of speech.


"Number One" would be my code word...


And finally saying "make it so" to make the command happen.


"Engage" to order some travel solution?


Captain might be funnier.

Or just use the mouse.


It's just not very realistic - if you think you can give orders to your captain, you'll be out of Starfleet in no time!


How quaint.


As a professional technology person I say “computer” about 1megatoken per day


Yeah, but how often do you say "computer" in a querying/interrogatory tone?

That's a perfect opportunity to get better at cosplaying a Starfleet officer.

(Seriously though, a Federation-grade system would just recognize from context whether or not you meant to invoke the voice interface. Federation is full of near-AGI in simple appliances. Like the sliding doors that just know whether you want to go through them, or are just passing by.)


While totally true it’s not a good reason to use it as a wake word in 2024 with my raspberry pi voice assistant ;-)


This is why we can't have nice LCARS things: https://en.wikipedia.org/wiki/LCARS#Legal


Or LLM Offline Camera, User Trained Understanding Speech

LOCUTUS


s/Offline/Online/ and make sure it has all the cloud features enabled, so you and your friends and loved ones can become one with the FAANG collective.


It should be really something like Beneficial Audio Realtime Recognition Electronic Transformer.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: