Home Assistant Voice PE speakers

Padook@feddit.nl · 3 days ago

Home Assistant Voice PE speakers

ikidd@lemmy.world · 2 days ago

I do OK with faster-whisper for transcribing, and I built a wyoming container for pocket-tts that does pretty good local TTS but I am running it as a docker on a ryzen machine (no gpu) so YMMV. Pocket-TTS seems better than Piper IMO, it’s certainly faster for local TTS if all you’re using is CPU.

I’ve been looking for a MCU that has enough oomph to do some noise cancellation onboard and I ordered up a couple of these in the hopes that the onboard NPU would be useful for that. It also has a speaker output and onboard mic. Price was right for 8GB of EMMC and 256MB of ram.

hendrik@palaver.p3x.de · 2 days ago

Nice, thanks for the link! I wasn’t aware of that. Sadly as with all shiny new things it doesn’t fit all my requirements… I’d really like to speak to my house in my native language. But I figure English will do. I’m gonna try that.

Not sure if an ESP32-S3 is fast enough for more advanced DSP plus the rest of an voice assistant. At least I found some ESP32 libraries with noise reduction, echo cancellation… There is the ESP-ADF and a project called ESP32-SpeexDSP. But I didn’t try that yet. The Rockckip / Luckfox development board looks nice as well. A Cortex-A7 and a few hundred megabytes of memory might come in handy. And whatever the NPU does. But I don’t have a clue what kind of software and libraries we got for embedded Linux or custom processing units.

Anyway. I think the production-grade stuff mostly uses multiple microphones and a combination of beamforming and echo cancellation. I got 4 inmp441 microphones here. But I lack the software/libraries to tinker with that kind of signal processing.