What are people hooking up to their HAVpe’s for good sounding speakers? Obviously Big brands like Bose kill it, but any cheap alternatives with good sound for whole house audio?

  • ikidd@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    2 days ago

    I do OK with faster-whisper for transcribing, and I built a wyoming container for pocket-tts that does pretty good local TTS but I am running it as a docker on a ryzen machine (no gpu) so YMMV. Pocket-TTS seems better than Piper IMO, it’s certainly faster for local TTS if all you’re using is CPU.

    I’ve been looking for a MCU that has enough oomph to do some noise cancellation onboard and I ordered up a couple of these in the hopes that the onboard NPU would be useful for that. It also has a speaker output and onboard mic. Price was right for 8GB of EMMC and 256MB of ram.

    • hendrik@palaver.p3x.de
      link
      fedilink
      English
      arrow-up
      3
      ·
      2 days ago

      Nice, thanks for the link! I wasn’t aware of that. Sadly as with all shiny new things it doesn’t fit all my requirements… I’d really like to speak to my house in my native language. But I figure English will do. I’m gonna try that.

      Not sure if an ESP32-S3 is fast enough for more advanced DSP plus the rest of an voice assistant. At least I found some ESP32 libraries with noise reduction, echo cancellation… There is the ESP-ADF and a project called ESP32-SpeexDSP. But I didn’t try that yet. The Rockckip / Luckfox development board looks nice as well. A Cortex-A7 and a few hundred megabytes of memory might come in handy. And whatever the NPU does. But I don’t have a clue what kind of software and libraries we got for embedded Linux or custom processing units.

      Anyway. I think the production-grade stuff mostly uses multiple microphones and a combination of beamforming and echo cancellation. I got 4 inmp441 microphones here. But I lack the software/libraries to tinker with that kind of signal processing.