Knowledgeable privacy aficionados of Lemmy, perhaps you can help.
I’m searching for a U.S. English speech to text program I can use for note taking, dictation, and internet searching that runs locally on Windows and doesn’t collect information or send it off to either the software company or third parties. I’m looking for an out-of-the-box easy option first- if needed I can explore writing scripts and using an LLM to craft a UI, but I’m not looking for something that would require a significant amount of extra building or coding. Ideally it’d be FLOSS and be light on compute, but I’m not averse to paying for a solid product that meets the privacy requirement and if it’s not ludicrously heavy on compute, that’s okay.
Vosk seems a good option, though in my brief exploration, I haven’t found a UI or scripts to use it easily.
WhisperAI, while very accurate, doesn’t natively support real-time speech to text, though there are some mods that try and address that.
Anything I’m completely missing?
Godspeed, friend.