A downloadable tool for Windows

Download NowName your own price

Speech-to-text... kinda. Your audio, semi-understood.

📄 Project Description:

Tired of transcribers that try too hard? Enter Vosk Transcriber: Maybe Edition—your new best drunk buddy in the realm of speech-to-text. Powered by the wonderfully unpredictable Vosk engine, this tool takes your audio files and attempts to transcribe them. The results? Questionable. The experience? Amazing.

Whether you need:

  • A broken-telephone version of your podcast,

  • A lo-fi transcript for artistic reasons,

  • Make a character sound weird and say gibberish yet still sort of make sense,

  • Or just want to wonder at what your voice “sounds like” to a confused algorithm.

Vosk Transcriber: Maybe Edition is here for it. 

Its an automatic batch tool so it'll handle many files in bulk. Toss in your .mp3s or .wavs, pick a preincluded Vosk model (40M "Small" = faster chaos, 128M "Large" = slightly more coherent chaos), and let the transcription do its thing, fully automated.

The good news: it technically works.
The better news: you might even get something useful out of it.
The best news: it will do its absolute best to do a half-assed job!

(In all fairness I'm using the weakest models on purpose for comedy, Alpha Cephei pls don't kill me, my PC sucks and I wasn't able to run the bigger models that are actually good.)

🛠️ How to Use:

  1. Drop your audio files into the "put your audio files here" folder.

  2. Run the 'Vosk Transcriber, Maybe Edition.exe'

  3. Choose your preferred Vosk model when prompted.

  4. Wait for the "magic" to unfold.

  5. Find your half-baked transcriptions in the "transcriptions" folder.

Bonus: If you want more chaos, use noisy or mumbly audio—it gets... positively incomprehensible. 💩👌

It can run on fairly lightweight hardware (less than 1GB of VRAM for the pre-included Vosk models), its relatively fast and doesn't take as much hard drive space as something like OpenAI's Whisper.

Works on Windows 7 and above.

If you want good transcriptions, use this instead:

In all seriousness, if you want good quality transcriptions, I DO actually have a better option that isn't a meme. It works really well and gives great results. 

Get it here: Whisper Batch Transcriber (FREE)
https://reactorcore.itch.io/whisper-batch-transcriber

(However, it does need atleast 2GB of GPU VRAM for the base version, or even 6GB of GPU VRAM if using the 'whisper large v3 turbo' model, so make sure your computer meets the system requirements before downloading it!)

(Alternatively, you can get best-in-class highest quality Speech-to-Text at https://elevenlabs.io for about $5/mo per 12 hours of audio (as of 2025 Q2). That way you don't need to have a super powerful PC to transcribe your audios.)

Support My Work:

If you enjoyed this release, please buy me an orange to fuel me: https://buymeacoffee.com/reactorcoregames

Or join my Patreon for games, assets, design knowledge and tool recommendations: https://www.patreon.com/ReactorcoreGames

Check my Itch.io page and Follow me there to know when I release cool new stuff! https://reactorcore.itch.io/

All my links - I make games, software, assets, lego mechs, AI art and more: http://www.reactorcoregames.com

Join my Discord server to discuss my projects and get sneak peeks: https://discord.gg/UdRavGhj47 

Another way to help me is to share my things with your friends, school, work, family or on social media. Every bit of visibility helps a lot! 

Enjoy! 

- Reactorcore


Updated 13 days ago
Published 16 days ago
StatusReleased
CategoryTool
PlatformsWindows
AuthorReactorcore
TagsAudio, broken, chaotic, Experimental, Funny, glitch, maybe, speech, text, transcriber

Download

Download NowName your own price

Click download now to get access to the following files:

Vosk Transcriber, Maybe Edition.zip 240 MB

Leave a comment

Log in with itch.io to leave a comment.