You are using an out of date browser. It may not display this or other websites correctly. You should upgrade or use an alternative browser.
text-output
About this tag
The text-output tag on WindowsForum.com covers discussions about generating plain text, subtitles, or metadata from audio and video files. A recent thread highlights FFmpeg's new Whisper audio filter, which enables on-device automatic speech recognition (ASR) and can output plain text, SRT subtitles, or JSON metadata directly from the command line. This integration uses whisper.cpp for local processing, supports GPU acceleration and voice-activity detection, and is designed for both batch transcription and live processing. The tag is relevant for users interested in command-line tools, media processing, and local AI-powered transcription workflows on Windows.
FFmpeg is adding a built-in transcription capability powered by OpenAI’s Whisper model: a new whisper audio filter (af_whisper) that brings automatic speech recognition (ASR) directly into FFmpeg’s libavfilter stack and can emit plain text, SRT subtitles, or JSON metadata — all without leaving...