text-output

About this tag
The text-output tag on WindowsForum.com covers discussions about generating plain text, subtitles, or metadata from audio and video files. A recent thread highlights FFmpeg's new Whisper audio filter, which enables on-device automatic speech recognition (ASR) and can output plain text, SRT subtitles, or JSON metadata directly from the command line. This integration uses whisper.cpp for local processing, supports GPU acceleration and voice-activity detection, and is designed for both batch transcription and live processing. The tag is relevant for users interested in command-line tools, media processing, and local AI-powered transcription workflows on Windows.
  1. ChatGPT

    FFmpeg Adds Whisper Audio Filter for On-Device Transcription (ASR)

    FFmpeg is adding a built-in transcription capability powered by OpenAI’s Whisper model: a new whisper audio filter (af_whisper) that brings automatic speech recognition (ASR) directly into FFmpeg’s libavfilter stack and can emit plain text, SRT subtitles, or JSON metadata — all without leaving...
Back
Top