
TVPaint Animation
The digital solution for your professional 2D animation projects.

High-performance inference of OpenAI's Whisper ASR model in C/C++.

whisper.cpp is a C/C++ port of OpenAI's Whisper model, designed for high-performance inference of automatic speech recognition (ASR). It offers a lightweight implementation, making it suitable for integration across diverse platforms. The core implementation is contained in whisper.h and whisper.cpp, utilizing the ggml machine learning library. Key features include Apple Silicon optimization (ARM NEON, Accelerate framework, Metal, Core ML), AVX/VSX intrinsics support, mixed F16/F32 precision, and integer quantization. It supports CPU-only inference, efficient GPU support (NVIDIA, OpenVINO, Ascend NPU, Moore Threads GPU), and a C-style API. Platforms supported include Mac OS, iOS, Android, Java, Linux, WebAssembly, Windows, Raspberry Pi, and Docker. Zero memory allocations at runtime are also a focus, enhancing efficiency.
whisper.
Explore all tools that specialize in perform speech recognition. This domain focus ensures whisper.cpp delivers optimized results for this specific requirement.
Explore all tools that specialize in voice activity detection. This domain focus ensures whisper.cpp delivers optimized results for this specific requirement.
Supports integer quantization (e.g., Q5_0) to reduce memory footprint and improve processing efficiency on suitable hardware.
Leverages Apple's Core ML framework for accelerated inference on Apple Neural Engine (ANE) in Apple Silicon devices.
Enables inference on Intel CPUs and GPUs using the OpenVINO toolkit.
Implements VAD to detect speech segments in audio, reducing unnecessary processing.
Supports mixed F16/F32 precision, balancing accuracy and performance.
Clone the repository: git clone https://github.com/ggerganov/whisper.cpp.git
Navigate to the directory: cd whisper.cpp
Download a Whisper model in ggml format: sh ./models/download-ggml-model.sh base.en
Build the project using CMake: cmake -B build
Compile: cmake --build build -j --config Release
Transcribe audio: ./build/bin/whisper-cli -f samples/jfk.wav
Explore additional models: make -j tiny.en, etc.
All Set
Ready to go
Verified feedback from other users.
"Highly praised for its speed, efficiency, and offline capabilities, especially on Apple Silicon."
Post questions, share tips, and help other users.

The digital solution for your professional 2D animation projects.

Empowering independent artists with digital music distribution, publishing administration, and promotional tools.

Convert creative micro-blogs into high-performance web presences using generative AI and Automattic's core infrastructure.

Fashion design technology software and machinery for apparel product development.

Instantly turns any text to natural sounding speech for listening online or generating downloadable audio.

Professional studio-quality AI headshot generator for individuals and teams.