/

Transcribe Audio Files with Open-Source Whisper Models

Last updated

4/22/2025

Share

Audio File

Transcript

Made by

Partner

By

EleutherAI

Transcribe Audio Files with Open-Source Whisper Models

This Blueprint shows you how to transcribe audio files using open-source Whisper models via Speaches, a self-hosted server that mimics the OpenAI Whisper API. This local-friendly setup offers a privacy-preserving alternative to commercial APIs. Great for transcribing sensitive or private audio data. This Blueprint includes setup instructions to help you get started quickly on your own data. Made in collaboration with EleutherAI.

Time

15 min

Complexity

Low

Medium

High

Status

Stable

Contributors

Tags

Speech-to-Text

Data Extraction

License

Preview this Blueprint in action

Hosted demo

Hosted Demo

Drag the corner to resize

Step by step walkthrough

Tools used to create

Trusted open source tools used for this Blueprint

Speaches is used to transcribe audio files via self-hosted server that mimics the OpenAI Whisper API.

Docker image is used as an option to run Speaches.

Gradio used to build a simple user interface that lets you upload audio files and see their transcriptions.

Choices

Insights into our motivations and key technical decisions throughout the development process.

No items found.

Ready? Try it yourself!

System Requirements

OS: Linux, macOS, Windows (WSL), Python 3.10 or higher, Docker, Minimum RAM: 16GB, Disk space: 40GB

Help Documentation

Detailed guidance on GitHub walking you through this project installation.

Discussion Points

Get involved in improving the Blueprint by visiting the GitHub Blueprint issues.

Explore Blueprints Extensions

See examples of extended blueprints unlocking new capabilities and adjusted configurations enabling tailored solutions—or try it yourself.

Want to build your own Blueprints?

See our guidelines for building a top-notch Blueprint.

Must-haves

Open-source models and tools usage

README, pyproject.toml, and organized folder structure

Demo app (Streamlit or Gradio) or jupyter notebook

Config file for easy customization

CLI support

Nice-to-haves

CPU compatibility for most local setups

Google Colab notebook option

PyPI package availability

Dockerfile for the demo app

Diagram of the Blueprint in the README

Setup and guidance docs using mkdocs

Github Template Repo