Last updated
4/22/2025
Get started
Transcribe Audio Files with Open-Source Whisper Models
This Blueprint shows you how to transcribe audio files using open-source Whisper models via Speaches, a self-hosted server that mimics the OpenAI Whisper API. This local-friendly setup offers a privacy-preserving alternative to commercial APIs. Great for transcribing sensitive or private audio data. This Blueprint includes setup instructions to help you get started quickly on your own data. Made in collaboration with EleutherAI.
Preview this Blueprint in action
Hosted demo
Step by step walkthrough
Tools used to create
Trusted open source tools used for this Blueprint
Choices
Insights into our motivations and key technical decisions throughout the development process.
No items found.
Ready? Try it yourself!
System Requirements
OS: Linux, macOS, Windows (WSL), Python 3.10 or higher, Docker, Minimum RAM: 16GB, Disk space: 40GB
Learn MoreHelp Documentation
Detailed guidance on GitHub walking you through this project installation.
View MoreDiscussion Points
Get involved in improving the Blueprint by visiting the GitHub Blueprint issues.
Join inExplore Blueprints Extensions
See examples of extended blueprints unlocking new capabilities and adjusted configurations enabling tailored solutions—or try it yourself.
Load more