/

Align text transcriptions in speech-to-text applications

Last updated

6/23/2025

Share

Audio File

Transcript

Made by

Mozilla.ai

By

Mozilla.ai

Align text transcriptions in speech-to-text applications

This Blueprint enables you to align OpenAI’s Whisper speech‑to‑text models toward user-defined text. By supplying a custom list of phrases (e.g., brand names, technical terms, rare phrases), the model adjusts its transcriptions, improving accuracy for domain‑specific vocabulary, especially when you need reliable recognition of words that aren’t common in everyday language.

In the audio example below, you can compare the transcriptions before and after biasing the model with the text "Dileesh Pothan", which is the correct spelling of a name that does not appear often in the training data of the original model.

Without model alignment: "The rich potent as an Indian film director from Kerala who works in the Malayalam film industry."

With model alignment: "Dileesh Pothan is an Indian film director from Kerala who works in the Malayalam film industry."

Time

10 min

Complexity

Low

Medium

High

Status

Stable

Contributors

Tags

Speech-to-Text

License

Preview this Blueprint in action

Hosted demo

Hosted Demo

Drag the corner to resize

Step by step walkthrough

Tools used to create

Trusted open source tools used for this Blueprint

Whisper BiDec enables to adjust transcriptions and recognize unusual names or phrases with smaller Whisper models.

Gradio used to build a simple user interface that lets you upload audio files and see their transcriptions.

Choices

Insights into our motivations and key technical decisions throughout the development process.

No items found.

Ready? Try it yourself!

System Requirements

OS: Linux, macOS, Windows (WSL), Python 3.10 or higher

Help Documentation

Detailed guidance on GitHub walking you through this project installation.

Discussion Points

Get involved in improving the Blueprint by visiting the GitHub Blueprint issues.

Explore Blueprints Extensions

See examples of extended blueprints unlocking new capabilities and adjusted configurations enabling tailored solutions—or try it yourself.

Want to build your own Blueprints?

See our guidelines for building a top-notch Blueprint.

Must-haves

Open-source models and tools usage

README, pyproject.toml, and organized folder structure

Demo app (Streamlit or Gradio) or jupyter notebook

Config file for easy customization

CLI support

Nice-to-haves

CPU compatibility for most local setups

Google Colab notebook option

PyPI package availability

Dockerfile for the demo app

Diagram of the Blueprint in the README

Setup and guidance docs using mkdocs

Github Template Repo