Stop Searching,
Start Building.

The Developer-First Hub for Open-Source AI Workflows

Text Link
Blueprint of the Week
Convert documents to markdown format

This blueprint guides you to convert unstructured documents to Markdown format using the Docling command-line interface.

Document
Markdown
EleutherAI
Preview this Blueprint in action

Highlighted Building Blocks

Explore the open-source resources behind our Blueprints.

Tools
Speaches

Speaches is an OpenAI API-compatible server supporting streaming transcription, translation, and speech generation.

Docling

Docling simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem.

Ultralytics

Ultralytics provides cutting-edge computer vision models, including YOLO11, enabling developers to integrate real-time object detection, segmentation, and classification into AI applications with minimal effort.

Datasets
Common Voice

A multilingual, crowdsourced collection of voice recordings from Mozilla.

Alpaca-gpt4

This dataset contains English Instruction-Following generated by GPT-4 using Alpaca prompts for fine-tuning LLMs.

Models
OuteTTS-0.1-350M

OuteTTS-0.1-350M is a novel text-to-speech synthesis model.

Qwen2.5-3B-Instruct-GGUF

Qwen2.5-3B-Instruct-GGUF is an instruction-tuned model that generates long-form content, and is optimized for efficient deployment via the GGUF format.

Kokoro-82M

Kokoro is an open-weight TTS model with 82 million parameters.

Text Link
Model Training
Text Link
Synthetic Data Detection
Text Link
Agents
Text Link
Finetune STT with your voice
Text Link
Map Features in OSM with CV
Text Link
Finetune LLM using Federated AI
Text Link
Embedding
Text Link
Federated AI
Text Link
Image Segmentation
Text Link
Object Detection
Text Link
Automatic Speech Recognition
Text Link
Speech-to-Text
Text Link
Query structured documents Q&A
Text Link
Emails
Text Link
Newsletter
Text Link
Podcast
Text Link
Community
Text Link
Events
Text Link
Discord
Text Link
Data Extraction
Text Link
User-Interface
Text Link
Performance Optimization
Text Link
LLM Inference
Text Link
Language Modelling
Text Link
Text-to-Text
Text Link
Text-to-Speech
Text Link
Podcast personalities
Text Link
Document-to-podcast
Text Link
Blueprints
Text Link
Use Cases
Text Link
English
Text Link
General Language
Text Link
Multilingual
Text Link
Finetuning
Text Link
Local AI
Text Link
Federated Learning
Text Link
LLM Integration
Text Link
Model Training
Text Link
Synthetic Data Detection
Text Link
Agents
Text Link
Finetune STT with your voice
Text Link
Map Features in OSM with CV
Text Link
Finetune LLM using Federated AI
Text Link
Embedding
Text Link
Federated AI
Text Link
Image Segmentation
Text Link
Object Detection
Text Link
Automatic Speech Recognition
Text Link
Speech-to-Text
Text Link
Query structured documents Q&A
Text Link
Emails
Text Link
Newsletter
Text Link
Podcast
Text Link
Community
Text Link
Events
Text Link
Discord
Text Link
Data Extraction
Text Link
User-Interface
Text Link
Performance Optimization
Text Link
LLM Inference
Text Link
Language Modelling
Text Link
Text-to-Text
Text Link
Text-to-Speech
Text Link
Podcast personalities
Text Link
Document-to-podcast
Text Link
Blueprints
Text Link
Use Cases
Text Link
English
Text Link
General Language
Text Link
Multilingual
Text Link
Finetuning
Text Link
Local AI
Text Link
Federated Learning
Text Link
LLM Integration
Text Link
Train Model
Text Link
Detect synthetic audio
Text Link
Markdown
Text Link
Audio File
Text Link
Transcript
Text Link
Fetch Posts & Generate Embeddings with LLM
Text Link
Map Features
Text Link
CompVis
Text Link
Federated AI
Text Link
Document
Text Link
Speech
Text Link
Personalized Timeline
Text Link
Algorithm
Text Link
Podcast
Text Link
Document