Quick Start
Deploy your first model in minutes with our step-by-step guide.
Migrate in 5 Minutes
Already have a Docker server? Migrate it to fal with minimal changes.
Examples
Step-by-step tutorials for deploying text-to-image, video, speech, and more.
CLI Reference
Complete reference for
fal deploy, fal apps, fal runners, and more.Key Features
- Instant scaling: Start from zero to thousands of GPUs instantly
- Pay-per-use: Pay only for the compute you use with auto-scaling and high availability
- Unified framework: Complete solution for running, deploying, and productionizing your AI apps
- GPU access: Access to thousands of H100s, H200s, and other high-performance GPUs
- Full observability: Complete visibility into requests, responses, and latencies (including custom metrics)
- Native clients: HTTP and WebSocket clients that work with both fal-provided models and your own apps