Skip to main content

Contact Sales
Login
Login

Status
Community
Blog

Introduction

Connect to Cursor

Getting Started

Quick Start
Deploy Your First Image Generator
Installation & Setup
Core Concepts

Tutorials

Deploy a Text-to-Image Model
Deploy a Text-to-Video Model
Deploy a Text-to-Speech Model
Deploy a Text-to-Music Model
Deploy Multi-GPU Inference
Deploy Models with Custom Containers

Deployment & Operations

Deploy to Production
Manage Deployments
Manage Secrets Securely
Monitor Performance
Scale Your Application

Development

Handle Inputs and Outputs
Download Model Weights and Files
Import Code
Use Persistent Storage
Realtime Endpoints
Test Models and Endpoints
Use a Custom Container Image
Handle request cancellations
Use KV Store

Multi-GPU Workloads

Overview
Event Streaming
API Reference

Advanced Optimizations

Optimize Routing Behavior
Optimize Model Performance
Optimize Startup with Compiled Caches
Optimize Container Images

Migrations

Migrate from Replicate

CLI Reference

Installation
fal auth
fal deploy
fal files
fal run
fal queue
fal keys
Profiles
fal secrets
fal doctor
fal create
fal runners

Python SDK

fal.App Class Reference
fal.api.SyncServerlessClient
Python SDK API Reference

API Reference

Platform APIs for Serverless

404

Page Not Found

We couldn't find the page. Maybe you were looking for one of these pages below?

Deploy Multi-GPU Inference Introduction to Serverless Deploy Your First Image Generator