Skip to main content
Model APIs gives you instant access to state-of-the-art AI models for image, video, audio, and multimodal generation. Every model is already optimized and production-ready, so you can authenticate and start generating immediately. Each model runs on fal’s infrastructure with automatic scaling, queue-based reliability, and pay-per-use billing. You call them the same way whether you use the Python or JavaScript client or raw HTTP. If you need to deploy your own model instead, see Serverless.

Quick Example

Generate an image in three lines of code. Install the client, set your API key, and call a model.
import fal_client

result = fal_client.subscribe("fal-ai/nano-banana-2", arguments={
    "prompt": "a futuristic cityscape at sunset"
})
print(result["images"][0]["url"])
The response includes a CDN URL for the generated image, along with metadata like dimensions and seed. Every model follows the same pattern: send inputs as JSON, receive outputs as JSON with media URLs.

How It Works

Every model on fal is exposed as an HTTP endpoint. You can call it directly, or go through the queue for automatic retries, status tracking, and scaling. There are several calling patterns depending on your use case. Direct (run) sends a synchronous HTTP request to fal.run and returns the result directly. This is the simplest approach for quick scripts and prototyping. Subscribe uses the queue under the hood but handles polling automatically, so it feels synchronous. This is what the Quick Example above uses. Asynchronous (submit) gives you full control over the queue. Submit a request and return immediately, then poll for status or receive results via webhook. This is the recommended approach for production workloads with parallel processing. Streaming delivers output progressively as the model generates it. This is useful for LLMs that produce tokens incrementally, or for showing generation progress in a UI. realtime() uses WebSockets for persistent connections, bypassing the queue entirely for sub-100ms latency. Only available for models with an explicit real-time endpoint.

What You Can Generate

The model gallery has 1,000+ models spanning several categories. Here are some popular starting points.

Image Generation and Editing

Video Generation

Audio and Speech

Explore All Models

Browse 1,000+ models across image, video, audio, 3D, and more
Every model page on fal.ai includes a Playground for testing, full API documentation with input/output schemas, pricing, and ready-to-copy code examples.

Next Steps