Deployment Types
Ephemeral Deployments
For development and testing, use ephemeral deployments withfal run.
fal run process in your terminal, the ephemeral deployment will be destroyed.
Persistent Deployments
To permanently deploy your application or update/redeploy existing one, you can use thefal deploy command.
Machine Types
You can specify the machine type of your app using themachine_type parameter in your fal.App class.
For GPU machines, you can also specify the number of GPUs you want to use with the num_gpus option.
fal deploy command.
Machine Type Options
| Value | Description | 
|---|---|
| XS | 0.50 CPU cores, 512MB RAM | 
| S | 1 CPU core, 1GB RAM (default) | 
| M | 2 CPU cores, 2GB RAM | 
| L | 4 CPU cores, 15GB RAM | 
| GPU-A6000 | 10 CPU cores, 18GB RAM, 1 GPU core (48GB VRAM) | 
| GPU-A100 | 12 CPU cores, 60GB RAM, 1 GPU core (40GB VRAM) | 
| GPU-H100 | 12 CPU cores, 112GB RAM, 1 GPU core (80GB VRAM) | 
| GPU-H200 | 12 CPU cores, 112GB RAM, 1 GPU core (141GB VRAM) | 
| GPU-B200 | 24 CPU cores, 112GB RAM, 1 GPU core (192GB VRAM) | 
Multiple Machine Types
Allow your app to use multiple machine types for a larger pool of available machines:Rollout Strategies
Your app could be deployed using one of two strategies:- recreate: default, instantly switch the app to the new revision.
- rolling: doesn’t switch the app to the new one until there is at least 1 runner in the new revision.
--strategy flag, e.g.
Authentication Modes
Your app could be deployed in one of three authentication modes:- private: default, your app is visible only to you and/or your team.
- shared: everyone can see and use your app, the caller pays for usage. This is how all of the apps in our Model Gallery work.
- public: everyone can see and use your app, the app owner (you) is paying for it.
fal deploy’s --auth flag or fal.App’s app_auth to specify your app’s authentication mode, e.g.
Best Practices
- Choose rolling strategy for production deployments to ensure zero downtime
- Use appropriate authentication modes based on your use case and cost considerations
- Test thoroughly with ephemeral deployments before permanent deployment
- Monitor your deployments using the fal dashboard and performance monitoring tools