Introduction to Compute

What is fal Compute?

fal Compute provides dedicated GPU clusters built for heavy workloads that require consistent, high-performance infrastructure. Our platform offers custom-optimized infrastructure with advanced networking capabilities, high-speed storage, and guaranteed resource availability for research applications.

Key Features

Dedicated GPU Resources

Full GPU control: Complete control over your GPU resources for fine-tuning, training, and specialized workflows
Dedicated clusters: Plan and manage your own capacity with guaranteed resource availability
Enterprise-grade: Designed for production workloads with enterprise security and compliance features

High-Performance Infrastructure

Custom-optimized infrastructure: Infrastructure tailored to your specific workload requirements
High-speed SSD storage: Fast data access for training datasets, efficient model loading, and low-latency I/O operations
InfiniBand interconnect: Ultra-low latency and high bandwidth communication for distributed computing

Scalability & Performance

Heavy workload support: Built specifically for compute-intensive tasks that require sustained performance
Multi-node clusters: Deploy multiple instances within the same sector for distributed workloads
Scalable architecture: Scale your workloads across multiple interconnected instances

Available Instance Types

Instance Type	CPU Cores	RAM	GPU VRAM	Storage
1xH100-SXM	16	200GB	80GB (1x H100)	1TB SSD
8xH100-SXM	128	1600GB	640GB (8x H100)	8TB SSD

When to Use fal Compute

fal Compute is ideal for workloads that require:

Machine Learning & AI

Large language model training: Train models with billions of parameters across multiple GPUs
Custom model fine-tuning: Dedicated resources for specialized model adaptation
Distributed training: Leverage InfiniBand connectivity for faster multi-node training
Batch inference: Large-scale inference jobs with predictable resource needs

Research & Development

Academic research: Sustained compute access for research projects
Computer vision: Process large image and video datasets
Scientific computing: Computational fluid dynamics, molecular dynamics, simulations

Architecture Benefits

InfiniBand Connectivity

For distributed computing scenarios, fal Compute offers InfiniBand interconnect that provides:

Ultra-low latency communication between nodes
High bandwidth for efficient data transfer
Optimized performance for distributed AI training
Seamless scaling across multiple instances

Enterprise Features

Guaranteed resource availability with dedicated clusters
Enterprise security and compliance capabilities
Custom infrastructure optimization for specific workloads
Professional support for production deployments

Ready to get started? Check out our Quickstart Guide to provision your first instance.

Reference

​What is fal Compute?

​Key Features

​Dedicated GPU Resources

​High-Performance Infrastructure

​Scalability & Performance

​Available Instance Types

​When to Use fal Compute

​Machine Learning & AI

​Research & Development

​Architecture Benefits

​InfiniBand Connectivity

​Enterprise Features