Skip to content
Dashboard

Introduction to Compute

fal Compute is a dedicated GPU cloud computing platform designed for high-performance computing workloads, machine learning training, and AI inference at scale. Unlike serverless options, Compute gives you full control over your GPU resources with enterprise-grade infrastructure designed for demanding AI and research workloads.

What is fal Compute?

fal Compute provides dedicated GPU clusters built for heavy workloads that require consistent, high-performance infrastructure. Our platform offers custom-optimized infrastructure with advanced networking capabilities, high-speed storage, and guaranteed resource availability for research applications.

Key Features

Dedicated GPU Resources

  • Full GPU control: Complete control over your GPU resources for fine-tuning, training, and specialized workflows
  • Dedicated clusters: Plan and manage your own capacity with guaranteed resource availability
  • Enterprise-grade: Designed for production workloads with enterprise security and compliance features

High-Performance Infrastructure

  • Custom-optimized infrastructure: Infrastructure tailored to your specific workload requirements
  • High-speed SSD storage: Fast data access for training datasets, efficient model loading, and low-latency I/O operations
  • InfiniBand interconnect: Ultra-low latency and high bandwidth communication for distributed computing

Scalability & Performance

  • Heavy workload support: Built specifically for compute-intensive tasks that require sustained performance
  • Multi-node clusters: Deploy multiple instances within the same sector for distributed workloads
  • Scalable architecture: Scale your workloads across multiple interconnected instances

Available Instance Types

Instance TypeCPU CoresRAMGPU VRAMStorage
1xH100-SXM16200GB80GB (1x H100)1TB SSD
8xH100-SXM1281600GB640GB (8x H100)8TB SSD

When to Use fal Compute

fal Compute is ideal for workloads that require:

Machine Learning & AI

  • Large language model training: Train models with billions of parameters across multiple GPUs
  • Custom model fine-tuning: Dedicated resources for specialized model adaptation
  • Distributed training: Leverage InfiniBand connectivity for faster multi-node training
  • Batch inference: Large-scale inference jobs with predictable resource needs

Research & Development

  • Academic research: Sustained compute access for research projects
  • Computer vision: Process large image and video datasets
  • Scientific computing: Computational fluid dynamics, molecular dynamics, simulations

Architecture Benefits

InfiniBand Connectivity

For distributed computing scenarios, fal Compute offers InfiniBand interconnect that provides:

  • Ultra-low latency communication between nodes
  • High bandwidth for efficient data transfer
  • Optimized performance for distributed AI training
  • Seamless scaling across multiple instances

Enterprise Features

  • Guaranteed resource availability with dedicated clusters
  • Enterprise security and compliance capabilities
  • Custom infrastructure optimization for specific workloads
  • Professional support for production deployments

Ready to get started? Check out our Quickstart Guide to provision your first instance.