Analytics

curl --request GET \
  --url https://api.fal.ai/v1/models/analytics \
  --header 'Authorization: <api-key>'

{
  "time_series": [
    {
      "bucket": "2025-01-15T12:00:00-05:00",
      "results": [
        {
          "endpoint_id": "fal-ai/flux/dev",
          "request_count": 1500,
          "success_count": 1450,
          "p50_duration": 2.5,
          "p90_duration": 4.8
        }
      ]
    }
  ],
  "next_cursor": null,
  "has_more": false
}

API Reference

Analytics

Time-bucketed metrics per model endpoint, including request counts, success/error rates, and latency percentiles. prepare_duration reflects queue/prepare time before execution; duration is request execution time. Use with the Queue/Webhooks flow to monitor SLAs.

Metric Selection: You must specify which metrics to include using the expand query parameter. Only requested metrics will be populated in the response, allowing you to optimize query performance and data transfer.

Available Metrics:

request_count: Total number of requests in the time bucket
success_count: Number of successful requests (2xx responses)
user_error_count: Number of user errors (4xx responses)
error_count: Number of server errors (5xx responses)
p50_prepare_duration: 50th percentile queue/prepare time
p75_prepare_duration: 75th percentile queue/prepare time
p90_prepare_duration: 90th percentile queue/prepare time
p50_duration: 50th percentile request execution duration
p75_duration: 75th percentile request execution duration
p90_duration: 90th percentile request execution duration

Key Features:

Selective metric inclusion via expand parameter
Performance metrics (latency percentiles, duration stats)
Reliability metrics (success/error rates, request counts)
Time-bucketed data for trend analysis
Single or multi-model analytics
Flexible date range and timeframe options

Common Use Cases:

Monitor model performance and reliability
Generate performance dashboards
Analyze latency trends and patterns
Track error rates and success metrics

See Queue API docs for more details.

GET

models

analytics

Analytics

curl --request GET \
  --url https://api.fal.ai/v1/models/analytics \
  --header 'Authorization: <api-key>'

{
  "time_series": [
    {
      "bucket": "2025-01-15T12:00:00-05:00",
      "results": [
        {
          "endpoint_id": "fal-ai/flux/dev",
          "request_count": 1500,
          "success_count": 1450,
          "p50_duration": 2.5,
          "p90_duration": 4.8
        }
      ]
    }
  ],
  "next_cursor": null,
  "has_more": false
}

Authorizations

Authorization

string

header

required

API key must be prefixed with "Key ", e.g. Authorization: Key YOUR_API_KEY

Query Parameters

limit

integer

Maximum number of items to return. Actual maximum depends on query type and expansion parameters.

Required range: x >= 1

Example:

50

cursor

string

Pagination cursor from previous response. Encodes the page number.

Example:

"Mg=="

start

Start date in ISO8601 format (e.g., '2025-01-01T00:00:00Z' or '2025-01-01'). Defaults to 24 hours ago.

Example:

"2025-01-01T00:00:00Z"

end

End date in ISO8601 format (e.g., '2025-01-31T23:59:59Z' or '2025-01-31'). Defaults to current time.

Example:

"2025-01-31T23:59:59Z"

timezone

string

default:UTC

Timezone for date aggregation and boundaries. All timestamps in responses are in UTC, but this controls how dates are bucketed.

Example:

"UTC"

timeframe

enum<string>

Aggregation timeframe for timeseries data (auto-detected from date range if not specified). Auto-detection uses: minute (<2h), hour (<2d), day (<64d), week (<183d), month (>=183d).

Available options:

minute,

hour,

day,

week,

month

Example:

"day"

bound_to_timeframe

enum<string>

default:true

Whether to adjust start/end dates to align with timeframe boundaries and use exclusive end. Defaults to true. When true, dates are aligned to the start of the timeframe period (e.g., start of day) and end is made exclusive (e.g., start of next day). When false, uses exact dates provided.

Available options:

true,

false

Example:

"true"

endpoint_id

required

Filter by specific endpoint ID(s). Accepts 1-50 endpoint IDs. Supports comma-separated values: ?endpoint_id=model1,model2 or array syntax: ?endpoint_id=model1&endpoint_id=model2

Example:

["fal-ai/flux/dev"]

expand

default:["time_series","request_count"]

Data and metrics to include in the response. Use 'time_series' for time-bucketed data, metric names for specific metrics in time series, and 'summary' for aggregate statistics. At least one of 'time_series' or 'summary' and at least one metric are required.

Example:

["request_count", "success_count"]

Response

Analytics data retrieved successfully

Response containing performance analytics with pagination support

next_cursor

string | null

required

Cursor for the next page of results, null if no more pages

has_more

boolean

required

Boolean indicating if more results are available (convenience field derived from next_cursor)

time_series

object[]

Time series analytics data grouped by time bucket (when expand includes 'time_series'). Each bucket contains all analytics records for that time period.

Show child attributes

time_series.bucket

string

required

Time bucket timestamp in user's timezone with offset (ISO8601 datetime)

time_series.results

object[]

required

Analytics records for this time bucket

Show child attributes

time_series.results.endpoint_id

string

required

Endpoint identifier for these statistics

time_series.results.request_count

integer

Total number of requests

Required range: x >= 0

time_series.results.success_count

integer

Number of successful requests (2xx responses)

Required range: x >= 0

time_series.results.user_error_count

integer

Number of user errors (4xx responses)

Required range: x >= 0

time_series.results.error_count

integer

Number of server errors (5xx responses)

Required range: x >= 0

time_series.results.p50_prepare_duration

number

50th percentile queue/prepare time before execution in seconds

Required range: x >= 0

time_series.results.p75_prepare_duration

number

75th percentile queue/prepare time before execution in seconds

Required range: x >= 0

time_series.results.p90_prepare_duration

number

90th percentile queue/prepare time before execution in seconds

Required range: x >= 0

time_series.results.p50_duration

number

50th percentile request execution duration in seconds

Required range: x >= 0

time_series.results.p75_duration

number

75th percentile request execution duration in seconds

Required range: x >= 0

time_series.results.p90_duration

number

90th percentile request execution duration in seconds

Required range: x >= 0

summary

object[]

Aggregate statistics (when expand includes 'summary')

Show child attributes

summary.endpoint_id

string

required

Endpoint identifier for these statistics

summary.request_count

integer

Total number of requests

Required range: x >= 0

summary.success_count

integer

Number of successful requests (2xx responses)

Required range: x >= 0

summary.user_error_count

integer

Number of user errors (4xx responses)

Required range: x >= 0

summary.error_count

integer

Number of server errors (5xx responses)

Required range: x >= 0

summary.p50_prepare_duration

number

50th percentile queue/prepare time before execution in seconds

Required range: x >= 0

summary.p75_prepare_duration

number

75th percentile queue/prepare time before execution in seconds

Required range: x >= 0

summary.p90_prepare_duration

number

90th percentile queue/prepare time before execution in seconds

Required range: x >= 0

summary.p50_duration

number

50th percentile request execution duration in seconds

Required range: x >= 0

summary.p75_duration

number

75th percentile request execution duration in seconds

Required range: x >= 0

summary.p90_duration

number

90th percentile request execution duration in seconds

Required range: x >= 0

Usage

Platform APIs for Serverless

⌘I