Time-bucketed metrics per model endpoint, including request counts, success/error
rates, and latency percentiles. prepare_duration reflects queue/prepare
time before execution; duration is request execution time. Use with the
Queue/Webhooks flow to monitor SLAs.
Metric Selection:
You must specify which metrics to include using the expand query
parameter. Only requested metrics will be populated in the response,
allowing you to optimize query performance and data transfer.
Available Metrics:
request_count: Total number of requests in the time bucketsuccess_count: Number of successful requests (2xx responses)user_error_count: Number of user errors (4xx responses)error_count: Number of server errors (5xx responses)p50_prepare_duration: 50th percentile queue/prepare timep75_prepare_duration: 75th percentile queue/prepare timep90_prepare_duration: 90th percentile queue/prepare timep50_duration: 50th percentile request execution durationp75_duration: 75th percentile request execution durationp90_duration: 90th percentile request execution durationKey Features:
Common Use Cases:
See Queue API docs for more details.
API key must be prefixed with "Key ", e.g. Authorization: Key YOUR_API_KEY
Maximum number of items to return. Actual maximum depends on query type and expansion parameters.
x >= 150
Pagination cursor from previous response. Encodes the page number.
"Mg=="
Start date in ISO8601 format (e.g., '2025-01-01T00:00:00Z' or '2025-01-01'). Defaults to 24 hours ago.
"2025-01-01T00:00:00Z"
End date in ISO8601 format (e.g., '2025-01-31T23:59:59Z' or '2025-01-31'). Defaults to current time.
"2025-01-31T23:59:59Z"
Timezone for date aggregation and boundaries. All timestamps in responses are in UTC, but this controls how dates are bucketed.
"UTC"
Aggregation timeframe for timeseries data (auto-detected from date range if not specified). Auto-detection uses: minute (<2h), hour (<2d), day (<64d), week (<183d), month (>=183d).
minute, hour, day, week, month "day"
Whether to adjust start/end dates to align with timeframe boundaries and use exclusive end. Defaults to true. When true, dates are aligned to the start of the timeframe period (e.g., start of day) and end is made exclusive (e.g., start of next day). When false, uses exact dates provided.
true, false "true"
Filter by specific endpoint ID(s). Accepts 1-50 endpoint IDs. Supports comma-separated values: ?endpoint_id=model1,model2 or array syntax: ?endpoint_id=model1&endpoint_id=model2
["fal-ai/flux/dev"]Data and metrics to include in the response. Use 'time_series' for time-bucketed data, metric names for specific metrics in time series, and 'summary' for aggregate statistics. At least one of 'time_series' or 'summary' and at least one metric are required.
["request_count", "success_count"]Analytics data retrieved successfully
Response containing performance analytics with pagination support
Cursor for the next page of results, null if no more pages
Boolean indicating if more results are available (convenience field derived from next_cursor)
Time series analytics data grouped by time bucket (when expand includes 'time_series'). Each bucket contains all analytics records for that time period.
Aggregate statistics (when expand includes 'summary')