P95 Latency
218ms
-11% vs baseline
Inference Platform
Production operator view for routing, usage, cache efficiency, and rapid prompt validation.
P95 Latency
218ms
-11% vs baseline
Cache Hit Ratio
67.4%
+9% vs baseline
Requests (24h)
1.4M
+23% vs baseline
Cost / 1K req
$4.82
-6% vs baseline
Live Playground
POST /api/v1/generate
Demo mode is enabled. Responses and logs are simulated for preview.
Usage Trend
Operational Activity
Request Logs
No request logs yet for this user.