Our docs got a refresh! Check out the new content and improved navigation. For detailed API reference see our Python SDK docs and TypeScript SDK.
Description
EvaluatorsAdvanced concepts

API Performance

We test our API endpoints regularly to ensure they are returning responses reliably. We are actively working to increase bandwidth and reduce latency. Note that your mileage may vary as compared to our testing setup. These numbers are from March 2025.

Evaluator NameLatency (in s)QPSMax Input TokensAverage Input Tokens (for benchmarking)
answer-relevance-small-2024-07-232.18TBD124k~30
answer-relevance-large-2024-07-233.83TBD124k~30
context-relevance-small-2024-07-232.55TBD124k~180
context-relevance-large-2024-07-233.93TBD124k~180
context-sufficiency-small-2024-07-233.38TBD124k~180
context-sufficiency-large-2024-07-236.87TBD124k~180
hallucination-small-2024-07-232.58TBD124k~180
hallucination-large-2024-07-233.80TBD124k~180
lynx-2024-07-231.32TBD124k~180
judge-small-2024-08-081.68TBD124k~200
judge-large-2024-08-083.16TBD124k~200
glider-2024-12-112.44TBD8K~200
toxicity-2024-10-272.52TBD512~20
pii-2024-05-310.26TBD16k~20
phi-2024-05-310.32TBD16k~20
nlp-2024-05-160.25TBD32k~50
exact-match-2024-05-310.33TBD32k~50

On this page

No Headings