Our Python SDK got smarter. We developed a Typscript SDK too. We are updating our SDK code blocks. Python SDKhere.Typscript SDKhere.
Description
Advanced

API Performance

We test our API endpoints regularly to ensure they are returning responses reliably. We are actively working to increase bandwidth and reduce latency. Note that your mileage may vary as compared to our testing setup. These numbers are from March 2025.

Evaluator NameLatency (in s)QPSMax Input TokensAverage Input Tokens (for benchmarking)
answer-relevance-small-2024-07-232.18TBD124k~30
answer-relevance-large-2024-07-233.83TBD124k~30
context-relevance-small-2024-07-232.55TBD124k~180
context-relevance-large-2024-07-233.93TBD124k~180
context-sufficiency-small-2024-07-233.38TBD124k~180
context-sufficiency-large-2024-07-236.87TBD124k~180
hallucination-small-2024-07-232.58TBD124k~180
hallucination-large-2024-07-233.80TBD124k~180
lynx-2024-07-231.32TBD124k~180
judge-small-2024-08-081.68TBD124k~200
judge-large-2024-08-083.16TBD124k~200
glider-2024-12-112.44TBD8K~200
toxicity-2024-10-272.52TBD512~20
pii-2024-05-310.26TBD16k~20
phi-2024-05-310.32TBD16k~20
nlp-2024-05-160.25TBD32k~50
exact-match-2024-05-310.33TBD32k~50

On this page

No Headings