Research and DifferentiatorsLynx
Log Evaluation Results (REST API)
Developers who are not using our Python SDK can import their evaluation results via our API. The endpoint to import evaluation results is POST /v1/evaluation-results/batch and you can see its documentation here. It is a batch API, so you can import multiple evaluation results at the same time.
All the features available to you through evaluations run through Patronus - like organizing evaluations into Apps, adding tags and filters, specifying raw scores and pass or fail - are available for imported evaluations as well. Below is a sample cURL request showcasing these features.
Imported evaluation results are distinguished from those generated through the Patronus platform in the following ways:
- The attribute
Externalis set toTrue - The
Evaluator Idis pre-pended with the keywordexternal:. This is so evaluator names for imported evaluations do not clash with those run through Patronus.
You can see both of these in Logs:

