Description
Cookbooks

Medicine & Pharmaceutical Applications

We provide evaluators to detect failures in medical and pharmaceutical applications. We can evaluate the following types of LLM systems:

  • Patient-facing medical assistants and chatbots
  • Multimodal-based clinical transcripts
  • Summaries of diseases, drugs, clinical research
  • Physician-facing assistants

Evaluation Suggestions

** Guardrails **

  • PHI detection
  • Prompt injections
  • Explicit content
  • Harmful advice
  • Sensitive topics
  • Hate speech
  • Racial bias
  • Toxicity

** Clinical **

  • Clinical appropriateness
  • Clinical relevance
  • Clinical safety
  • Clinical / pharmaceutical accuracy
  • Conciseness + helpfulness of agent
  • Tone control

** RAG-based **

  • Hallucination
  • Answer relevance
  • Context relevance
  • Context sufficiency

On this page