Paper page - PhysicianBench: Evaluating LLM Agents in Real-World EHR Environments
…View arXiv page View PDF Project page GitHub 33 Add to collection Community PhysicianBench is a benchmark for evaluating LLM agents on physician tasks grounded in real clinical workflows. It comprises 100…