Search: agent safety focus

Paper page - MedSkillAudit: A Domain-Specific Audit Framework for Medical Research Agent Skills

… Concern-Level Diagnostics for AI Peer Review 2026 Evaluating Patient Safety Risks in Generative AI: Development and Validation of a FMECA Framework for Generated Clinical Content 2026 PhysicianBench: Evaluating LLM Agents in Real-World EHR Environments 2026 An Empirical Study of Agent Skills for He… …

May 7, 2026

Paper page - Audio-Visual Intelligence in Large Foundation Models

… View arXiv page View PDF Project page GitHub 70 Add to collection Community 🎧👀 Audio-Visual Intelligence in Large Foundation Models: A Comprehensive Survey 📄 arXiv: 2605.04045 We are excited to release what we believe is the first comprehensive survey on Audio-Visual Intelligence AVI in the era of … …

May 8, 2026

We Got Claude to Fine-Tune an Open Source LLM

… I found the explanation of Hugging Face’s “Skills Training” initiative — how it lets you use a coding‑agent like Claude Code or other supported agents to fine‑tune large language models, submit GPU jobs, monitor progress and push trained models to the Hub — particularly eye‑opening. …

Oct 14, 2025 · ben burtenshaw

Followed topics

Paper page - MedSkillAudit: A Domain-Specific Audit Framework for Medical Research Agent Skills

Paper page - Audio-Visual Intelligence in Large Foundation Models

We Got Claude to Fine-Tune an Open Source LLM