Paper page - Agent Skills Should Go Beyond Text: The Case for Visual Skills
…enabling multimodal agents to act with stronger visual grounding, better workflow continuity, and more personalized task execution. This is an automated message from the Librarian Bot . I found the following papers similar…