Search: agent cost control

Paper page - On-Policy Self-Evolution via Failure Trajectories for Agentic Safety Alignment

…improving agent safety comes at the cost of degraded task performance . Such sparse and single-objective rewards severely limit real-world usability. To bridge this gap, we propose FATE, an on-policy…

Paper page - WorldMemArena: Evaluating Multimodal Agent Memory Through Action-World Interaction

…a Long-Horizon Memory Environment for LLM Agents (2026) MementoGUI: Learning Agentic Multimodal Memory Control for Long-Horizon GUI Agents (2026) When Stored Evidence Stops Being Usable: Scale-Conditioned Evaluation of Agent…

May 29, 2026

Paper page - Learning to Act and Cooperate for Distributed Black-Box Consensus Optimization

…Work focuses on improving the efficiency and robustness of distributed black box optimization in multi-agent systems. Potential applications include cooperative sensing, resource allocation, and distributed control, which may contribute to more…

May 4, 2026

Paper page - MEME: Multi-entity & Evolving Memory Evaluation

…Only a file-based agent paired with Claude Opus 4.7 as its internal LLM partially closes the gap, but at ~70x the baseline cost, indicating closure currently depends on configurations that…

May 13, 2026

Paper page - AI CFD Scientist: Toward Open-Ended Computational Fluid Dynamics Discovery with Physics-Aware AI Agents

…cost, two strong general AI-scientist baselines ( ARIS , DeepScientist ) execute partial CFD workflows but lack the domain-specific validity gates needed to convert runs into defensible scientific claims; and a controlled planted…

May 15, 2026

Paper page - Large Language Models over Networks: Collaborative Intelligence under Resource Constraints

…UAVs hit connectivity gaps, closed-loop control can't tolerate round-trips, and per-token pricing caps sustained agentic deployments. On-device LLMs hit the opposite wall: compute, memory, capability. This survey…

May 13, 2026

Paper page - REPOT: Recoverable Program-of-Thought via Checkpoint Repair

…The following papers were recommended by the Semantic Scholar API Push Your Agent: Measuring and Enforcing Quantitative Goal Persistence in Long-Horizon LLM Agents (2026) RubricRefine: Improving Tool-Use Agent Reliability with…

May 29, 2026

Paper page - Debiased Model-based Representations for Sample-efficient Continuous Control

Papers arxiv:2605.11711 Debiased Model-based Representations for Sample-efficient Continuous Control Published on May 12 Submitted by Jiafei Lyu on May 13 Tencent Hunyuan Authors: , , , Kai Yang , , , , Abstract DR.Q…

May 13, 2026

Paper page - Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video

…control branches, or attention and positional-encoding modifications, which often require post-training on large-scale camera-annotated videos. Training-free alternatives avoid such post-training, but often shift the cost to…

May 15, 2026

Paper page - Stream-T1: Test-Time Scaling for Streaming Video Generation

…AI-generated summary While Test-Time Scaling (TTS) offers a promising direction to enhance video generation without the surging costs of training, current test-time video generation methods based on diffusion models…

May 7, 2026

Followed topics

Paper page - On-Policy Self-Evolution via Failure Trajectories for Agentic Safety Alignment

Paper page - WorldMemArena: Evaluating Multimodal Agent Memory Through Action-World Interaction

Paper page - Learning to Act and Cooperate for Distributed Black-Box Consensus Optimization

Paper page - MEME: Multi-entity & Evolving Memory Evaluation

Paper page - AI CFD Scientist: Toward Open-Ended Computational Fluid Dynamics Discovery with Physics-Aware AI Agents

Paper page - Large Language Models over Networks: Collaborative Intelligence under Resource Constraints

Paper page - REPOT: Recoverable Program-of-Thought via Checkpoint Repair

Paper page - Debiased Model-based Representations for Sample-efficient Continuous Control

Paper page - Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video

Paper page - Stream-T1: Test-Time Scaling for Streaming Video Generation