Search: Performance & optimization

Paper page - RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

…horizon optimization . In parallel, RubricEM trains a shared-backbone reflection meta-policy that distills judged trajectories into reusable rubric-grounded guidance for future attempts. The resulting RubricEM-8B achieves strong performance across…

May 13, 2026

Paper page - LiVeAction: a Lightweight, Versatile, and Asymmetric Neural Codec Design for Real-time Operation

…Dan Jacobellis , Abstract LiVeAction is a lightweight neural codec architecture that improves rate-distortion performance for resource-constrained devices by using an FFT-like structure and variance-based rate penalty instead of…

May 11, 2026

Paper page - Rethinking Memory as Continuously Evolving Connectivity

…Across three fundamentally distinct benchmarks including LoCoMo, Mind2Web, and GAIA, FluxMem achieves consistent state-of-the-art performance, demonstrating strong adaptation and generalization in complex agentic environments. The code will be open…

May 28, 2026

Paper page - Solve the Loop: Attractor Models for Language and Reasoning

…Yet recurrent architectures remain unstable to train, costly to optimize and deploy, and constrained to small, fixed recurrence depths. We introduce Attractor Models , in which a backbone module first proposes output embeddings…

May 13, 2026

Paper page - PEFT-Arena: Understanding Parameter-Efficient Finetuning from a Stability-Plasticity Perspective

…Yangyi Huang , , , , , , Weiyang Liu Abstract Parameter-efficient fine-tuning methods exhibit varying stability-plasticity trade-offs in preserving pretrained capabilities, with orthogonal fine-tuning showing optimal performance under similar parameter constraints. AI…

May 28, 2026

Paper page - Auto Research with Specialist Agents Develops Effective and Non-Trivial Training Recipes

…Jingjie Ning , , , , Abstract Auto research operates as an empirical loop where agents iteratively refine code based on evaluation feedback, achieving improved performance across multiple tasks without human intervention. AI-generated summary We…

May 8, 2026

Paper page - Flow-OPD: On-Policy Distillation for Flow Matching Models

…the reward sparsity induced by scalar-valued rewards, and the gradient interference arising from jointly optimizing heterogeneous objectives, which together give rise to a 'seesaw effect' of competing metrics and pervasive reward…

May 11, 2026

Paper page - MEME: Multi-entity & Evolving Memory Evaluation

…1% in average accuracy) despite adequate static retrieval performance. Prompt optimization, deeper retrieval, reduced filler noise, and most stronger LLMs fail to close this gap. Only a file-based agent paired with…

May 13, 2026

Paper page - Everything at Every Scale: Scale-Invariant Diffusion with Continuous Super-Resolution

…The same trained reverse process performs generation and continuous super-resolution by varying only the starting timestep: no task-specific architecture, no conditioning branch, no classifier-free guidance, no retraining per scale…

May 28, 2026

SOTA OCR with Core ML and dots.ocr

…I was able to perform a rough comparison between Dots.OCR.Runner and other VLMs such as Magistral-Small-2509 and qwen3-vl-30b , using their top quantized versions that can run…

Oct 3, 2025 · Christopher Fleetwood

Followed topics

Paper page - RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards

Paper page - LiVeAction: a Lightweight, Versatile, and Asymmetric Neural Codec Design for Real-time Operation

Paper page - Rethinking Memory as Continuously Evolving Connectivity

Paper page - Solve the Loop: Attractor Models for Language and Reasoning

Paper page - PEFT-Arena: Understanding Parameter-Efficient Finetuning from a Stability-Plasticity Perspective

Paper page - Auto Research with Specialist Agents Develops Effective and Non-Trivial Training Recipes

Paper page - Flow-OPD: On-Policy Distillation for Flow Matching Models

Paper page - MEME: Multi-entity & Evolving Memory Evaluation

Paper page - Everything at Every Scale: Scale-Invariant Diffusion with Continuous Super-Resolution

SOTA OCR with Core ML and dots.ocr