Search

Showing top 123 results for "In the Weights"

…While Low-Rank Adaptation ( LoRA ) introduces additional weights between the LLM layers, Soft Prompting introduces additional fine-tuning-specific raw tokens to an LLM input. However, both require modification to the computational…

Jun 11, 2026

Paper page - ATLAS: Agentic or Latent Visual Reasoning? One Word is Enough for Both

…To further address the sparsity of functional tokens during RL, we introduce Latent-Anchored GRPO (LA-GRPO), which stabilizes the training by anchoring functional tokens with a statically weighted auxiliary objective , providing…

May 15, 2026

Paper page - SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating

…In the SFT stage, SlimSearcher employs Pareto-efficient filtration to distill trajectories that are both successful and economical, guiding the model toward inherently efficiency-aware search behaviors. During RL, we introduce Adaptive…

Jun 9, 2026

Paper page - How can embedding models bind concepts?

…Unlocking Vision-Language Alignment via Weight Recycling (2026) Learning Relative Representations for Fine-Grained Multimodal Alignment with Limited Data (2026) Unlocking Compositional Generalization in Continual Few-Shot Learning (2026) CrossFlowDG: Bridging the…

Jun 1, 2026

Paper page - Adaptive Auto-Harness: Sustained Self-Improvement for Agentic System Deployment on Open-Ended Task Streams

…https://github.com/A-EVO-Lab/a-evolve/tree/release/adaptive-auto-harness Get this paper in your agent: hf papers read 2606.01770 Don't have the latest CLI? curl -LsSf…

Jun 3, 2026

Paper page - Clark Hash: Stateless Sparse Johnson-Lindenstrauss Quantization for Neural Embeddings

…Queries stay in floating point and are scored against the stored sketches. In the default 384-dimensional sentence-embedding setting, Clark Hash stores a cosine-search vector in 48 bytes instead of…

May 28, 2026

Paper page - Compositional Text-to-Image Generation Via Region-aware Bimodal Direct Preference Optimization

…It delivers a huge boost in compositional fidelity. 🔥 We’ve already open-sourced the dataset and model weights right here on Hugging Face. Check them out! 🎉 Super excited to share that this…

Jun 2, 2026

Paper page - RepWAM: World Action Modeling with Representation Visual-Action Tokenizers

…hf papers read 2606.13674 Don't have the latest CLI? curl -LsSf https://hf.co/cli/install.sh | bash No model linking this paper Cite arxiv.org/abs/2606.13674 in…

Jun 12, 2026

Paper page - RewardHarness: Self-Evolving Agentic Post-Training

…By comparing predicted judgments with ground-truth preferences and analyzing successes and failures in the reasoning process, the Orchestrator automatically refines its library of tools and skills without additional human annotation. Using…

May 14, 2026

Paper page - Meta-Cognitive Memory Policy Optimization for Long-Horizon LLM Agents

…hf papers read 2605.30159 Don't have the latest CLI? curl -LsSf https://hf.co/cli/install.sh | bash No model linking this paper Cite arxiv.org/abs/2605.30159 in…

Jun 5, 2026

Followed topics