Search

Showing top 96 results for "AI training practices"

All sources huggingface.co 26 developer.nvidia.com 13 amd.com 6 9to5mac.com 5 anthropic.com 5 techcrunch.com 4 blogs.nvidia.com 4 techradar.com 3 theregister.com 3 intel.com 3 wired.com 2 storagereview.com 2

National Robotics Week — Latest Physical AI Research, Breakthroughs and Resources

…https://blogs.nvidia.com/wp-content/uploads/2026/04/nvidia_cosmos_accelerates_ai_training_for_robotics.mp4 Mimic robotics takes a different angle with mimic-video, a video-action model that pairs…

Apr 10, 2026 · NVIDIA Writers

Paper page - MARBLE: Multi-Aspect Reward Balance for Diffusion RL

…Existing practice deal with multiple rewards by training one specialist model per reward, optimizing a weighted-sum reward R(x)=sum_k w_k R_k(x), or sequentially fine-tuning with…

May 8, 2026

Paper page - SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training

…These results offer practical guidance for efficient MoE compression at scale. View arXiv page View PDF Add to collection Community SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training…

May 12, 2026

Paper page - MolmoAct2: Action Reasoning Models for Real-world Deployment

…We present MolmoAct2, a fully open action reasoning model built for practical deployment, advancing its predecessor along five axes. We introduce MolmoER, a VLM backbone specialized for spatial and embodied reasoning , trained…

May 5, 2026

Discussions and forums

Hacker News · u/socratizeio · 2w ago

Ask HN: Does anyone believe role-play AI is effective for training?

We built Socratize, an AI-based training tool where employees practice real workplace scenarios instead of watching videos or taking quizzes.Most corporate training is passive. People watch content, click through slides,…

Hacker News · u/emmanol · 3w ago

There's a $50B company hiding inside Salesforce

Hi — I'm Taylor, founder of revkit.ai.I've been quiet on the YC blog because we've been heads-down with our first wave of customers. But I want to share the thesis we're building on, because I think it matters for any fo…

1 2

r/sysadmin · u/Relaxation_Time · May 4, 2026

Reality check from the Microsoft AI Tour: "Agents" hype, the enterprise disconnect, and peak AI Fatigue

Just got back from the Microsoft AI Tour in Zurich. Honestly? Nothing has globally changed since my last visit to these events two years ago. They just scrubbed "LLM" and "GenAI" from all the slides and replaced them wit…

r/nvidia · u/apoppin · May 5, 2026

DLSS Launches in Dead As Disco early access, plus new looks at STAR WARS: Galactic Racer and 007 First Light

First the article link: https://www.nvidia.com/en-us/geforce/news/star-wars-galactic-racer-launches-october-6-with-dlss-4-5/ From GeForce PR: This week, Dead As Disco enters Early Access, and there are new looks at two h…

Paper page - Large Language Models over Networks: Collaborative Intelligence under Resource Constraints

…prompt-driven coordination, cooperative policy optimization (co-training agents in authentic scenarios), and inter-agent network optimization These compose into hybrid topologies in practice. 🎓 Learning to collaborate Two training threads we trace…

May 13, 2026

Google Translate will help you improve your rubbish pronunciation

…Looking through version 10.10.37.885563132.3-release of Google Translate for Android, we’ve uncovered a new AI-powered “Practice” mode for perfecting your pronunciation. This isn’t yet visible…

Mar 20, 2026 · Stephen Schenck

Taming the Wild West of ML: Practical Model Signing with Sigstore

…Applications that use advanced AI models are typically developed in at least three different stages. First, a large foundation model is trained on large datasets. Next, a separate ML team finetunes the…

Apr 4, 2025 · Mihai Maruseac

Using NVFP4 Low-Precision Model Training for Higher Throughput Without Losing Accuracy | NVIDIA Technical Blog

Agentic AI / Generative AI Using NVFP4 Low-Precision Model Training for Higher Throughput Without Losing Accuracy Feb 23, 2026 By Aditya Vavre , Nima Tajbakhsh , Wenwen Gao , Selvaraj Anandaraj and Amit Bleiweiss Discuss…

Feb 23, 2026 · Aditya Vavre

Paper page - Memory-Efficient Looped Transformer: Decoupling Compute from Memory in Looped Language Models

…a single KV cache across reasoning loops and using chunk-wise training with interpolated transition and attention-aligned distillation. AI-generated summary Recurrent LLM architectures have emerged as a promising approach for…

May 12, 2026

Paper page - Does Synthetic Layered Design Data Benefit Layered Design Decomposition?

…layered image data improves graphic design decomposition by enabling scalable training and better layer distribution control compared to traditional methods. AI-generated summary Recent advances in image generation have made it easy…

May 15, 2026

Followed topics