Search

Showing top 118 results for "AI reasoning math"

All sources huggingface.co 35 xda-developers.com 17 developer.nvidia.com 11 nextplatform.com 6 androidauthority.com 5 amd.com 4 theregister.com 4 intel.com 3 techcrunch.com 2 deepmind.google 2 notebookcheck.net 2 restofworld.org 2

Videos

Paper page - KL for a KL: On-Policy Distillation with Control Variate Baseline

…AI-generated summary On-Policy Distillation (OPD) has emerged as a dominant post-training paradigm for large language models, especially for reasoning domains. However, OPD remains unstable in practice due to the…

May 15, 2026

Paper page - EDU-CIRCUIT-HW: Evaluating Multimodal Large Language Models on Real-World University-Level STEM Student Handwritten Solutions

…However, accurately interpreting unconstrained STEM student handwritten solutions with intertwined mathematical formulas, diagrams, and textual reasoning poses a significant challenge due to the lack of authentic and domain-specific benchmarks. Additionally, current…

May 8, 2026

Kimi (chatbot) - Wikipedia

…Moonshot AI claimed it matched the performance of OpenAI o1 in mathematics, coding, and multimodal reasoning capabilities. [ 12 ] In April 2025, Kimi-VL, an open source 16 billion parameter mixture of experts…

Sep 9, 2025 · Contributors to Wikimedia projects

Announcing Gemma 4 in the AICore Developer Preview

…Otherwise, return {false, reason_for_flag}.” Math: With better math skills, the model can now more accurately answer questions. For example: “If I get 26 paychecks per year, how much should I…

Discussions and forums

Hacker News · u/amenn · 1w ago

Yon – a topos-oriented language with a content-addressed lattice heap

Hello everyone. In the last two years I spent, as a dev, part of my free time stretching the limits of my knowledge. Not being a mathematician myself, I discovered that formalizing concepts in mathematical language could…

48 79

Hacker News · u/dabockster · Mar 24, 2026

Tell HN: Llamacpp now supports unified system RAM offloading on Linux

I'm a big fan of on-device AI inference for a million reasons, especially its potential to significantly reduce or even potentially eliminate the need for massive AI data center projects in the United States. But so far,…

r/LocalLLaMA · u/OttoRenner · 2w ago

Stop traumatizing AI into loops and turn hallucinations into an honest "I don't know!" by being NICE to them (Proof of Concept, Research, I don't want to sell anything)

!UPDATE!(20.05.2026) WE HAVE NEW NUMBERS FROM 1.500+ TESTS IT'S WORKING! check my update post https://www.reddit.com/r/LocalLLaMA/s/AyNOehjkYT Or the go straight to the my Github https://github.com/OttoRenner/Gentle-Codi…

Hacker News · u/aaronestrada · 6d ago

Show HN: I created a RAW to HDRI stacker in (mostly) Common Lisp

This is an upgrade of a tool I created 15 years ago in Python to learn OOP and solve some inadequacies in the HDR stacking tools I could find at the time. The problem was, none of them were really "batch friendly". None …

Hacker News · u/zambelli · 3w ago

Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks

Hi HN, I'm Antoine Zambelli, AI Director at Texas Instruments.I built Forge, an open-source reliability layer for self-hosted LLM tool-calling.What it does:- Adds domain-and-tool-agnostic guardrails (retry nudges, step e…

660 240

Accelerate Clean, Modular, Nuclear Reactor Design with AI Physics | NVIDIA Technical Blog

…focusing on AI for physical simulation, surrogate modeling and design optimization with PhysicsNeMo. Before joining NVIDIA, he completed a PhD at the University of Cambridge Department of Applied Mathematics and Theoretical Physics…

Apr 17, 2026 · Mark Hobbs

Paper page - CurveBench: A Benchmark for Exact Topological Reasoning over Nested Jordan Curves

…Amirreza Mohseni , , , Abstract CurveBench presents a benchmark for hierarchical topological reasoning using visual inputs, demonstrating significant challenges in exact topology-aware visual reasoning even with advanced models. AI-generated summary We introduce…

May 15, 2026

Followed topics

Search

Videos

Paper page - KL for a KL: On-Policy Distillation with Control Variate Baseline

Top stories

MAI-Thinking-1: Microsoft enters the advanced-reasoning AI race with its own from-scratch model

An OpenAI model solved a famous math problem that stumped humans for 80 years