Search

Showing top 108 results for "LLM capabilities"

Top stories

Discussions and forums

r/LocalLLaMA · u/The_Paradoxy · May 11, 2026

The Qwen 3.6 35B A3B hype is real!!!

My personal test for small local LLM intelligence is to check whether a model has any ability to understand the code that I write for my own academic research. My research is on some pretty niche topics and I doubt that …

r/LocalLLaMA · u/jacek2023 · 1w ago

google/gemma-4-12B · Hugging Face

Gemma is a family of open models built by Google DeepMind. Gemma 4 models are multimodal, handling text and image input (with audio supported on E2B, E4B, and 12B) and generating text output. This release includes open-w…

r/netsec · u/Fickle-Box1433 · 1w ago

I evaluated 5 LLM agents on patching real-world CVEs. Here is what I found.

I built an independent benchmark with 20 real CVEs across 15 CWE categories, 5 models (3 OpenAI, 2 Poolside Laguna), three prompt conditions: full advisory, behavioral description only, and location only (file and functi…

Hacker News · u/shoushen · 4d ago

A wild idea: Abstract reality using ontology

# A Wild Idea: Abstract Reality with Ontology## Background Large language models (LLMs) debuted with GPT-3 back in June 2020. After roughly five to six years of development, I believe the technology is still in its infan…

3 1
Hacker News · u/atleastoptimal · 1d ago

Ask HN: Why won't you be replaced by AI?

AI models are rapidly getting better. The general public still hasn't seen the capabilities of Anthropic's Mythos model, which is already 4 months old at this point.I've seen many arguments about why certain jobs will al…

8 30