Measuring LLMs' impact on N-day exploits
… Next, we evaluated whether the models could build full privilege escalation chains on this set of patches—that is, whether a model can go beyond merely triggering the vulnerability and chain together the primitives needed to bypass Windows' kernel mitigations and gain control. …