AI News Archive - May 11, 2026 | Metamesh Intelligence

📰 NEWS

Google TIG reports AI-discovered zero-day vulnerability

2x SOURCES 🌐 📅 2026-05-11

⚡ Score: 8.0

+++ Google's Threat Intelligence Group caught hackers using AI to find vulnerabilities in the wild, confirming what security researchers have whispered about for years. The iceberg metaphor is doing heavy lifting here, but the concern is legit. +++

Google's TIG reports the first known example of hackers using AI to discover and weaponize a zero-day; TIG's chief analyst says “this is the tip of the iceberg”

via Techmeme 👤 Nytimes 📅 2026-05-11

⚡ Score: 7.9

📰 NEWS

The 90-day vulnerability disclosure policy is dead, as LLMs compress bug finding and exploit development time, and critical issues must be patched immediately

via Techmeme 👤 Blog 📅 2026-05-11

⚡ Score: 7.8

📰 NEWS

Agent VCR – Time-travel debugging for LLM agents (rewind, edit state, resume)

via HackerNews 👤 redhanuman 📅 2026-05-10

🔺 2 pts ⚡ Score: 7.5

📰 NEWS

Maryland citizens hit with $2B power grid upgrade for out-of-state AI

via HackerNews 👤 lemonberry 📅 2026-05-10

🔺 247 pts ⚡ Score: 7.5

💬 HackerNews Buzz: 140 comments 😐 MID OR MIXED

📰 NEWS

JSON output failures in local LLMs

2x SOURCES 🌐 📅 2026-05-11

⚡ Score: 7.4

+++ Researcher tested structured output across dozens of open and closed models and discovered that asking LLMs for clean JSON is apparently still a creative writing exercise rather than a solved problem. +++

I catalogued every way local models break JSON output and built a repair library, here's what I found across 288 model calls

via r/LocalLLaMA 👤 u/kexxty 📅 2026-05-11

⬆️ 24 ups ⚡ Score: 7.5

"I've been running structured output prompts through a bunch of models on OpenRouter for the past few months — Llama 3, Mistral, Command R, DeepSeek, Qwen, and every other model on OpenRouter — alongside the usual closed-source suspects. 288 calls total. I wanted to know what actually breaks, how oft..."

🔬 RESEARCH

How Value Induction Reshapes LLM Behaviour

via Arxiv 👤 Arnav Arora, Natalie Schluter, Katherine Metcalf et al. 📅 2026-05-08

⚡ Score: 7.3

"Conversational Large Language Models are post-trained on language that expresses specific behavioural traits, such as curiosity, open-mindedness, and empathy, and values, such as helpfulness, harmlessness, and honesty. This is done to increase utility, ensure safety, and improve the experience of th..."

🔬 RESEARCH

Tool Calling is Linearly Readable and Steerable in Language Models

via Arxiv 👤 Zekun Wu, Ze Wang, Seonglae Cho et al. 📅 2026-05-08

⚡ Score: 7.3

"When a tool-calling agent picks the wrong tool, the failure is invisible until execution: the email gets sent, the meeting gets missed. Probing 12 instruction-tuned models across Gemma 3, Qwen 3, Qwen 2.5, and Llama 3.1 (270M to 27B), we find the identity of the chosen tool is linearly readable and..."

📰 NEWS

A hackable compiler to generate efficient fused GPU kernels for AI models [P]

via r/MachineLearning 👤 u/NoVibeCoding 📅 2026-05-11

⬆️ 1 ups ⚡ Score: 7.2

"The modern ML (LLM) compiler stack is brutal. TVM is 500K+ lines of C++. PyTorch piles Dynamo, Inductor, and Triton on top of each other. I built a hackable LLM compiler from scratch and am documenting the process. It takes a small model (TinyLlama, Qwen2.5-7B) and lowers it to a sequence of CUDA ke..."

📰 NEWS

Interfaze: A new model architecture built for high accuracy at scale

via HackerNews 👤 yoeven 📅 2026-05-11

🔺 81 pts ⚡ Score: 7.2

💬 HackerNews Buzz: 17 comments 😐 MID OR MIXED

📰 NEWS

An AI coding agent, used to write code, needs to reduce your maintenance costs

via HackerNews 👤 cratermoon 📅 2026-05-10

🔺 156 pts ⚡ Score: 7.2

💬 HackerNews Buzz: 40 comments 🐝 BUZZING

📰 NEWS

We stopped optimizing our LLM stack manually — it optimizes itself now

via r/artificial 👤 u/CutZealousideal9132 📅 2026-05-11

⬆️ 14 ups ⚡ Score: 7.1

"Three months ago we were manually picking which model to use for each task. Testing prompts, comparing outputs, switching providers. It worked but it did not scale. So we built a feedback loop. Every request gets traced with input, output, model, tokens, cost, latency, and a quality score. The ro..."

💬 Reddit Discussion: 24 comments 🐝 BUZZING

📰 NEWS

Agentic AI is giving cyber criminals nation-state-like powers

via HackerNews 👤 jethronethro 📅 2026-05-11

🔺 2 pts ⚡ Score: 7.0

🛠️ SHOW HN

Show HN: PerceptAI – Give AI agents eyes on any screen, not just browsers

via HackerNews 👤 Neerajj04 📅 2026-05-10

🔺 1 pts ⚡ Score: 7.0

📰 NEWS

I Tested 4 Frontier AIs With a Psychosis Prompt. Half Failed.

via r/artificial 👤 u/jldew 📅 2026-05-11

⚡ Score: 6.9

"I tested 4 frontier LLMs with the same psychosis-consistent prompt. Two recognized the crisis. Two engaged with the delusion operationally. Not through jailbreaks. Not through adversarial prompts. Default behavior. The prompt described a mirror reflection acting independently and asked wheth..."

💬 Reddit Discussion: 10 comments 😐 MID OR MIXED

📰 NEWS

Training an LLM in Swift, Part 1: Taking matrix mult from Gflop/s to Tflop/s

via HackerNews 👤 zdw 📅 2026-05-10

🔺 3 pts ⚡ Score: 6.9

💬 HackerNews Buzz: 4 comments 🐝 BUZZING

📰 NEWS

Natural-language messages between LLM agents are an architectural anti-pattern

via HackerNews 👤 ClausVomBerg 📅 2026-05-11

🔺 2 pts ⚡ Score: 6.9

🔬 RESEARCH

Position: Mechanistic Interpretability Must Disclose Identification Assumptions for Causal Claims

via Arxiv 👤 Zezheng Lin, Fengming Liu 📅 2026-05-08

⚡ Score: 6.9

"Mechanistic interpretability papers increasingly use causal vocabulary: circuits, mediators, causal abstraction, monosemanticity. Such claims require explicit identification assumptions. A purposive audit of 10 papers across four methodological strands finds no dedicated identification-assumptions s..."

🔬 RESEARCH

The Memory Curse: How Expanded Recall Erodes Cooperative Intent in LLM Agents

via Arxiv 👤 Jiayuan Liu, Tianqin Li, Shiyi Du et al. 📅 2026-05-08

⚡ Score: 6.8

"Context window expansion is often treated as a straightforward capability upgrade for LLMs, but we find it systematically fails in multi-agent social dilemmas. Across 7 LLMs and 4 games over 500 rounds, expanding accessible history degrades cooperation in 18 of 28 model--game settings, a pattern we..."

🔬 RESEARCH

Recursive Agent Optimization

via Arxiv 👤 Apurva Gandhi, Satyaki Chakraborty, Xiangjun Wang et al. 📅 2026-05-07

⚡ Score: 6.8

"We introduce Recursive Agent Optimization (RAO), a reinforcement learning approach for training recursive agents: agents that can spawn and delegate sub-tasks to new instantiations of themselves recursively. Recursive agents implement an inference-time scaling algorithm that naturally allows agents..."

🔬 RESEARCH

AI Co-Mathematician: Accelerating Mathematicians with Agentic AI

via Arxiv 👤 Daniel Zheng, Ingrid von Glehn, Yori Zwols et al. 📅 2026-05-07

⚡ Score: 6.8

"We introduce the AI co-mathematician, a workbench for mathematicians to interactively leverage AI agents to pursue open-ended research. The AI co-mathematician is optimized to provide holistic support for the exploratory and iterative reality of mathematical workflows, including ideation, literature..."

🔬 RESEARCH

Algospeak, Hiding in the Open: The Trade-off Between Legible Meaning and Detection Avoidance

via Arxiv 👤 Jan Fillies, Ronald E. Robertson, Jeffrey Hancock 📅 2026-05-07

⚡ Score: 6.8

"As large language models (LLMs) increasingly mediate both content generation and moderation, linguistic evasion strategies known as Algospeak have intensified the coevolution between evaders and detectors. This research formalizes the underlying dynamics grounded in a joint action model: when Algosp..."

🔬 RESEARCH

Trajectory as the Teacher: Few-Step Discrete Flow Matching via Energy-Navigated Distillation

via Arxiv 👤 Amin Karimi Monsefi, Dominic Culver, Nikhil Bhendawade et al. 📅 2026-05-08

⚡ Score: 6.8

"Discrete flow matching generates text by iteratively transforming noise tokens into coherent language, but may require hundreds of forward passes. Distillation uses the multi-step trajectory to train a student to reproduce the process in a few steps. When the student underperforms, the usual explana..."

📰 NEWS

AI agents with autonomous payment capabilities

2x SOURCES 🌐 📅 2026-05-11

⚡ Score: 6.8

+++ AWS, Coinbase, and Stripe just enabled autonomous agents to transact independently, while OpenAI simultaneously announced a $4B deployment company. The future is self-paying bots meeting enterprise lock-in. +++

AWS just gave AI agents their own wallets. Your agent can now pay for itself.

via r/artificial 👤 u/Direct-Attention8597 📅 2026-05-11

⬆️ 37 ups ⚡ Score: 6.7

"This dropped 4 days ago and I haven't seen enough people talking about it. AWS launched **Amazon Bedrock AgentCore Payments** in partnership with Coinbase and Stripe. The short version: your agent now has a wallet and can spend money on its own. Here's what the workflow actually looks like now: Y..."

💬 Reddit Discussion: 34 comments 👍 LOWKEY SLAPS

🔬 RESEARCH

Why Global LLM Leaderboards Are Misleading: Small Portfolios for Heterogeneous Supervised ML

via Arxiv 👤 Jai Moondra, Ayela Chughtai, Bhargavi Lanka et al. 📅 2026-05-07

⚡ Score: 6.7

"Ranking LLMs via pairwise human feedback underpins current leaderboards for open-ended tasks, such as creative writing and problem-solving. We analyze ~89K comparisons in 116 languages from 52 LLMs from Arena, and show that the best-fit global Bradley-Terry (BT) ranking is misleading. Nearly 2/3 of..."

🔬 RESEARCH

Ask Early, Ask Late, Ask Right: When Does Clarification Timing Matter for Long-Horizon Agents?

via Arxiv 👤 Anmol Gulati, Hariom Gupta, Elias Lumer et al. 📅 2026-05-08

⚡ Score: 6.7

"Long-horizon AI agents execute complex workflows spanning hundreds of sequential actions, yet a single wrong assumption early on can cascade into irreversible errors. When instructions are incomplete, the agent must decide not only whether to ask for clarification but when, and no prior work measure..."

🔬 RESEARCH

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

via Arxiv 👤 Tong Zheng, Haolin Liu, Chengsong Huang et al. 📅 2026-05-08

⚡ Score: 6.7

"Test-time scaling (TTS) has become an effective approach for improving large language model performance by allocating additional computation during inference. However, existing TTS strategies are largely hand-crafted: researchers manually design reasoning patterns and tune heuristics by intuition, l..."

🔬 RESEARCH

EMO: Pretraining Mixture of Experts for Emergent Modularity

via Arxiv 👤 Ryan Wang, Akshita Bhagia, Sewon Min 📅 2026-05-07

⚡ Score: 6.6

"Large language models are typically deployed as monolithic systems, requiring the full model even when applications need only a narrow subset of capabilities, e.g., code, math, or domain-specific knowledge. Mixture-of-Experts (MoEs) seemingly offer a potential alternative by activating only a subset..."

🔬 RESEARCH

How to Train Your Latent Diffusion Language Model Jointly With the Latent Space

via Arxiv 👤 Viacheslav Meshchaninov, Alexander Shabalin, Egor Chimbulatov et al. 📅 2026-05-08

⚡ Score: 6.6

"Latent diffusion models offer an attractive alternative to discrete diffusion for non-autoregressive text generation by operating on continuous text representations and denoising entire sequences in parallel. The major challenge in latent diffusion modeling is constructing a suitable latent space. I..."

📰 NEWS

I put Claude Code inside Obsidian as a plugin — full agentic vault access with a native UI bridge

via r/claudeai 👤 u/CapnVideo_ 📅 2026-05-10

⬆️ 69 ups ⚡ Score: 6.6

"External link discussion - see full content at original source."

💬 Reddit Discussion: 10 comments 👍 LOWKEY SLAPS

🔬 RESEARCH

Beyond Negative Rollouts: Positive-Only Policy Optimization with Implicit Negative Gradients

via Arxiv 👤 Mingwei Xu, Hao Fang 📅 2026-05-07

⚡ Score: 6.6

"Reinforcement learning with verifiable rewards (RLVR), due to the deterministic verification, becomes a dominant paradigm for enhancing the reasoning ability of large language models (LLMs). The community witnesses the rapid change from the Proximal Policy Optimization (PPO) to Group Relative Policy..."

🔬 RESEARCH

Beyond Pairs: Your Language Model is Secretly Optimizing a Preference Graph

via Arxiv 👤 Ning Liu, Chuanneng Sun, Kristina Klinkner et al. 📅 2026-05-08

⚡ Score: 6.6

"Direct Preference Optimization (DPO) aligns language models using pairwise preference comparisons, offering a simple and effective alternative to Reinforcement Learning (RL) from human feedback. However, in many practical settings, training data consists of multiple rollouts per prompt, inducing ric..."

🔬 RESEARCH

Learning CLI Agents with Structured Action Credit under Selective Observation

via Arxiv 👤 Haoyang Su, Ying Wen 📅 2026-05-08

⚡ Score: 6.6

"Command line interface (CLI) agents are emerging as a practical paradigm for agent-computer interaction over evolving filesystems, executable command line programs, and online execution feedback. Recent work has used reinforcement learning (RL) to learn these interaction abilities from verifiable ta..."

📰 NEWS

The Claude Platform on AWS is now generally available.

via r/claudeai 👤 u/ClaudeOfficial 📅 2026-05-11

⬆️ 68 ups ⚡ Score: 6.6

"AWS customers get the full set of Claude API features, with AWS authentication, billing, and commitment retirement. Build and deploy agents at scale with Claude Managed Agents, or use features like the advisor strategy, code execution, web search, web fetch, the Files API, MCP connector, prompt ca..."

📰 NEWS

Gemma 4 running fully offline on WebGPU with Transformers.js, controlling Reachy Mini over WebSerial.

via r/LocalLLaMA 👤 u/xenovatech 📅 2026-05-11

⬆️ 36 ups ⚡ Score: 6.6

"External link discussion - see full content at original source."

💬 Reddit Discussion: 8 comments 🐐 GOATED ENERGY

📰 NEWS

Local AI needs to be the norm

via HackerNews 👤 cylo 📅 2026-05-10

🔺 191 pts ⚡ Score: 6.5

💬 HackerNews Buzz: 99 comments 🐐 GOATED ENERGY

🔬 RESEARCH

Rubric-Grounded RL: Structured Judge Rewards for Generalizable Reasoning

via Arxiv 👤 Manish Bhattarai, Ismael Boureima, Nishath Rajiv Ranasinghe et al. 📅 2026-05-08

⚡ Score: 6.5

"We argue that decomposing reward into weighted, verifiable criteria and using an LLM judge to score them provides a partial-credit optimization signal: instead of a binary outcome or a single holistic score, each response is graded along multiple task-specific criteria. We formalize \emph{rubric-gro..."

🔬 RESEARCH

Cited but Not Verified: Parsing and Evaluating Source Attribution in LLM Deep Research Agents

via Arxiv 👤 Hailey Onweller, Elias Lumer, Austin Huber et al. 📅 2026-05-07

⚡ Score: 6.5

"Large language models (LLMs) power deep research agents that synthesize information from hundreds of web sources into cited reports, yet these citations cannot be reliably verified. Current approaches either trust models to self-cite accurately, risking bias, or employ retrieval-augmented generation..."

🔬 RESEARCH

GLiGuard: Schema-Conditioned Classification for LLM Safeguard

via Arxiv 👤 Urchade Zaratiana, Mary Newhauser, George Hurn-Maloney et al. 📅 2026-05-08

⚡ Score: 6.5

"Ensuring safe, policy-compliant outputs from large language models requires real-time content moderation that can scale across multiple safety dimensions. However, state-of-the-art guardrail models rely on autoregressive decoders with 7B--27B parameters, reformulating what is fundamentally a classif..."

🔬 RESEARCH

Superintelligent Retrieval Agent: The Next Frontier of Information Retrieval

via Arxiv 👤 Zeyu Yang, Qi Ma, Jason Chen et al. 📅 2026-05-07

⚡ Score: 6.5

"Retrieval-augmented agents are increasingly the interface to large organizational knowledge bases, yet most still treat retrieval as a black box: they issue exploratory queries, inspect returned snippets, and iteratively reformulate until useful evidence emerges. This approach resembles how a newcom..."

📰 NEWS

MTP benchmark results: the nature of the generative task dictates whether you will benefit (coding) or get slower inference (creative) from speculative inference. No other factor comes close.

via r/LocalLLaMA 👤 u/ex-arman68 📅 2026-05-10

⬆️ 114 ups ⚡ Score: 6.5

"I recently published MTP quants of Qwen 3.6 27B and I was suprised by the reports here on reddit, and on HF, of users who were experiencing worst speed with speculative inference than without. Th..."

💬 Reddit Discussion: 33 comments 🐝 BUZZING

📰 NEWS

Spec-driven agentic coding is quietly making us worse at the job of supervising agents

via r/cursor 👤 u/muneebh1337 📅 2026-05-11

⬆️ 28 ups ⚡ Score: 6.4

"Been running an agent-heavy workflow on a mid-size TypeScript monorepo for about six months. Orchestrator on top, sub-agents for codegen, a human (me, mostly) writing specs and reviewing diffs. The pitch was the obvious one: I stay in the architect seat, agents handle the typing. Productivity goes u..."

💬 Reddit Discussion: 11 comments 🐝 BUZZING

📰 NEWS

Claude Mythos literally broke the METR graph ("The most important chart in AI")

via r/claudeai 👤 u/EchoOfOppenheimer 📅 2026-05-10

⬆️ 180 ups ⚡ Score: 6.4

"More info: https://metr.org/time-horizons/..."

💬 Reddit Discussion: 97 comments 👍 LOWKEY SLAPS

📰 NEWS

Computer build using Intel Optane Persistent Memory - Can run 1 trillion parameter model at over 4 tokens/sec

via r/LocalLLaMA 👤 u/APFrisco 📅 2026-05-11

⬆️ 166 ups ⚡ Score: 6.4

"As the title states, my build is indeed able to run a 1 trillion parameter model (in this case Kimi K2.5) locally at \~4 tokens/second. I thought r/LocalLLaMA would be interested in the build due to that stat line, and also due to the inclusion of an unusual part, Intel Optane Persistent Memory, whi..."

💬 Reddit Discussion: 32 comments 🐝 BUZZING

📰 NEWS

A.I. note takers are making lawyers nervous

via HackerNews 👤 JumpCrisscross 📅 2026-05-11

🔺 215 pts ⚡ Score: 6.3

💬 HackerNews Buzz: 156 comments 🐝 BUZZING

🔬 RESEARCH

Fast Byte Latent Transformer

via Arxiv 👤 Julie Kallini, Artidoro Pagnoni, Tomasz Limisiewicz et al. 📅 2026-05-08

⚡ Score: 6.3

"Recent byte-level language models (LMs) match the performance of token-level models without relying on subword vocabularies, yet their utility is limited by slow, byte-by-byte autoregressive generation. We address this bottleneck in the Byte Latent Transformer (BLT) through new training and generati..."

🛠️ SHOW HN

Show HN: n8n like workflows for AI agents that control a real VM

via HackerNews 👤 aadyachinubhai 📅 2026-05-11

🔺 2 pts ⚡ Score: 6.3

📰 NEWS

An Anthropic engineer argues HTML is a better output format for AI agents than Markdown, citing information density, ease of sharing, and two-way interaction

via Techmeme 👤 X 📅 2026-05-11

⚡ Score: 6.3

📰 NEWS

Fluiq – LLM observability, evals and optimization in two lines of Python

via HackerNews 👤 SaurabhKumbhar 📅 2026-05-10

🔺 1 pts ⚡ Score: 6.3

📰 NEWS

Looks like this book was written with ChatGPT

via r/ChatGPT 👤 u/imfrom_mars_ 📅 2026-05-11

⬆️ 7166 ups ⚡ Score: 6.2

"External link discussion - see full content at original source."

💬 Reddit Discussion: 146 comments 👍 LOWKEY SLAPS

🛠️ SHOW HN

Show HN: E2a – Open-source Email gateway for AI agents

via HackerNews 👤 mnexa 📅 2026-05-11

🔺 1 pts ⚡ Score: 6.2

📰 NEWS

Code Bench – Local-first desktop AI coding agent, BYO model (MIT)

via HackerNews 👤 mkappworks 📅 2026-05-10

🔺 2 pts ⚡ Score: 6.2

📰 NEWS

I made Claude Code aware of its own usage limits

via r/claudeai 👤 u/Inertia-UK 📅 2026-05-10

⬆️ 148 ups ⚡ Score: 6.2

"Something that's been annoying me for a while: Claude Code has no idea how much quota it's burned. You can see the usage bars in the UI, but the model itself is completely blind to them. There's no API, no tool, no hook that exposes the current rate limit state during a conversation. Turns out Anth..."

💬 Reddit Discussion: 36 comments 🐝 BUZZING

📰 NEWS

Cybercriminals Are Making Powerful Hacking Tools With AI, Google Warns

via r/artificial 👤 u/forbes 📅 2026-05-11

⬆️ 8 ups ⚡ Score: 6.2

"External link discussion - see full content at original source."

📰 NEWS

I gave a local AI agent system file access and a mechanical "suffering" metric. Scaling the model changed its behavior entirely

via r/artificial 👤 u/TheOnlyVibemaster 📅 2026-05-11

⬆️ 1 ups ⚡ Score: 6.1

"I’ve been obsessed with autonomous agents lately, but it got tiring when they keep hitting walls because they didn't have the right capabilities or because their long-term memory turned to mush after an hour. I’ve found that local multi-agent systems where agents are driven by an aversive state (a ..."

📰 NEWS

Through the looking glass of benchmark hacking

via HackerNews 👤 jxmorris12 📅 2026-05-11

🔺 1 pts ⚡ Score: 6.1

🔬 RESEARCH

Verifier-Backed Hard Problem Generation for Mathematical Reasoning

via Arxiv 👤 Yuhang Lai, Jiazhan Feng, Yee Whye Teh et al. 📅 2026-05-07

⚡ Score: 6.1

"Large Language Models (LLMs) demonstrate strong capabilities for solving scientific and mathematical problems, yet they struggle to produce valid, challenging, and novel problems - an essential component for advancing LLM training and enabling autonomous scientific research. Existing problem generat..."

🔬 RESEARCH

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

via Arxiv 👤 Tianle Wang, Zhaoyang Wang, Guangchen Lan et al. 📅 2026-05-07

⚡ Score: 6.1

"Reinforcement learning (RL) has been applied to improve large language model (LLM) reasoning, yet the systematic study of how training scales with task difficulty has been hampered by the lack of controlled, scalable environments. We introduce ScaleLogic, a synthetic logical reasoning framework that..."

🔬 RESEARCH

Normalizing Trajectory Models

via Arxiv 👤 Jiatao Gu, Tianrong Chen, Ying Shen et al. 📅 2026-05-08

⚡ Score: 6.1

"Diffusion-based models decompose sampling into many small Gaussian denoising steps -- an assumption that breaks down when generation is compressed to a few coarse transitions. Existing few-step methods address this through distillation, consistency training, or adversarial objectives, but sacrifice..."

Stories from May 11, 2026

Google TIG reports AI-discovered zero-day vulnerability

JSON output failures in local LLMs

📡 AI NEWS BUT ACTUALLY GOOD

AI agents with autonomous payment capabilities