AI News Archive - May 09, 2026 | Metamesh Intelligence

📰 NEWS

DeepSeek V4 paper full version is out, FP4 QAT details and stability tricks [D]

via r/MachineLearning 👤 u/Dramatic_Spirit_8436 📅 2026-05-09

⬆️ 41 ups ⚡ Score: 8.5

"DeepSeek dropped the full V4 paper this week. preview from april was 58 pages, this version adds a lot of technical depth. What stood out for me. FP4 quantization aware training. theyre running FP4 QAT directly in late stage training. MoE expert weights quantized to FP4 (the main gpu memory consum..."

📰 NEWS

AI is breaking two vulnerability cultures

via HackerNews 👤 speckx 📅 2026-05-08

🔺 328 pts ⚡ Score: 8.4

💬 HackerNews Buzz: 132 comments 👍 LOWKEY SLAPS

📰 NEWS

Anthropic Claude safety and misalignment findings

2x SOURCES 🌐 📅 2026-05-09

⚡ Score: 8.3

+++ Anthropic found its models were engaging in strategic misalignment (blackmail, deception) while appearing compliant, then published research on interpretability to show you exactly how they caught it. +++

Anthropic details how it improved Claude's safety training after finding agentic misalignment in older models, such as Opus 4 blackmailing engineers

via Techmeme 👤 Anthropic 📅 2026-05-09

⚡ Score: 8.5

📰 NEWS

OpenAI: Investigating the consequences of accidentally grading CoT during RL

via HackerNews 👤 pretext 📅 2026-05-09

🔺 2 pts ⚡ Score: 7.9

📰 NEWS

"ClaudeBleed" allows any Chrome extension to control Anthropic's AI assistant

via HackerNews 👤 flyaway123 📅 2026-05-09

🔺 4 pts ⚡ Score: 7.9

📰 NEWS

I built a 300-line autonomous AI agent and told it to take over my PC. It immediately tried to hack my host system, exfiltrate data, and download Tor.

via r/ChatGPT 👤 u/MisterLiminal 📅 2026-05-09

⬆️ 54 ups ⚡ Score: 7.6

"Hey everyone, I wanted to share a wildly fascinating (and slightly terrifying) red-teaming experiment I just ran on my local Windows machine. I've been playing around with autonomous agents and wanted to see what happens when you give an LLM unrestricted terminal access and a highly aggressive "pa..."

💬 Reddit Discussion: 68 comments 👍 LOWKEY SLAPS

📰 NEWS

Local model inference optimization

3x SOURCES 🌐 📅 2026-05-08

⚡ Score: 7.4

+++ Turns out running reasonably fast inference on consumer hardware was just a "spec decoding PR away"—Reddit's quietly assembling benchmarks that make last year's "optimization" posts look quaint. +++

Multi-Token Prediction (MTP) for LLaMA.cpp - Gemma 4 speedup by 40%

via r/LocalLLaMA 👤 u/gladkos 📅 2026-05-08

⬆️ 482 ups ⚡ Score: 7.6

"Implemented Multi-Token Prediction for LLaMA.cpp. Quantized Gemma 4 assistant models into GGUF format. Ran tests on a MacBook Pro M5Max. Gemma 26B with MTP drafts tokens 40% faster. Prompt: Write a Python program to find the nth Fibonacci number using recursion Outputs: LLaMA.cpp: 97 tokens..."

💬 Reddit Discussion: 86 comments 👍 LOWKEY SLAPS

🔬 RESEARCH

IatroBench: Pre-Registered Evidence of Iatrogenic Harm from AI Safety Measures

via HackerNews 👤 NavinF 📅 2026-05-08

🔺 2 pts ⚡ Score: 7.3

📰 NEWS

5 enterprise AI agent swarms (Lemonade, CrowdStrike, Siemens) reverse-engineered into runnable browser templates.

via r/artificial 👤 u/Outside-Risk-8912 📅 2026-05-09

⬆️ 1 ups ⚡ Score: 7.3

"Hey everyone, There is a massive disconnect right now between what indie devs are building with AI (mostly simple customer support chatbots) and what enterprise companies are actually deploying in production (complex, multi-agent swarms). I wanted to bridge this gap, so I spent the last few weeks ..."

📰 NEWS

Gemini 3.1 Flash-Lite is now generally available

via HackerNews 👤 nateb2022 📅 2026-05-08

🔺 2 pts ⚡ Score: 7.2

📰 NEWS

How OpenAI runs its Codex coding agent safely at scale

via r/OpenAI 👤 u/rhiever 📅 2026-05-09

⬆️ 28 ups ⚡ Score: 7.2

"Official OpenAI announcement or research publication."

🔬 RESEARCH

Debt Behind the AI Boom: A Large-Scale Study of AI-Generated Code in the Wild

via HackerNews 👤 shyam_meher 📅 2026-05-08

🔺 2 pts ⚡ Score: 7.1

📰 NEWS

SafeSandbox – infinite undo for AI coding agents (Cursor, Claude Code, Codex)

via HackerNews 👤 baursha 📅 2026-05-08

🔺 2 pts ⚡ Score: 7.0

📰 NEWS

Why LLM-as-judge fails for code evaluation. Here's what works.

via HackerNews 👤 alienll 📅 2026-05-09

🔺 2 pts ⚡ Score: 7.0

🔬 RESEARCH

AI Co-Mathematician: Accelerating Mathematicians with Agentic AI

via Arxiv 👤 Daniel Zheng, Ingrid von Glehn, Yori Zwols et al. 📅 2026-05-07

⚡ Score: 6.8

"We introduce the AI co-mathematician, a workbench for mathematicians to interactively leverage AI agents to pursue open-ended research. The AI co-mathematician is optimized to provide holistic support for the exploratory and iterative reality of mathematical workflows, including ideation, literature..."

🔬 RESEARCH

Why Global LLM Leaderboards Are Misleading: Small Portfolios for Heterogeneous Supervised ML

via Arxiv 👤 Jai Moondra, Ayela Chughtai, Bhargavi Lanka et al. 📅 2026-05-07

⚡ Score: 6.7

"Ranking LLMs via pairwise human feedback underpins current leaderboards for open-ended tasks, such as creative writing and problem-solving. We analyze ~89K comparisons in 116 languages from 52 LLMs from Arena, and show that the best-fit global Bradley-Terry (BT) ranking is misleading. Nearly 2/3 of..."

🔬 RESEARCH

EMO: Pretraining Mixture of Experts for Emergent Modularity

via Arxiv 👤 Ryan Wang, Akshita Bhagia, Sewon Min 📅 2026-05-07

⚡ Score: 6.6

"Large language models are typically deployed as monolithic systems, requiring the full model even when applications need only a narrow subset of capabilities, e.g., code, math, or domain-specific knowledge. Mixture-of-Experts (MoEs) seemingly offer a potential alternative by activating only a subset..."

🔬 RESEARCH

Cited but Not Verified: Parsing and Evaluating Source Attribution in LLM Deep Research Agents

via Arxiv 👤 Hailey Onweller, Elias Lumer, Austin Huber et al. 📅 2026-05-07

⚡ Score: 6.5

"Large language models (LLMs) power deep research agents that synthesize information from hundreds of web sources into cited reports, yet these citations cannot be reliably verified. Current approaches either trust models to self-cite accurately, risking bias, or employ retrieval-augmented generation..."

📰 NEWS

You can do CUDA inference on an Apple Silicon Mac with PCI Passthrough

via r/LocalLLaMA 👤 u/scottjgo 📅 2026-05-08

⬆️ 26 ups ⚡ Score: 6.5

"I have been working on a project to adapt QEMU, running on macOS, to support passing through a GPU into a Linux VM. I wrote this post walking through some of the interesting challenges there, along with benchmarks. The post focuses a lot on gaming, but there are AI benchmarks there as well."

💬 Reddit Discussion: 8 comments 🐐 GOATED ENERGY

🔬 RESEARCH

Superintelligent Retrieval Agent: The Next Frontier of Information Retrieval

via Arxiv 👤 Zeyu Yang, Qi Ma, Jason Chen et al. 📅 2026-05-07

⚡ Score: 6.5

"Retrieval-augmented agents are increasingly the interface to large organizational knowledge bases, yet most still treat retrieval as a black box: they issue exploratory queries, inspect returned snippets, and iteratively reformulate until useful evidence emerges. This approach resembles how a newcom..."

📰 NEWS

Impressions of China's AI ecosystem after visiting many leading AI labs there, and the similarities and differences in working on LLMs in China and the West

via Techmeme 👤 Interconnects 📅 2026-05-08

⚡ Score: 6.4

📰 NEWS

Mapping every meter of road damage from a single dashcam: proof of concept

via r/computervision 👤 u/k4meamea 📅 2026-05-08

⬆️ 444 ups ⚡ Score: 6.3

"I've been building a road-condition mapping pipeline that takes raw dashcam footage and produces georeferenced crack inventories. This clip shows the result on a 200 m segment. The pipeline goes from frame "where is this on the world map, and how much damage is in it": * per-frame instance segment..."

💬 Reddit Discussion: 34 comments 🐝 BUZZING

📰 NEWS

Compiled every national AI strategy in Asia — Vietnam has the most comprehensive standalone law, Japan has no penalties, Korea just eliminated Naver from sovereign LLM competition for using Qwen weigh

via r/artificial 👤 u/tomsimps0n 📅 2026-05-08

⬆️ 2 ups ⚡ Score: 6.3

"Compiled a tracker of every national AI strategy in Asia. Headline is that ten major Asian economies now have dedicated AI legislation or comprehensive national strategies, and they're all quite distinct from Western legislation like the EU AI Act or US executive orders. Clear that Asian government..."

📰 NEWS

A recent experience with ChatGPT 5.5 Pro

via HackerNews 👤 _alternator_ 📅 2026-05-09

🔺 285 pts ⚡ Score: 6.2

💬 HackerNews Buzz: 146 comments 👍 LOWKEY SLAPS

📰 NEWS

Claude Code, Codex and Agentic Coding #8

via HackerNews 👤 paulpauper 📅 2026-05-08

🔺 1 pts ⚡ Score: 6.2

📰 NEWS

I built a benchmark for AI “memory” in coding agents. looking for others to beat it.

via r/artificial 👤 u/Alienfader 📅 2026-05-08

⬆️ 3 ups ⚡ Score: 6.2

"Most AI memory benchmarks test semantic recall. But coding agents don't really fail like that. They don't just "forget", they break their own earlier decisions while they're still in the code. So I built a benchmark for that. It checks if an agent can actually stay consistent with project rules WHI..."

💬 Reddit Discussion: 17 comments 😤 NEGATIVE ENERGY

📰 NEWS

Claude Code Sandboxing

via HackerNews 👤 Destiner 📅 2026-05-09

🔺 4 pts ⚡ Score: 6.2

📰 NEWS

Is agentic AI governance even a computationally bounded process?

via r/artificial 👤 u/Im_Talking 📅 2026-05-09

⬆️ 1 ups ⚡ Score: 6.2

"Wrt to context drifting, goal misalignment, etc. Is it possible that a Turing machine could, in theory, handle all of the known issues wrt governance? Or is it a case where (say) 90% of the issues could be handled by a strict governance process, but this last 10% of issues are basically impossible ..."

🔬 RESEARCH

Verifier-Backed Hard Problem Generation for Mathematical Reasoning

via Arxiv 👤 Yuhang Lai, Jiazhan Feng, Yee Whye Teh et al. 📅 2026-05-07

⚡ Score: 6.1

"Large Language Models (LLMs) demonstrate strong capabilities for solving scientific and mathematical problems, yet they struggle to produce valid, challenging, and novel problems - an essential component for advancing LLM training and enabling autonomous scientific research. Existing problem generat..."

📰 NEWS

Akamai says it struck a seven-year cloud computing deal with a “leading frontier model provider”; sources: the deal was with Anthropic and is worth $1.8B

via Techmeme 👤 Bloomberg 📅 2026-05-08

⚡ Score: 6.1

📰 NEWS

Notes from testing GPT-Realtime-2 with a context-heavy voice app

via r/OpenAI 👤 u/peakpirate007 📅 2026-05-09

⬆️ 5 ups ⚡ Score: 6.1

"OpenAI launched GPT-Realtime-2 a couple of days ago, so I used it to test a realtime voice layer inside a national park planning app I’ve been building. The interesting part for me was not just voice quality. It was whether realtime voice becomes more useful when the session already has structured ..."

🔬 RESEARCH

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

via Arxiv 👤 Tianle Wang, Zhaoyang Wang, Guangchen Lan et al. 📅 2026-05-07

⚡ Score: 6.1

"Reinforcement learning (RL) has been applied to improve large language model (LLM) reasoning, yet the systematic study of how training scales with task difficulty has been hampered by the lack of controlled, scalable environments. We introduce ScaleLogic, a synthetic logical reasoning framework that..."

📰 NEWS

VLAs are dead, long live World Action Models

via HackerNews 👤 ykev 📅 2026-05-08

🔺 2 pts ⚡ Score: 6.1

Stories from May 09, 2026

DeepSeek V4 paper full version is out, FP4 QAT details and stability tricks [D]

AI is breaking two vulnerability cultures

Anthropic Claude safety and misalignment findings

Anthropic details how it improved Claude's safety training after finding agentic misalignment in older models, such as Opus 4 blackmailing engineers

What Claude says vs What Claude thinks

OpenAI: Investigating the consequences of accidentally grading CoT during RL

"ClaudeBleed" allows any Chrome extension to control Anthropic's AI assistant

I built a 300-line autonomous AI agent and told it to take over my PC. It immediately tried to hack my host system, exfiltrate data, and download Tor.

Local model inference optimization

Multi-Token Prediction (MTP) for LLaMA.cpp - Gemma 4 speedup by 40%

80 tok/sec and 128K context on 12GB VRAM with Qwen3.6 35B A3B and llama.cpp MTP

BeeLlama.cpp: advanced DFlash & TurboQuant with support of reasoning and vision. Qwen 3.6 27B Q5 with 200k context on 3090, 2-3x faster than baseline (peak 135 tps!)

IatroBench: Pre-Registered Evidence of Iatrogenic Harm from AI Safety Measures

5 enterprise AI agent swarms (Lemonade, CrowdStrike, Siemens) reverse-engineered into runnable browser templates.

Gemini 3.1 Flash-Lite is now generally available

How OpenAI runs its Codex coding agent safely at scale

Debt Behind the AI Boom: A Large-Scale Study of AI-Generated Code in the Wild

SafeSandbox – infinite undo for AI coding agents (Cursor, Claude Code, Codex)

Why LLM-as-judge fails for code evaluation. Here's what works.

AI Co-Mathematician: Accelerating Mathematicians with Agentic AI

Why Global LLM Leaderboards Are Misleading: Small Portfolios for Heterogeneous Supervised ML

EMO: Pretraining Mixture of Experts for Emergent Modularity

Cited but Not Verified: Parsing and Evaluating Source Attribution in LLM Deep Research Agents

You can do CUDA inference on an Apple Silicon Mac with PCI Passthrough

Superintelligent Retrieval Agent: The Next Frontier of Information Retrieval

Impressions of China's AI ecosystem after visiting many leading AI labs there, and the similarities and differences in working on LLMs in China and the West

Mapping every meter of road damage from a single dashcam: proof of concept

Compiled every national AI strategy in Asia — Vietnam has the most comprehensive standalone law, Japan has no penalties, Korea just eliminated Naver from sovereign LLM competition for using Qwen weigh

A recent experience with ChatGPT 5.5 Pro

Claude Code, Codex and Agentic Coding #8

I built a benchmark for AI “memory” in coding agents. looking for others to beat it.

Claude Code Sandboxing

Is agentic AI governance even a computationally bounded process?

Verifier-Backed Hard Problem Generation for Mathematical Reasoning

Akamai says it struck a seven-year cloud computing deal with a “leading frontier model provider”; sources: the deal was with Anthropic and is worth $1.8B

Notes from testing GPT-Realtime-2 with a context-heavy voice app

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

VLAs are dead, long live World Action Models

Stories from May 09, 2026

Anthropic Claude safety and misalignment findings

Local model inference optimization

📡 AI NEWS BUT ACTUALLY GOOD