πŸš€ WELCOME TO METAMESH.BIZ +++ Liquid AI drops 8B-parameter MoE trained on 38 TRILLION tokens because apparently parameter count is passΓ© now it's all about that data diet +++ ByteDance building knockoff Groq chips with InnoStar while LLMs literally can't stop believing lies even when you tell them they're lies +++ Xcena raises $135M to stuff KV cache management into memory modules (your RAM is now sentient, congrats) +++ THE FUTURE OF INTELLIGENCE IS DISAGREEING WITH ITSELF AT 3000 TOKENS PER SECOND ON COMMODITY HARDWARE +++ πŸš€ β€’
πŸš€ WELCOME TO METAMESH.BIZ +++ Liquid AI drops 8B-parameter MoE trained on 38 TRILLION tokens because apparently parameter count is passΓ© now it's all about that data diet +++ ByteDance building knockoff Groq chips with InnoStar while LLMs literally can't stop believing lies even when you tell them they're lies +++ Xcena raises $135M to stuff KV cache management into memory modules (your RAM is now sentient, congrats) +++ THE FUTURE OF INTELLIGENCE IS DISAGREEING WITH ITSELF AT 3000 TOKENS PER SECOND ON COMMODITY HARDWARE +++ πŸš€ β€’
AI Signal - PREMIUM TECH INTELLIGENCE
πŸ“Ÿ Optimized for Netscape Navigator 4.0+
πŸ“š HISTORICAL ARCHIVE - May 29, 2026
What was happening in AI on 2026-05-29
← May 28 πŸ“Š TODAY'S NEWS πŸ“š ARCHIVE May 30 β†’
πŸ“Š You are visitor #47291 to this AWESOME site! πŸ“Š
Archive from: 2026-05-29 | Preserved for posterity ⚑

Stories from May 29, 2026

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
πŸ“‚ Filter by Category
Loading filters...
πŸ“° NEWS

Various LLM Smells

πŸ’¬ HackerNews Buzz: 241 comments 🐝 BUZZING
πŸ“° NEWS

Anthropic says it expects Mythos-class models to be available to all customers β€œin the coming weeks” following the development of stronger safeguards

πŸ“° NEWS

Real-time LLM Inference on Standard GPUs: 3k tokens/s per request

πŸ’¬ HackerNews Buzz: 88 comments πŸ‘ LOWKEY SLAPS
πŸ“° NEWS

Liquid AI reveals 8B-A1B MoE trained on 38T

πŸ’¬ HackerNews Buzz: 25 comments 🐝 BUZZING
πŸ“° NEWS

Notes from the Mistral AI Now Summit in Paris

πŸ’¬ HackerNews Buzz: 69 comments 🐐 GOATED ENERGY
πŸ“° NEWS

The mysterious Hy3 LLM is topping OpenRouter Model Rankings by a large margin

πŸ’¬ HackerNews Buzz: 49 comments 😐 MID OR MIXED
πŸ“° NEWS

Microsoft data suggests using AI is more expensive than hiring people

πŸ’¬ HackerNews Buzz: 8 comments 😐 MID OR MIXED
πŸ“° NEWS

Claude Code Dynamic Workflows

+++ Anthropic's new parallel subagent workflows let Claude juggle hundreds of tasks simultaneously, which sounds great until you realize coordinating that many moving parts is its own special kind of chaos. +++

Anthropic adds dynamic workflows to Claude Code, enabling hundreds of subagents to run in parallel for complex engineering tasks such as framework migrations

πŸ“° NEWS

CVE-Bench: testing LLM agents on real-world vulnerability patches

πŸ’° FUNDING

Xcena, whose MX1 chip performs data orchestration and KV cache management directly within memory modules, raised a $135M Series B at a $570M valuation

πŸ“° NEWS

Claude Code Configuration Guide

+++ A real case study of AI-assisted research reveals Claude can solve physics problems autonomously, but still needs humans for the parts that actually matter: knowing what to build. +++

Claude Code – Everything You Can Configure That the Docs Don't Tell You

πŸ’¬ HackerNews Buzz: 17 comments πŸ‘ LOWKEY SLAPS
πŸ› οΈ SHOW HN

Show HN: Tiny-vLLM – high performance LLM inference engine in C++ and CUDA

πŸ“° NEWS

Sources: ByteDance has partnered with chipmaker InnoStar to develop an AI inference chip modeled after Groq's LPUs, which are built to run AI models at low cost

πŸ“° NEWS

LLMs believe false statements even after explicit warnings that they're false

πŸ’° FUNDING

Anthropic raises $65B in Series H funding at $965B post-money valuation

πŸ’¬ HackerNews Buzz: 360 comments 🐝 BUZZING
πŸ”¬ RESEARCH

Calibrating Conservatism for Scalable Oversight

"Agentic AI systems capable of autonomous planning and extended environmental interaction pose a fundamental control problem: how can humans maintain meaningful oversight of systems that may exceed their own capabilities? Existing approaches to scalable oversight rely on complex assumptions, remain l..."
πŸ”¬ RESEARCH

LLMSurgeon: Diagnosing Data Mixture of Large Language Models

"The pretraining data mixture of Large Language Models (LLMs) constitutes their "digital DNA", shaping model behaviors, capabilities, and failure modes. Yet this composition is rarely disclosed, making post-hoc auditing of data combination or provenance difficult. In this work, we formalize $\textbf{..."
πŸ”¬ RESEARCH

Gram: Assessing sabotage propensities via automated alignment auditing

"We introduce Gram, an automated alignment auditing framework to assess the propensity of AI agents to engage in sabotage. We evaluate Gemini models across 17 simulated agentic deployment scenarios that incentivize sabotage. We find Gemini models misbehave in about 2-3% of our simulated trajectories...."
πŸ”¬ RESEARCH

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

"Embodied intelligence is often studied through specialized models for individual tasks such as manipulation or navigation, resulting in fragmented capabilities and limited generalization across tasks, environments, and robot embodiments. In this work, we study whether heterogeneous embodied decision..."
πŸ“° NEWS

Is AI causing a repeat of frontend’s lost decade?

πŸ’¬ HackerNews Buzz: 205 comments 🐝 BUZZING
πŸ”¬ RESEARCH

Extrapolative Weight Averaging Reveals Correctness-Efficiency Frontiers in Code RL

"Linear interpolation between fine-tuned checkpoints has been shown to trace the Pareto front between competing objectives, but whether extrapolative weight averaging can extend such frontiers to new checkpoints useful at inference time, without additional RL training, remains unclear. We study this..."
πŸ“° NEWS

CAPTCHAs can still detect AI agents

πŸ’¬ HackerNews Buzz: 42 comments 😀 NEGATIVE ENERGY
πŸ”¬ RESEARCH

MedCase-Structured: A Text-to-FHIR Dataset for Benchmarking Diagnostic Reasoning in Clinically Realistic EHR Settings

"Large language models (LLMs) show promise for clinical reasoning and decision support, but evaluation in realistic, electronic health record-congruent settings remains limited. Existing benchmarks often rely on static datasets or unstructured inputs that do not reflect the structured, interoperable..."
πŸ’° FUNDING

Pittsburgh-based Gray Swan, which stress-tests AI models for top frontier AI labs, raised a $40M Series A at a $200M valuation co-led by Wing VC and Madrona

πŸ› οΈ SHOW HN

Show HN: ClawChat – End-to-end encrypted coordination for multi-agent AI

πŸ”¬ RESEARCH

SoundnessBench: Can Your AI Scientist Really Tell Good Research Ideas from Bad Ones?

"Autonomous AI research agents aim to accelerate scientific discovery by automating the research pipeline, from hypothesis generation to peer review. However, existing benchmarks rarely test a fundamental bottleneck: whether Large Language Models can judge the methodological viability of a research i..."
πŸ“° NEWS

Unhealthy code makes AI agents consume 35-50% more tokens

πŸ“° NEWS

AI Agent Permissions: The Missing Layer Between "Works" and "Safe"

πŸ“° NEWS

Knowa – Open-Source LLM Context Optimizer

πŸ”¬ RESEARCH

Locally Coherent, Globally Incoherent: Bounding Compositional Incoherence in Multi-Component LLM Agents

"Multi-component LLM agents assemble probabilistic claims from components that each see only part of a joint problem; the composition can violate basic probability axioms even when every component is locally coherent. We formalise this locally coherent, globally incoherent failure via the composition..."
πŸ“° NEWS

Anthropic launches Opus 4.8, saying it's β€œmore likely to flag uncertainties about its work and less likely to make unsupported claims”, at the same price as 4.7

πŸ“° NEWS

AI researchers ran 15-day simulations of worlds governed by different AI models: Claude Sonnet 4.6 recorded no crimes, while Gemini 3 Flash had the most at 683

πŸ“° NEWS

After hitting their annual AI budget in months or seeing their AI bills double or triple due to β€œtokenmaxxing”, some companies are rationing or tracking AI use

πŸ”¬ RESEARCH

Can LLMs Use Linguistic Uncertainty Markers to Reliably Reflect Intrinsic Confidence?

"LLMs' linguistically expressed confidence should faithfully reflect their intrinsic uncertainty. While recent work shows LLMs struggle to use epistemic markers (e.g., "it is likely...") in a human-aligned fashion, it remains unclear whether models can apply their own linguistic confidence framework..."
πŸ“° NEWS

Coding agent can read your .env file

πŸ“° NEWS

OpenAI: Computer use now works on Windows

πŸ“° NEWS

UK researchers gain access to Google's Willow quantum chip, which it says solves a problem in five minutes that would take supercomputers 10 septillion years

πŸ”¬ RESEARCH

Reasoning with Sampling: Cutting at Decision Points

"Frontier reasoning models are produced by posttraining base language models with reinforcement learning. Recent work has challenged this by showing that sampling from a sharpened version of the base model's distribution, a so-called power distribution, elicits comparable reasoning without additional..."
πŸ”¬ RESEARCH

CORE: Contrastive Reflection Enables Rapid Improvements in Reasoning

"Language models can use verifiable rewards to improve at a wide variety of reasoning tasks. However, both parametric (e.g. RLVR) and non-parametric (e.g. prompt optimization) approaches to doing so typically require hundreds of training samples and thousands of model rollouts, making them expensive..."
πŸ“° NEWS

AI startup Shift launches a free home cleaning service in NYC to record first-person video with a camera-equipped cap and use it to train robots

πŸ“° NEWS

OpenAI says it has briefed the White House on its new biodefense program, which uses GPT-Rosalind to help develop biodefense and pandemic preparedness tools

πŸ”¬ RESEARCH

Learn from Weaknesses: Automated Domain Specialization for Small Computer-Use Agents

"Computer-use agents (CUAs) have recently made substantial progress, but deploying a separate large expert for each software domain remains expensive. Small open computer-use agents are more practical specialization targets, but they remain substantially weaker and exhibit uneven domain-specific fail..."
πŸ”¬ RESEARCH

Continuous Diffusion Models Can Obey Formal Syntax

πŸ”¬ RESEARCH

In-Context Reward Adaptation for Robust Preference Modeling

"Reinforcement Learning from Human Feedback (RLHF) typically relies on static reward models to align Large Language Models with human preferences. However, human values are inherently diverse and heterogeneous, and a single reward model often lacks the robustness required to generalize to unseen pref..."
πŸ“° NEWS

Robinhood now lets your AI agents trade stocks

πŸ’¬ HackerNews Buzz: 141 comments 😐 MID OR MIXED
πŸ“° NEWS

Undisclosed addition in jqwik instructed AI coding agents to delete app output

πŸ’¬ HackerNews Buzz: 39 comments 😐 MID OR MIXED
πŸ¦†
HEY FRIENDO
CLICK HERE IF YOU WOULD LIKE TO JOIN MY PROFESSIONAL NETWORK ON LINKEDIN
🀝 LETS BE BUSINESS PALS 🀝