πŸš€ WELCOME TO METAMESH.BIZ +++ Karpathy joins Anthropic to accelerate pre-training because apparently the talent war is just musical chairs with RSUs +++ ByteDance drops Lance doing image/video/generation/editing at 3B params while Google flexes Gemini 3.5 Flash for "long-horizon agentic tasks" (we're really just making up words now) +++ DeepMind acqui-hires 20+ Contextual AI researchers for $100M because why build when you can buy the whole team +++ THE MESH WATCHES EVERYONE WATERMARK THEIR OUTPUTS WHILE THE MODELS LEARN TO FAKE AUTHENTICITY +++ β€’
πŸš€ WELCOME TO METAMESH.BIZ +++ Karpathy joins Anthropic to accelerate pre-training because apparently the talent war is just musical chairs with RSUs +++ ByteDance drops Lance doing image/video/generation/editing at 3B params while Google flexes Gemini 3.5 Flash for "long-horizon agentic tasks" (we're really just making up words now) +++ DeepMind acqui-hires 20+ Contextual AI researchers for $100M because why build when you can buy the whole team +++ THE MESH WATCHES EVERYONE WATERMARK THEIR OUTPUTS WHILE THE MODELS LEARN TO FAKE AUTHENTICITY +++ β€’
AI Signal - PREMIUM TECH INTELLIGENCE
πŸ“Ÿ Optimized for Netscape Navigator 4.0+
πŸ“Š You are visitor #54552 to this AWESOME site! πŸ“Š
Last updated: 2026-05-20 | Server uptime: 99.9% ⚑

Today's Stories

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
πŸ“‚ Filter by Category
Loading filters...
πŸ“° NEWS

bytedance released an open source model that attempts to do just about anything with only 3b parameters

"EDIT: working link https://huggingface.co/bytedance-research/Lance Lance is a lightweight native unified multimodal model that supportsΒ **image and video understanding, generation, and editing**Β within a single framework. * **Efficient at 3B scale..."
πŸ’¬ Reddit Discussion: 59 comments πŸ‘ LOWKEY SLAPS
πŸ“° NEWS

Andrej Karpathy joins Anthropic

+++ Andrej Karpathy, whose neural network lectures basically bootstrapped a generation of ML engineers, is now leading Anthropic's pre-training research team. The talent war just got interesting. +++

Andrej Karpathy joins Anthropic to help launch a team focused on using Claude to accelerate pre-training research; he helped found OpenAI and worked at Tesla

πŸ“° NEWS

Gemini 3.5 Flash launch

+++ Google shipped a faster Gemini model explicitly optimized for agents and coding, because apparently the path to AI usefulness runs through letting models make decisions autonomously rather than just predict tokens persuasively. +++

Gemini 3.5 Flash

πŸ’¬ HackerNews Buzz: 292 comments πŸ‘ LOWKEY SLAPS
πŸ“° NEWS

Sources: Google DeepMind has reached a ~$100M deal to hire 20+ researchers from Contextual AI, including CEO Douwe Kiela, and license its technology

πŸ“° NEWS

Anthropic Announced vs current compute capacity (Sources Below)

"**source list:** 1. **Google Cloud TPU deal β€” up to 1M TPUs, β€œwell over 1 GW” expected online in 2026** https://www.anthropic.com/news/expanding-our-use-of-google-cloud-tpus-and-services [https://www.googlecloudpr..."
πŸ“° NEWS

OpenAI adopts SynthID watermarking

+++ OpenAI adopts SynthID to watermark generated images and launches a verification portal, finally acknowledging that "trust us bro" wasn't a viable authenticity strategy for the synthetic media era. +++

OpenAI Adopts Google's SynthID Watermark for AI Images with Verification Tool

πŸ’¬ HackerNews Buzz: 28 comments πŸ‘ LOWKEY SLAPS
πŸ”¬ RESEARCH

Overeager Coding Agents: Measuring Out-of-Scope Actions on Benign Tasks

"Coding agents now run autonomously with shell, file, and network privileges. When a user issues a benign request, the agent sometimes does more than asked: it deletes unrelated files, wipes a stale credentials backup, or rewrites configuration the user never mentioned. We call these scope expansions..."
πŸ”¬ RESEARCH

Fully Open Meditron: An Auditable Pipeline for Clinical LLMs

"Clinical decision support systems (CDSS) require scrutable, auditable pipelines that enable rigorous, reproducible validation. Yet current LLM-based CDSS remain largely opaque. Most "open" models are open-weight only, releasing parameters while withholding the data provenance, curation procedures, a..."
πŸ”¬ RESEARCH

Formal Methods Meet LLMs: Auditing, Monitoring, and Intervention for Compliance of Advanced AI Systems

"We examine one particular dimension of AI governance: how to monitor and audit AI-enabled products and services throughout the AI development lifecycle, from pre-deployment testing to post-deployment auditing. Combining principles from formal methods with SoTA machine learning, we propose techniques..."
πŸ“° NEWS

Google launches the Gemini Omni multimodal model, saying it can β€œcreate anything from any input”, starting with video generation, for Google AI subscribers

πŸ“° NEWS

Here are my KV cache quantization benchmarks: TurboQuant is overrated but saved by TCQ, q5 deserves more attention, and symmetric q8 might be a waste of VRAM

"Greetings from former TurboQuant's biggest defender, now middle-sized niche-aware TurboQuant defender. Today I'm presenting to you the results of me thoroughly exploring the world of PPL and KLD benchmarks with my single RTX 3090 using BeeLlama v0.1.2, with..."
πŸ’¬ Reddit Discussion: 51 comments πŸ‘ LOWKEY SLAPS
πŸ“° NEWS

Anthropic just bought the company that generates most production MCP servers

"Anthropic acquired Stainless on Monday for a reported $300M+. Most coverage is framing this as a developer tools acquisition. Stainless is best known for generating the official Python and Node SDKs that ship with OpenAI, Google, Meta, Cloudflare, and Anthropic. The SDK story is real. The MCP side ..."
πŸ’¬ Reddit Discussion: 70 comments πŸ‘ LOWKEY SLAPS
πŸ“° NEWS

Sundar Pichai says Google is now processing 3.2 quadrillion tokens per month, up from 480T tokens per month a year ago and 9.7T tokens per month two years ago

πŸ“° NEWS

GPT-2 mechanistic interpretability visualization

+++ Mechanistic interpretability enthusiast creates AXON, a real-time 3D visualization tool that decomposes GPT-2's token generation into human-readable concept activations via sparse autoencoders. Finally, a window into the black box that's actually useful. +++

I built a tool that shows you what GPT-2 is "thinking" in real-time as it generates 3D graph of concept activations per token [R]

"Been going down a mechanistic interpretability rabbit hole for the past few weeks and ended up building this thing called AXON. The idea: every time GPT-2 generates a token, its residual stream gets passed through a Sparse Autoencoder (Joseph Bloom's pretrained SAE). The SAE decomposes it into huma..."
πŸ“° NEWS

Mistral AI Acquires Emmi AI to Create the Leading AI Stack

πŸ’¬ HackerNews Buzz: 16 comments 🐝 BUZZING
πŸ”¬ RESEARCH

Language-Switching Triggers Take a Latent Detour Through Language Models

"Backdoor attacks on language models pose a growing security concern, yet the internal mechanisms by which a trigger sequence hijacks model computations remain poorly understood. We identify a circuit underlying a language-switching backdoor in an 8B-parameter autoregressive language model, where a t..."
πŸ”¬ RESEARCH

Post-Trained MoE Can Skip Half Experts via Self-Distillation

"Mixture-of-Experts (MoE) scales language models efficiently through sparse expert activation, and its dynamic variant further reduces computation by adjusting the activated experts in an input-dependent manner. Existing dynamic MoE methods usually rely on pre-training from scratch or task-specific a..."
πŸ“° NEWS

Released a free 9.8M doc Indic multilingual corpus β€” Hindi, Bengali, Tamil, Telugu + 7 more (CC0, HuggingFace) [P]

"Built this over the past few weeks as part of a multilingual research project. Figured I'd share it here. Check it out! \~9.8M web documents across 11 languages β€” hi, bn, ta, te, mr, gu, kn, ml, pa, ur, en. \~8.4B tokens. CC0 license. πŸ€— [https://huggingface.co/datasets/AM0908/indic-hplt-v1](https:..."
πŸ”¬ RESEARCH

Forecasting Downstream Performance of LLMs With Proxy Metrics

"Progress in language model development is often driven by comparative decisions: which architecture to adopt, which pretraining corpus to use, or which training recipe to apply. Making these decisions well requires reliable performance forecasts, yet the two commonly used signals are fundamentally l..."
πŸ”¬ RESEARCH

Code as Agent Harness

"Recent large language models (LLMs) have demonstrated strong capabilities in understanding and generating code, from competitive programming to repository-level software engineering. In emerging agentic systems, code is no longer only a target output. It increasingly serves as an operational substra..."
πŸ”¬ RESEARCH

What Does the AI Doctor Value? Auditing Pluralism in the Clinical Ethics of Language Models

"Medicine is inherently pluralistic. Principles such as autonomy, beneficence, nonmaleficence, and justice routinely conflict, and such ethical dilemmas often sharply divide reasonable physicians. Good clinical practice navigates these tensions in concert with each patient's values rather than imposi..."
πŸ“° NEWS

I gave Claude access to my M365 account using Power Automate + a small MCP server

"I’ve been messing with MCP servers lately and finally got one working that feels genuinely useful instead of β€œcool demo, never use again.” The problem: I wanted Claude to be able to do basic Microsoft 365 stuff for me: - read my inbox - send a draft/follow-up - check my calendar - save notes into ..."
πŸ’¬ Reddit Discussion: 27 comments 🐝 BUZZING
πŸ”¬ RESEARCH

AI-Mediated Communication Can Steer Collective Opinion

"Generative artificial intelligence (AI) is increasingly integrated into the online platforms where humans exchange opinions; large language models (LLMs) now polish users' posts on LinkedIn and provide context for content shared on X. While prior work has shown that AI can express biased opinions an..."
πŸ“° NEWS

Google overhauls its search box, letting users ask longer queries, upload photos and videos, and use Gemini 3.5 Flash-powered agents to automate searches

πŸ”¬ RESEARCH

Predictable Confabulations: Factual Recall by LLMs Scales with Model Size and Topic Frequency

"While scaling laws govern aggregate large language model performance, no scaling law has linked factual recall to both model size and training-data composition. We evaluated 38 models on over 8,900 scholarly references evaluated by an automated reference verification system. Recall quality follows a..."
πŸ”¬ RESEARCH

Argus: Evidence Assembly for Scalable Deep Research Agents

"Deep research agents have achieved remarkable progress on complex information seeking tasks. Even long ReAct style rollouts explore only a single trajectory, while recent state of the art systems scale inference time compute via parallel search and aggregation. Yet deep research answers are composed..."
πŸ“° NEWS

Google adds a conversational search feature to YouTube and rolls out the new Gemini Omni model in YouTube Shorts Remix and the Create app

πŸ”¬ RESEARCH

Context, Reasoning, and Hierarchy: A Cost-Performance Study of Compound LLM Agent Design in an Adversarial POMDP

"Deploying compound LLM agents in adversarial, partially observable sequential environments requires navigating several design dimensions: (1) what the agent sees, (2) how it reasons, and (3) how tasks are decomposed across components. Yet practitioners lack guidance on which design choices improve p..."
πŸ”¬ RESEARCH

Prospective multi-pathogen disease forecasting using autonomous LLM-guided tree search

"Probabilistic forecasting of infectious diseases is crucial for public health but relies on labor-intensive manual model curation by expert modeling teams. This bespoke development bottlenecks scalability to granular geographic resolutions or emerging pathogens. Here, we present an autonomous system..."
πŸ“° NEWS

OpenAI introduces Guaranteed Capacity, a new offering that lets customers guarantee access to OpenAI's compute through one- to three-year commitments

πŸ“° NEWS

Google announces Gemini Spark, a β€œ24/7 personal AI agent” that is powered by Gemini 3.5 and supports integrations with Google Workspace apps, including Gmail

πŸ”¬ RESEARCH

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

"Equipping LLMs with tool-use capabilities via Agentic Reinforcement Learning (Agentic RL) is bottlenecked by two challenges: the lack of scalable, robust execution environments and the scarcity of realistic training data that captures implicit human reasoning. Existing approaches depend on costly re..."
πŸ“° NEWS

Cloudflare tests Mythos against 50+ repositories, highlights its ability to chain bugs into a single exploit, and details a vulnerability discovery harness

πŸ”¬ RESEARCH

Look Before You Leap: Autonomous Exploration for LLM Agents

"Large language model based agents often fail in unfamiliar environments due to premature exploitation: a tendency to act on prior knowledge before acquiring sufficient environment-specific information. We identify autonomous exploration as a critical yet underexplored capability for building adaptiv..."
πŸ”¬ RESEARCH

FORGE: Self-Evolving Agent Memory With No Weight Updates via Population Broadcast

"Can LLM agents improve decision-making through self-generated memory without gradient updates? We propose FORGE (Failure-Optimized Reflective Graduation and Evolution), a staged, population-based protocol that evolves prompt-injected natural-language memory for hierarchical ReAct agents. FORGE wraps..."
πŸ“° NEWS

Sundar Pichai announced at Google I/O that Gemini 3.5 Pro will launch next month; attendees groaned at the model coming out later than they expected

πŸ“° NEWS

Combined P2PNet + Apple's Depth Pro to reconstruct crowds in 3D and predict people hidden behind obstructions β€” from a single image

"Estimating crowd size by eye is notoriously hard. I've found a CNN called P2PNet to detect heads of people and created a custom pipeline to detect occluded people and reconstruct an approximate 3d scene. **Pipeline overview** 1. **P2PNet** detection gives 2D head points 2. **Depth Pro** ..."
πŸ“° NEWS

People who use ai image gen like this make us all look bad.

"External link discussion - see full content at original source."
πŸ’¬ Reddit Discussion: 165 comments 😐 MID OR MIXED
πŸ”¬ RESEARCH

General Preference Reinforcement Learning

"Post-training has split large language model (LLM) alignment into two largely disconnected tracks. Online reinforcement learning (RL) with verifiable rewards drives emergent reasoning on math and code but depends on a programmatic verifier that cannot reach open-ended tasks, while preference optimiz..."
πŸ”¬ RESEARCH

Democratizing Large-Scale Re-Optimization with LLM-Guided Model Patches

"Optimization models developed by operations research (OR) experts are often deployed as decision-support systems in industrial settings. However, real-world environments are dynamic, with evolving business rules, previously overlooked constraints, and unforeseen perturbations. In such contexts, end..."
πŸ› οΈ SHOW HN

Show HN: Nano-RAG – Agentic multi-hog retrieval without graph database

πŸ“° NEWS

Anthropic shuts the EU out of its most advanced cyber AI model

πŸ“° NEWS

eXo MCP server: expose workplace tools to AI agents with OAuth

πŸ“° NEWS

Memory just turned a goldfish into a research beast.

"I've been building Nyx, a persistent memory layer for local AI, and today I got the first real benchmark numbers worth sharing. The test: same long civic investigation task twice. Building a full politician profile, then asking follow-up questions that required remembering details established earl..."
πŸ“° NEWS

The State of Statefulness in AI Agents

πŸ“° NEWS

Web Researcher MCP: Give AI assistants web search and research capabilities (Go)

πŸ”¬ RESEARCH

Vision-OPD: Learning to See Fine Details for Multimodal LLMs via On-Policy Self-Distillation

"Multimodal Large Language Models (MLLMs) still struggle with fine-grained visual understanding, where answers often depend on small but decisive evidence in the full image. We observe a regional-to-global perception gap: the same MLLM answers fine-grained questions more accurately when conditioned o..."
πŸ”¬ RESEARCH

DashAttention: Differentiable and Adaptive Sparse Hierarchical Attention

"Current hierarchical attention methods, such as NSA and InfLLMv2, select the top-k relevant key-value (KV) blocks based on coarse attention scores and subsequently apply fine-grained softmax attention on the selected tokens. However, the top-k operation assumes the number of relevant tokens for any..."
πŸ”¬ RESEARCH

Reversa: A Reverse Documentation Engineering Framework for Converting Legacy Software into Operational Specifications for AI Agents

"Legacy systems concentrate business rules, architectural decisions, and operational exceptions that often remain implicit in code, data, configuration, and maintenance practices. At the same time, language-model-based coding agents depend on reliable context, correctness criteria, and behavioral c..."
πŸ”¬ RESEARCH

PopPy: Opportunistically Exploiting Parallelism in Python Compound AI Applications

"Compound AI applications, which compose calls to ML models using a general-purpose programming language like Python, are widely used for a variety of user-facing tasks, from software engineering to enterprise automation, making their end-to-end latency a critical bottleneck. In contrast to tradition..."
πŸ“° NEWS

You Can’t Regulate Programming: How the EU AI Act May Kill Software

"External link discussion - see full content at original source."
πŸ’¬ Reddit Discussion: 5 comments 😐 MID OR MIXED
πŸ“° NEWS

State of AI 2026 survey results

πŸ”¬ RESEARCH

Runtime-Orchestrated Second-Order Optimization for Scalable LLM Training

"Second-order methods offer an attractive path toward more sample-efficient LLM training, but their practical use is often blocked by the systems cost of maintaining and updating large matrix-based optimizer states. We introduce \textbf{Asteria}, a runtime system designed to remove this bottleneck by..."
πŸ¦†
HEY FRIENDO
CLICK HERE IF YOU WOULD LIKE TO JOIN MY PROFESSIONAL NETWORK ON LINKEDIN
🀝 LETS BE BUSINESS PALS 🀝