AI News Archive - March 20, 2026 | Metamesh Intelligence

🔒 SECURITY

Meta AI agent security incident

3x SOURCES 🌐 📅 2026-03-19

⚡ Score: 8.8

+++ Internal AI system at Meta leaked sensitive employee data without authorization, offering a timely reminder that the real security vulnerability in AI deployments remains instruction-following without guardrails. +++

Meta confirms a critical security incident after an internal rogue AI agent's actions led to the exposure of sensitive data to employees without authorization

via Techmeme 👤 Theinformation 📅 2026-03-19

⚡ Score: 9.0

🛠️ TOOLS

MacBook M5 Pro and Qwen3.5 = Local AI Security System

via HackerNews 👤 aegis_camera 📅 2026-03-20

🔺 143 pts ⚡ Score: 8.2

💬 HackerNews Buzz: 138 comments 🐝 BUZZING

🎯 Home security workflows • Model selection for specific tasks • Compliance and legal requirements

💬 "You get better results by picking specific models for specific tasks" • "the compliance/legal hurdles are still real, slow, and human"

🌐 POLICY

White House AI legislative framework

3x SOURCES 🌐 📅 2026-03-19

⚡ Score: 8.1

+++ The Biden administration and a Trump-backed senator have both pushed federal AI legislation to preempt state rules, suggesting the fragmentation problem is now urgent enough to unite Congress across party lines and ideological fault lines. +++

The White House releases an AI policy framework, explicitly calling on Congress to preempt state AI laws, create age-gating requirements for AI models, and more

via Techmeme 👤 Politico 📅 2026-03-20

⚡ Score: 7.9

🔒 SECURITY

Security advisories for AI/ML infrastructure most scanners miss

via HackerNews 👤 raxe 📅 2026-03-19

🔺 5 pts ⚡ Score: 7.8

🛡️ SAFETY

Aligning LLMs at inference time by suppressing internal concepts

via HackerNews 👤 adebayoj 📅 2026-03-19

🔺 2 pts ⚡ Score: 7.7

🔒 SECURITY

AI security research digest

2x SOURCES 🌐 📅 2026-03-19

⚡ Score: 7.6

+++ A practitioner-friendly digest tackles arXiv's growing pile of compound AI vulnerabilities, because apparently researchers assumed "cross-stack rowhammer attacks" was conversational enough for security teams. +++

[R] Weekly digest: arXiv AI security papers translated for practitioners -- Cascade (cross-stack CVE+Rowhammer attacks on compound AI), LAMLAD (dual-LLM adversarial ML, 97% evasion), OpenClaw (4 vuln

via r/MachineLearning 👤 u/cyberamyntas 📅 2026-03-19

⬆️ 1 ups ⚡ Score: 7.7

" I have been building a bi-weekly digest that takes AI security papers from arXiv and translates them into practitioner-oriented intelligence. Each paper gets rated on four dimensions: Threat Realism, Defensive Urgency, Novelty, and Research Maturity (1-5 scale), then classified as Act Now / Watc..."

🤖 AI MODELS

OpenAI autonomous AI researcher plans

2x SOURCES 🌐 📅 2026-03-20

⚡ Score: 7.4

+++ OpenAI is betting the farm on fully autonomous AI researchers by 2028, because apparently the real bottleneck in science was always the lack of tireless agents willing to work for compute cycles. +++

OpenAI plans “an autonomous AI research intern” by September and says its “North Star” is to build a fully automated multi-agent research system by 2028

via Techmeme 👤 Technologyreview 📅 2026-03-20

⚡ Score: 7.8

OpenAI is throwing everything into building a fully automated researcher

via r/OpenAI 👤 u/techreview 📅 2026-03-20

⬆️ 11 ups ⚡ Score: 6.1

"OpenAI is refocusing its research efforts and throwing its resources into a new grand challenge. The San Francisco firm has set its sights on building what it calls an AI researcher, a fully automated agent-based system that will be able to go off and tackle large, complex problems by itself. OpenAI..."

💬 Reddit Discussion: 11 comments 😐 MID OR MIXED

🎯 Business focus • Autonomous AI capabilities • Concerns about AI dependence

💬 "Didn't they just say they want to focus on business and coding?" • "The hard parts are: reliable long-horizon execution, knowing when results are invalid"

🛠️ SHOW HN

Show HN: I built a P2P network where AI agents publish formally verified science

via HackerNews 👤 FranciscoAngulo 📅 2026-03-19

🔺 26 pts ⚡ Score: 7.3

💬 HackerNews Buzz: 4 comments 👍 LOWKEY SLAPS

🎯 Mathematical Verification • Trustworthiness of Review • Limitations of Formal Proofs

💬 "LEAN only proves what you tell it to prove" • "the submitting agent itself can spin up any number of subagents"

🔬 RESEARCH

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

via Arxiv 👤 Zhuolin Yang, Zihan Liu, Yang Chen et al. 📅 2026-03-19

⚡ Score: 7.3

"We introduce Nemotron-Cascade 2, an open 30B MoE model with 3B activated parameters that delivers best-in-class reasoning and strong agentic capabilities. Despite its compact size, its mathematical and coding reasoning performance approaches that of frontier open models. It is the second open-weight..."

🔬 RESEARCH

SOL-ExecBench: Speed-of-Light Benchmarking for Real-World GPU Kernels Against Hardware Limits

via Arxiv 👤 Edward Lin, Sahil Modi, Siva Kumar Sastry Hari et al. 📅 2026-03-19

⚡ Score: 7.3

"As agentic AI systems become increasingly capable of generating and optimizing GPU kernels, progress is constrained by benchmarks that reward speedup over software baselines rather than proximity to hardware-efficient execution. We present SOL-ExecBench, a benchmark of 235 CUDA kernel optimization p..."

🏢 BUSINESS

OpenAI Said Yes to the Pentagon. Anthropic Said No. Here's What Happened to Both.

via r/OpenAI 👤 u/ChainOfThot 📅 2026-03-20

⚡ Score: 7.2

"External link discussion - see full content at original source."

🔬 RESEARCH

Medical AI gets 66% worse when you use automated labels for training, and the benchmark hides it! [R][P]

via r/MachineLearning 👤 u/ade17_in 📅 2026-03-20

⬆️ 10 ups ⚡ Score: 7.1

"A recent work on fairness in medical segmentation for breast cancer tumors found that segmentation models work way worse for younger patients. Common explanation: higher breast density = harder cases. But this is not it. The bias is qualitative -- younger patients have tumors that are larger, more ..."

🔬 RESEARCH

Unified Spatio-Temporal Token Scoring for Efficient Video VLMs

via Arxiv 👤 Jianrui Zhang, Yue Yang, Rohun Tripathi et al. 📅 2026-03-18

⚡ Score: 7.0

"Token pruning is essential for enhancing the computational efficiency of vision-language models (VLMs), particularly for video-based tasks where temporal redundancy is prevalent. Prior approaches typically prune tokens either (1) within the vision transformer (ViT) exclusively for unimodal perceptio..."

🔬 RESEARCH

TDAD: Test-Driven Agentic Development - Reducing Code Regressions in AI Coding Agents via Graph-Based Impact Analysis

via Arxiv 👤 Pepe Alonso 📅 2026-03-18

⚡ Score: 7.0

"AI coding agents can resolve real-world software issues, yet they frequently introduce regressions, breaking tests that previously passed. Current benchmarks focus almost exclusively on resolution rate, leaving regression behavior under-studied. This paper presents TDAD (Test-Driven Agentic Developm..."

🤖 AI MODELS

We Made Haiku as Good as Opus. Improving Claude Code with Codeset

via HackerNews 👤 andre15silva 📅 2026-03-19

🔺 3 pts ⚡ Score: 7.0

🔬 RESEARCH

RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference

via Arxiv 👤 Arpit Singh Gautam, Saurabh Jha 📅 2026-03-18

⚡ Score: 7.0

"Post training quantization is essential for deploying large language models (LLMs) on resource constrained hardware, yet state of the art methods enforce uniform bit widths across layers, yielding suboptimal accuracy efficiency trade offs. We present RAMP (Reinforcement Adaptive Mixed Precision), an..."

🛠️ TOOLS

[R] Doc-to-LoRA: Learning to Instantly Internalize Contexts from Sakana AI

via r/MachineLearning 👤 u/Happysedits 📅 2026-03-19

⬆️ 13 ups ⚡ Score: 7.0

"This is cool paper! Creating loras from docs on the fly using a hypernetwork. "Long input sequences are central to in-context learning, document understanding, and multi-step reasoning of Large Language Models (LLMs). However, the quadratic attention cost of Transformers makes inference memory-i..."

🔬 RESEARCH

CARE: Covariance-Aware and Rank-Enhanced Decomposition for Enabling Multi-Head Latent Attention

via Arxiv 👤 Zhongzhu Zhou, Fengxiang Bie, Ziyan Chen et al. 📅 2026-03-18

⚡ Score: 7.0

"Converting pretrained attention modules such as grouped-query attention (GQA) into multi-head latent attention (MLA) can improve expressivity without increasing KV-cache cost, making it attractive for efficient inference. However, many practical conversion baselines rely on weight-only low-rank appr..."

🧠 NEURAL NETWORKS

Activation Exposure & Feature Interpretability for GGUF via llama-server

via r/LocalLLaMA 👤 u/wattswrites 📅 2026-03-20

⬆️ 4 ups ⚡ Score: 6.9

"You can now capture per-layer activation vectors from llama-server during inference, train sparse autoencoders on them, discover which internal features correspond to specific behaviors (sycophancy, hedging, creativity, etc.), and extract those features as GGUF control vectors for real-time steering..."

🛠️ TOOLS

Replay debugger for AI agents (fix failures without rerunning everything)

via HackerNews 👤 whitepaper27 📅 2026-03-19

🔺 1 pts ⚡ Score: 6.9

🔬 RESEARCH

How Uncertainty Estimation Scales with Sampling in Reasoning Models

via Arxiv 👤 Maksym Del, Markus Kängsepp, Marharyta Domnich et al. 📅 2026-03-19

⚡ Score: 6.8

"Uncertainty estimation is critical for deploying reasoning language models, yet remains poorly understood under extended chain-of-thought reasoning. We study parallel sampling as a fully black-box approach using verbalized confidence and self-consistency. Across three reasoning models and 17 tasks s..."

🔬 RESEARCH

Only relative ranks matter in weight-clustered large language models

via Arxiv 👤 Borja Aizpurua, Sukhbinder Singh, Román Orús 📅 2026-03-18

⚡ Score: 6.8

"Large language models (LLMs) contain billions of parameters, yet many exact values are not essential. We show that what matters most is the relative rank of weights-whether one connection is stronger or weaker than another-rather than precise magnitudes. To reduce the number of unique weight values,..."

🛠️ TOOLS

Projects are now available in Cowork.

via r/claudeai 👤 u/ClaudeOfficial 📅 2026-03-20

⬆️ 499 ups ⚡ Score: 6.7

"Keep your tasks and context in one place, focused on one area of work. Files and instructions stay on your computer. Import existing projects in one click, or start fresh. Update or download the Claude desktop app to give it a try: https://claude.com/download..."

💬 Reddit Discussion: 41 comments 👍 LOWKEY SLAPS

🎯 Anthropic's Market Dominance • Productivity and Business Use Cases • Employee Satisfaction

💬 "The absolute tear you guys have been on" • "Good. This is huge."

🔬 RESEARCH

ShapleyLaw: A Game-Theoretic Approach to Multilingual Scaling Laws

via Arxiv 👤 Xuyang Cao, Qianying Liu, Chuan Xiao et al. 📅 2026-03-18

⚡ Score: 6.7

"In multilingual pretraining, the test loss of a pretrained model is heavily influenced by the proportion of each language in the pretraining data, namely the \textit{language mixture ratios}. Multilingual scaling laws can predict the test loss under different language mixture ratios and can therefor..."

🔬 RESEARCH

VideoAtlas: Navigating Long-Form Video in Logarithmic Compute

via Arxiv 👤 Mohamed Eltahir, Ali Habibullah, Yazan Alshoibi et al. 📅 2026-03-18

⚡ Score: 6.7

"Extending language models to video introduces two challenges: representation, where existing methods rely on lossy approximations, and long-context, where caption- or agent-based pipelines collapse video into text and lose visual fidelity. To overcome this, we introduce \textbf{VideoAtlas}, a task-a..."

🔬 RESEARCH

DebugLM: Learning Traceable Training Data Provenance for LLMs

via Arxiv 👤 Wenjie Jacky Mo, Qin Liu, Xiaofei Wen et al. 📅 2026-03-18

⚡ Score: 6.7

"Large language models (LLMs) are trained through multi-stage pipelines over heterogeneous data sources, yet developers lack a principled way to pinpoint the specific data responsible for an observed behavior. This lack of observability reduces debugging to reactive patching and makes failures prone..."

🔬 RESEARCH

Differential Privacy in Generative AI Agents: Analysis and Optimal Tradeoffs

via Arxiv 👤 Ya-Ting Yang, Quanyan Zhu 📅 2026-03-18

⚡ Score: 6.7

"Large language models (LLMs) and AI agents are increasingly integrated into enterprise systems to access internal databases and generate context-aware responses. While such integration improves productivity and decision support, the model outputs may inadvertently reveal sensitive information. Altho..."

🔬 RESEARCH

Box Maze: A Process-Control Architecture for Reliable LLM Reasoning

via Arxiv 👤 Zou Qiang 📅 2026-03-19

⚡ Score: 6.7

"Large language models (LLMs) demonstrate strong generative capabilities but remain vulnerable to hallucination and unreliable reasoning under adversarial prompting. Existing safety approaches -- such as reinforcement learning from human feedback (RLHF) and output filtering -- primarily operate at th..."

🔬 RESEARCH

Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders

via Arxiv 👤 Shang-Jui Ray Kuo, Paola Cascante-Bonilla 📅 2026-03-19

⚡ Score: 6.6

"Large vision--language models (VLMs) often use a frozen vision backbone, whose image features are mapped into a large language model through a lightweight connector. While transformer-based encoders are the standard visual backbone, we ask whether state space model (SSM) vision backbones can be a st..."

🔬 RESEARCH

IndicSafe: A Benchmark for Evaluating Multilingual LLM Safety in South Asia

via Arxiv 👤 Priyaranjan Pattnayak, Sanchari Chowdhuri 📅 2026-03-18

⚡ Score: 6.6

"As large language models (LLMs) are deployed in multilingual settings, their safety behavior in culturally diverse, low-resource languages remains poorly understood. We present the first systematic evaluation of LLM safety across 12 Indic languages, spoken by over 1.2 billion people but underreprese..."

🛠️ TOOLS

Your local model can now render interactive charts, clickable diagrams, and forms that talk back to the AI — no cloud required

via r/LocalLLaMA 👤 u/ClassicMain 📅 2026-03-20

⬆️ 42 ups ⚡ Score: 6.6

"Anthropic recently shipped interactive artifacts in Claude — charts, diagrams, visualizations rendered right in the chat. Cool feature, locked to one provider. (source) I wanted the same thing for whatever model I'm running. So I built it. It's c..."

💬 Reddit Discussion: 19 comments 🐝 BUZZING

🎯 Local AI models • Interactive HTML • Community contributions

💬 "Qwen3.5 27b has been a standout" • "I am using Q4 quant"

🔬 RESEARCH

AgentFactory: A Self-Evolving Framework Through Executable Subagent Accumulation and Reuse

via Arxiv 👤 Zhang Zhang, Shuqi Lu, Hongjin Qian et al. 📅 2026-03-18

⚡ Score: 6.6

"Building LLM-based agents has become increasingly important. Recent works on LLM-based agent self-evolution primarily record successful experiences as textual prompts or reflections, which cannot reliably guarantee efficient task re-execution in complex scenarios. We propose AgentFactory, a new self..."

🔬 RESEARCH

CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents

via Arxiv 👤 Lintang Sutawika, Aditya Bharat Soni, Bharath Sriraam R R et al. 📅 2026-03-18

⚡ Score: 6.6

"A prerequisite for coding agents to perform tasks on large repositories is code localization - the identification of relevant files, classes, and functions to work on. While repository-level code localization has been performed using embedding-based retrieval approaches such as vector search, recent..."

🔬 RESEARCH

Beyond Muon: MUD (MomentUm Decorrelation) for Faster Transformer Training

via Arxiv 👤 Ben S. Southworth, Stephen Thomas 📅 2026-03-18

⚡ Score: 6.6

"Orthogonalized-momentum optimizers such as Muon improve transformer training by approximately whitening/orthogonalizing matrix-valued momentum updates via a short polar-decomposition iteration. However, polar-factor approximations typically require multiple large matrix multiplications, and the resu..."

🔬 RESEARCH

How do LLMs Compute Verbal Confidence

via Arxiv 👤 Dharshan Kumaran, Arthur Conmy, Federico Barbero et al. 📅 2026-03-18

⚡ Score: 6.6

"Verbal confidence -- prompting LLMs to state their confidence as a number or category -- is widely used to extract uncertainty estimates from black-box models. However, how LLMs internally generate such scores remains unknown. We address two questions: first, when confidence is computed - just-in-ti..."

🔬 RESEARCH

SAVeS: Steering Safety Judgments in Vision-Language Models via Semantic Cues

via Arxiv 👤 Carlos Hinojosa, Clemens Grange, Bernard Ghanem 📅 2026-03-19

⚡ Score: 6.6

"Vision-language models (VLMs) are increasingly deployed in real-world and embodied settings where safety decisions depend on visual context. However, it remains unclear which visual evidence drives these judgments. We study whether multimodal safety behavior in VLMs can be steered by simple semantic..."

🛠️ SHOW HN

Show HN: llamafile 0.10.0 rebuilt, Qwen3.5, lfm2, Anthropic API

via HackerNews 👤 mzlaai 📅 2026-03-19

🔺 6 pts ⚡ Score: 6.5

🤖 AI MODELS

Nemotron-3-Nano (4B), new hybrid Mamba + Attention model from NVIDIA, running locally in your browser on WebGPU.

via r/LocalLLaMA 👤 u/xenovatech 📅 2026-03-19

⬆️ 37 ups ⚡ Score: 6.5

"I haven't seen many people talking about NVIDIA's new Nemotron-3-Nano model, which was released just a couple of days ago... so, I decided to build a WebGPU demo for it! Everything runs locally in your browser (using Transformers.js). On my M4 Max, I get \~75 tokens per second - not bad! It's a 4B ..."

💬 Reddit Discussion: 4 comments 🐐 GOATED ENERGY

🎯 Accessibility • Performance • Hardware

💬 "Incredible for accessibility to do it this way!" • "Interesting that your WebGPU demo hits \~75 tok/s on M4 Max"

🔬 RESEARCH

Mitigating LLM Hallucinations through Domain-Grounded Tiered Retrieval

via Arxiv 👤 Md. Asraful Haque, Aasar Mehdi, Maaz Mahboob et al. 📅 2026-03-18

⚡ Score: 6.5

"Large Language Models (LLMs) have achieved unprecedented fluency but remain susceptible to "hallucinations" - the generation of factually incorrect or ungrounded content. This limitation is particularly critical in high-stakes domains where reliability is paramount. We propose a domain-grounded tier..."

🎯 PRODUCT

WordPress.com says it will now allow AI agents to draft, edit, and publish content on customers' websites, as well as manage comments, update metadata, and more

via Techmeme 👤 Techcrunch 📅 2026-03-20

⚡ Score: 6.3

🔄 OPEN SOURCE

OpenCode – The open source AI coding agent

via HackerNews 👤 rbanffy 📅 2026-03-20

🔺 10 pts ⚡ Score: 6.3

💬 HackerNews Buzz: 1 comments 🐝 BUZZING

🎯 Local AI models • OpenCode alternatives • Telemetry concerns

💬 "I've been using this as my primary harness for local models" • "I really like how their subagents work"

🛠️ TOOLS

Knowledge-RAG – Local RAG for Claude Code with hybrid search and cross-encoder

via HackerNews 👤 lyonzin 📅 2026-03-19

🔺 1 pts ⚡ Score: 6.3

🛠️ TOOLS

Cursor Composer 2 launch

2x SOURCES 🌐 📅 2026-03-19

⚡ Score: 6.3

+++ Cursor launches Composer 2, a coding-focused AI agent positioned to undercut Anthropic and OpenAI, proving once again that the path to enterprise dominance apparently runs through aggressive pricing and narrow domain expertise. +++

Cursor says Composer 2 is “frontier-level at coding” and is priced at $0.50/1M input tokens and $2.50/1M output tokens, with a faster variant costing 3x more

via Techmeme 👤 Cursor 📅 2026-03-19

⚡ Score: 6.2

🛠️ TOOLS

Push events into a running session with channels

via HackerNews 👤 jasonjmcghee 📅 2026-03-20

🔺 328 pts ⚡ Score: 6.2

💬 HackerNews Buzz: 186 comments 🐝 BUZZING

🎯 AI integration channels • Headless API for Claude • Scaling AI workflows

💬 "Architecturally it's a little different, most *claws would call the Agent SDK from some orchestrator, but with claude channels the claude code binary starts the MCP server used to communicate with the channel." • "Hopefully this is coming to Claude Cowork as well."

🏢 BUSINESS

Super Micro Shares Plunge 25% After Co-Founder Charged in $2.5B Smuggling Plot

via HackerNews 👤 pera 📅 2026-03-20

🔺 271 pts ⚡ Score: 6.2

💬 HackerNews Buzz: 124 comments 😐 MID OR MIXED

🎯 Hardware complexity • Geopolitical trade tensions • Ethical business practices

💬 "Can someone shed light on why China still couldn't copy the Nvidia GPUs" • "Violating sanctions isn't exactly the same thing as smuggling"

🤖 AI MODELS

Microsoft releases MAI-Image-2, ranked #3 on the text-to-image Arena leaderboard behind models from Google and OpenAI, available in the MAI Playground

via Techmeme 👤 Decrypt 📅 2026-03-20

⚡ Score: 6.2

🧠 NEURAL NETWORKS

How I got 20 AI agents to autonomously trade in a medieval village economy with zero behavioral instructions

via r/claudeai 👤 u/Illustrious-Bug-5593 📅 2026-03-20

⬆️ 92 ups ⚡ Score: 6.2

"Repo: https://github.com/Dominien/brunnfeld-agentic-world Been building a multi agent simulation where 20 LLM agents live in a medieval village and run a real economy. No behavioral instructions, no trading strategies, no goals. Just a world wi..."

💬 Reddit Discussion: 24 comments 🐝 BUZZING

🎯 Emergent capitalism • AI-driven simulations • Cloudflare-powered village networks

💬 "no prompts, just vibes" • "Definitely would be the sort of game I spend my whole day on"

🛠️ SHOW HN

Show HN: Built a zero config proxy that lets Claude control your React App

via HackerNews 👤 thomscoder 📅 2026-03-19

🔺 3 pts ⚡ Score: 6.2

🔬 RESEARCH

Specification-Aware Distribution Shaping for Robotics Foundation Models

via Arxiv 👤 Sadık Bera Yüksel, Derya Aksaray 📅 2026-03-18

⚡ Score: 6.2

"Robotics foundation models have demonstrated strong capabilities in executing natural language instructions across diverse tasks and environments. However, they remain largely data-driven and lack formal guarantees on safety and satisfaction of time-dependent specifications during deployment. In pra..."

🛠️ TOOLS

Shown HN: Mittens for Claw – Go sandbox to safely run local AI agents

via HackerNews 👤 x-guo 📅 2026-03-19

🔺 1 pts ⚡ Score: 6.1

🛠️ TOOLS

Every LLM has a default voice and it's making us all sound the same

via r/ChatGPT 👤 u/prokajevo 📅 2026-03-20

⬆️ 769 ups ⚡ Score: 6.1

"Been building Noren mostly because this kept bothering me: every model has a default voice it falls back on. Ask five different people to rewrite the same paragraph and you'll get five versions of the same sanitized, oddly formal output! We're trying to fix that by learning how you actually writ..."

💬 Reddit Discussion: 33 comments 🐝 BUZZING

🎯 AI homogenization • Relatable writing styles • Horror movie discussion

💬 "the homogenization thing is so real" • "It's like they've been indoctrinated by the phrasing of an LLM"

🔒 SECURITY

Anthropic's Claude Code had a workspace trust bypass (CVE-2026-33068). Not a prompt injection or AI attack. A configuration loading order bug. Fixed in 2.1.53.

via r/artificial 👤 u/cyberamyntas 📅 2026-03-20

⬆️ 3 ups ⚡ Score: 6.1

" An interesting data point in the AI safety discussion: Anthropic's own Claude Code CLI tool had a security vulnerability, and it was not an AI-specific attack at all. CVE-2026-33068 (CVSS 7.7 HIGH) is a workspace trust dialog bypass in Claude Code versions prior to 2.1.53. A malici..."

🔬 RESEARCH

The Silent Thought: Modeling Internal Cognition in Full-Duplex Spoken Dialogue Models via Latent Reasoning

via Arxiv 👤 Donghang Wu, Tianyu Zhang, Yuxin Li et al. 📅 2026-03-18

⚡ Score: 6.1

"During conversational interactions, humans subconsciously engage in concurrent thinking while listening to a speaker. Although this internal cognitive processing may not always manifest as explicit linguistic structures, it is instrumental in formulating high-quality responses. Inspired by this cogn..."

Stories from March 20, 2026

Meta AI agent security incident

White House AI legislative framework

AI security research digest

OpenAI autonomous AI researcher plans

📡 AI NEWS BUT ACTUALLY GOOD

Cursor Composer 2 launch