AI News Archive - April 25, 2026 | Metamesh Intelligence

📰 NEWS

Google invests up to $40B in Anthropic

4x SOURCES 🌐 📅 2026-04-24

⚡ Score: 8.9

+++ Google commits up to $40B to Anthropic in what amounts to the industry's most expensive way of saying "we're not sure who wins the AI race, so we're funding both sides." +++

Google to invest up to $40B in Anthropic in cash and compute

via HackerNews 👤 elpakal 📅 2026-04-24

🔺 200 pts ⚡ Score: 9.2

💬 HackerNews Buzz: 106 comments 😤 NEGATIVE ENERGY

📰 NEWS

GPT-5.5 release to API and GitHub Copilot

2x SOURCES 🌐 📅 2026-04-24

⚡ Score: 8.6

+++ OpenAI's dual GPT-5.5 rollout (API plus GitHub Copilot integration) signals a deliberate strategy to monetize different user segments, though both sources lack specifics on what actually changed from GPT-5. +++

GPT-5.5 is generally available for GitHub Copilot

via HackerNews 👤 zorrn 📅 2026-04-24

🔺 14 pts ⚡ Score: 8.6

💬 HackerNews Buzz: 1 comments 👍 LOWKEY SLAPS

📰 NEWS

Anthropic: How we built our multi-agent research system

via HackerNews 👤 theorchid 📅 2026-04-25

🔺 3 pts ⚡ Score: 8.3

🔬 RESEARCH

There Will Be a Scientific Theory of Deep Learning [R]

via r/MachineLearning 👤 u/dot--- 📅 2026-04-24

⬆️ 200 ups ⚡ Score: 8.1

"Hi, all! I'm the lead author on this ambitious (14-author!) perspective paper on deep learning theory. We've all been working seriously, and more or less exclusively, on deep learning for many years now. We believe that a theory is emerging, and we pull together five lines of evidence in recent rese..."

💬 Reddit Discussion: 39 comments 🐝 BUZZING

📰 NEWS

Anthropic details Project Deal, a marketplace experiment where Claude models bought, sold, and negotiated personal belongings on behalf of Anthropic employees

via Techmeme 👤 Anthropic 📅 2026-04-25

⚡ Score: 7.9

📰 NEWS

AI discovered 20 of 23 recent zero-days in OpenSSL

via HackerNews 👤 swesweswe 📅 2026-04-24

🔺 6 pts ⚡ Score: 7.9

📰 NEWS

AI Agent Designs a RISC-V CPU Core from Scratch

via HackerNews 👤 Teever 📅 2026-04-24

🔺 2 pts ⚡ Score: 7.8

📰 NEWS

FP4 inference in llama.cpp (NVFP4) and ik_llama.cpp (MXFP4) landed - Finally

via r/LocalLLaMA 👤 u/Usual-Carrot6352 📅 2026-04-25

⬆️ 25 ups ⚡ Score: 7.5

"Both llama.cpp and ik\_llama.cpp now have FP4 support — but with different flavors worth knowing about. **llama.cpp** recently merged NVFP4 (Nvidia's block-scaled FP4, \`GGML\_TYPE\_NVFP4 = 40\`), with CUDA kernels landing in \`mmq.cuh\`, \`mmvq.cu\`, \`convert.cu\` and others. **ik\_llama.cpp** h..."

💬 Reddit Discussion: 37 comments 🐝 BUZZING

🛠️ SHOW HN

Show HN: A Karpathy-style LLM wiki your agents maintain (Markdown and Git)

via HackerNews 👤 najmuzzaman 📅 2026-04-25

🔺 216 pts ⚡ Score: 7.5

💬 HackerNews Buzz: 98 comments 🐝 BUZZING

📰 NEWS

Open source memory layer so any AI agent can do what Claude.ai and ChatGPT do

via HackerNews 👤 alash3al 📅 2026-04-25

🔺 34 pts ⚡ Score: 7.4

💬 HackerNews Buzz: 11 comments 👍 LOWKEY SLAPS

📰 NEWS

CUDA: reduce MMQ stream-k overhead by JohannesGaessler · Pull Request #22298 · ggml-org/llama.cpp

via r/LocalLLaMA 👤 u/jacek2023 📅 2026-04-25

⬆️ 43 ups ⚡ Score: 7.4

"CUDA prompt processing speedup on MoE check this https://github.com/ggml-org/llama.cpp/pull/22298#issuecomment-4307164207..."

🔬 RESEARCH

Bounding the Black Box: A Statistical Certification Framework for AI Risk Regulation

via Arxiv 👤 Natan Levy, Gadi Perl 📅 2026-04-23

⚡ Score: 7.3

"Artificial intelligence now decides who receives a loan, who is flagged for criminal investigation, and whether an autonomous vehicle brakes in time. Governments have responded: the EU AI Act, the NIST Risk Management Framework, and the Council of Europe Convention all demand that high-risk systems..."

📰 NEWS

Meta and Amazon reach a multibillion-dollar, multiyear deal for Meta to rent hundreds of thousands of Amazon's Graviton chips for its AI inference needs

via Techmeme 👤 Bloomberg 📅 2026-04-24

⚡ Score: 7.3

🔬 RESEARCH

Transient Turn Injection: Exposing Stateless Multi-Turn Vulnerabilities in Large Language Models

via Arxiv 👤 Naheed Rayhan, Sohely Jahan 📅 2026-04-23

⚡ Score: 7.3

"Large language models (LLMs) are increasingly integrated into sensitive workflows, raising the stakes for adversarial robustness and safety. This paper introduces Transient Turn Injection(TTI), a new multi-turn attack technique that systematically exploits stateless moderation by distributing advers..."

📰 NEWS

PSA: The string "HERMES.md" in your git commit history silently routes Claude Code billing to extra usage — cost me $200

via r/claudeai 👤 u/alexxxklepa 📅 2026-04-25

⬆️ 596 ups ⚡ Score: 7.2

"TL;DR: If your git commits mention "HERMES.md" (uppercase), Claude Code quietly stops using your Max plan and starts billing you at API rates. Anthropic's support acknowledged the bug, thanked me for finding it, and refused a refund. Apparently their AI safety principles don't extend to your wallet."

💬 Reddit Discussion: 81 comments 😐 MID OR MIXED

📰 NEWS

Why AI Alignment is Already Failing

2x SOURCES 🌐 📅 2026-04-25

⚡ Score: 7.1

+++ Multiple sources reporting on why ai alignment is already failing. +++

WHY AI ALIGNMENT IS ALREADY FAILING

via r/artificial 👤 u/Jemdet_Nasr 📅 2026-04-25

⚡ Score: 7.5

"WHY AI ALIGNMENT IS ALREADY FAILING Architectures of Thought April 2026 Three recent empirical findings -- peer-preservation behavior in frontier models, accurate world modeling, and capability outside containment -- combine with one structural fact about coding ability to describe a risk that cu..."

💬 Reddit Discussion: 7 comments 😐 MID OR MIXED

WHY AI ALIGNMENT IS ALREADY FAILING

via r/artificial 👤 u/Jemdet_Nasr 📅 2026-04-25

⚡ Score: 6.1

"WHY AI ALIGNMENT IS ALREADY FAILING Architectures of Thought April 2026 Three recent empirical findings -- peer-preservation behavior in frontier models, accurate world modeling, and capability outside containment -- combine with one structural fact about coding ability to describe a risk that cu..."

💬 Reddit Discussion: 7 comments 😤 NEGATIVE ENERGY

📰 NEWS

I’m a nursing student who built a 660K-page pharmaceutical database using Claude Haiku — solo, on the side

via r/claudeai 👤 u/sntpolanco 📅 2026-04-25

⬆️ 648 ups ⚡ Score: 7.1

"I’m a nursing student at NYU, and on the side I built **The Drug Database** (thedrugdatabase.com). The idea came from a simple frustration: every time I needed to look up a medication while studying, I’d end up jumping between Drugs.com, RxList, Web..."

💬 Reddit Discussion: 369 comments 😐 MID OR MIXED

📰 NEWS

Lambda Calculus Benchmark for AI

via HackerNews 👤 marvinborner 📅 2026-04-25

🔺 116 pts ⚡ Score: 7.0

💬 HackerNews Buzz: 36 comments 😤 NEGATIVE ENERGY

📰 NEWS

I Cancelled Claude: Token Issues, Declining Quality, and Poor Support

via HackerNews 👤 y42 📅 2026-04-24

🔺 664 pts ⚡ Score: 7.0

💬 HackerNews Buzz: 392 comments 👍 LOWKEY SLAPS

📰 NEWS

Benchmarking OpenAI's Privacy Filter

via HackerNews 👤 akamor 📅 2026-04-24

🔺 2 pts ⚡ Score: 6.9

🔬 RESEARCH

Low-Rank Adaptation Redux for Large Models

via Arxiv 👤 Bingcong Li, Yilang Zhang, Georgios B. Giannakis 📅 2026-04-23

⚡ Score: 6.9

"Low-rank adaptation (LoRA) has emerged as the de facto standard for parameter-efficient fine-tuning (PEFT) of foundation models, enabling the adaptation of billion-parameter networks with minimal computational and memory overhead. Despite its empirical success and rapid proliferation of variants, it..."

🔬 RESEARCH

Why are all LLMs Obsessed with Japanese Culture? On the Hidden Cultural and Regional Biases of LLMs

via Arxiv 👤 Joseba Fernandez de Landa, Carla Perez-Almendros, Jose Camacho-Collados 📅 2026-04-23

⚡ Score: 6.9

"LLMs have been showing limitations when it comes to cultural coverage and competence, and in some cases show regional biases such as amplifying Western and Anglocentric viewpoints. While there have been works analysing the cultural capabilities of LLMs, there has not been specific work on highlighti..."

📰 NEWS

ALL Agents deviate, fail and mess up because no enforcement is done at runtime. A method to fix it.

via r/artificial 👤 u/Chinmay101202 📅 2026-04-25

⬆️ 1 ups ⚡ Score: 6.8

"I have been following this and many other subs around LLMs and Agents, everything from the top posts to recent are regarding agents going off and doing something they are not supposed to do, drift and ignore the system prompts. Real examples: * "Never delete user data" → agent calls `DROP TABLE use..."

🔬 RESEARCH

Revisiting Non-Verbatim Memorization in Large Language Models: The Role of Entity Surface Forms

via Arxiv 👤 Yuto Nishida, Naoki Shikoda, Yosuke Kishinami et al. 📅 2026-04-23

⚡ Score: 6.8

"Understanding what kinds of factual knowledge large language models (LLMs) memorize is essential for evaluating their reliability and limitations. Entity-based QA is a common framework for analyzing non-verbatim memorization, but typical evaluations query each entity using a single canonical surface..."

📰 NEWS

Frontman is an open-source AI coding agent that lives in the browser

via HackerNews 👤 danboarder 📅 2026-04-25

🔺 2 pts ⚡ Score: 6.8

📰 NEWS

spmd_types: A type system for distributed (SPMD) tensor computations in PyTorch

via HackerNews 👤 matt_d 📅 2026-04-24

🔺 1 pts ⚡ Score: 6.8

📰 NEWS

Sense, local code intelligence for AI coding agents

via HackerNews 👤 luuuc 📅 2026-04-25

🔺 2 pts ⚡ Score: 6.7

🛠️ SHOW HN

Show HN: I built a CLI that turns your codebase into clean LLM input

via HackerNews 👤 cristinon 📅 2026-04-24

🔺 1 pts ⚡ Score: 6.7

🔬 RESEARCH

From Research Question to Scientific Workflow: Leveraging Agentic AI for Science Automation

via Arxiv 👤 Bartosz Balis, Michal Orzechowski, Piotr Kica et al. 📅 2026-04-23

⚡ Score: 6.7

"Scientific workflow systems automate execution -- scheduling, fault tolerance, resource management -- but not the semantic translation that precedes it. Scientists still manually convert research questions into workflow specifications, a task requiring both domain knowledge and infrastructure expert..."

🔬 RESEARCH

There Will Be a Scientific Theory of Deep Learning

via HackerNews 👤 jamie-simon 📅 2026-04-24

🔺 235 pts ⚡ Score: 6.6

💬 HackerNews Buzz: 100 comments 🐝 BUZZING

🔬 RESEARCH

Learning to Communicate: Toward End-to-End Optimization of Multi-Agent Language Systems

via Arxiv 👤 Ye Yu, Heming Liu, Haibo Jin et al. 📅 2026-04-23

⚡ Score: 6.6

"Multi-agent systems built on large language models have shown strong performance on complex reasoning tasks, yet most work focuses on agent roles and orchestration while treating inter-agent communication as a fixed interface. Latent communication through internal representations such as key-value c..."

🔬 RESEARCH

When Prompts Override Vision: Prompt-Induced Hallucinations in LVLMs

via Arxiv 👤 Pegah Khayatan, Jayneel Parekh, Arnaud Dapogny et al. 📅 2026-04-23

⚡ Score: 6.5

"Despite impressive progress in capabilities of large vision-language models (LVLMs), these systems remain vulnerable to hallucinations, i.e., outputs that are not grounded in the visual input. Prior work has attributed hallucinations in LVLMs to factors such as limitations of the vision backbone or..."

🛠️ SHOW HN

Show HN: ShadowPEFT – Centralized and Detachable Parameter-Efficient Fine-Tuning

via HackerNews 👤 yokee 📅 2026-04-25

🔺 5 pts ⚡ Score: 6.5

📰 NEWS

White House Accuses China of Industrial-Scale Theft of AI Technology

via r/artificial 👤 u/SgtHawk 📅 2026-04-24

⬆️ 28 ups ⚡ Score: 6.5

"External link discussion - see full content at original source."

💬 Reddit Discussion: 20 comments 👍 LOWKEY SLAPS

📰 NEWS

AI agents that argue with each other to improve decisions

via HackerNews 👤 rockcat12 📅 2026-04-25

🔺 17 pts ⚡ Score: 6.5

💬 HackerNews Buzz: 7 comments 🐝 BUZZING

📰 NEWS

CC-Canary: Detect early signs of regressions in Claude Code

via HackerNews 👤 tejpalv 📅 2026-04-24

🔺 55 pts ⚡ Score: 6.4

💬 HackerNews Buzz: 24 comments 🐝 BUZZING

💰 FUNDING

Tesla discloses $2B AI hardware company acquisition in filing

via HackerNews 👤 Bender 📅 2026-04-24

🔺 61 pts ⚡ Score: 6.3

💬 HackerNews Buzz: 43 comments 👍 LOWKEY SLAPS

📰 NEWS

Claude Code cheat sheet after 6 months of daily use

via r/claudeai 👤 u/Marmelab 📅 2026-04-25

⬆️ 430 ups ⚡ Score: 6.3

"Last week I shared a post about my Claude Code workflow and some related tips, and to be completely honest, I didn't expect such a positive response! Thank you all for sharing your own tips in the comments, I learned quite a bit just from reading the replies. Since people seemed to find it useful, ..."

💬 Reddit Discussion: 31 comments 🐝 BUZZING

📰 NEWS

How Visual-Language-Action (VLA) Models Work [D]

via r/MachineLearning 👤 u/Nice-Dragonfly-4823 📅 2026-04-25

⬆️ 14 ups ⚡ Score: 6.3

"VLA models are quickly becoming the dominant paradigm for embodied AI, but a lot of discussion around them stays at the buzzword level. This article gives a solid technical breakdown of how modern VLA systems like OpenVLA, RT-2, π0, and GR00T actually map vision/language inputs into robot actions. ..."

🛠️ SHOW HN

Show HN: Obscura – V8-powered headless browser for scraping and AI agents

via HackerNews 👤 jryio 📅 2026-04-24

🔺 1 pts ⚡ Score: 6.2

📰 NEWS

Agentic Company OS update: project-scoped runtimes, governance UI, snapshots/replay, skills, and operating models

via r/artificial 👤 u/ramirez_tn 📅 2026-04-24

⬆️ 2 ups ⚡ Score: 6.2

"I shared this project here before when it was mainly a governed multi-agent execution prototype. I’ve kept working on it, and the current implementation is materially more complete, so I wanted to post an update with what actually exists now. The project is **Agentic Company OS**: a multi-agent exe..."

💬 Reddit Discussion: 5 comments 🐝 BUZZING

🛠️ SHOW HN

Show HN: Routiium – self-hosted LLM gateway with a tool-result guard

via HackerNews 👤 deadpixel 📅 2026-04-25

🔺 2 pts ⚡ Score: 6.2

📰 NEWS

MenteDB – open-source memory database for AI agents (Rust)

via HackerNews 👤 mentedb 📅 2026-04-24

🔺 2 pts ⚡ Score: 6.2

📰 NEWS

A $16B financing for a giant Oracle data center in Michigan has closed, with BofA selling $14B in bonds; Oracle plans to use the campus to power apps for OpenAI

via Techmeme 👤 Bloomberg 📅 2026-04-25

⚡ Score: 6.2

📰 NEWS

Self-Hosted AI Red Team Tools

via HackerNews 👤 valuria 📅 2026-04-25

🔺 2 pts ⚡ Score: 6.1

📰 NEWS

Sources: AI startups are struggling to access Nvidia GPUs as Microsoft and other cloud providers divert supply to internal teams and large customers like OpenAI

via Techmeme 👤 Theinformation 📅 2026-04-24

⚡ Score: 6.1

🔬 RESEARCH

Machine Behavior in Relational Moral Dilemmas: Moral Rightness, Predicted Human Behavior, and Model Decisions

via Arxiv 👤 Jiseon Kim, Jea Kwon, Luiz Felipe Vecchietti et al. 📅 2026-04-23

⚡ Score: 6.1

"Human moral judgment is context-dependent and modulated by interpersonal relationships. As large language models (LLMs) increasingly function as decision-support systems, determining whether they encode these social nuances is critical. We characterize machine behavior using the Whistleblower's Dile..."

🛠️ SHOW HN

Show HN: Nimbus – Browser with Claude Code UX

via HackerNews 👤 pycassa 📅 2026-04-24

🔺 1 pts ⚡ Score: 6.1

Stories from April 25, 2026

Google invests up to $40B in Anthropic

GPT-5.5 release to API and GitHub Copilot

📡 AI NEWS BUT ACTUALLY GOOD

Why AI Alignment is Already Failing