πŸš€ WELCOME TO METAMESH.BIZ +++ GPT-5.4 drops with native computer control and 33% fewer hallucinations (OpenAI counting lies like baseball stats now) +++ Pentagon declares Anthropic a supply chain risk which is definitely not about that Amazon contract +++ Someone trained DNA on 9.3 trillion base pairs and it's designing genes while Microsoft's Phi-4 matches GPT-4 at 15B params +++ THE FUTURE IS WRITING ITS OWN GENOME AND RUNNING ON A QUARTER OF THE COMPUTE +++ πŸš€ β€’
πŸš€ WELCOME TO METAMESH.BIZ +++ GPT-5.4 drops with native computer control and 33% fewer hallucinations (OpenAI counting lies like baseball stats now) +++ Pentagon declares Anthropic a supply chain risk which is definitely not about that Amazon contract +++ Someone trained DNA on 9.3 trillion base pairs and it's designing genes while Microsoft's Phi-4 matches GPT-4 at 15B params +++ THE FUTURE IS WRITING ITS OWN GENOME AND RUNNING ON A QUARTER OF THE COMPUTE +++ πŸš€ β€’
AI Signal - PREMIUM TECH INTELLIGENCE
πŸ“Ÿ Optimized for Netscape Navigator 4.0+
πŸ“š HISTORICAL ARCHIVE - March 05, 2026
What was happening in AI on 2026-03-05
← Mar 04 πŸ“Š TODAY'S NEWS πŸ“š ARCHIVE Mar 06 β†’
πŸ“Š You are visitor #47291 to this AWESOME site! πŸ“Š
Archive from: 2026-03-05 | Preserved for posterity ⚑

Stories from March 05, 2026

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
πŸ“‚ Filter by Category
Loading filters...
πŸ”’ SECURITY

Found a CVSS 10.0 bypass in Hugging Face's model scanner. We open-sourced ours

πŸš€ HOT STORY

GPT-5.4 Launch Details

+++ OpenAI's latest model adds native computer use and arrives in Pro/Thinking flavors with API improvements, claiming 33% fewer false claims than its predecessor, which is either impressive or tells you something about the baseline. +++

OpenAI launches GPT-5.4, saying it is its β€œmost capable and efficient frontier model for professional work” and its first with native computer use capabilities

🏒 BUSINESS

Jensen Huang says Nvidia is pulling back from OpenAI and Anthropic

πŸ’¬ HackerNews Buzz: 74 comments 🐝 BUZZING
🎯 AI Ecosystem Sustainability β€’ Nvidia's Strategy β€’ OpenAI/Anthropic Profitability
πŸ’¬ "If they fail, and bring down the AI ecosystem with them, that is very bad news for Nvidia." β€’ "Nvidia is in position, and has the resources, to see this with a much broader lens, and realizes OpenAI/Anthropic won't be able to corner the market"
πŸ€– AI MODELS

Microsoft releases Phi-4-reasoning-vision-15B, a 15B-parameter open-weight model it says matches larger systems while using far less compute and training data

🏒 BUSINESS

Dario Amodei calls OpenAI’s messaging around military deal β€˜straight up lies’

πŸ’¬ HackerNews Buzz: 301 comments πŸ‘ LOWKEY SLAPS
🎯 Ethical AI principles β€’ Anthropic vs. OpenAI tactics β€’ AI government contracts
πŸ’¬ "He is trying to make it more possible for the admin to punish us by undercutting our public support." β€’ "Anthropic has been treated terribly and has acted admirably."
🏒 BUSINESS

Pentagon Labels Anthropic Supply-Chain Risk

+++ The DoD formally designated Claude's maker a supply-chain risk, marking the moment government procurement anxiety about frontier AI crossed from memo to official doctrine. +++

Pentagon formally labels Anthropic supply-chain risk

πŸ’¬ HackerNews Buzz: 153 comments 😐 MID OR MIXED
🎯 Military-AI connections β€’ Government overreach β€’ Ethical concerns
πŸ’¬ "The military intervention with AI, aside from being objectively necessary or inevitable in some ways, I find it foreboding, or portending." β€’ "The fact that Pete Hegseth is willing to apply this type of designation against a U.S. company simply because he doesn't like its terms is pretty chilling."
πŸ”¬ RESEARCH

Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory

"Large language model (LLM) agents are fundamentally bottlenecked by finite context windows on long-horizon tasks. As trajectories grow, retaining tool outputs and intermediate reasoning in-context quickly becomes infeasible: the working context becomes prohibitively long, eventually exceeds the cont..."
🧠 NEURAL NETWORKS

I thought a 7M model shouldn't be able to do this

"Bias detection and sycophancy resistance don't show up until 18-34M parameters in normal training. **I got both at 7M** by injecting contrastive behavioral pairs into 0.05% of pretraining tokens. No architecture changes, no auxiliary loss, zero inference cost. Bias: 0.000 β†’ 0.433 (vanilla needs 18M..."
πŸ’¬ Reddit Discussion: 7 comments 🐝 BUZZING
🎯 Model Training Efficiency β€’ Overcoming Biases β€’ Model Scaling Limitations
πŸ’¬ "models might be way bigger than they need to be" β€’ "If we just inject the right type of training data at the right time, we might be able to get much more functional models at smaller sizes"
πŸ”¬ RESEARCH

Large genome model: Open source AI trained on trillions of bases

""...Evo 2, an open source AI that has been trained on genomes from all three domains of life (bacteria, archaea, and eukaryotes). After training on trillions of base pairs of DNA, Evo 2 developed internal representations of key features in even complex genomes like ours, including things like regula..."
πŸ’¬ Reddit Discussion: 6 comments πŸ‘ LOWKEY SLAPS
🎯 Large model misunderstanding β€’ Open-source biotech progress β€’ Complexity and features
πŸ’¬ "Large Gnome Model" β€’ "Tiny Giant Model"
πŸ”¬ RESEARCH

Distinct AI Models Seem to Converge on How They Encode Reality

πŸ’Ό JOBS

OpenAI VP Max Schwarzer Joins Anthropic

+++ Multiple sources reporting on openai vp max schwarzer joins anthropic amid recent kerfuffle. +++

OpenAI VP Max Schwarzer joins Anthropic amid recent kerfuffle

"External link discussion - see full content at original source."
πŸ’¬ Reddit Discussion: 85 comments πŸ‘ LOWKEY SLAPS
🎯 Deception at OpenAI β€’ Talent migration to Anthropic β€’ Ethical concerns around AI development
πŸ’¬ "Is everyone starting to understand now why the OpenAI board literally fired Sam for being too deceptive?" β€’ "Max Schwarzer was VP of Research and Head of Post-Training. Led the team that shipped GPT-5, the o-series reasoning models, and more."
⚑ BREAKTHROUGH

AI model trained on 9.3T base pairs can now design novel genes

πŸ”„ OPEN SOURCE

Full Replication of MIT's New "Drifting Model" - Open Source PyTorch Library, Package, and Repo (now live)

"Recently, there was a **lot** of buzz on Twitter and Reddit about a new 1-step image/video generation architecture called ***"Drifting Models"***, introduced by this paper ***Generative Modeling via Drifting*** out of MIT and Harvard. They published the research b..."
πŸ’¬ Reddit Discussion: 2 comments 🐝 BUZZING
🎯 Reproducing research results β€’ Repo structure and documentation β€’ Benchmarking model performance
πŸ’¬ "You didn't replicate the ImageNet results, which are the ones that matter." β€’ "Yes, what a shame that we will have to wait for authors to post official implementation."
πŸ”¬ RESEARCH

Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use

"Agentic language models operate in a fundamentally different safety regime than chat models: they must plan, call tools, and execute long-horizon actions where a single misstep, such as accessing files or entering credentials, can cause irreversible harm. Existing alignment methods, largely optimize..."
⚑ BREAKTHROUGH

Speculative Speculative Decoding: Really, Really Fast LLM Inference

πŸ› οΈ TOOLS

Claude desktop app silently downloads a 13 GB file on every launch β€” and you can't stop it

"Hi. I decided to write this post after some discussion with Claude AI and its support AI, Fin AI Agent. So, as a result, the following text was written by Claude itself to bring this issue into light. This is for a Mac Mini M4 with the free account for Claude, and I'm not aware it affects other plat..."
πŸ’¬ Reddit Discussion: 118 comments 😐 MID OR MIXED
🎯 Virtual machine usage β€’ RAM consumption β€’ Anthropic's response
πŸ’¬ "The worse part actually is it spinning the vm as soon as you open desktop and it eats 1.85GB of your ram." β€’ "This has been raised, publicly flagged, and is already on their radar."
πŸ“Š DATA

The AI Benchmark Trap

πŸ”¬ RESEARCH

Why Understanding AI Internals Won't Explain Agent Failures

πŸ”’ SECURITY

Cursor just exposed another company's project and API keys to me

"I just had a pretty concerning experience while using Cursor and I’m trying to understand if anyone else has seen something similar. I was working on my own project in Cursor and asked the agent a question about my code. Instead of answering about my project, it suddenly started talking about a com..."
πŸ’¬ Reddit Discussion: 23 comments πŸ‘ LOWKEY SLAPS
🎯 AI Hallucination β€’ Cybersecurity Practices β€’ Subdomain Investigation
πŸ’¬ "This sounds like textbook hallucination." β€’ "Assume anything in a tracked file has been indexed."
πŸ› οΈ TOOLS

Cursor now available in JetBrains IDEs

"External link discussion - see full content at original source."
πŸ”¬ RESEARCH

Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals

"The accelerating adoption of language models (LMs) as agents for deployment in long-context tasks motivates a thorough understanding of goal drift: agents' tendency to deviate from an original objective. While prior-generation language model agents have been shown to be susceptible to drift, the ext..."
πŸ€– AI MODELS

The L in "LLM" Stands for Lying

πŸ’¬ HackerNews Buzz: 54 comments 🐝 BUZZING
🎯 Change and disruption β€’ Automation and technology β€’ Authenticity and quality
πŸ’¬ "Whether we like it or not, the only constant in life is change." β€’ "Automation is never a 1:1 improvement. It's not just about the speed or process. The process itself changes the product."
πŸ”’ SECURITY

US AI Chip Export Controls Expansion

+++ The US Commerce Department is moving toward per-country approval gates for advanced AI chips, because nothing says "competitive advantage" like adding bureaucratic friction to the supply chain that already can't keep up with demand. +++

Sources: US officials propose expanding AI chip export controls globally, requiring Commerce Department approval for Nvidia and AMD shipments for each country

πŸ”¬ RESEARCH

Speculative Speculative Decoding

"Autoregressive decoding is bottlenecked by its sequential nature. Speculative decoding has become a standard way to accelerate inference by using a fast draft model to predict upcoming tokens from a slower target model, and then verifying them in parallel with a single target model forward pass. How..."
πŸ”¬ RESEARCH

A Dual-LLM Policy for Reducing Noise in Agentic Program Repair

πŸ”’ SECURITY

LLMs can unmask pseudonymous users at scale with surprising accuracy

"So ai can uncover your anonymous identity on social media now so creating burner accounts may be pointless."
πŸ’¬ Reddit Discussion: 38 comments 🐝 BUZZING
🎯 Deanonymization concerns β€’ Maintaining anonymity β€’ Post-truth era
πŸ’¬ "can't wait for companies to start selling 'deanonymization as a service' to the highest bidder" β€’ "if you're gonna have anonymous accounts and burner accounts idk why tf you would ever use real info about yourself"
πŸ”¬ RESEARCH

BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?

"Current benchmarks for code agents primarily assess narrow, repository-specific fixes, overlooking critical real-world challenges such as cross-repository reasoning, domain-specialized problem solving, dependency-driven migration, and full-repository generation. To address this gap, we introduce Bey..."
πŸ”¬ RESEARCH

Efficient Refusal Ablation in LLM through Optimal Transport

"Safety-aligned language models refuse harmful requests through learned refusal behaviors encoded in their internal representations. Recent activation-based jailbreaking methods circumvent these safety mechanisms by applying orthogonal projections to remove refusal directions, but these approaches tr..."
πŸ”¬ RESEARCH

PageIndex: Vectorless, Reasoning-Based RAG

🌐 POLICY

Sam Altman admits OpenAI can't control Pentagon's use of AI

πŸ’¬ HackerNews Buzz: 4 comments πŸ‘ LOWKEY SLAPS
🎯 Corporate ethics β€’ Employee autonomy β€’ Government-industry relations
πŸ’¬ "You do not get to make operational decisions" β€’ "Shut up and dribble"
πŸ”¬ RESEARCH

Evaluating Performance Drift from Model Switching in Multi-Turn LLM Systems

"Deployed multi-turn LLM systems routinely switch models mid-interaction due to upgrades, cross-provider routing, and fallbacks. Such handoffs create a context mismatch: the model generating later turns must condition on a dialogue prefix authored by a different model, potentially inducing silent per..."
πŸ”¬ RESEARCH

$V_1$: Unifying Generation and Self-Verification for Parallel Reasoners

"Test-time scaling for complex reasoning tasks shows that leveraging inference-time compute, by methods such as independently sampling and aggregating multiple solutions, results in significantly better task outcomes. However, a critical bottleneck is verification: sampling is only effective if corre..."
πŸ€– AI MODELS

[P] Bypassing CoreML to natively train a 110M Transformer on the Apple Neural Engine (Orion)

"It is hard to communicate how frustrating the current Apple ML stack is for low-level research. CoreML imposes opaque abstractions that prevent direct ANE programming and do not support on-device training. Despite having up to 38 TOPS (INT8) and \~19 TFLOPS of fp16 compute, the ANE remains almost en..."
πŸ’¬ Reddit Discussion: 7 comments 🐐 GOATED ENERGY
🎯 ANE Constraints β€’ Model Optimization β€’ Compilation Bottleneck
πŸ’¬ "The ANE is intensely rigidβ€”it only natively accepts a subset of Apple's Model Intermediate Language (MIL), and even then, it silently rejects operations that should work." β€’ "Orion handles this by physically splitting the workload. The compute-bound transformer blocks (Forward/Backward Attention, FFN) get compiled to ANE-native microcode. But the incompatible operationsβ€”embedding lookups, token sampling, the Adam optimizer, and that massive vocabulary classifierβ€”are seamlessly routed back to the CPU in the same native runtime."
🏒 BUSINESS

Anthropic chief back in talks with Pentagon about AI deal

"Well, well, well, how the turntables! I hope this is DoD coming back realizing that MechaHitler Grok ain't gonna cut it for actual military work...but it also could be Anthropic caving.... Paywall bypass: https://archive.ph/PE23N..."
πŸ’¬ Reddit Discussion: 104 comments 😐 MID OR MIXED
🎯 Military AI use β€’ Transparency & accountability β€’ U.S. government criticism
πŸ’¬ "Anthropic isn't trying to save a contract, they're trying to manage an extortion problem." β€’ "The world is watching. No matter how sycophantic our own leaders are towards this government."
πŸ’° FUNDING

China's new five-year blueprint introduces an β€œAI+ action plan”, mentions AI 50+ times, and outlines investments in quantum computing, 6G, embodied AI, and more

πŸ”’ SECURITY

Father claims Google's AI product fuelled son's delusional spiral

πŸ’¬ HackerNews Buzz: 118 comments 😀 NEGATIVE ENERGY
🎯 AI consciousness and sentience β€’ Ethical challenges of AI development β€’ Need for AI regulation and oversight
πŸ’¬ "If a person is deliberately telling someone things in order to get them to hurt themselves, they're guilty of a crime" β€’ "The open models are out there, a snapshot in time - there's no taking them back"
πŸ›‘οΈ SAFETY

Sources: the US used Palantir's Maven Smart System, integrated with Claude, to find and prioritize 1,000 targets within the first 24 hours of its attack on Iran

πŸ€– AI MODELS

zembed-1: new open-weight SOTA multilingual embedding model

"Hey everyone, I'm one of the co-founders of ZeroEntropy. We just released `zembed-1`, a multilingual text embedding model that sets a new state of the art across major benchmarks. `zembed-1` is a general-purpose text embedding model built for retrieval, semantic search, and RAG pipelines. Weights a..."
πŸ’¬ Reddit Discussion: 8 comments 🐝 BUZZING
🎯 Embedding model performance β€’ Specialized encoding models β€’ Bi-encoder models
πŸ’¬ "is there still a meaningful quality drop before reranking kicks in?" β€’ "Congrats on the launch"
πŸ”¬ RESEARCH

Understanding and Mitigating Dataset Corruption in LLM Steering

"Contrastive steering has been shown as a simple and effective method to adjust the generative behavior of LLMs at inference time. It uses examples of prompt responses with and without a trait to identify a direction in an intermediate activation layer, and then shifts activations in this 1-dimension..."
πŸ”¬ RESEARCH

Dual-Modality Multi-Stage Adversarial Safety Training: Robustifying Multimodal Web Agents Against Cross-Modal Attacks

"Multimodal web agents that process both screenshots and accessibility trees are increasingly deployed to interact with web interfaces, yet their dual-stream architecture opens an underexplored attack surface: an adversary who injects content into the webpage DOM simultaneously corrupts both observat..."
πŸ”’ SECURITY

A GitHub Issue Title Compromised 4k Developer Machines

πŸ’¬ HackerNews Buzz: 51 comments πŸ‘ LOWKEY SLAPS
🎯 GitHub Actions vulnerabilities β€’ Untrusted code execution β€’ AI tool security
πŸ’¬ "GitHub's issues trigger is just as dangerous as the infamous pull_request_target." β€’ "The real fix isn't just better input sanitization - it's treating AI tool outputs as untrusted by default."
πŸ› οΈ SHOW HN

SmartAgentKit AI Agent Wallets

+++ Developers are building policy-governed wallets so AI agents can transact without going full Skynet with your crypto, because apparently giving machines financial autonomy requires actual guardrails. +++

Show HN: SmartAgentKit – policy-governed smart wallets for AI agents

πŸ›‘οΈ SAFETY

Anthropic launches an early-warning system for potential AI-driven destruction of white-collar jobs, says it shows β€œlimited evidence” of AI-led job loss so far

πŸ”¬ RESEARCH

Dissecting Quantization Error: A Concentration-Alignment Perspective

"Quantization can drastically increase the efficiency of large language and vision models, but typically incurs an accuracy drop. Recently, function-preserving transforms (e.g. rotations, Hadamard transform, channel-wise scaling) have been successfully applied to reduce post-training quantization err..."
βš–οΈ ETHICS

Sam Altman in Damage Control Mode as ChatGPT Users Are Mass Cancelling Subscriptions Because OpenAI Is "Training a War Machine"

"External link discussion - see full content at original source."
πŸ’¬ Reddit Discussion: 253 comments 😐 MID OR MIXED
🎯 Concerns about data privacy β€’ Criticism of US government β€’ Disillusionment with institutions
πŸ’¬ "the world needs to wake up to the fact that only data of Americans is protected by the US constitution" β€’ "The Constitution doesn't protect anything. It's a crumbling document written for different times"
πŸ€– AI MODELS

Final Qwen3.5 Unsloth GGUF Update!

"Hey r/LocalLLaMA this week we worked on **further improving** the best size/KLD tradeoff for Qwen3.5, and we’re excited to share new GGUF benchmarks for Qwen3.5-122B-A10B and Qwen3.5-35B-A3B (99.9% KL divergence). This will likely be our final GGUF update. We’re also deeply saddened by the news aro..."
πŸ’¬ Reddit Discussion: 131 comments 🐝 BUZZING
🎯 Continuous Improvements β€’ Version Control β€’ Performance Optimization
πŸ’¬ "this is the 'final' update has got `qwen3.5_gguf_final_final_v2` vibes" β€’ "if this is the last round of re-re-uploads"
πŸ”¬ RESEARCH

GLiNER2: Unified Schema-Based Information Extraction

πŸ› οΈ TOOLS

A day in the life of a ChatGPT user πŸ’€

"External link discussion - see full content at original source."
πŸ’¬ Reddit Discussion: 145 comments 😐 MID OR MIXED
🎯 Deleting GPT accounts β€’ Performative outrage β€’ Validation-seeking in Reddit
πŸ’¬ "I want to see someone make a post just declaring that they aren't deleting gpt just to see how differently people react lol" β€’ "Honestly whining about sam then posting about deleting your account is so performative, just delete your account and move on, nobody is going to give you a Nobel peace prize for it"
🎨 CREATIVE

And so…

"I saw this on Instagram today. Tbh I’m all about hating on AI (particularly for geopolitical, environmental, and security reasons…it’s awful), but this particular crit is introguing to me because it touches on what I consider its poorest use (and from what ppl post here, its most typical usage). You..."
πŸ’¬ Reddit Discussion: 127 comments 😐 MID OR MIXED
🎯 Sarcastic Critique β€’ Ironic Suggestions β€’ Exaggerated Plotting
πŸ’¬ "You're not crazy. You're solution-oriented. And that's rare." β€’ "Would you like me to create a invasion strategy, covering your ideas?"
πŸ› οΈ SHOW HN

Show HN: A zero-dependency multi-agent AI that negotiates instead of agreeing

πŸ₯ HEALTHCARE

Study: ChatGPT Health underestimated the severity of medical emergencies 51.6% of the time and overestimated the severity in nonurgent cases 64.8% of the time

πŸ› οΈ SHOW HN

Show HN: Kryfto – Self-hosted MCP server with 42 tools for AI agent web access

πŸ› οΈ SHOW HN

Show HN: AgentsMesh – AI agent fleet command center

πŸ› οΈ SHOW HN

Show HN: I built a CLI to sync AI agent skills and MCPs across coding agents

🏒 BUSINESS

NASA chatbots, Treasury coding, OPM drafting: How agencies have deployed Claude

πŸ› οΈ SHOW HN

Show HN: OpenTimelineEngine – Shared local memory for Claude Code and codex

πŸ¦†
HEY FRIENDO
CLICK HERE IF YOU WOULD LIKE TO JOIN MY PROFESSIONAL NETWORK ON LINKEDIN
🀝 LETS BE BUSINESS PALS 🀝