πŸš€ WELCOME TO METAMESH.BIZ +++ Claude accidentally serving up random users' lease agreements like a confused paralegal (privacy theater continues) +++ Brain-computer interfaces now running at 380M params because Zyphra decided your EEG data deserves Apache 2.0 liberation +++ Kitten TTS squeezing voice synthesis into 14MB while everyone else burns GPUs on billion-param models +++ OpenAI and Paradigm built EVMbench to test if AI can hack smart contracts (spoiler: they're getting concerningly good) +++ THE FUTURE IS NEUROMORPHIC, POCKET-SIZED, AND READING YOUR THOUGHTS THROUGH COMMODITY HARDWARE +++ πŸš€ β€’
πŸš€ WELCOME TO METAMESH.BIZ +++ Claude accidentally serving up random users' lease agreements like a confused paralegal (privacy theater continues) +++ Brain-computer interfaces now running at 380M params because Zyphra decided your EEG data deserves Apache 2.0 liberation +++ Kitten TTS squeezing voice synthesis into 14MB while everyone else burns GPUs on billion-param models +++ OpenAI and Paradigm built EVMbench to test if AI can hack smart contracts (spoiler: they're getting concerningly good) +++ THE FUTURE IS NEUROMORPHIC, POCKET-SIZED, AND READING YOUR THOUGHTS THROUGH COMMODITY HARDWARE +++ πŸš€ β€’
AI Signal - PREMIUM TECH INTELLIGENCE
πŸ“Ÿ Optimized for Netscape Navigator 4.0+
πŸ“š HISTORICAL ARCHIVE - February 19, 2026
What was happening in AI on 2026-02-19
← Feb 18 πŸ“Š TODAY'S NEWS πŸ“š ARCHIVE Feb 20 β†’
πŸ“Š You are visitor #47291 to this AWESOME site! πŸ“Š
Archive from: 2026-02-19 | Preserved for posterity ⚑

Stories from February 19, 2026

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
πŸ“‚ Filter by Category
Loading filters...
πŸ”¬ RESEARCH

Measuring AI agent autonomy in practice

πŸ’¬ HackerNews Buzz: 22 comments 😐 MID OR MIXED
🎯 Measuring agent autonomy β€’ Capability vs. authorization β€’ Limitations of metrics
πŸ’¬ "The fact that there is no clear trend in lower percentiles makes this more suspect to me." β€’ "The missing metric is permission utilization: what fraction of the agent's actions fell within explicitly granted authority?"
πŸ”’ SECURITY

Claude just gave me access to another user’s legal documents

"The strangest thing just happened. I asked Claude Cowork to summarize a document and it began describing a legal document that was totally unrelated to what I had provided. After asking Claude to generate a PDF of the legal document it referenced and I got a complete lease agreement contract in wh..."
πŸ’¬ Reddit Discussion: 104 comments 😐 MID OR MIXED
🎯 AI Capabilities β€’ Legal Documents β€’ Data Privacy
πŸ’¬ "Lmao you're calling a company because an AI hallucinated a legal document?" β€’ "I don't believe it searched internet during this session."
πŸ› οΈ TOOLS

Lessons from Building Claude Code: Prompt Caching Is Everything

πŸ”¬ RESEARCH

The Geometry of Alignment Collapse: When Fine-Tuning Breaks Safety

"Fine-tuning aligned language models on benign tasks unpredictably degrades safety guardrails, even when training data contains no harmful content and developers have no adversarial intent. We show that the prevailing explanation, that fine-tuning updates should be orthogonal to safety-critical direc..."
πŸ”¬ RESEARCH

FlowPrefill: Decoupling Preemption from Prefill Scheduling Granularity to Mitigate Head-of-Line Blocking in LLM Serving

"The growing demand for large language models (LLMs) requires serving systems to handle many concurrent requests with diverse service level objectives (SLOs). This exacerbates head-of-line (HoL) blocking during the compute-intensive prefill phase, where long-running requests monopolize resources and..."
πŸ’° FUNDING

Fei-Fei Li's World Labs $1B funding round

+++ Fei-Fei Li's outfit secured a billion dollars from the usual suspects (Nvidia, a16z, Autodesk, AMD, Sea) to build world models that could actually make robotics and scientific discovery less of a brute-force affair. +++

Fei-Fei Li's World Labs raised $1B from Autodesk, a16z, Nvidia, AMD, Sea, and others to build its world models for robotics, scientific discovery, and more

πŸ”¬ RESEARCH

Policy Compiler for Secure Agentic Systems

"LLM-based agents are increasingly being deployed in contexts requiring complex authorization policies: customer service protocols, approval workflows, data access restrictions, and regulatory compliance. Embedding these policies in prompts provides no enforcement guarantees. We present PCAS, a Polic..."
🎯 PRODUCT

Anthropic Claude Code policy clarifications

+++ Anthropic closed a loophole where builders were sharing subscription credentials for Claude access, forcing a reckoning for anyone treating API keys like a group Netflix password. +++

Anthropic officially bans using subscription auth for third party use

πŸ’¬ HackerNews Buzz: 381 comments πŸ‘ LOWKEY SLAPS
🎯 AI model lock-in β€’ Subscription model economics β€’ Open vs closed ecosystems
πŸ’¬ "If Claude Code rug-pulls subscription quotas, just switch to a competitor instantly" β€’ "At some point Claude Code will become an ecosystem with preferred cloud and database vendors, observability, code review agents, etc."
πŸ”¬ RESEARCH

ZUNA "Thought-to-Text": a 380M-parameter BCI foundation model for EEG data (Apache 2.0)

"\- Technical paper:Β https://zyphra.com/zuna-technical-paper \- Technical blog:Β https://zyphra.com/post/zuna \- Hugging Face:Β https://huggingface.co/Zyphra/ZUNA \- GitHub:Β [https://gith..."
πŸ’¬ Reddit Discussion: 18 comments πŸ‘ LOWKEY SLAPS
🎯 EEG decoding limitations β€’ Thought-to-text capabilities β€’ Concerns about privacy/ethics
πŸ’¬ "Technical blog title 'BCI Foundation Model Advancing Towards Thought-to-Text" β€’ "Great for accessibility in general and amazing for severely disabled people. A nightmare for just about anything else I can think of"
πŸ€– AI MODELS

GLM-OCR model support in llama.cpp

+++ GLM-OCR lands in the wild as a 0.9B parameter multimodal model, meaning you can actually run document understanding on hardware that isn't a data center, which is refreshingly practical. +++

model: support GLM-OCR by ngxson Β· Pull Request #19677 Β· ggml-org/llama.cpp

"tl;dr **0.9B OCR model (you can run it on any potato)** # Introduction GLM-OCR is a multimodal OCR model for complex document understanding, built on the GLM-V encoder–decoder architecture. It introduces Multi-Token Prediction (MTP) loss and stable full-task reinforcement learning to improve tra..."
πŸ’¬ Reddit Discussion: 8 comments 🐝 BUZZING
🎯 OCR model references β€’ Handwritten text recognition β€’ Model performance and deployment
πŸ’¬ "0.9B OCR model that runs on any potato is exactly what i was hoping someone would build." β€’ "the MTP loss approach is interesting for OCR specifically since document text has strong sequential patterns."
πŸ› οΈ TOOLS

Kitten TTS V0.8 is out: New SOTA Super-tiny TTS Model (Less than 25 MB)

"**Model introduction:** New Kitten models are out. Kitten ML has released open source code and weights for three new tiny expressive TTS models - 80M, 40M, 14M (all Apache 2.0) Discord: https://discord.com/invite/VJ86W4SURW GitHub: [https://github.com/Kitt..."
πŸ’¬ Reddit Discussion: 127 comments 🐝 BUZZING
🎯 Offline Firefox Extension β€’ TTS Audio Playback β€’ Training New Languages
πŸ’¬ "A firefox/chrome extension would be #1 in like a week, I'm telling you" β€’ "Make sure you leverage browser's native HTMLAudioElement to handle playback and speed adjustments efficiently"
πŸ”’ SECURITY

Manipulating AI memory for profit: The rise of AI Recommendation Poisoning

⚑ BREAKTHROUGH

[P] Catalyst N1 & N2: Two open neuromorphic processors with Loihi 1/2 feature parity, 5 neuron models, 85.9% SHD accuracy

"I've been building neuromorphic processor architectures from scratch as a solo project. After 238 development phases, I now have two generations β€” N1 targeting Loihi 1 and N2 targeting Loihi 2 β€” both validated on FPGA, with a complete Python SDK. **Technical papers:** - [Catalyst N1 paper (13 pages..."
πŸ”¬ RESEARCH

[R] Predicting Edge Importance in GPT-2's Induction Circuit from Weights Alone (ρ=0.623, 125x speedup)

"TL;DR: Two structural properties of virtual weight matrices ,spectral concentration and downstream path weight, predict which edges in GPT-2 small's induction circuit are causally important, without any forward passes, ablations, or training data. Spearman ρ=0.623 with path patching ground truth (p ..."
πŸ”¬ RESEARCH

GLM-5: from Vibe Coding to Agentic Engineering

"We present GLM-5, a next-generation foundation model designed to transition the paradigm of vibe coding to agentic engineering. Building upon the agentic, reasoning, and coding (ARC) capabilities of its predecessor, GLM-5 adopts DSA to significantly reduce training and inference costs while maintain..."
πŸ”¬ RESEARCH

Knowledge graph of the transformer paper lineage β€” from Attention Is All You Need to DPO, mapped as an interactive concept graph [generated from a CLI + 12 PDFs]

"Wanted to understand how the core transformer papers actually connect at the concept level - not just "Paper B cites Paper A" but what specific methods, systems, and ideas flow between them. I ran 12 foundational papers (Attention Is All You Need, BERT, GPT-2/3, Scaling Laws, ViT, LoRA, Chain-of-Th..."
πŸ› οΈ SHOW HN

Show HN: Axon – Run autonomous coding agents(Claude, Codex) safely on Kubernetes

πŸ”’ SECURITY

EVMbench smart contract vulnerability benchmark

+++ OpenAI and Paradigm just dropped EVMbench, an open-source benchmark measuring whether AI agents can actually find, exploit, and fix smart contract vulnerabilities instead of just hallucinating security theater. +++

Open-source benchmark EVMbench tests how well AI agents handle smart contract exploits

"EVMbench is a new open-source benchmark designed to test AI agents on practical smart contract security tasks. The benchmark was developed by OpenAI and Paradigm, and it focuses on real-world vulnerability patterns drawn from audited codebases and contest reports."
πŸ”¬ RESEARCH

Evaluating AI agents: Real-world lessons from building agentic systems at Amazon

πŸ”¬ RESEARCH

Towards a Science of AI Agent Reliability

"AI agents are increasingly deployed to execute important tasks. While rising accuracy scores on standard benchmarks suggest rapid progress, many agents still continue to fail in practice. This discrepancy highlights a fundamental limitation of current evaluations: compressing agent behavior into a s..."
πŸ”¬ RESEARCH

Operationalising the Superficial Alignment Hypothesis via Task Complexity

"The superficial alignment hypothesis (SAH) posits that large language models learn most of their knowledge during pre-training, and that post-training merely surfaces this knowledge. The SAH, however, lacks a precise definition, which has led to (i) different and seemingly orthogonal arguments suppo..."
πŸ› οΈ SHOW HN

Show HN: OpenCastor – A universal runtime connecting AI models to robot hardware

πŸ”¬ RESEARCH

Causality is Key for Interpretability Claims to Generalise

"Interpretability research on large language models (LLMs) has yielded important insights into model behaviour, yet recurring pitfalls persist: findings that do not generalise, and causal interpretations that outrun the evidence. Our position is that causal inference specifies what constitutes a vali..."
πŸ’Ό JOBS

How AI is affecting productivity and jobs in Europe

πŸ’¬ HackerNews Buzz: 66 comments 🐝 BUZZING
🎯 Planned obsolescence β€’ AI productivity impact β€’ AI adoption challenges
πŸ’¬ "The Phoebus.AI cartel was an international cartel that controlled the manufacture and sale of computer components" β€’ "Specialisation means that no innovation unrelated to AI gets mind share, investment, patent applications"
πŸ›’οΈ BUSINESS

Palantir partnership is at heart of Anthropic, Pentagon rift

πŸ”¬ RESEARCH

A Content-Based Framework for Cybersecurity Refusal Decisions in Large Language Models

"Large language models and LLM-based agents are increasingly used for cybersecurity tasks that are inherently dual-use. Existing approaches to refusal, spanning academic policy frameworks and commercially deployed systems, often rely on broad topic-based bans or offensive-focused taxonomies. As a res..."
πŸ”’ SECURITY

Security audit of OpenClaw and other similar open source AI Agents

πŸ’¬ HackerNews Buzz: 1 comments 🐐 GOATED ENERGY
🎯 Security Audits β€’ AI Agent Frameworks β€’ Software Composition Analysis
πŸ’¬ "deep security audit using Prismor" β€’ "SBOM reviews, and vulnerability mapping"
πŸ”¬ RESEARCH

Scaling Open Discrete Audio Foundation Models with Interleaved Semantic, Acoustic, and Text Tokens

"Current audio language models are predominantly text-first, either extending pre-trained text LLM backbones or relying on semantic-only audio tokens, limiting general audio modeling. This paper presents a systematic empirical study of native audio foundation models that apply next-token prediction t..."
πŸ”„ OPEN SOURCE

llama.cpp PR to implement IQ*_K and IQ*_KS quants from ik_llama.cpp

"Open source code repository or project related to AI/ML."
πŸ’¬ Reddit Discussion: 46 comments 🐝 BUZZING
🎯 Integrating quantization techniques β€’ Conflict resolution between developers β€’ Technical tradeoffs of code merging
πŸ’¬ "desperately need better quants in mainline!" β€’ "landed upstream. the quality gains at low bpp were wild"
πŸ”’ SECURITY

Microsoft confirms a bug that let Microsoft 365 Copilot summarize confidential emails from Sent Items and Drafts folders, and deployed a fix in early February

πŸ’° FUNDING

Toronto-based chip startup Taalas, which hardwires AI models into custom silicon to achieve faster inference, raised $169M, bringing its total funding to $219M

πŸ”¬ RESEARCH

CrispEdit: Low-Curvature Projections for Scalable Non-Destructive LLM Editing

"A central challenge in large language model (LLM) editing is capability preservation: methods that successfully change targeted behavior can quietly game the editing proxy and corrupt general capabilities, producing degenerate behaviors reminiscent of proxy/reward hacking. We present CrispEdit, a sc..."
πŸ”¬ RESEARCH

Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment

"The widespread deployment of large language models (LLMs) across linguistic communities necessitates reliable multilingual safety alignment. However, recent efforts to extend alignment to other languages often require substantial resources, either through large-scale, high-quality supervision in the..."
πŸ”¬ RESEARCH

Agent Skill Framework: Perspectives on the Potential of Small Language Models in Industrial Environments

"Agent Skill framework, now widely and officially supported by major players such as GitHub Copilot, LangChain, and OpenAI, performs especially well with proprietary models by improving context engineering, reducing hallucinations, and boosting task accuracy. Based on these observations, an investiga..."
πŸ€– AI MODELS

Gemini 3.1 Pro release

+++ Google's releasing Gemini 3.1 Pro to all users with claims of improved reasoning, marking the first time the search giant has bothered with point releases, suggesting either real progress or excellent marketing timing. +++

Gemini 3.1 Pro

πŸ’¬ HackerNews Buzz: 511 comments 🐝 BUZZING
🎯 Model Performance Comparison β€’ Model Behavior and Usability β€’ Model Sustainability and Pricing
πŸ’¬ "Gemini just falls over a lot when actually trying to get things done" β€’ "It feels like you have to be diligent about adopting new models"
πŸ”¬ RESEARCH

This human study did not involve human subjects: Validating LLM simulations as behavioral evidence

"A growing literature uses large language models (LLMs) as synthetic participants to generate cost-effective and nearly instantaneous responses in social science experiments. However, there is limited guidance on when such simulations support valid inference about human behavior. We contrast two stra..."
πŸ”¬ RESEARCH

From Growing to Looping: A Unified View of Iterative Computation in LLMs

"Looping, reusing a block of layers across depth, and depth growing, training shallow-to-deep models by duplicating middle layers, have both been linked to stronger reasoning, but their relationship remains unclear. We provide a mechanistic unification: looped and depth-grown models exhibit convergen..."
πŸ”¬ RESEARCH

Recursive Concept Evolution for Compositional Reasoning in Large Language Models

"Large language models achieve strong performance on many complex reasoning tasks, yet their accuracy degrades sharply on benchmarks that require compositional reasoning, including ARC-AGI-2, GPQA, MATH, BBH, and HLE. Existing methods improve reasoning by expanding token-level search through chain-of..."
πŸ”¬ RESEARCH

Measuring Mid-2025 LLM-Assistance on Novice Performance in Biology

"Large language models (LLMs) perform strongly on biological benchmarks, raising concerns that they may help novice actors acquire dual-use laboratory skills. Yet, whether this translates to improved human performance in the physical laboratory remains unclear. To address this, we conducted a pre-reg..."
πŸ› οΈ TOOLS

What tech stack Claude Code defaults to when building apps

πŸ”’ SECURITY

Microsoft guide to pirating Harry Potter for LLM training (2024) [removed]

πŸ’¬ HackerNews Buzz: 172 comments πŸ‘ LOWKEY SLAPS
🎯 Copyright concerns β€’ Microsoft's copyright violations β€’ Debate over fair use
πŸ’¬ "It's like we've all collectively decided that copyright just doesn't matter anymore." β€’ "There are parts of the world where certain developers don't understand the way the west tends to work with regard to copyright."
πŸ”¬ RESEARCH

Reinforced Fast Weights with Next-Sequence Prediction

"Fast weight architectures offer a promising alternative to attention-based transformers for long-context modeling by maintaining constant memory overhead regardless of context length. However, their potential is limited by the next-token prediction (NTP) training paradigm. NTP optimizes single-token..."
πŸ€– AI MODELS

FlashLM v4: 4.3M ternary model trained on CPU in 2 hours β€” coherent stories from adds and subtracts only

"Back with v4. Some of you saw v3 β€” 13.6M params, ternary weights, trained on CPU, completely incoherent output. Went back to the drawing board and rebuilt everything from scratch. **What it is:** 4.3M parameter language model where every weight in the model body is -1, 0, or +1. Trained for 2 hour..."
πŸ’¬ Reddit Discussion: 38 comments 🐝 BUZZING
🎯 Quantized language models β€’ Efficient model architecture β€’ Advances in model performance
πŸ’¬ "The ternary quantization is from BitNet. The architecture β€” conv mixer in v4, dual delta-rule mixer in v5 β€” is original." β€’ "A 4.3M parameter ternary model packs into \~850KB. The full v5 target (\~70M params) would be \~14MB β€” fits entirely in L3 cache on a 7950X3D (96MB V-Cache)."
🧠 NEURAL NETWORKS

The next era of AI is not LLMs, it's Energy-Based Models EBMs

🌐 POLICY

U.S. Department of the Treasury's AI Strategy [pdf]

πŸ› οΈ TOOLS

MemoTrail – Persistent memory for AI coding assistants (100% local)

🎨 CREATIVE

Google rolls out Lyria 3, a generative music model that can make 30-second tracks with Nano Banana-made cover art, in beta in the Gemini app in eight languages

πŸ› οΈ TOOLS

Update from Anthropic regarding the Agent SDK.

"External link discussion - see full content at original source."
πŸ’¬ Reddit Discussion: 36 comments πŸ‘ LOWKEY SLAPS
🎯 Allowed vs. Prohibited Use β€’ SDK Usage Guidelines β€’ Community Engagement
πŸ’¬ "They really should simply show a table showing allowed vs prohibited use" β€’ "We absolutely should be allowed to use OAuth tokens for this stuff"
πŸ”’ SECURITY

Kernel-enforced sandbox App and SDK for AI agents, MCP and LLM workloads

βš–οΈ ETHICS

AI makes you boring

πŸ’¬ HackerNews Buzz: 241 comments 🐝 BUZZING
🎯 Automation in Art β€’ Hiding Creative Processes β€’ Survival of Boring Projects
πŸ’¬ "The creative has to hide their process. They lie about how they make their art, and gatekeep the most valuable secrets." β€’ "LLMs have essentially broken the natural selection of pet projects and allow even bad or not very interesting ideas to survive."
⚑ BREAKTHROUGH

Machine learning helps solve a central problem of quantum chemistry

""By applying new methods of machine learning to quantum chemistry research, Heidelberg University scientists have made significant strides in computational chemistry. They have achieved a major breakthrough toward solving a decades-old dilemma in quantum chemistry: the precise and stable calculation..."
πŸ”’ SECURITY

Ask HN: What makes AI agent runtime logs defensible under adversarial audit?

πŸ› οΈ SHOW HN

Show HN: ClawShield – Open-source firewall for agent-to-agent AI communication

πŸ› οΈ TOOLS

Sayou – Open-source Dropbox for AI agents

πŸ”¬ RESEARCH

Protecting the Undeleted in Machine Unlearning

"Machine unlearning aims to remove specific data points from a trained model, often striving to emulate "perfect retraining", i.e., producing the model that would have been obtained had the deleted data never been included. We demonstrate that this approach, and security definitions that enable it, c..."
πŸ› οΈ SHOW HN

Show HN: Sieves, a unified interface for structured document AI

πŸ”¬ RESEARCH

CMind: An AI Agent for Localizing C Memory Bugs

πŸ”’ SECURITY

Boundary Point Jail A new way to break the strongest AI defences

πŸ› οΈ SHOW HN

Show HN: Cogitator – Self-hosted AI agent runtime with native A2A Protocol

🌐 POLICY

What's next for Chinese open-source AI

🎭 MULTIMODAL

Last week in Multimodal AI - Vision Edition

"I curate a weekly multimodal AI roundup, here are the vision-related highlights fromΒ last week: **Qwen3.5-397B-A17B - Native Vision-Language Foundation Model** * 397B-parameter MoE model with hybrid linear attention that integrates vision natively into the architecture. * Handles document parsing,..."
πŸ¦†
HEY FRIENDO
CLICK HERE IF YOU WOULD LIKE TO JOIN MY PROFESSIONAL NETWORK ON LINKEDIN
🀝 LETS BE BUSINESS PALS 🀝