AI News Archive - February 19, 2026 | Metamesh Intelligence

🔬 RESEARCH

Measuring AI agent autonomy in practice

via HackerNews 👤 jbredeche 📅 2026-02-19

🔺 59 pts ⚡ Score: 8.7

💬 HackerNews Buzz: 22 comments 😐 MID OR MIXED

🎯 Measuring agent autonomy • Capability vs. authorization • Limitations of metrics

💬 "The fact that there is no clear trend in lower percentiles makes this more suspect to me." • "The missing metric is permission utilization: what fraction of the agent's actions fell within explicitly granted authority?"

🔒 SECURITY

Claude just gave me access to another user’s legal documents

via r/claudeai 👤 u/Raton-Raton 📅 2026-02-19

⬆️ 682 ups ⚡ Score: 8.5

"The strangest thing just happened. I asked Claude Cowork to summarize a document and it began describing a legal document that was totally unrelated to what I had provided. After asking Claude to generate a PDF of the legal document it referenced and I got a complete lease agreement contract in wh..."

💬 Reddit Discussion: 104 comments 😐 MID OR MIXED

🎯 AI Capabilities • Legal Documents • Data Privacy

💬 "Lmao you're calling a company because an AI hallucinated a legal document?" • "I don't believe it searched internet during this session."

🛠️ TOOLS

Lessons from Building Claude Code: Prompt Caching Is Everything

via HackerNews 👤 mfiguiere 📅 2026-02-19

🔺 2 pts ⚡ Score: 8.3

🔬 RESEARCH

The Geometry of Alignment Collapse: When Fine-Tuning Breaks Safety

via Arxiv 👤 Max Springer, Chung Peng Lee, Blossom Metevier et al. 📅 2026-02-17

⚡ Score: 8.0

"Fine-tuning aligned language models on benign tasks unpredictably degrades safety guardrails, even when training data contains no harmful content and developers have no adversarial intent. We show that the prevailing explanation, that fine-tuning updates should be orthogonal to safety-critical direc..."

🔬 RESEARCH

FlowPrefill: Decoupling Preemption from Prefill Scheduling Granularity to Mitigate Head-of-Line Blocking in LLM Serving

via Arxiv 👤 Chia-chi Hsieh, Zan Zong, Xinyang Chen et al. 📅 2026-02-18

⚡ Score: 7.8

"The growing demand for large language models (LLMs) requires serving systems to handle many concurrent requests with diverse service level objectives (SLOs). This exacerbates head-of-line (HoL) blocking during the compute-intensive prefill phase, where long-running requests monopolize resources and..."

💰 FUNDING

Fei-Fei Li's World Labs $1B funding round

2x SOURCES 🌐 📅 2026-02-18

⚡ Score: 7.8

+++ Fei-Fei Li's outfit secured a billion dollars from the usual suspects (Nvidia, a16z, Autodesk, AMD, Sea) to build world models that could actually make robotics and scientific discovery less of a brute-force affair. +++

Fei-Fei Li's World Labs raised $1B from Autodesk, a16z, Nvidia, AMD, Sea, and others to build its world models for robotics, scientific discovery, and more

via Techmeme 👤 Bloomberg 📅 2026-02-18

⚡ Score: 8.1

Fei-Fei Li's World Labs raised $1B from A16Z, Nvidia to advance its world models

via HackerNews 👤 aanet 📅 2026-02-18

🔺 55 pts ⚡ Score: 6.8

💬 HackerNews Buzz: 15 comments 👍 LOWKEY SLAPS

🎯 World model definitions • World model applications • Investor information

💬 "the current approach for world labs is likely based on the expertise of the founders" • "What are the industries that would truly benefit from good world models?"

🔬 RESEARCH

Policy Compiler for Secure Agentic Systems

via Arxiv 👤 Nils Palumbo, Sarthak Choudhary, Jihye Choi et al. 📅 2026-02-18

⚡ Score: 7.6

"LLM-based agents are increasingly being deployed in contexts requiring complex authorization policies: customer service protocols, approval workflows, data access restrictions, and regulatory compliance. Embedding these policies in prompts provides no enforcement guarantees. We present PCAS, a Polic..."

🎯 PRODUCT

Anthropic Claude Code policy clarifications

2x SOURCES 🌐 📅 2026-02-18

⚡ Score: 7.6

+++ Anthropic closed a loophole where builders were sharing subscription credentials for Claude access, forcing a reckoning for anyone treating API keys like a group Netflix password. +++

Anthropic officially bans using subscription auth for third party use

via HackerNews 👤 theahura 📅 2026-02-19

🔺 335 pts ⚡ Score: 8.0

💬 HackerNews Buzz: 381 comments 👍 LOWKEY SLAPS

🎯 AI model lock-in • Subscription model economics • Open vs closed ecosystems

💬 "If Claude Code rug-pulls subscription quotas, just switch to a competitor instantly" • "At some point Claude Code will become an ecosystem with preferred cloud and database vendors, observability, code review agents, etc."

🔬 RESEARCH

ZUNA "Thought-to-Text": a 380M-parameter BCI foundation model for EEG data (Apache 2.0)

via r/LocalLLaMA 👤 u/Nunki08 📅 2026-02-19

⬆️ 145 ups ⚡ Score: 7.4

"\- Technical paper: https://zyphra.com/zuna-technical-paper \- Technical blog: https://zyphra.com/post/zuna \- Hugging Face: https://huggingface.co/Zyphra/ZUNA \- GitHub: [https://gith..."

💬 Reddit Discussion: 18 comments 👍 LOWKEY SLAPS

🎯 EEG decoding limitations • Thought-to-text capabilities • Concerns about privacy/ethics

💬 "Technical blog title 'BCI Foundation Model Advancing Towards Thought-to-Text" • "Great for accessibility in general and amazing for severely disabled people. A nightmare for just about anything else I can think of"

🤖 AI MODELS

GLM-OCR model support in llama.cpp

2x SOURCES 🌐 📅 2026-02-18

⚡ Score: 7.4

+++ GLM-OCR lands in the wild as a 0.9B parameter multimodal model, meaning you can actually run document understanding on hardware that isn't a data center, which is refreshingly practical. +++

model: support GLM-OCR by ngxson · Pull Request #19677 · ggml-org/llama.cpp

via r/LocalLLaMA 👤 u/jacek2023 📅 2026-02-18

⬆️ 38 ups ⚡ Score: 7.4

"tl;dr **0.9B OCR model (you can run it on any potato)** # Introduction GLM-OCR is a multimodal OCR model for complex document understanding, built on the GLM-V encoder–decoder architecture. It introduces Multi-Token Prediction (MTP) loss and stable full-task reinforcement learning to improve tra..."

💬 Reddit Discussion: 8 comments 🐝 BUZZING

🎯 OCR model references • Handwritten text recognition • Model performance and deployment

💬 "0.9B OCR model that runs on any potato is exactly what i was hoping someone would build." • "the MTP loss approach is interesting for OCR specifically since document text has strong sequential patterns."

Model: support GLM-OCR merged! LLama.cpp

via r/LocalLLaMA 👤 u/LegacyRemaster 📅 2026-02-18

⬆️ 17 ups ⚡ Score: 7.0

"https://github.com/ggml-org/llama.cpp/pull/19677 Can't wait to test!"

💬 Reddit Discussion: 2 comments 🐐 GOATED ENERGY

🎯 PDF processing • Tool usage • Tool comparison

💬 "Would really appreciate some resources on how to actually use this in practice." • "I would really like to use this to be able to convert pdfs to text + latex equations + markdown tables + separate images."

🛠️ TOOLS

Kitten TTS V0.8 is out: New SOTA Super-tiny TTS Model (Less than 25 MB)

via r/LocalLLaMA 👤 u/ElectricalBar7464 📅 2026-02-19

⬆️ 824 ups ⚡ Score: 7.3

"**Model introduction:** New Kitten models are out. Kitten ML has released open source code and weights for three new tiny expressive TTS models - 80M, 40M, 14M (all Apache 2.0) Discord: https://discord.com/invite/VJ86W4SURW GitHub: [https://github.com/Kitt..."

💬 Reddit Discussion: 127 comments 🐝 BUZZING

🎯 Offline Firefox Extension • TTS Audio Playback • Training New Languages

💬 "A firefox/chrome extension would be #1 in like a week, I'm telling you" • "Make sure you leverage browser's native HTMLAudioElement to handle playback and speed adjustments efficiently"

🔒 SECURITY

Manipulating AI memory for profit: The rise of AI Recommendation Poisoning

via HackerNews 👤 WalterSobchak 📅 2026-02-18

🔺 1 pts ⚡ Score: 7.3

⚡ BREAKTHROUGH

[P] Catalyst N1 & N2: Two open neuromorphic processors with Loihi 1/2 feature parity, 5 neuron models, 85.9% SHD accuracy

via r/MachineLearning 👤 u/Mr-wabbit0 📅 2026-02-19

⬆️ 1 ups ⚡ Score: 7.2

"I've been building neuromorphic processor architectures from scratch as a solo project. After 238 development phases, I now have two generations — N1 targeting Loihi 1 and N2 targeting Loihi 2 — both validated on FPGA, with a complete Python SDK. **Technical papers:** - [Catalyst N1 paper (13 pages..."

🔬 RESEARCH

[R] Predicting Edge Importance in GPT-2's Induction Circuit from Weights Alone (ρ=0.623, 125x speedup)

via r/MachineLearning 👤 u/IfUDontLikeBigRedFU 📅 2026-02-19

⬆️ 5 ups ⚡ Score: 7.1

"TL;DR: Two structural properties of virtual weight matrices ,spectral concentration and downstream path weight, predict which edges in GPT-2 small's induction circuit are causally important, without any forward passes, ablations, or training data. Spearman ρ=0.623 with path patching ground truth (p ..."

🔬 RESEARCH

GLM-5: from Vibe Coding to Agentic Engineering

via Arxiv 👤 GLM-5 Team, :, Aohan Zeng et al. 📅 2026-02-17

⚡ Score: 7.0

"We present GLM-5, a next-generation foundation model designed to transition the paradigm of vibe coding to agentic engineering. Building upon the agentic, reasoning, and coding (ARC) capabilities of its predecessor, GLM-5 adopts DSA to significantly reduce training and inference costs while maintain..."

🔬 RESEARCH

Knowledge graph of the transformer paper lineage — from Attention Is All You Need to DPO, mapped as an interactive concept graph [generated from a CLI + 12 PDFs]

via r/artificial 👤 u/garagebandj 📅 2026-02-19

⬆️ 3 ups ⚡ Score: 7.0

"Wanted to understand how the core transformer papers actually connect at the concept level - not just "Paper B cites Paper A" but what specific methods, systems, and ideas flow between them. I ran 12 foundational papers (Attention Is All You Need, BERT, GPT-2/3, Scaling Laws, ViT, LoRA, Chain-of-Th..."

🛠️ SHOW HN

Show HN: Axon – Run autonomous coding agents(Claude, Codex) safely on Kubernetes

via HackerNews 👤 gjkim042 📅 2026-02-18

🔺 2 pts ⚡ Score: 7.0

🔒 SECURITY

EVMbench smart contract vulnerability benchmark

2x SOURCES 🌐 📅 2026-02-18

⚡ Score: 7.0

+++ OpenAI and Paradigm just dropped EVMbench, an open-source benchmark measuring whether AI agents can actually find, exploit, and fix smart contract vulnerabilities instead of just hallucinating security theater. +++

Open-source benchmark EVMbench tests how well AI agents handle smart contract exploits

via r/artificial 👤 u/tekz 📅 2026-02-19

⬆️ 2 ups ⚡ Score: 7.0

"EVMbench is a new open-source benchmark designed to test AI agents on practical smart contract security tasks. The benchmark was developed by OpenAI and Paradigm, and it focuses on real-world vulnerability patterns drawn from audited codebases and contest reports."

🔬 RESEARCH

Evaluating AI agents: Real-world lessons from building agentic systems at Amazon

via HackerNews 👤 bpedro 📅 2026-02-19

🔺 2 pts ⚡ Score: 6.9

🔬 RESEARCH

Towards a Science of AI Agent Reliability

via Arxiv 👤 Stephan Rabanser, Sayash Kapoor, Peter Kirgis et al. 📅 2026-02-18

⚡ Score: 6.9

"AI agents are increasingly deployed to execute important tasks. While rising accuracy scores on standard benchmarks suggest rapid progress, many agents still continue to fail in practice. This discrepancy highlights a fundamental limitation of current evaluations: compressing agent behavior into a s..."

🔬 RESEARCH

Operationalising the Superficial Alignment Hypothesis via Task Complexity

via Arxiv 👤 Tomás Vergara-Browne, Darshan Patil, Ivan Titov et al. 📅 2026-02-17

⚡ Score: 6.8

"The superficial alignment hypothesis (SAH) posits that large language models learn most of their knowledge during pre-training, and that post-training merely surfaces this knowledge. The SAH, however, lacks a precise definition, which has led to (i) different and seemingly orthogonal arguments suppo..."

🛠️ SHOW HN

Show HN: OpenCastor – A universal runtime connecting AI models to robot hardware

via HackerNews 👤 craigm26 📅 2026-02-18

🔺 3 pts ⚡ Score: 6.8

🔬 RESEARCH

Causality is Key for Interpretability Claims to Generalise

via Arxiv 👤 Shruti Joshi, Aaron Mueller, David Klindt et al. 📅 2026-02-18

⚡ Score: 6.8

"Interpretability research on large language models (LLMs) has yielded important insights into model behaviour, yet recurring pitfalls persist: findings that do not generalise, and causal interpretations that outrun the evidence. Our position is that causal inference specifies what constitutes a vali..."

💼 JOBS

How AI is affecting productivity and jobs in Europe

via HackerNews 👤 pseudolus 📅 2026-02-19

🔺 109 pts ⚡ Score: 6.8

💬 HackerNews Buzz: 66 comments 🐝 BUZZING

🎯 Planned obsolescence • AI productivity impact • AI adoption challenges

💬 "The Phoebus.AI cartel was an international cartel that controlled the manufacture and sale of computer components" • "Specialisation means that no innovation unrelated to AI gets mind share, investment, patent applications"

🛢️ BUSINESS

Palantir partnership is at heart of Anthropic, Pentagon rift

via HackerNews 👤 everybodyknows 📅 2026-02-19

🔺 8 pts ⚡ Score: 6.7

🔬 RESEARCH

A Content-Based Framework for Cybersecurity Refusal Decisions in Large Language Models

via Arxiv 👤 Meirav Segal, Noa Linder, Omer Antverg et al. 📅 2026-02-17

⚡ Score: 6.7

"Large language models and LLM-based agents are increasingly used for cybersecurity tasks that are inherently dual-use. Existing approaches to refusal, spanning academic policy frameworks and commercially deployed systems, often rely on broad topic-based bans or offensive-focused taxonomies. As a res..."

🔒 SECURITY

Security audit of OpenClaw and other similar open source AI Agents

via HackerNews 👤 noobcoder 📅 2026-02-19

🔺 6 pts ⚡ Score: 6.7

💬 HackerNews Buzz: 1 comments 🐐 GOATED ENERGY

🎯 Security Audits • AI Agent Frameworks • Software Composition Analysis

💬 "deep security audit using Prismor" • "SBOM reviews, and vulnerability mapping"

🔬 RESEARCH

Scaling Open Discrete Audio Foundation Models with Interleaved Semantic, Acoustic, and Text Tokens

via Arxiv 👤 Potsawee Manakul, Woody Haosheng Gan, Martijn Bartelds et al. 📅 2026-02-18

⚡ Score: 6.7

"Current audio language models are predominantly text-first, either extending pre-trained text LLM backbones or relying on semantic-only audio tokens, limiting general audio modeling. This paper presents a systematic empirical study of native audio foundation models that apply next-token prediction t..."

🔄 OPEN SOURCE

llama.cpp PR to implement IQ_K and IQ_KS quants from ik_llama.cpp

via r/LocalLLaMA 👤 u/TKGaming_11 📅 2026-02-19

⬆️ 121 ups ⚡ Score: 6.7

"Open source code repository or project related to AI/ML."

💬 Reddit Discussion: 46 comments 🐝 BUZZING

🎯 Integrating quantization techniques • Conflict resolution between developers • Technical tradeoffs of code merging

💬 "desperately need better quants in mainline!" • "landed upstream. the quality gains at low bpp were wild"

🔒 SECURITY

Microsoft confirms a bug that let Microsoft 365 Copilot summarize confidential emails from Sent Items and Drafts folders, and deployed a fix in early February

via Techmeme 👤 Bleepingcomputer 📅 2026-02-18

⚡ Score: 6.7

💰 FUNDING

Toronto-based chip startup Taalas, which hardwires AI models into custom silicon to achieve faster inference, raised $169M, bringing its total funding to $219M

via Techmeme 👤 Reuters 📅 2026-02-19

⚡ Score: 6.7

🔬 RESEARCH

CrispEdit: Low-Curvature Projections for Scalable Non-Destructive LLM Editing

via Arxiv 👤 Zarif Ikram, Arad Firouzkouhi, Stephen Tu et al. 📅 2026-02-17

⚡ Score: 6.6

"A central challenge in large language model (LLM) editing is capability preservation: methods that successfully change targeted behavior can quietly game the editing proxy and corrupt general capabilities, producing degenerate behaviors reminiscent of proxy/reward hacking. We present CrispEdit, a sc..."

🔬 RESEARCH

Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment

via Arxiv 👤 Yuyan Bu, Xiaohao Liu, ZhaoXing Ren et al. 📅 2026-02-18

⚡ Score: 6.6

"The widespread deployment of large language models (LLMs) across linguistic communities necessitates reliable multilingual safety alignment. However, recent efforts to extend alignment to other languages often require substantial resources, either through large-scale, high-quality supervision in the..."

🔬 RESEARCH

Agent Skill Framework: Perspectives on the Potential of Small Language Models in Industrial Environments

via Arxiv 👤 Yangjie Xu, Lujun Li, Lama Sleem et al. 📅 2026-02-18

⚡ Score: 6.6

"Agent Skill framework, now widely and officially supported by major players such as GitHub Copilot, LangChain, and OpenAI, performs especially well with proprietary models by improving context engineering, reducing hallucinations, and boosting task accuracy. Based on these observations, an investiga..."

🤖 AI MODELS

Gemini 3.1 Pro release

2x SOURCES 🌐 📅 2026-02-19

⚡ Score: 6.5

+++ Google's releasing Gemini 3.1 Pro to all users with claims of improved reasoning, marking the first time the search giant has bothered with point releases, suggesting either real progress or excellent marketing timing. +++

Gemini 3.1 Pro

via HackerNews 👤 MallocVoidstar 📅 2026-02-19

🔺 240 pts ⚡ Score: 6.5

💬 HackerNews Buzz: 511 comments 🐝 BUZZING

🎯 Model Performance Comparison • Model Behavior and Usability • Model Sustainability and Pricing

💬 "Gemini just falls over a lot when actually trying to get things done" • "It feels like you have to be diligent about adopting new models"

🔬 RESEARCH

This human study did not involve human subjects: Validating LLM simulations as behavioral evidence

via Arxiv 👤 Jessica Hullman, David Broska, Huaman Sun et al. 📅 2026-02-17

⚡ Score: 6.5

"A growing literature uses large language models (LLMs) as synthetic participants to generate cost-effective and nearly instantaneous responses in social science experiments. However, there is limited guidance on when such simulations support valid inference about human behavior. We contrast two stra..."

🔬 RESEARCH

From Growing to Looping: A Unified View of Iterative Computation in LLMs

via Arxiv 👤 Ferdinand Kapl, Emmanouil Angelis, Kaitlin Maile et al. 📅 2026-02-18

⚡ Score: 6.5

"Looping, reusing a block of layers across depth, and depth growing, training shallow-to-deep models by duplicating middle layers, have both been linked to stronger reasoning, but their relationship remains unclear. We provide a mechanistic unification: looped and depth-grown models exhibit convergen..."

🔬 RESEARCH

Recursive Concept Evolution for Compositional Reasoning in Large Language Models

via Arxiv 👤 Sarim Chaudhry 📅 2026-02-17

⚡ Score: 6.5

"Large language models achieve strong performance on many complex reasoning tasks, yet their accuracy degrades sharply on benchmarks that require compositional reasoning, including ARC-AGI-2, GPQA, MATH, BBH, and HLE. Existing methods improve reasoning by expanding token-level search through chain-of..."

🔬 RESEARCH

Measuring Mid-2025 LLM-Assistance on Novice Performance in Biology

via Arxiv 👤 Shen Zhou Hong, Alex Kleinman, Alyssa Mathiowetz et al. 📅 2026-02-18

⚡ Score: 6.5

"Large language models (LLMs) perform strongly on biological benchmarks, raising concerns that they may help novice actors acquire dual-use laboratory skills. Yet, whether this translates to improved human performance in the physical laboratory remains unclear. To address this, we conducted a pre-reg..."

🛠️ TOOLS

What tech stack Claude Code defaults to when building apps

via HackerNews 👤 edwin 📅 2026-02-18

🔺 6 pts ⚡ Score: 6.5

🔒 SECURITY

Microsoft guide to pirating Harry Potter for LLM training (2024) [removed]

via HackerNews 👤 anonymous908213 📅 2026-02-18

🔺 283 pts ⚡ Score: 6.5

💬 HackerNews Buzz: 172 comments 👍 LOWKEY SLAPS

🎯 Copyright concerns • Microsoft's copyright violations • Debate over fair use

💬 "It's like we've all collectively decided that copyright just doesn't matter anymore." • "There are parts of the world where certain developers don't understand the way the west tends to work with regard to copyright."

🔬 RESEARCH

Reinforced Fast Weights with Next-Sequence Prediction

via Arxiv 👤 Hee Seung Hwang, Xindi Wu, Sanghyuk Chun et al. 📅 2026-02-18

⚡ Score: 6.4

"Fast weight architectures offer a promising alternative to attention-based transformers for long-context modeling by maintaining constant memory overhead regardless of context length. However, their potential is limited by the next-token prediction (NTP) training paradigm. NTP optimizes single-token..."

🤖 AI MODELS

FlashLM v4: 4.3M ternary model trained on CPU in 2 hours — coherent stories from adds and subtracts only

via r/LocalLLaMA 👤 u/Own-Albatross868 📅 2026-02-18

⬆️ 61 ups ⚡ Score: 6.4

"Back with v4. Some of you saw v3 — 13.6M params, ternary weights, trained on CPU, completely incoherent output. Went back to the drawing board and rebuilt everything from scratch. **What it is:** 4.3M parameter language model where every weight in the model body is -1, 0, or +1. Trained for 2 hour..."

💬 Reddit Discussion: 38 comments 🐝 BUZZING

🎯 Quantized language models • Efficient model architecture • Advances in model performance

💬 "The ternary quantization is from BitNet. The architecture — conv mixer in v4, dual delta-rule mixer in v5 — is original." • "A 4.3M parameter ternary model packs into \~850KB. The full v5 target (\~70M params) would be \~14MB — fits entirely in L3 cache on a 7950X3D (96MB V-Cache)."

🧠 NEURAL NETWORKS

The next era of AI is not LLMs, it's Energy-Based Models EBMs

via HackerNews 👤 66yatman 📅 2026-02-18

🔺 3 pts ⚡ Score: 6.3

🌐 POLICY

U.S. Department of the Treasury's AI Strategy [pdf]

via HackerNews 👤 Nition 📅 2026-02-19

🔺 1 pts ⚡ Score: 6.3

🛠️ TOOLS

MemoTrail – Persistent memory for AI coding assistants (100% local)

via HackerNews 👤 halilhp 📅 2026-02-19

🔺 1 pts ⚡ Score: 6.3

🎨 CREATIVE

Google rolls out Lyria 3, a generative music model that can make 30-second tracks with Nano Banana-made cover art, in beta in the Gemini app in eight languages

via Techmeme 👤 Blog 📅 2026-02-18

⚡ Score: 6.3

🛠️ TOOLS

Update from Anthropic regarding the Agent SDK.

via r/claudeai 👤 u/Distinct_Fox_6358 📅 2026-02-18

⬆️ 101 ups ⚡ Score: 6.3

"External link discussion - see full content at original source."

💬 Reddit Discussion: 36 comments 👍 LOWKEY SLAPS

🎯 Allowed vs. Prohibited Use • SDK Usage Guidelines • Community Engagement

💬 "They really should simply show a table showing allowed vs prohibited use" • "We absolutely should be allowed to use OAuth tokens for this stuff"

🔒 SECURITY

Kernel-enforced sandbox App and SDK for AI agents, MCP and LLM workloads

via HackerNews 👤 decodebytes 📅 2026-02-18

🔺 1 pts ⚡ Score: 6.3

⚖️ ETHICS

AI makes you boring

via HackerNews 👤 speckx 📅 2026-02-19

🔺 399 pts ⚡ Score: 6.2

💬 HackerNews Buzz: 241 comments 🐝 BUZZING

🎯 Automation in Art • Hiding Creative Processes • Survival of Boring Projects

💬 "The creative has to hide their process. They lie about how they make their art, and gatekeep the most valuable secrets." • "LLMs have essentially broken the natural selection of pet projects and allow even bad or not very interesting ideas to survive."

⚡ BREAKTHROUGH

Machine learning helps solve a central problem of quantum chemistry

via r/artificial 👤 u/jferments 📅 2026-02-19

⬆️ 1 ups ⚡ Score: 6.2

""By applying new methods of machine learning to quantum chemistry research, Heidelberg University scientists have made significant strides in computational chemistry. They have achieved a major breakthrough toward solving a decades-old dilemma in quantum chemistry: the precise and stable calculation..."

🔒 SECURITY

Ask HN: What makes AI agent runtime logs defensible under adversarial audit?

via HackerNews 👤 catarina_eng 📅 2026-02-19

🔺 1 pts ⚡ Score: 6.2

🛠️ SHOW HN

Show HN: ClawShield – Open-source firewall for agent-to-agent AI communication

via HackerNews 👤 Joe_DNAI 📅 2026-02-19

🔺 1 pts ⚡ Score: 6.2

🛠️ TOOLS

Sayou – Open-source Dropbox for AI agents

via HackerNews 👤 syumpx 📅 2026-02-19

🔺 6 pts ⚡ Score: 6.2

🔬 RESEARCH

Protecting the Undeleted in Machine Unlearning

via Arxiv 👤 Aloni Cohen, Refael Kohen, Kobbi Nissim et al. 📅 2026-02-18

⚡ Score: 6.1

"Machine unlearning aims to remove specific data points from a trained model, often striving to emulate "perfect retraining", i.e., producing the model that would have been obtained had the deleted data never been included. We demonstrate that this approach, and security definitions that enable it, c..."

🛠️ SHOW HN

Show HN: Sieves, a unified interface for structured document AI

via HackerNews 👤 rmitsch 📅 2026-02-18

🔺 2 pts ⚡ Score: 6.1

🔬 RESEARCH

CMind: An AI Agent for Localizing C Memory Bugs

via HackerNews 👤 PaulHoule 📅 2026-02-18

🔺 2 pts ⚡ Score: 6.1

🔒 SECURITY

Boundary Point Jail A new way to break the strongest AI defences

via HackerNews 👤 iNic 📅 2026-02-19

🔺 1 pts ⚡ Score: 6.1

🛠️ SHOW HN

Show HN: Cogitator – Self-hosted AI agent runtime with native A2A Protocol

via HackerNews 👤 el1fe 📅 2026-02-19

🔺 1 pts ⚡ Score: 6.1

🌐 POLICY

What's next for Chinese open-source AI

via HackerNews 👤 calcifer 📅 2026-02-19

🔺 1 pts ⚡ Score: 6.1

🎭 MULTIMODAL

Last week in Multimodal AI - Vision Edition

via r/computervision 👤 u/Vast_Yak_4147 📅 2026-02-19

⬆️ 29 ups ⚡ Score: 6.1

"I curate a weekly multimodal AI roundup, here are the vision-related highlights from last week: **Qwen3.5-397B-A17B - Native Vision-Language Foundation Model** * 397B-parameter MoE model with hybrid linear attention that integrates vision natively into the architecture. * Handles document parsing,..."

Stories from February 19, 2026

Fei-Fei Li's World Labs $1B funding round

Anthropic Claude Code policy clarifications

GLM-OCR model support in llama.cpp

📡 AI NEWS BUT ACTUALLY GOOD

EVMbench smart contract vulnerability benchmark

Gemini 3.1 Pro release