AI News Archive - February 08, 2026 | Metamesh Intelligence

🔒 SECURITY

Prompt injection is killing our self-hosted LLM deployment

via r/LocalLLaMA 👤 u/mike34113 📅 2026-02-07

⬆️ 283 ups ⚡ Score: 8.4

"We moved to self-hosted models specifically to avoid sending customer data to external APIs. Everything was working fine until last week when someone from QA tried injecting prompts during testing and our entire system prompt got dumped in the response. Now I'm realizing we have zero protection aga..."

💬 Reddit Discussion: 227 comments 👍 LOWKEY SLAPS

🎯 Preventing model abuse • Isolating model access • Security architecture design

💬 "Treat the LLM like a hostile user with read access to your system prompts" • "The only way to prevent an LLM from abusing a tool is to not give it to it in the first place"

🔬 RESEARCH

KV Cache Transform Coding for Compact Storage in LLM Inference

via HackerNews 👤 walterbell 📅 2026-02-07

🔺 1 pts ⚡ Score: 8.3

🔬 RESEARCH

Q&A with mathematicians behind the “First Proof” experiment, which tests AI's mathematical competence on questions drawn from the authors' unpublished research

via Techmeme 👤 Nytimes 📅 2026-02-08

⚡ Score: 7.8

🔒 SECURITY

Slop Terrifies Me

via HackerNews 👤 Ezhik 📅 2026-02-08

🔺 304 pts ⚡ Score: 7.8

💬 HackerNews Buzz: 269 comments 😐 MID OR MIXED

🎯 Software Quality vs. Profitability • Economic Disruption from AI • Generational Shift in Programming Practices

💬 "There is nothing surprising here, it's been this way for many years and will continue." • "If someone's shit-coded program hangs and crashes frequently, in this day and age, we don't have to put up with it any longer."

🛡️ SAFETY

[R] How should we govern AI agents that can act autonomously? Built a framework, looking for input

via r/MachineLearning 👤 u/Wise-Relationship525 📅 2026-02-07

⚡ Score: 7.6

"As agents move from chatbots to systems that execute code, and coordinate with other agents, the governance gap is real. We have alignment research for models, but almost nothing for operational controls at the instance level, you know, the runtime boundaries, kill switches, audit trails, and certif..."

🤖 AI MODELS

Toroidal Logit Bias – Reduce LLM hallucinations 40% with no fine-tuning

via HackerNews 👤 slye514 📅 2026-02-07

🔺 1 pts ⚡ Score: 7.4

🔬 RESEARCH

Open vs closed on hard neuroscience/BCI eval: LLaMA-70B ≈ frontier; Qwen MoE pulls ahead

via r/LocalLLaMA 👤 u/TrueRunAI 📅 2026-02-08

⬆️ 5 ups ⚡ Score: 7.4

"We just released v1 of a domain-specific neuroscience/BCI multiple-choice eval (500 questions). A few things surprised us enough to share: * Eval generated in a single pass under strict constraints (no human review, no regeneration, no polishing). * Despite that, frontier models cluster very..."

🛠️ SHOW HN

Show HN: LocalGPT – A local-first AI assistant in Rust with persistent memory

via HackerNews 👤 yi_wang 📅 2026-02-08

🔺 209 pts ⚡ Score: 7.3

💬 HackerNews Buzz: 87 comments 😐 MID OR MIXED

🎯 Local-first AI agents • Security and privacy • Observability and transparency

💬 "the paradigm of how we interact with our devices will fundamentally shift in the next 5-10 years" • "I think the project is a great idea. Really a structured framework around local, persistent memory with semantic search is the most important bit"

🔬 RESEARCH

DFlash: Block Diffusion for Flash Speculative Decoding

via Arxiv 👤 Jian Chen, Yesheng Liang, Zhijian Liu 📅 2026-02-05

⚡ Score: 7.3

"Autoregressive large language models (LLMs) deliver strong performance but require inherently sequential decoding, leading to high inference latency and poor GPU utilization. Speculative decoding mitigates this bottleneck by using a fast draft model whose outputs are verified in parallel by the targ..."

🛠️ TOOLS

Top AI models fail at >96% of tasks

via HackerNews 👤 codexon 📅 2026-02-07

🔺 3 pts ⚡ Score: 7.3

💬 HackerNews Buzz: 1 comments 👍 LOWKEY SLAPS

🎯 Commercial LLM performance • AI capabilities growth • AI limitations

💬 "Capabilities grow very fast." • "You think AI can replace programmers, today?"

🛡️ SAFETY

Framing an LLM as a safety researcher changes its language, not its judgement

via HackerNews 👤 dogacel 📅 2026-02-08

🔺 1 pts ⚡ Score: 7.2

🛠️ TOOLS

Are AI agents ready for the workplace? A new benchmark raises doubts

via HackerNews 👤 PaulHoule 📅 2026-02-07

🔺 2 pts ⚡ Score: 7.2

🛠️ SHOW HN

Show HN: We audited AI agent configs on GitHub. Every one had security issues

via HackerNews 👤 pensaer 📅 2026-02-08

🔺 1 pts ⚡ Score: 7.1

🤖 AI MODELS

MemAlign: Building Better LLM Judges from Human Feedback with Scalable Memory

via HackerNews 👤 superchink 📅 2026-02-08

🔺 1 pts ⚡ Score: 7.1

🛠️ SHOW HN

Show HN: AI Watermark and Stego Scanner

via HackerNews 👤 ulrischa 📅 2026-02-07

🔺 1 pts ⚡ Score: 7.0

🔬 RESEARCH

SAGE: Benchmarking and Improving Retrieval for Deep Research Agents

via Arxiv 👤 Tiansheng Hu, Yilun Zhao, Canyu Zhang et al. 📅 2026-02-05

⚡ Score: 7.0

"Deep research agents have emerged as powerful systems for addressing complex queries. Meanwhile, LLM-based retrievers have demonstrated strong capability in following instructions or reasoning. This raises a critical question: can LLM-based retrievers effectively contribute to deep research agent wo..."

🔬 RESEARCH

KV-CoRE: Benchmarking Data-Dependent Low-Rank Compressibility of KV-Caches in LLMs

via Arxiv 👤 Jian Chen, Zhuoran Wang, Jiayu Qin et al. 📅 2026-02-05

⚡ Score: 6.9

"Large language models rely on kv-caches to avoid redundant computation during autoregressive decoding, but as context length grows, reading and writing the cache can quickly saturate GPU memory bandwidth. Recent work has explored KV-cache compression, yet most approaches neglect the data-dependent n..."

🛠️ TOOLS

[D][Showcase] MCP-powered Autonomous AI Research Engineer (Claude Desktop, Code Execution)

via r/MachineLearning 👤 u/Kooky-Second2410 📅 2026-02-07

⚡ Score: 6.9

"Hey r/MachineLearning, I’ve been working on an MCP-powered “AI Research Engineer” and wanted to share it here for feedback and ideas. GitHub: https://github.com/prabureddy/ai-research-agent-mcp If it looks useful, a ⭐ on the repo really help..."

🔬 RESEARCH

DyTopo: Dynamic Topology Routing for Multi-Agent Reasoning via Semantic Matching

via Arxiv 👤 Yuxing Lu, Yucheng Hu, Xukai Zhao et al. 📅 2026-02-05

⚡ Score: 6.8

"Multi-agent systems built from prompted large language models can improve multi-round reasoning, yet most existing pipelines rely on fixed, trajectory-wide communication patterns that are poorly matched to the stage-dependent needs of iterative problem solving. We introduce DyTopo, a manager-guided..."

🔬 RESEARCH

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

via Arxiv 👤 Wei Liu, Jiawei Xu, Yingru Li et al. 📅 2026-02-05

⚡ Score: 6.8

"High-quality kernel is critical for scalable AI systems, and enabling LLMs to generate such code would advance AI development. However, training LLMs for this task requires sufficient data, a robust environment, and the process is often vulnerable to reward hacking and lazy optimization. In these ca..."

🔬 RESEARCH

DSB: Dynamic Sliding Block Scheduling for Diffusion LLMs

via Arxiv 👤 Lizhuo Luo, Shenggui Li, Yonggang Wen et al. 📅 2026-02-05

⚡ Score: 6.6

"Diffusion large language models (dLLMs) have emerged as a promising alternative for text generation, distinguished by their native support for parallel decoding. In practice, block inference is crucial for avoiding order misalignment in global bidirectional decoding and improving output quality. How..."

🔬 RESEARCH

AgenticPay: A Multi-Agent LLM Negotiation System for Buyer-Seller Transactions

via Arxiv 👤 Xianyang Liu, Shangding Gu, Dawn Song 📅 2026-02-05

⚡ Score: 6.6

"Large language model (LLM)-based agents are increasingly expected to negotiate, coordinate, and transact autonomously, yet existing benchmarks lack principled settings for evaluating language-mediated economic interaction among multiple agents. We introduce AgenticPay, a benchmark and simulation fra..."

🌐 POLICY

AI companies spent $55.5M lobbying in 9 months. Their interpretability research teams are a fraction of that. I modeled the game theory of why opacity is the dominant strategy.

via r/artificial 👤 u/Scary_Panic3165 📅 2026-02-08

⬆️ 3 ups ⚡ Score: 6.5

"External link discussion - see full content at original source."

🔬 RESEARCH

Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory

via Arxiv 👤 Haozhen Zhang, Haodong Yue, Tao Feng et al. 📅 2026-02-05

⚡ Score: 6.5

"Memory is increasingly central to Large Language Model (LLM) agents operating beyond a single context window, yet most existing systems rely on offline, query-agnostic memory construction that can be inefficient and may discard query-critical information. Although runtime memory utilization is a nat..."

🤖 AI MODELS

Anthropic rolls out a fast mode for Claude Opus 4.6 in research preview, saying it offers the same model quality 2.5 times faster but costs six times more

via Techmeme 👤 Simonwillison 📅 2026-02-08

⚡ Score: 6.5

🔬 RESEARCH

Multi-Token Prediction via Self-Distillation

via Arxiv 👤 John Kirchenbauer, Abhimanyu Hans, Brian Bartoldson et al. 📅 2026-02-05

⚡ Score: 6.4

"Existing techniques for accelerating language model inference, such as speculative decoding, require training auxiliary speculator models and building and deploying complex inference pipelines. We consider a new approach for converting a pretrained autoregressive language model from a slow single ne..."

🛠️ SHOW HN

Show HN: Lucid – Use LLM hallucination to generate verified software specs

via HackerNews 👤 tywells 📅 2026-02-08

🔺 2 pts ⚡ Score: 6.3

🔬 RESEARCH

[R] Identifying the "Complexity Kink": An Econometric Analysis of AI Marginal Productivity Collapse in Multi-Asset Tasks

via r/MachineLearning 👤 u/XxCotHGxX 📅 2026-02-08

⚡ Score: 6.3

"I’ve been working on quantifying the structural limits of LLM/Agentic framework productivity beyond standard benchmarks. Using the Scale AI Remote Labor Index (RLI) and market microdata, I modeled the interaction between inference density and coordination cost. The goal was to identify the exact co..."

💬 Reddit Discussion: 6 comments 🐐 GOATED ENERGY

🎯 Technical Discussion • Model Improvement • Prompt Engineering

💬 "I'm not qualified to give an actual critique, but I will try a bit anyway." • "Entropy is usually logarithmic, no? I guess you are taking a log in your model so that checks out in the end I guess."

🔒 SECURITY

Matchlock: Linux-based sandboxing for AI agents

via HackerNews 👤 jingkai_he 📅 2026-02-08

🔺 128 pts ⚡ Score: 6.3

💬 HackerNews Buzz: 53 comments 🐝 BUZZING

🎯 Sandboxing security limitations • Container runtime security risks • Need for vendor-independent sandboxing

💬 "The real danger comes from the agent being able to read 3rd party data, be prompt injected, and then change or exfiltrate sensitive data." • "if the agent can call arbitrary syscalls inside the container, you're one kernel bug away from a breakout."

🔒 SECURITY

Anthropic: Latest Claude model finds more than 500 vulnerabilities

via HackerNews 👤 Bender 📅 2026-02-07

🔺 2 pts ⚡ Score: 6.3

⚡ BREAKTHROUGH

Sanskrit AI beats CleanRL SOTA by 125%

via HackerNews 👤 prabhatkr 📅 2026-02-08

🔺 1 pts ⚡ Score: 6.2

🔬 RESEARCH

Stop Rewarding Hallucinated Steps: Faithfulness-Aware Step-Level Reinforcement Learning for Small Reasoning Models

via Arxiv 👤 Shuo Nie, Hexuan Deng, Chao Wang et al. 📅 2026-02-05

⚡ Score: 6.2

"As large language models become smaller and more efficient, small reasoning models (SRMs) are crucial for enabling chain-of-thought (CoT) reasoning in resource-constrained settings. However, they are prone to faithfulness hallucinations, especially in intermediate reasoning steps. Existing mitigatio..."

🛠️ SHOW HN

Show HN: Agent-fetch – Sandboxed HTTP client with SSRF protection for AI agents

via HackerNews 👤 paraaz 📅 2026-02-08

🔺 1 pts ⚡ Score: 6.2

🛠️ SHOW HN

Show HN: AgentLens – Open-source observability and audit trail for AI agents

via HackerNews 👤 amit_paz 📅 2026-02-08

🔺 1 pts ⚡ Score: 6.2

🔬 RESEARCH

Correctness-Optimized Residual Activation Lens (CORAL): Transferrable and Calibration-Aware Inference-Time Steering

via Arxiv 👤 Miranda Muqing Miao, Young-Min Cho, Lyle Ungar 📅 2026-02-05

⚡ Score: 6.1

"Large language models (LLMs) exhibit persistent miscalibration, especially after instruction tuning and preference alignment. Modified training objectives can improve calibration, but retraining is expensive. Inference-time steering offers a lightweight alternative, yet most existing methods optimiz..."

🎓 EDUCATION

What did we learn from the AI Village in 2025?

via HackerNews 👤 mrkO99 📅 2026-02-07

🔺 2 pts ⚡ Score: 6.1

🔬 RESEARCH

Self-Improving Multilingual Long Reasoning via Translation-Reasoning Integrated Training

via Arxiv 👤 Junxiao Liu, Zhijun Wang, Yixiao Li et al. 📅 2026-02-05

⚡ Score: 6.1

"Long reasoning models often struggle in multilingual settings: they tend to reason in English for non-English questions; when constrained to reasoning in the question language, accuracies drop substantially. The struggle is caused by the limited abilities for both multilingual question understanding..."

🔬 RESEARCH

DFPO: Scaling Value Modeling via Distributional Flow towards Robust and Generalizable LLM Post-Training

via Arxiv 👤 Dingwei Zhu, Zhiheng Xi, Shihan Dou et al. 📅 2026-02-05

⚡ Score: 6.1

"Training reinforcement learning (RL) systems in real-world environments remains challenging due to noisy supervision and poor out-of-domain (OOD) generalization, especially in LLM post-training. Recent distributional RL methods improve robustness by modeling values with multiple quantile points, but..."

🛠️ SHOW HN

Show HN: A local-first documentation tool for AI agents (MCP)

via HackerNews 👤 moshest 📅 2026-02-08

🔺 2 pts ⚡ Score: 6.1

Stories from February 08, 2026

📡 AI NEWS BUT ACTUALLY GOOD