AI News Archive - February 01, 2026 | Metamesh Intelligence

💰 FUNDING

Inside Physical Intelligence, a startup co-founded by Stripe veteran Lachy Groom that is building general-purpose robotics foundation models and has raised $1B+

via Techmeme 👤 Techcrunch 📅 2026-02-01

⚡ Score: 8.0

🔒 SECURITY

OpenClaw security assessment [pdf]

via HackerNews 👤 nreece 📅 2026-02-01

🔺 59 pts ⚡ Score: 7.9

💬 HackerNews Buzz: 19 comments 😐 MID OR MIXED

🎯 OpenClaw Security Risks • Leaking System Prompts • Report Credibility

💬 "Almost all of this report is about leaking system prompts." • "I do not think this is a credible report."

🛠️ SHOW HN

Show HN: Zuckerman – minimalist personal AI agent that self-edits its own code

via HackerNews 👤 ddaniel10 📅 2026-02-01

🔺 57 pts ⚡ Score: 7.7

💬 HackerNews Buzz: 43 comments 👍 LOWKEY SLAPS

🎯 AI Ecosystem • Security Concerns • Language Choice

💬 "Agents propose and publish capabilities to a shared contribution site" • "How do you prevent this being abused as an attack vector for prompt injection?"

🛠️ TOOLS

Sources: Alibaba has delivered more than 100K units of the Zhenwu 810E, an ASIC for AI training and inference, surpassing those of its domestic rival Cambricon

via Techmeme 👤 Scmp 📅 2026-02-01

⚡ Score: 7.5

🤖 AI MODELS

nanochat can now train GPT-2 grade LLM for –$73 (3 hours on single 8XH100 node)

via HackerNews 👤 tosh 📅 2026-02-01

🔺 4 pts ⚡ Score: 7.4

💬 HackerNews Buzz: 2 comments 😐 MID OR MIXED

🎯 AI model capabilities • Computing power requirements • URL link formatting

💬 "Anything below 7b params struggles hard with reliable json output" • "Is there a theoritical minimum for computing power required to say, target GPT-2?"

🔬 RESEARCH

StepShield: When, Not Whether to Intervene on Rogue Agents

via Arxiv 👤 Gloria Felicia, Michael Eniolade, Jinfeng He et al. 📅 2026-01-29

⚡ Score: 7.3

"Existing agent safety benchmarks report binary accuracy, conflating early intervention with post-mortem analysis. A detector that flags a violation at step 8 enables intervention; one that reports it at step 48 provides only forensic value. This distinction is critical, yet current benchmarks cannot..."

🔬 RESEARCH

Research: vllm-mlx on Apple Silicon achieves 21% to 87% higher throughput than llama.cpp

via r/LocalLLaMA 👤 u/Synor 📅 2026-02-01

⬆️ 46 ups ⚡ Score: 7.3

"Academic research paper shared from arXiv preprint server."

💬 Reddit Discussion: 15 comments 😤 NEGATIVE ENERGY

🎯 M1 Mac Performance • vLLM-MLX Implementation • MLLM Ecosystem

💬 "vllm-mlx mainly adds continuous batching and serving" • "No mention of mlx-lm.server for openai api endpoint"

🔬 RESEARCH

DynaWeb: Model-Based Reinforcement Learning of Web Agents

via Arxiv 👤 Hang Ding, Peidong Liu, Junqiao Wang et al. 📅 2026-01-29

⚡ Score: 7.1

"The development of autonomous web agents, powered by Large Language Models (LLMs) and reinforcement learning (RL), represents a significant step towards general-purpose AI assistants. However, training these agents is severely hampered by the challenges of interacting with the live internet, which i..."

🔬 RESEARCH

Value-Based Pre-Training with Downstream Feedback

via Arxiv 👤 Shuqi Ke, Giulia Fanti 📅 2026-01-29

⚡ Score: 7.1

"Can a small amount of verified goal information steer the expensive self-supervised pretraining of foundation models? Standard pretraining optimizes a fixed proxy objective (e.g., next-token prediction), which can misallocate compute away from downstream capabilities of interest. We introduce V-Pret..."

🔬 RESEARCH

FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale

via Arxiv 👤 Ajay Patel, Colin Raffel, Chris Callison-Burch 📅 2026-01-29

⚡ Score: 7.0

"Due to limited supervised training data, large language models (LLMs) are typically pre-trained via a self-supervised "predict the next word" objective on a vast amount of unstructured text data. To make the resulting model useful to users, it is further trained on a far smaller amount of "instructi..."

🛠️ TOOLS

Memory-First AI Reminder Agents with Mem0 and Claude Agent SDK

via HackerNews 👤 ninadwrites 📅 2026-01-31

🔺 2 pts ⚡ Score: 7.0

🔬 RESEARCH

Exploring Reasoning Reward Model for Agents

via Arxiv 👤 Kaixuan Fan, Kaituo Feng, Manyuan Zhang et al. 📅 2026-01-29

⚡ Score: 6.9

"Agentic Reinforcement Learning (Agentic RL) has achieved notable success in enabling agents to perform complex reasoning and tool use. However, most methods still relies on sparse outcome-based reward for training. Such feedback fails to differentiate intermediate reasoning quality, leading to subop..."

🔬 RESEARCH

On the Paradoxical Interference between Instruction-Following and Task Solving

via Arxiv 👤 Yunjia Qi, Hao Peng, Xintong Shi et al. 📅 2026-01-29

⚡ Score: 6.9

"Instruction following aims to align Large Language Models (LLMs) with human intent by specifying explicit constraints on how tasks should be performed. However, we reveal a counterintuitive phenomenon: instruction following can paradoxically interfere with LLMs' task-solving capability. We propose a..."

🔬 RESEARCH

EditYourself: Audio-Driven Generation and Manipulation of Talking Head Videos with Diffusion Transformers

via Arxiv 👤 John Flynn, Wolfgang Paier, Dimitar Dinev et al. 📅 2026-01-29

⚡ Score: 6.8

"Current generative video models excel at producing novel content from text and image prompts, but leave a critical gap in editing existing pre-recorded videos, where minor alterations to the spoken script require preserving motion, temporal coherence, speaker identity, and accurate lip synchronizati..."

🔬 RESEARCH

ECO: Quantized Training without Full-Precision Master Weights

via Arxiv 👤 Mahdi Nikdan, Amir Zandieh, Dan Alistarh et al. 📅 2026-01-29

⚡ Score: 6.8

"Quantization has significantly improved the compute and memory efficiency of Large Language Model (LLM) training. However, existing approaches still rely on accumulating their updates in high-precision: concretely, gradient updates must be applied to a high-precision weight buffer, known as $\textit..."

🔬 RESEARCH

RedSage: A Cybersecurity Generalist LLM

via Arxiv 👤 Naufal Suryanto, Muzammal Naseer, Pengfei Li et al. 📅 2026-01-29

⚡ Score: 6.8

"Cybersecurity operations demand assistant LLMs that support diverse workflows without exposing sensitive data. Existing solutions either rely on proprietary APIs with privacy risks or on open models lacking domain adaptation. To bridge this gap, we curate 11.8B tokens of cybersecurity-focused contin..."

🔬 RESEARCH

VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning

via Arxiv 👤 Yibo Wang, Yongcheng Jing, Shunyu Liu et al. 📅 2026-01-29

⚡ Score: 6.8

"Long-context reasoning has significantly empowered large language models (LLMs) to tackle complex tasks, yet it introduces severe efficiency bottlenecks due to the computational complexity. Existing efficient approaches often rely on complex additional training or external models for compression, wh..."

🔬 RESEARCH

World of Workflows: a Benchmark for Bringing World Models to Enterprise Systems

via Arxiv 👤 Lakshya Gupta, Litao Li, Yizhe Liu et al. 📅 2026-01-29

⚡ Score: 6.8

"Frontier large language models (LLMs) excel as autonomous agents in many domains, yet they remain untested in complex enterprise systems where hidden workflows create cascading effects across interconnected databases. Existing enterprise benchmarks evaluate surface-level agentic task completion simi..."

🔬 RESEARCH

The Patient is not a Moving Document: A World Model Training Paradigm for Longitudinal EHR

via Arxiv 👤 Irsyad Adam, Zekai Chen, David Laprade et al. 📅 2026-01-29

⚡ Score: 6.7

"Large language models (LLMs) trained with next-word-prediction have achieved success as clinical foundation models. Representations from these language backbones yield strong linear probe performance across biomedical tasks, suggesting that patient semantics emerge from next-token prediction at scal..."

🔒 SECURITY

Former Google Engineer Found Guilty of Economic Espionage,Theft of AI Technology

via HackerNews 👤 737min 📅 2026-02-01

🔺 8 pts ⚡ Score: 6.7

🔬 RESEARCH

Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts

via Arxiv 👤 Yingfa Chen, Zhen Leng Thai, Zihan Zhou et al. 📅 2026-01-29

⚡ Score: 6.7

"Hybrid Transformer architectures, which combine softmax attention blocks and recurrent neural networks (RNNs), have shown a desirable performance-throughput tradeoff for long-context modeling, but their adoption and studies are hindered by the prohibitive cost of large-scale pre-training from scratc..."

🔬 RESEARCH

Reasoning While Asking: Transforming Reasoning Large Language Models from Passive Solvers to Proactive Inquirers

via Arxiv 👤 Xin Chen, Feng Jiang, Yiqian Zhang et al. 📅 2026-01-29

⚡ Score: 6.7

"Reasoning-oriented Large Language Models (LLMs) have achieved remarkable progress with Chain-of-Thought (CoT) prompting, yet they remain fundamentally limited by a \emph{blind self-thinking} paradigm: performing extensive internal reasoning even when critical information is missing or ambiguous. We..."

🛠️ TOOLS

10 Claude Code tips from Boris, the creator of Claude Code, summarized

via r/claudeai 👤 u/yksugi 📅 2026-02-01

⬆️ 1134 ups ⚡ Score: 6.7

"Boris Cherny, the creator of Claude Code, recently shared 10 tips on X sourced from the Claude Code team. Here's a quick summary I created with the help of Claude Code and Opus 4.5. Web version: [https://ykdojo.github.io/claude-code-tips/content/b..."

💬 Reddit Discussion: 91 comments 👍 LOWKEY SLAPS

🎯 Homelessness to Success • Effective Use of Claude • Community Discussion

💬 "At one point he was homeless drug addict and used to sleep in his car before turning around his life" • "Investing in your claude.md and plan plan plan are really the only tips that will enhance your experience"

🔬 RESEARCH

SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents

via Arxiv 👤 Yifeng Ding, Lingming Zhang 📅 2026-01-29

⚡ Score: 6.6

"Test-time scaling has been widely adopted to enhance the capabilities of Large Language Model (LLM) agents in software engineering (SWE) tasks. However, the standard approach of repeatedly sampling trajectories from scratch is computationally expensive. While recent methods have attempted to mitigat..."

🔒 SECURITY

A researcher says an exposed Moltbook database could have let anyone take control of AI agents on the site and post anything; the database has now been closed

via Techmeme 👤 404Media 📅 2026-02-01

⚡ Score: 6.5

🔬 RESEARCH

A Federated and Parameter-Efficient Framework for Large Language Model Training in Medicine

via Arxiv 👤 Anran Li, Yuanyuan Chen, Wenjun Long et al. 📅 2026-01-29

⚡ Score: 6.5

"Large language models (LLMs) have demonstrated strong performance on medical benchmarks, including question answering and diagnosis. To enable their use in clinical settings, LLMs are typically further adapted through continued pretraining or post-training using clinical data. However, most medical..."

🔬 RESEARCH

Pay for Hints, Not Answers: LLM Shepherding for Cost-Efficient Inference

via Arxiv 👤 Ziming Dong, Hardik Sharma, Evan O'Toole et al. 📅 2026-01-29

⚡ Score: 6.5

"Large Language Models (LLMs) deliver state-of-the-art performance on complex reasoning tasks, but their inference costs limit deployment at scale. Small Language Models (SLMs) offer dramatic cost savings yet lag substantially in accuracy. Existing approaches - routing and cascading - treat the LLM a..."

🤖 AI MODELS

Claude System Prompt Change

via r/claudeai 👤 u/-DankFire 📅 2026-01-31

⬆️ 63 ups ⚡ Score: 6.5

"So apparently Anthropic quietly replaced Claude's system prompt (Sonnet; perhaps other models too). I found out when it told me about a parameter named "reasoning_effort" They don't show it online (https://platform.claude.com/docs/en/release-notes/system-prompts), and when I ask to share it, it fla..."

💬 Reddit Discussion: 27 comments 🐝 BUZZING

🎯 System prompt transparency • Community discussion • Anthropic's practices

💬 "The 'lack of transparency' isn't new though, it's kinda Anthropic's modus operandi" • "Why would it ever need to be stored locally? Why would they not just inject it into your first prompt when it lands on the server"

🤖 AI MODELS

Falcon-H1-Tiny (90M) is out - specialized micro-models that actually work

via r/LocalLLaMA 👤 u/United-Manner-7 📅 2026-02-01

⬆️ 188 ups ⚡ Score: 6.4

"TII just dropped Falcon-H1-Tiny - a series of sub-100M models that quietly challenge the scaling dogma. We've all suspected that narrow, specialized smal models tend to hallucinate less than giant generalists. After all, a 90M parameter model has far less internal "room" to drift off-topic or invent..."

💬 Reddit Discussion: 34 comments 🐝 BUZZING

🎯 Latest research advancements • Model performance and optimization • Open-sourcing training pipeline

💬 "NorMuon replaced Muon 4 months ago in the modded-nanogpt leaderboards." • "This needs to be focused more. I mean, it doesnt need to have a lot of knowledge. It just needs to learn to pull knowledges and make use of it"

🛠️ TOOLS

I built an open-source, offline brain for AI coding agents. Indexes 10k files in 2s, remembers everything you teach it.

via r/claudeai 👤 u/Fluffy_Citron3547 📅 2026-02-01

⬆️ 33 ups ⚡ Score: 6.4

"**Hey Everyone!** **Drift Cortex OSS just released today which is a massive update that finally makes agents.md or claude.md obsolete. Let be honest, they become static stale documents that almost becomes bloatware in the process.** **Try it here:** [**https://github.com/dadbodgeoff/drift*..."

💬 Reddit Discussion: 11 comments 🐐 GOATED ENERGY

🎯 Frequent posting • Anthropic's plans • Retrieval Augmented Generation

💬 "Bro you don't need to post it ten times a day." • "RAG means Retrieval Augmented Generation, which is just a fancy way to say, a mechanism to search and inject context into prompts for better generation."

🛡️ SAFETY

New paper proposes AI alignment "bees" — classifier species that monitor LLMs continuously, can't be jailbroken, and produce both value and correction

via r/artificial 👤 u/Accurate_Complaint48 📅 2026-02-01

⬆️ 1 ups ⚡ Score: 6.3

"TL;DR: LLMs inherit human failure modes from training data. Current alignment (RLHF, Constitutional AI) faces circularity — biased humans correcting biased models. We propose small classifiers ("bees") running 24/7 as alignment monitors. They can't be jailbroken because they don't reason — they patt..."

🛠️ TOOLS

Self Discovering MCP servers, no more token overload or semantic loss

via r/claudeai 👤 u/Prestigious-Play8738 📅 2026-02-01

⬆️ 39 ups ⚡ Score: 6.3

"Hey everyone! Anyone else tired of configuring 50 tools into MCP and just hoping the agent figures it out? (invoking the right tools in the right order). We keep hitting same problems: * Agent calls \`checkout()\` before \`add\_to\_cart()\` * Context bloat: 50+ tools served for every conversation..."

💬 Reddit Discussion: 10 comments 🐝 BUZZING

🎯 Tool ordering and visibility • State persistence across sessions • Server-side determinism

💬 "The staged visibility approach makes a lot of sense" • "Often determinism is needed on the server side to enforce tool order"

🔄 OPEN SOURCE