AI News Archive - January 16, 2026 | Metamesh Intelligence

🔬 RESEARCH

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

via Arxiv 👤 Xingjun Ma, Yixu Wang, Hengyuan Xu et al. 📅 2026-01-15

⚡ Score: 8.1

"The rapid evolution of Large Language Models (LLMs) and Multimodal Large Language Models (MLLMs) has produced substantial gains in reasoning, perception, and generative capability across language and vision. However, whether these advances yield commensurate improvements in safety remains unclear, i..."

🔒 SECURITY

Letter: a top House Republican warns severe DRAM and HBM3E supply shortages will constrain H200 export licenses; Nvidia says it “can serve all approved” orders

via Techmeme 👤 Bloomberg 📅 2026-01-15

⚡ Score: 8.0

🔬 RESEARCH

The Promptware Kill Chain: How Prompt Injections Gradually Evolved Into a Multi-Step Malware

via Arxiv 👤 Ben Nassi, Bruce Schneier, Oleg Brodt 📅 2026-01-14

⚡ Score: 8.0

"The rapid adoption of large language model (LLM)-based systems -- from chatbots to autonomous agents capable of executing code and financial transactions -- has created a new attack surface that existing security frameworks inadequately address. The dominant framing of these threats as "prompt injec..."

🛠️ SHOW HN

Show HN: Gambit, an open-source agent harness for building reliable AI agents

via HackerNews 👤 randall 📅 2026-01-16

🔺 71 pts ⚡ Score: 8.0

💬 HackerNews Buzz: 15 comments 🐝 BUZZING

🎯 Agent reliability • Agent accountability • Predictable agent systems

💬 "Reliability came more from reducing degrees of freedom than from adding intelligence." • "Each step had an explicit goal, explicit inputs, and a defined end."

🔬 RESEARCH

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

via Arxiv 👤 Christopher Clark, Jieyu Zhang, Zixian Ma et al. 📅 2026-01-15

⚡ Score: 7.9

"Today's strongest video-language models (VLMs) remain proprietary. The strongest open-weight models either rely on synthetic data from proprietary VLMs, effectively distilling from them, or do not disclose their training data or recipe. As a result, the open-source community lacks the foundations ne..."

🔒 SECURITY

OpenAI Asking Contractors to Upload Work from Past Jobs to Evaluate AI Agents

via HackerNews 👤 rbanffy 📅 2026-01-16

🔺 9 pts ⚡ Score: 7.8

🔬 RESEARCH

On the origin of neural scaling laws: from random graphs to natural language

via Arxiv 👤 Maissam Barkeshli, Alberto Alfarano, Andrey Gromov 📅 2026-01-15

⚡ Score: 7.8

"Scaling laws have played a major role in the modern AI revolution, providing practitioners predictive power over how the model performance will improve with increasing data, compute, and number of model parameters. This has spurred an intense interest in the origin of neural scaling laws, with a com..."

🛡️ SAFETY

Training large language models on narrow tasks can lead to broad misalignment

via HackerNews 👤 thebeardisred 📅 2026-01-16

🔺 5 pts ⚡ Score: 7.6

🔒 SECURITY

Ask HN: LLM Poisoning Resources

via HackerNews 👤 totallygeeky 📅 2026-01-16

🔺 3 pts ⚡ Score: 7.5

🔬 RESEARCH

Be Your Own Red Teamer: Safety Alignment via Self-Play and Reflective Experience Replay

via Arxiv 👤 Hao Wang, Yanting Wang, Hao Li et al. 📅 2026-01-15

⚡ Score: 7.3

"Large Language Models (LLMs) have achieved remarkable capabilities but remain vulnerable to adversarial ``jailbreak'' attacks designed to bypass safety guardrails. Current safety alignment methods depend heavily on static external red teaming, utilizing fixed defense prompts or pre-collected adversa..."

🔬 RESEARCH

Empathy Applicability Modeling for General Health Queries

via Arxiv 👤 Shan Randhawa, Agha Ali Raza, Kentaro Toyama et al. 📅 2026-01-14

⚡ Score: 7.0

"LLMs are increasingly being integrated into clinical workflows, yet they often lack clinical empathy, an essential aspect of effective doctor-patient communication. Existing NLP frameworks focus on reactively labeling empathy in doctors' responses but offer limited support for anticipatory modeling..."

🔬 RESEARCH

Generative AI collective behavior needs an interactionist paradigm

via Arxiv 👤 Laura Ferrarotti, Gian Maria Campedelli, Roberto Dessì et al. 📅 2026-01-15

⚡ Score: 7.0

"In this article, we argue that understanding the collective behavior of agents based on large language models (LLMs) is an essential area of inquiry, with important implications in terms of risks and benefits, impacting us as a society at many levels. We claim that the distinctive nature of LLMs--na..."

🤖 AI MODELS

RAG-select: an end-to-end optimization package for selecting RAG architectures

via HackerNews 👤 agnim25 📅 2026-01-16

🔺 1 pts ⚡ Score: 7.0

🔬 RESEARCH

LLMs can Compress LLMs: Adaptive Pruning by Agents

via Arxiv 👤 Sai Varun Kodathala, Rakesh Vunnam 📅 2026-01-14

⚡ Score: 7.0

"As Large Language Models (LLMs) continue to scale, post-training pruning has emerged as a promising approach to reduce computational costs while preserving performance. Existing methods such as SparseGPT and Wanda achieve high sparsity through layer-wise weight reconstruction or activation-aware mag..."

🔬 RESEARCH

From Prompt to Protocol: Fast Charging Batteries with Large Language Models

via Arxiv 👤 Ge Lei, Ferran Brosa Planella, Sterling G. Baird et al. 📅 2026-01-14

⚡ Score: 6.9

"Efficiently optimizing battery charging protocols is challenging because each evaluation is slow, costly, and non-differentiable. Many existing approaches address this difficulty by heavily constraining the protocol search space, which limits the diversity of protocols that can be explored, preventi..."

🤖 AI MODELS

Cutting LLM token Usage by ~80% using REPL driven document analysis

via HackerNews 👤 yogthos 📅 2026-01-16

🔺 3 pts ⚡ Score: 6.9

🔬 RESEARCH

Value-Aware Numerical Representations for Transformer Language Models

via Arxiv 👤 Andreea Dutulescu, Stefan Ruseti, Mihai Dascalu 📅 2026-01-14

⚡ Score: 6.8

"Transformer-based language models often achieve strong results on mathematical reasoning benchmarks while remaining fragile on basic numerical understanding and arithmetic operations. A central limitation is that numbers are processed as symbolic tokens whose embeddings do not explicitly encode nume..."

🔬 RESEARCH

DR-Arena: an Automated Evaluation Framework for Deep Research Agents

via Arxiv 👤 Yiwen Gao, Ruochen Zhao, Yang Deng et al. 📅 2026-01-15

⚡ Score: 6.8

"As Large Language Models (LLMs) increasingly operate as Deep Research (DR) Agents capable of autonomous investigation and information synthesis, reliable evaluation of their task performance has become a critical bottleneck. Current benchmarks predominantly rely on static datasets, which suffer from..."

🔬 RESEARCH

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

via Arxiv 👤 Chi-Pin Huang, Yunze Man, Zhiding Yu et al. 📅 2026-01-14

⚡ Score: 6.8

"Vision-Language-Action (VLA) tasks require reasoning over complex visual scenes and executing adaptive actions in dynamic environments. While recent studies on reasoning VLAs show that explicit chain-of-thought (CoT) can improve generalization, they suffer from high inference latency due to lengthy..."

🔬 RESEARCH

Toward Understanding Unlearning Difficulty: A Mechanistic Perspective and Circuit-Guided Difficulty Metric

via Arxiv 👤 Jiali Cheng, Ziheng Chen, Chirag Agarwal et al. 📅 2026-01-14

⚡ Score: 6.8

"Machine unlearning is becoming essential for building trustworthy and compliant language models. Yet unlearning success varies considerably across individual samples: some are reliably erased, while others persist despite the same procedure. We argue that this disparity is not only a data-side pheno..."

🔬 RESEARCH

Automating Supply Chain Disruption Monitoring via an Agentic AI Approach

via Arxiv 👤 Sara AlMahri, Liming Xu, Alexandra Brintrup 📅 2026-01-14

⚡ Score: 6.7

"Modern supply chains are increasingly exposed to disruptions from geopolitical events, demand shocks, trade restrictions, to natural disasters. While many of these disruptions originate deep in the supply network, most companies still lack visibility beyond Tier-1 suppliers, leaving upstream vulnera..."

🔬 RESEARCH

Learning Latency-Aware Orchestration for Parallel Multi-Agent Systems

via Arxiv 👤 Xi Shi, Mengxin Zheng, Qian Lou 📅 2026-01-15

⚡ Score: 6.7

"Multi-agent systems (MAS) enable complex reasoning by coordinating multiple agents, but often incur high inference latency due to multi-step execution and repeated model invocations, severely limiting their scalability and usability in time-sensitive scenarios. Most existing approaches primarily opt..."

🛡️ SAFETY

Andrea Vallone, who left OpenAI in November as the head of its safety research team, joins Anthropic's alignment team

via Techmeme 👤 Theverge 📅 2026-01-15

⚡ Score: 6.7

🔬 RESEARCH

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

via Arxiv 👤 Zhiyuan Hu, Yunhai Hu, Juncheng Liu et al. 📅 2026-01-14

⚡ Score: 6.7

"Multi-agent systems have evolved into practical LLM-driven collaborators for many applications, gaining robustness from diversity and cross-checking. However, multi-agent RL (MARL) training is resource-intensive and unstable: co-adapting teammates induce non-stationarity, and rewards are often spars..."

🔬 RESEARCH

DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation

via Arxiv 👤 Yibo Wang, Lei Wang, Yue Deng et al. 📅 2026-01-14

⚡ Score: 6.7

"Deep research systems are widely used for multi-step web research, analysis, and cross-source synthesis, yet their evaluation remains challenging. Existing benchmarks often require annotation-intensive task construction, rely on static evaluation dimensions, or fail to reliably verify facts when cit..."

🔬 RESEARCH

Are Your Reasoning Models Reasoning or Guessing? A Mechanistic Analysis of Hierarchical Reasoning Models

via Arxiv 👤 Zirui Ren, Ziming Liu 📅 2026-01-15

⚡ Score: 6.7

"Hierarchical reasoning model (HRM) achieves extraordinary performance on various reasoning tasks, significantly outperforming large language model-based reasoners. To understand the strengths and potential failure modes of HRM, we conduct a mechanistic study on its reasoning patterns and find three..."

🔬 RESEARCH

LLM for Large-Scale Optimization Model Auto-Formulation: A Lightweight Few-Shot Learning Approach

via Arxiv 👤 Kuo Liang, Yuhang Lu, Jianming Mao et al. 📅 2026-01-14

⚡ Score: 6.7

"Large-scale optimization is a key backbone of modern business decision-making. However, building these models is often labor-intensive and time-consuming. We address this by proposing LEAN-LLM-OPT, a LightwEight AgeNtic workflow construction framework for LLM-assisted large-scale OPTimization auto-f..."

🔬 RESEARCH

MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching

via Arxiv 👤 Changle Qu, Sunhao Dai, Hengyi Cai et al. 📅 2026-01-15

⚡ Score: 6.6

"Tool-Integrated Reasoning (TIR) empowers large language models (LLMs) to tackle complex tasks by interleaving reasoning steps with external tool interactions. However, existing reinforcement learning methods typically rely on outcome- or trajectory-level rewards, assigning uniform advantages to all..."

🔬 RESEARCH

Representation-Aware Unlearning via Activation Signatures: From Suppression to Knowledge-Signature Erasure

via Arxiv 👤 Syed Naveed Mahmood, Md. Rezaur Rahman Bhuiyan, Tasfia Zaman et al. 📅 2026-01-15

⚡ Score: 6.6

"Selective knowledge erasure from LLMs is critical for GDPR compliance and model safety, yet current unlearning methods conflate behavioral suppression with true knowledge removal, allowing latent capabilities to persist beneath surface-level refusals. In this work, we address this challenge by intro..."

🔬 RESEARCH

Routing with Generated Data: Annotation-Free LLM Skill Estimation and Expert Selection

via Arxiv 👤 Tianyi Niu, Justin Chih-Yao Chen, Genta Indra Winata et al. 📅 2026-01-14

⚡ Score: 6.6

"Large Language Model (LLM) routers dynamically select optimal models for given inputs. Existing approaches typically assume access to ground-truth labeled data, which is often unavailable in practice, especially when user request distributions are heterogeneous and unknown. We introduce Routing with..."

🔬 RESEARCH

Contextual StereoSet: Stress-Testing Bias Alignment Robustness in Large Language Models

via Arxiv 👤 Abhinaba Basu, Pavan Chakraborty 📅 2026-01-15

⚡ Score: 6.6

"A model that avoids stereotypes in a lab benchmark may not avoid them in deployment. We show that measured bias shifts dramatically when prompts mention different places, times, or audiences -- no adversarial prompting required. We introduce Contextual StereoSet, a benchmark that holds stereotype..."

🛠️ TOOLS

Cursor's latest “browser experiment” implied success without evidence

via HackerNews 👤 embedding-shape 📅 2026-01-16

🔺 247 pts ⚡ Score: 6.5

💬 HackerNews Buzz: 110 comments 🐝 BUZZING

🎯 Browser development • AI-driven coding • Fundraising strategy

💬 "it's just plain slop" • "these things will get better"

🔬 RESEARCH

Defending Large Language Models Against Jailbreak Attacks via In-Decoding Safety-Awareness Probing

via Arxiv 👤 Yinzhi Zhao, Ming Wang, Shi Feng et al. 📅 2026-01-15

⚡ Score: 6.5

"Large language models (LLMs) have achieved impressive performance across natural language tasks and are increasingly deployed in real-world applications. Despite extensive safety alignment efforts, recent studies show that such alignment is often shallow and remains vulnerable to jailbreak attacks...."

🔬 RESEARCH

Grounding Agent Memory in Contextual Intent

via Arxiv 👤 Ruozhen Yang, Yucheng Jiang, Yueqi Jiang et al. 📅 2026-01-15

⚡ Score: 6.5

"Deploying large language models in long-horizon, goal-oriented interactions remains challenging because similar entities and facts recur under different latent goals and constraints, causing memory systems to retrieve context-mismatched evidence. We propose STITCH (Structured Intent Tracking in Cont..."

🔧 INFRASTRUCTURE

OpenAI says it issued a request for proposals to US-based hardware manufacturers as it seeks to push into consumer devices, robotics, and cloud data centers

via Techmeme 👤 Bloomberg 📅 2026-01-15

⚡ Score: 6.5

🤖 AI MODELS

Google releases TranslateGemma, a suite of Gemma 3-based open translation models available in 4B-, 12B-, and 27B-parameter sizes, with support for 55 languages

via Techmeme 👤 Blog 📅 2026-01-15

⚡ Score: 6.5

🔧 INFRASTRUCTURE

SkyVM (By Dioxus Labs): Instant-Boot Desktop VMs for AI Agents

via HackerNews 👤 satvikpendem 📅 2026-01-16

🔺 5 pts ⚡ Score: 6.5

🏢 BUSINESS

The US says Taiwanese companies will invest $250B+ in chip production in the US as part of a trade deal, with Taiwanese government guaranteeing $250B in credit

via Techmeme 👤 Cnbc 📅 2026-01-15

⚡ Score: 6.4

🛠️ TOOLS

The Wikimedia Foundation says Microsoft, Meta, Amazon, Perplexity, and Mistral joined Wikimedia Enterprise to get “tuned” API access; Google is already a member

via Techmeme 👤 Theverge 📅 2026-01-15

⚡ Score: 6.3

🤖 AI MODELS

Open-Weight Models Are Getting Serious: GLM 4.7 vs. MiniMax M2.1

via HackerNews 👤 kristianp 📅 2026-01-16

🔺 2 pts ⚡ Score: 6.2

🤖 AI MODELS

AI as a Compression Problem

via HackerNews 👤 pabs3 📅 2026-01-16

🔺 1 pts ⚡ Score: 6.2

🛠️ SHOW HN

Show HN: 1Code – open-source Cursor-like UI for Claude Code

via HackerNews 👤 Bunas 📅 2026-01-15

🔺 9 pts ⚡ Score: 6.1

💬 HackerNews Buzz: 15 comments 👍 LOWKEY SLAPS

🎯 Deployment issues • Pricing concerns • Comparisons to existing tools

💬 "fyi: it does not build for me from the source code." • "The pricing is ridiculous. It doesn't include the Claude subscription so $20/m is way out of league for a UI."

🛠️ TOOLS

From AI agent prototype to product: Lessons from building AWS DevOps Agent

via HackerNews 👤 malahay 📅 2026-01-16

🔺 4 pts ⚡ Score: 6.1

⚖️ ETHICS

ChatGPT wrote "Goodnight Moon" suicide lullaby for man who later killed himself

via HackerNews 👤 mirabilis 📅 2026-01-15

🔺 15 pts ⚡ Score: 6.1

💬 HackerNews Buzz: 16 comments 😤 NEGATIVE ENERGY

🎯 AI Risks • Algorithmic Bias • Responsibility of Tech Companies

💬 "these things come with their own tradeoffs" • "the attention model (and its finite size) causes the suicidal person's discourse to slowly displace any constraints"

🔬 RESEARCH

ShortCoder: Knowledge-Augmented Syntax Optimization for Token-Efficient Code Generation

via Arxiv 👤 Sicong Liu, Yanxian Huang, Mingwei Liu et al. 📅 2026-01-14

⚡ Score: 6.1

"Code generation tasks aim to automate the conversion of user requirements into executable code, significantly reducing manual development efforts and enhancing software productivity. The emergence of large language models (LLMs) has significantly advanced code generation, though their efficiency is..."

🔬 RESEARCH

Influential Training Data Retrieval for Explaining Verbalized Confidence of LLMs

via Arxiv 👤 Yuxi Xia, Loris Schoenegger, Benjamin Roth 📅 2026-01-15

⚡ Score: 6.1

"Large language models (LLMs) can increase users' perceived trust by verbalizing confidence in their outputs. However, prior work has shown that LLMs are often overconfident, making their stated confidence unreliable since it does not consistently align with factual accuracy. To better understand the..."

💰 FUNDING

Brain-computer interface startup Merge Labs, which Sam Altman co-founded, raised a $252M seed from OpenAI, Bain Capital, Gabe Newell, and others

via Techmeme 👤 Bloomberg 📅 2026-01-15

⚡ Score: 6.0

Stories from January 16, 2026

📡 AI NEWS BUT ACTUALLY GOOD