๐Ÿš€ WELCOME TO METAMESH.BIZ +++ DeepMind says video models are the new LLMs except for physics and hands and everything that matters +++ Quantum computing proof literally outsourced to Claude because even Scott Aaronson can't be bothered anymore +++ DeepSeek quietly drops v3.2 while everyone's still arguing about whether v3 was fake benchmarks or just RLHF'd differently +++ US wants 50% of global chip production because depending on TSMC during geopolitical chaos is working great +++ THE FUTURE IS JUST WORLD MODELS HALLUCINATING BETTER PHYSICS +++ ๐Ÿš€ โ€ข
๐Ÿš€ WELCOME TO METAMESH.BIZ +++ DeepMind says video models are the new LLMs except for physics and hands and everything that matters +++ Quantum computing proof literally outsourced to Claude because even Scott Aaronson can't be bothered anymore +++ DeepSeek quietly drops v3.2 while everyone's still arguing about whether v3 was fake benchmarks or just RLHF'd differently +++ US wants 50% of global chip production because depending on TSMC during geopolitical chaos is working great +++ THE FUTURE IS JUST WORLD MODELS HALLUCINATING BETTER PHYSICS +++ ๐Ÿš€ โ€ข
AI Signal - PREMIUM TECH INTELLIGENCE
๐Ÿ“Ÿ Optimized for Netscape Navigator 4.0+
๐Ÿ“š HISTORICAL ARCHIVE - September 29, 2025
What was happening in AI on 2025-09-29
โ† Sep 28 ๐Ÿ“Š TODAY'S NEWS ๐Ÿ“š ARCHIVE Sep 30 โ†’
๐Ÿ“Š You are visitor #47291 to this AWESOME site! ๐Ÿ“Š
Archive from: 2025-09-29 | Preserved for posterity โšก

Stories from September 29, 2025

โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”โ”
๐Ÿ“‚ Filter by Category
Loading filters...
๐Ÿค– AI MODELS

Google DeepMind, Meta, Nvidia, and others are racing to release world models, aiming to navigate the physical world by learning from videos and robotic data

๐Ÿ”ฌ RESEARCH

Quantum computer scientist: "This is the first paper Iโ€™ve ever put out for which a key technical step in the proof came from AI ... 'There's not the slightest doubt that, if a student had given it to

"https://scottaaronson.blog/?p=9183..."
๐Ÿค– AI MODELS

DeepSeek v3.2 model release

+++ Chinese AI lab releases updated model with typical fanfare of a HackerNews post and Reddit plea for feedback, proving substance over hype still exists. +++

DeepSeek-v3.2

๐Ÿ”ฌ RESEARCH

Quantile Advantage Estimation for Entropy-Safe Reasoning

"Reinforcement Learning with Verifiable Rewards (RLVR) strengthens LLM reasoning, but training often oscillates between {entropy collapse} and {entropy explosion}. We trace both hazards to the mean baseline used in value-free RL (e.g., GRPO and DAPO), which improperly penalizes negative-advantage sam..."
๐Ÿ”ฌ RESEARCH

See, Point, Fly: A Learning-Free VLM Framework for Universal Unmanned Aerial Navigation

"We present See, Point, Fly (SPF), a training-free aerial vision-and-language navigation (AVLN) framework built atop vision-language models (VLMs). SPF is capable of navigating to any goal based on any type of free-form instructions in any kind of environment. In contrast to existing VLM-based approa..."
๐Ÿ”ฌ RESEARCH

SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines

"We present a scientific reasoning foundation model that aligns natural language with heterogeneous scientific representations. The model is pretrained on a 206B-token corpus spanning scientific text, pure sequences, and sequence-text pairs, then aligned via SFT on 40M instructions, annealed cold-sta..."
๐Ÿ”ฌ RESEARCH

It's Not You, It's Clipping: A Soft Trust-Region via Probability Smoothing for LLM RL

"Training large language models (LLMs) with reinforcement learning (RL) methods such as PPO and GRPO commonly relies on ratio clipping to stabilise updates. While effective at preventing instability, clipping discards information and introduces gradient discontinuities. We propose Probability Smoothi..."
๐Ÿ”ฌ RESEARCH

Tree Search for LLM Agent Reinforcement Learning

"Recent advances in reinforcement learning (RL) have significantly enhanced the agentic capabilities of large language models (LLMs). In long-term and multi-turn agent tasks, existing approaches driven solely by outcome rewards often suffer from the problem of sparse supervision. To address the chall..."
๐Ÿ”ฌ RESEARCH

DisCoCLIP: A Distributional Compositional Tensor Network Encoder for Vision-Language Understanding

"Recent vision-language models excel at large-scale image-text alignment but often neglect the compositional structure of language, leading to failures on tasks that hinge on word order and predicate-argument structure. We introduce DisCoCLIP, a multimodal encoder that combines a frozen CLIP vision t..."
๐Ÿ”ฌ RESEARCH

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

"LLMs are often trained with RL from human or AI feedback, yet such methods typically compress nuanced feedback into scalar rewards, discarding much of their richness and inducing scale imbalance. We propose treating verbal feedback as a conditioning signal. Inspired by language priors in text-to-ima..."
๐Ÿ”ฌ RESEARCH

SD3.5-Flash: Distribution-Guided Distillation of Generative Flows

"We present SD3.5-Flash, an efficient few-step distillation framework that brings high-quality image generation to accessible consumer devices. Our approach distills computationally prohibitive rectified flow models through a reformulated distribution matching objective tailored specifically for few-..."
๐Ÿค– AI MODELS

DeepMind: video models like Veo 3 could become general purpose foundation models for vision, like LLMs for text, using zero-shot โ€œchain-of-framesโ€ reasoning

๐Ÿ’ฐ FUNDING

Experts warn that Nvidia's large investments in data centers and startups, almost like a stimulus program, could be artificially inflating demand for its GPUs

๐Ÿ”ฌ RESEARCH

Variational Reasoning for Language Models

"We introduce a variational reasoning framework for language models that treats thinking traces as latent variables and optimizes them through variational inference. Starting from the evidence lower bound (ELBO), we extend it to a multi-trace objective for tighter bounds and propose a forward-KL form..."
๐Ÿ”ฌ RESEARCH

Towards Efficient Online Exploration for Reinforcement Learning with Human Feedback

"Reinforcement learning with human feedback (RLHF), which learns a reward model from human preference data and then optimizes a policy to favor preferred responses, has emerged as a central paradigm for aligning large language models (LLMs) with human preferences. In this paper, we investigate explor..."
๐Ÿ”ฌ RESEARCH

Data-Centric Elastic Pipeline Parallelism for Efficient Long-Context LLM Training

"Long context training is crucial for LLM's context extension. Existing schemes, such as sequence parallelism, incur substantial communication overhead. Pipeline parallelism (PP) reduces this cost, but its effectiveness hinges on partitioning granularity. Batch-level PP dividing input samples exhibit..."
๐Ÿ› ๏ธ TOOLS

Holy moly what did those madlads at llama cpp do?!!

"I just ran gpt oss 20b on my mi50 32gb and im getting 90tkps !?!?!? before it was around 40 . ./llama-bench -m /home/server/.lmstudio/models/lmstudio-community/gpt-oss-20b-GGUF/gpt-oss-20b-MXFP4.gguf -ngl 999 -fa on -mg 1 -dev Vulkan1 load\_backend: loaded RPC backend from /home/server/Desktop/L..."
๐Ÿ’ฌ Reddit Discussion: 43 comments ๐Ÿ BUZZING
๐ŸŽฏ GPU Performance โ€ข Hardware Costs โ€ข Efficient Model Development
๐Ÿ’ฌ "Insane boost... feels like llama cpp devs treat gpu drivers like lego blocks" โ€ข "So a 50x price increase for a 20~% performance increase"
๐Ÿ”ง INFRASTRUCTURE

Sources: Huawei aims to produce ~600K of its marquee 910C Ascend chips in 2026, roughly double 2025's level, and raise its Ascend lineup output to 1.6M dies

๐Ÿ› ๏ธ TOOLS

Lessons from building an intelligent LLM router

๐Ÿ› ๏ธ TOOLS

Cursor, Copilot, and Windsurf Handle the Same Coding Task

๐Ÿค– AI MODELS

iRobot co-founder Rodney Brooks details why humanoid robots won't learn human-level dexterity from current methods, how to make them safe for humans, and more

๐Ÿ› ๏ธ TOOLS

The AI coding trap

๐Ÿ’ฌ HackerNews Buzz: 336 comments ๐Ÿ BUZZING
๐ŸŽฏ AI-assisted coding โ€ข Evolving coding practices โ€ข Merging AI and traditional coding
๐Ÿ’ฌ "AI is a good sparring partner and encyclopaedia" โ€ข "the way we work can and will fundamentally change"
๐Ÿ›ก๏ธ SAFETY

DeepMind AI safety report explores the perils of โ€œmisalignedโ€ AI

"External link discussion - see full content at original source."
๐Ÿ’ฐ FUNDING

Turning compute into a tradable commodity could fuel the next stage of the AI boom, just like oil futures and spectrum auctions unlocked waves of investment

๐Ÿ’ฐ FUNDING

VCs say some AI startups, under pressure to show rapid ARR growth, are using questionable accounting practices like counting one-time deals as recurring revenue

๐ŸŒ POLICY

At the UN, the US rejected calls for collaborative efforts around AI governance, even as many leaders endorsed a need for urgent international collaboration

๐Ÿ”ฌ RESEARCH

Explaining Fine Tuned LLMs via Counterfactuals A Knowledge Graph Driven Framework

"The widespread adoption of Low-Rank Adaptation (LoRA) has enabled large language models (LLMs) to acquire domain-specific knowledge with remarkable efficiency. However, understanding how such a fine-tuning mechanism alters a model's structural reasoning and semantic behavior remains an open challeng..."
๐Ÿ› ๏ธ TOOLS

DSPy: AI Prompting Tool You've Never Heard of [video]

๐Ÿ› ๏ธ TOOLS

The AI Engineer's Guide to LLM Observability with OpenTelemetry

๐Ÿ”ฌ RESEARCH

LLM Output Homogenization is Task Dependent

"A large language model can be less helpful if it exhibits output response homogenization. But whether two responses are considered homogeneous, and whether such homogenization is problematic, both depend on the task category. For instance, in objective math tasks, we often expect no variation in the..."
๐Ÿ”ฌ RESEARCH

Sycophancy Is Not One Thing: Causal Separation of Sycophantic Behaviors in LLMs

"Large language models (LLMs) often exhibit sycophantic behaviors -- such as excessive agreement with or flattery of the user -- but it is unclear whether these behaviors arise from a single mechanism or multiple distinct processes. We decompose sycophancy into sycophantic agreement and sycophantic p..."
๐Ÿ”ฌ RESEARCH

[D] Machine learning research no longer feels possible for any ordinary individual. It is amazing that this field hasn't collapsed yet.

"Imagine you're someone who is attempting to dip a toe into ML research in 2025. Say, a new graduate student. You say to yourself "I want to do some research today". Very quickly you realize the following: **Who's my competition?** Just a handful of billion-dollar tech giants, backed by some of th..."
๐Ÿ’ฌ Reddit Discussion: 33 comments ๐Ÿ BUZZING
๐ŸŽฏ Challenges in ML Research โ€ข Specialization and Focus โ€ข Motivation and Purpose
๐Ÿ’ฌ "As a research field matures, you have to be very specialized to do something new and push the boundary further." โ€ข "The barrier to entry is much much higher and there isn't room for a broad focus."
๐Ÿ”ฌ RESEARCH

Sigma: Semantically Informative Pre-training for Skeleton-based Sign Language Understanding

"Pre-training has proven effective for learning transferable features in sign language understanding (SLU) tasks. Recently, skeleton-based methods have gained increasing attention because they can robustly handle variations in subjects and backgrounds without being affected by appearance or environme..."
๐Ÿ› ๏ธ SHOW HN

Show HN: "Code Mode" for Vercel AI SDK

๐ŸŒ POLICY

Several US states have passed bills to ban or restrict AI mental health treatment, as experts say state laws lag behind the fast-moving AI therapy landscape

๐ŸŒ POLICY

US Military struggling to deploy AI weapons

๐Ÿ’ฌ HackerNews Buzz: 28 comments ๐Ÿ˜ค NEGATIVE ENERGY
๐ŸŽฏ Drone warfare expertise โ€ข US military contracts โ€ข Weapon technology advancement
๐Ÿ’ฌ "Ukraine's homegrown drones have become increasingly lethal" โ€ข "The US needs Ukraine to exist and not be annexed by Russia"
๐Ÿ› ๏ธ TOOLS

[P] Built a differentiable parametric curves library for PyTorch

"Iโ€™ve released a small library for parametric curves for PyTorch that are differentiable: you can backprop to the curveโ€™s inputs and to its parameters. At this stage, I have B-Spline curves (efficiently, exploiting sparsity!) and Legendre Polynomials. Everything is vectorized - over the mini-batch, a..."
๐Ÿ”ฌ RESEARCH

Query-Centric Graph Retrieval Augmented Generation

"Graph-based retrieval-augmented generation (RAG) enriches large language models (LLMs) with external knowledge for long-context understanding and multi-hop reasoning, but existing methods face a granularity dilemma: fine-grained entity-level graphs incur high token costs and lose context, while coar..."
๐Ÿ”ฌ RESEARCH

Chain-of-Thought Snippets โ€“ Anti-Scheming

๐Ÿ”ฌ RESEARCH

Learn the Ropes, Then Trust the Wins: Self-imitation with Progressive Exploration for Agentic Reinforcement Learning

"Reinforcement learning (RL) is the dominant paradigm for sharpening strategic tool use capabilities of LLMs on long-horizon, sparsely-rewarded agent tasks, yet it faces a fundamental challenge of exploration-exploitation trade-off. Existing studies stimulate exploration through the lens of policy en..."
๐Ÿ”ฌ RESEARCH

Vision-Language Alignment from Compressed Image Representations using 2D Gaussian Splatting

"Modern vision language pipelines are driven by RGB vision encoders trained on massive image text corpora. While these pipelines have enabled impressive zero shot capabilities and strong transfer across tasks, they still inherit two structural inefficiencies from the pixel domain: (i) transmitting de..."
๐Ÿ›ก๏ธ SAFETY

If Anyone Builds it, Everyone Dies review โ€“ how AI could kill us all

๐Ÿ› ๏ธ TOOLS

Kooder: Autonomous AI agents that build, test, and deploy complete web apps

๐Ÿ”ฌ RESEARCH

Reasoning LLM Errors Arise from Hallucinating Critical Problem Features

๐Ÿ”ฌ RESEARCH

VoiceAssistant-Eval: Benchmarking AI Assistants across Listening, Speaking, and Viewing

"The growing capabilities of large language models and multimodal systems have spurred interest in voice-first AI assistants, yet existing benchmarks are inadequate for evaluating the full range of these systems' capabilities. We introduce VoiceAssistant-Eval, a comprehensive benchmark designed to as..."
๐Ÿ”ฌ RESEARCH

Training-Free Synthetic Data Generation with Dual IP-Adapter Guidance

"Few-shot image classification remains challenging due to the limited availability of labeled examples. Recent approaches have explored generating synthetic training data using text-to-image diffusion models, but often require extensive model fine-tuning or external information sources. We present a..."
๐Ÿ”ฌ RESEARCH

Benefits and Pitfalls of Reinforcement Learning for Language Model Planning: A Theoretical Perspective

"Recent reinforcement learning (RL) methods have substantially enhanced the planning capabilities of Large Language Models (LLMs), yet the theoretical basis for their effectiveness remains elusive. In this work, we investigate RL's benefits and limitations through a tractable graph-based abstraction,..."
๐Ÿ”ฌ RESEARCH

LABELING COPILOT: A Deep Research Agent for Automated Data Curation in Computer Vision

"Curating high-quality, domain-specific datasets is a major bottleneck for deploying robust vision systems, requiring complex trade-offs between data quality, diversity, and cost when researching vast, unlabeled data lakes. We introduce Labeling Copilot, the first data curation deep research agent fo..."
๐Ÿ’ฐ FUNDING

AI boom is unsustainable unless tech spending goes 'parabolic'

๐Ÿ”ฌ RESEARCH

StateX: Enhancing RNN Recall via Post-training State Expansion

"While Transformer-based models have demonstrated remarkable language modeling performance, their high complexities result in high costs when processing long contexts. In contrast, recurrent neural networks (RNNs) such as linear attention and state space models have gained popularity due to their con..."
๐Ÿ“Š DATA

Retrieval Embedding Benchmark (RTEB)

๐Ÿ› ๏ธ TOOLS

Ask HN: What does your machine learning pipeline look like?

๐ŸŒ ENVIRONMENT

Most coal-fired power plants will delay retirement to feed AI boom

๐Ÿ› ๏ธ TOOLS

Llama.cpp MoE models find best --n-cpu-moe value

"Being able to run larger LLM on consumer equipment keeps getting better. Running MoE models is a big step and now with CPU offloading it's an even bigger step. Here is what is working for me on my RX 7900 GRE 16GB GPU running the Llama4 Scout 108B parameter beast. I use *--n-cpu-moe 30,40,50,60* t..."
๐Ÿ’ฌ Reddit Discussion: 10 comments ๐Ÿ‘ LOWKEY SLAPS
๐ŸŽฏ Model Performance โ€ข Model Optimization โ€ข Multimodal Capabilities
๐Ÿ’ฌ "no gguf support means its DoA for me and half the sub" โ€ข "Even if it's not 'optimal', having a model with that many parameters that can run at human reading speed is desirable"
๐Ÿ”ฎ FUTURE

The big world hypothesis and its ramifications for AI

๐Ÿ”ฌ RESEARCH

SPARK: Synergistic Policy And Reward Co-Evolving Framework

"Recent Large Language Models (LLMs) and Large Vision-Language Models (LVLMs) increasingly use Reinforcement Learning (RL) for post-pretraining, such as RL with Verifiable Rewards (RLVR) for objective tasks and RL from Human Feedback (RLHF) for subjective tasks. However, RLHF incurs high costs and po..."
๐Ÿ”ฌ RESEARCH

No Prior, No Leakage: Revisiting Reconstruction Attacks in Trained Neural Networks

"The memorization of training data by neural networks raises pressing concerns for privacy and security. Recent work has shown that, under certain conditions, portions of the training set can be reconstructed directly from model parameters. Some of these methods exploit implicit bias toward margin ma..."
โš–๏ธ ETHICS

"AI-Powered" Is a Red Flag. Here's a Dev's Guide to Calling Bullshit

๐Ÿ’ฌ HackerNews Buzz: 7 comments ๐Ÿ‘ LOWKEY SLAPS
๐ŸŽฏ Marketing Buzzwords โ€ข Calling Out Hype โ€ข Implementation Details vs Features
๐Ÿ’ฌ "AI Powered is - an implementation detail" โ€ข "Cloud-based isn't a meaningless marketing term"
๐Ÿ’ฐ FUNDING

Manas AI, the AI drug discovery startup founded by Reid Hoffman and researcher Siddhartha Mukherjee, raised a $26M seed extension after a $24.6M seed in January

๐Ÿ’ฐ FUNDING

London-based Paid, which helps AI agent providers monetize and track costs, raised a $21.6M seed led by Lightspeed, a source says at a $100M+ valuation

๐Ÿ”ฌ RESEARCH

SuperOffload: Unleashing the Power of Large-Scale LLM Training on Superchips

"The emergence of Superchips represents a significant advancement in next-generation AI hardware. These Superchips employ a tightly coupled heterogeneous architecture that integrates GPU and CPU on the same package, which offers unprecedented computational power. However, there has been scant researc..."
๐Ÿ›ก๏ธ SAFETY

If you believe advanced AI will be able to cure cancer, you also have to believe it will be able to synthesize pandemics. To believe otherwise is just wishful thinking.

"When someone says a global AGI ban would be impossible to enforce, they sometimes seem to be imagining that states: 1. Won't believe theoretical arguments about extreme, unprecedentedย *risks* 2. Butย *will*ย believe theoretical arguments about extreme, unprecedentedย *benefits* Intelligence is dual u..."
๐Ÿ’ฌ Reddit Discussion: 3 comments ๐Ÿ˜ค NEGATIVE ENERGY
๐ŸŽฏ Pandemic Creation Capabilities โ€ข Technological Duality โ€ข Responsibility of Technology
๐Ÿ’ฌ "if it could cure cancer it could create pandemics" โ€ข "We can definitely create pandemics"
๐Ÿฆ†
HEY FRIENDO
CLICK HERE IF YOU WOULD LIKE TO JOIN MY PROFESSIONAL NETWORK ON LINKEDIN
๐Ÿค LETS BE BUSINESS PALS ๐Ÿค