AI News Archive - September 27, 2025

🔄 OPEN SOURCE

Gpt-oss Reinforcement Learning - Fastest inference now in Unsloth! (<15GB VRAM)

via r/LocalLLaMA 👤 u/danielhanchen 📅 2025-09-26

⬆️ 362 ups ⚡ Score: 9.2

"Hey guys we've got lots of updates for Reinforcement Learning (RL)! We’re excited to introduce gpt-oss, Vision, and even better RL in Unsloth. Our new gpt-oss RL inference also achieves the fastest token/s vs. any other implementation. Our GitHub: [https://github.com/unslothai/unsloth](https://githu..."

💬 Reddit Discussion: 46 comments 🐝 BUZZING

🎯 Fine-tuning LLMs • Open-source AI models • Code generation usecase

💬 "You would need to construct how you're going to qualify success and the rewards." • "Before RL, look into how to train a LoRA, and try that."

💰 FUNDING

OpenAI Needs a Trillion Dollars in the Next Four Years

via HackerNews 👤 msk-lywenn 📅 2025-09-27

🔺 16 pts ⚡ Score: 8.9

🤖 AI MODELS

Moondream 3 Preview: Frontier-level reasoning at a blazing speed

via HackerNews 👤 kristianp 📅 2025-09-26

🔺 219 pts ⚡ Score: 8.8

💬 HackerNews Buzz: 31 comments 🐐 GOATED ENERGY

🎯 Object detection performance • Edge device deployment • Model architecture and optimization

💬 "The ability to keep only 2B parameters active while maintaining 8B model performance is a game-changer for edge deployment." • "Scaling these models to production environments often introduces security challenges, including bot floods targeting inference APIs and adversarial inputs that mimic legitimate queries to disrupt detections."

🔬 RESEARCH

Q&A with reinforcement learning pioneer Richard Sutton on why LLMs are not the path to achieving human intelligence, world models, continual learning, and more

via Techmeme 👤 Dwarkesh 📅 2025-09-27

⚡ Score: 8.7

🤖 AI MODELS

Model picker is a mirage : 4o/5/4.5/5 pro are all being rerouted to “5” - What’s actually happening (Forensic Summary)

via r/ChatGPT 👤 u/Littlearthquakes 📅 2025-09-27

⬆️ 100 ups ⚡ Score: 8.6

"Here’s what I’ve mapped after 24 hours of chaos, hundreds of receipts and way too many tests: 1. **Model names are no longer contracts** “GPT-4o,” “GPT-5,” “4.5,” even “5 Instant” are now just UI labels. - The backend is silently serving *whatever model* the router decides - This is happening to pl..."

💬 Reddit Discussion: 22 comments 👍 LOWKEY SLAPS

🎯 Model changes • User frustration • Transparency concerns

💬 "We literally pay for 4o. Its not free. We pay for a service." • "I just want transparency. For me it's 4o or nothing."

🔒 SECURITY

Google's Secure AI Framework: Red Teaming in the Age of LLMs [pdf]

via HackerNews 👤 lnsp 📅 2025-09-26

🔺 1 pts ⚡ Score: 8.6

🔬 RESEARCH

AI's Hidden Geometry: Riemannian Optimization on Manifolds

via HackerNews 👤 WASDAai 📅 2025-09-26

🔺 1 pts ⚡ Score: 8.2

🧠 NEURAL NETWORKS

Analog in-memory computing attention mechanism for fast energy-efficient LLMs

via HackerNews 👤 physarum_salad 📅 2025-09-27

🔺 3 pts ⚡ Score: 8.2

🛠️ TOOLS

Sources: Apple has developed an internal ChatGPT-like iPhone app, code-named Veritas, to help test and prepare for a major Siri overhaul next year

via Techmeme 👤 Bloomberg 📅 2025-09-26

⚡ Score: 8.1

🔬 RESEARCH

Automated Repair of Ambiguous Problem Descriptions for LLM-Based Code Generation

via HackerNews 👤 mechtaev 📅 2025-09-27

🔺 2 pts ⚡ Score: 8.0

🤖 AI MODELS

K2-Think 32B - Reasoning model from UAE

via r/LocalLLaMA 👤 u/Mr_Moonsilver 📅 2025-09-27

⬆️ 160 ups ⚡ Score: 7.8

"Seems like a strong model and a very good paper released alongside. Opensource is going strong at the moment, let's hope this benchmark holds true. Huggingface Repo: https://huggingface.co/LLM360/K2-Think Paper: [https://huggingface.co/papers/2509.07604](..."

💬 Reddit Discussion: 46 comments 😐 MID OR MIXED

🎯 Dataset Contamination • Model Benchmarking • Ethical Research Practices

💬 "We find clear evidence of data contamination." • "Interestingly, there is a large overlap between the creators of the RL dataset, Guru, and the authors of K2-Think, who should have been fully aware of this."

🚗 AUTOMOTIVE

International Federation of Robotics: Chinese factories installed ~300K new robots in 2024, more than the rest of the world combined; US factories installed 34K

via Techmeme 👤 Nytimes 📅 2025-09-27

⚡ Score: 7.8

🛠️ TOOLS

OpenAI: Updated function calling to support files, images as tool call outputs

via HackerNews 👤 tosh 📅 2025-09-26

🔺 1 pts ⚡ Score: 7.8

🔧 INFRASTRUCTURE

GPU Snapshots to reduce ML coldstarts

via HackerNews 👤 agcat 📅 2025-09-27

🔺 2 pts ⚡ Score: 7.8

🔮 FUTURE

Turing Award winner on AI succession inevitability

2x SOURCES 🌐 📅 2025-09-27

⚡ Score: 7.8

+++ The RL pioneer tells Dwarkesh that humans getting replaced by AI is inevitable, adding another Turing Award voice to the "we're doomed" chorus. +++

Another Turing Award winner has said he thinks succession to AI is inevitable

via r/artificial 👤 u/MetaKnowing 📅 2025-09-27

⬆️ 76 ups ⚡ Score: 8.5

"From the Dwarkesh podcast interview: https://www.dwarkesh.com/p/richard-sutton..."

🔧 INFRASTRUCTURE

For llama.cpp/ggml AMD MI50s are now universally faster than NVIDIA P40s

via r/LocalLLaMA 👤 u/Remove_Ayys 📅 2025-09-27

⬆️ 445 ups ⚡ Score: 7.8

"In 2023 I implemented llama.cpp/ggml CUDA support specifically for NVIDIA P40s since they were one of the cheapest options for GPUs with 24 GB VRAM. Recently AMD MI50s became very cheap options for GPUs with 32 GB VRAM, selling for well below $150 if you order multiple of them off of Alibaba. Howeve..."

💬 Reddit Discussion: 130 comments 🐝 BUZZING

🎯 Sponsorship & Funding • GPU Optimization • Community Engagement

💬 "Congrats on the sponsorship, well deserved!" • "I'm sure I'm not the only guy who would happily sponsor a few bucks a month for your work on amd platforms"

🔧 INFRASTRUCTURE

LLM Observability in the Wild – Why OpenTelemetry Should Be the Standard

via HackerNews 👤 pranay01 📅 2025-09-27

🔺 109 pts ⚡ Score: 7.8

💬 HackerNews Buzz: 31 comments 🐝 BUZZING

🎯 OpenTelemetry Compatibility • LLM Observability Vendors • Observability Challenges

💬 "Saying OpenInference is not otel compatible does not make any sense." • "We've invested heavily in observability having quickly found that observability + evals are the cornerstone to a successful agent."

🔄 OPEN SOURCE

GPT-OSS Reinforcement Learning

via HackerNews 👤 vinhnx 📅 2025-09-27

🔺 126 pts ⚡ Score: 7.8

🏢 BUSINESS

OpenAI's historic week has redefined the AI arms race for investors

via HackerNews 👤 rntn 📅 2025-09-27

🔺 2 pts ⚡ Score: 7.7

🔧 INFRASTRUCTURE

AI Needs a Lot of Computing Power. Is a Market for 'Compute' the Next Big Thing?

via HackerNews 👤 ryan_j_naughton 📅 2025-09-27

🔺 4 pts ⚡ Score: 7.6

🔒 SECURITY

Security Advisory: Anthropic's Slack MCP Server Vulnerable to Data Exfiltration

via HackerNews 👤 schrodinger 📅 2025-09-26

🔺 2 pts ⚡ Score: 7.5

🔬 RESEARCH

[R] DynaMix: First dynamical systems foundation model enabling zero-shot forecasting of long-term statistics at #NeurIPS2025

via r/MachineLearning 👤 u/DangerousFunny1371 📅 2025-09-27

⬆️ 97 ups ⚡ Score: 7.5

"Our **dynamical systems foundation model DynaMix** was accepted to **#NeurIPS2025** with outstanding reviews (6555) – the first model which can ***zero-shot***, w/o any fine-tuning, forecast the ***long-term behavior*** of time series from just a short context signal. Test it on #HuggingFace: [http..."

🔒 SECURITY

ForcedLeak: AI Agent risks exposed in Salesforce AgentForce

via HackerNews 👤 tempodox 📅 2025-09-27

🔺 2 pts ⚡ Score: 7.5

🛠️ SHOW HN

Show HN: I built an AI Colosseum to battle-test different agent architectures

via HackerNews 👤 aytuakarlar 📅 2025-09-27

🔺 2 pts ⚡ Score: 7.5

💼 JOBS

Anthropic plans to triple its global workforce and expand its applied AI team 5x in 2025, after growing its business clients from ~1K to 300K+ in two years

via Techmeme 👤 Cnbc 📅 2025-09-26

⚡ Score: 7.5

🌐 POLICY

The first AI system in the world to hold a cabinet-level government role

via HackerNews 👤 ColinWright 📅 2025-09-27

🔺 1 pts ⚡ Score: 7.3

🛠️ TOOLS

Bringing AI Applications from Prototype to Production: The Last Mile

via HackerNews 👤 panrobo 📅 2025-09-26

🔺 1 pts ⚡ Score: 7.3

🛠️ TOOLS

Perplexity launches Search API, giving developers direct access to the same web index that powers the startup's answer engine

via Techmeme 👤 Venturebeat 📅 2025-09-26

⚡ Score: 7.3

🏢 BUSINESS

Anthropic to triple international workforce in global AI push

via HackerNews 👤 alecco 📅 2025-09-27

🔺 3 pts ⚡ Score: 7.2

🛠️ TOOLS

Agent design lessons from Claude Code

via HackerNews 👤 calcsam 📅 2025-09-26

🔺 3 pts ⚡ Score: 7.0

🔬 RESEARCH

LLM probabilities cannot distinguish between possible and impossible language

via HackerNews 👤 foobarqux 📅 2025-09-26

🔺 1 pts ⚡ Score: 7.0

🤖 AI MODELS

Why GPT 4o Feels So Much Better: It’s Not the Emojis, It’s the Context Window (from a Comp-Sci PhD)

via r/ChatGPT 👤 u/hexferro 📅 2025-09-26

⬆️ 52 ups ⚡ Score: 7.0

"At a time during this GPT5/4o switching nosnsense - let me explain why 4o's superiority isn't because of its 'personality' or because it's 'our best friend'. For the record, I've got my credentials (PhD in comp-sci), so I know what I'm talking about. I don't work in OpenAI (and after this fiasco I ..."

💬 Reddit Discussion: 13 comments 🐝 BUZZING

🎯 AI model capabilities • Language model context limits • User experience with AI models

💬 "4o could understand that's not how humans write or want to read" • "GPT5-Auto has the memory of a fish lol"

🔬 RESEARCH

[R] Object Tracking: A Comprehensive Survey From Classical Approaches to Large Vision-Language and Foundation Models

via r/MachineLearning 👤 u/Downtown_Ambition662 📅 2025-09-27

⬆️ 4 ups ⚡ Score: 7.0

"I came across a new survey and resource repository on object tracking. It covers classical Single Object Tracking (SOT) and Multi-Object Tracking (MOT), as well as more recent approaches that use vision-language and foundation models. The repository also includes Long-Term Tracking (LTT), benchmark..."

🔬 RESEARCH

Object Tracking: A Comprehensive Survey From Classical Approaches to Large Vision-Language and Foundation Models

via r/computervision 👤 u/Downtown_Ambition662 📅 2025-09-27

⬆️ 37 ups ⚡ Score: 7.0

"Found a a new survey + resource repo on **object tracking**, spanning from **classical Single Object Tracking (SOT)** and **Multi-Object Tracking (MOT)** to the latest **vision-language and foundation model based trackers**. 🔗 GitHub: [Awesome-Object-Tracking](https://github.com/rahulrj/Awesome-Obj..."

🔒 SECURITY

Why AI systems may never be secure, and what to do about it

via HackerNews 👤 loosescrews 📅 2025-09-26

🔺 2 pts ⚡ Score: 7.0

💬 HackerNews Buzz: 3 comments 😤 NEGATIVE ENERGY

🎯 AI Safety • Existential Risk • Ethical Challenges

💬 "AI's lethal trifecta is a thorny issue" • "There's no easy solution to this problem"

🏢 BUSINESS

Alibaba unveils $53B global AI plan – but it will need GPUs to back it up

via HackerNews 👤 rntn 📅 2025-09-27

🔺 2 pts ⚡ Score: 6.8

🔧 INFRASTRUCTURE

Building a Serverless WASM AI Runtime in Rust [video]

via HackerNews 👤 todsacerdoti 📅 2025-09-27

🔺 1 pts ⚡ Score: 6.8

🔬 RESEARCH

Fast and Accurate Long Text Generation with Few-Step Diffusion Language Models

via HackerNews 👤 gok 📅 2025-09-27

🔺 2 pts ⚡ Score: 6.8

🔧 INFRASTRUCTURE

Given the model, context size and number of GPU can you calculate VRAM needed for each GPU?

via r/LocalLLaMA 👤 u/arstarsta 📅 2025-09-26

⬆️ 5 ups ⚡ Score: 6.7

"Is 4x16GB GPU equivalent to a 64GB gpu or is there overhead in memory requirements? Are there some variables that must build duplicated on all GPU? I was trying to run Qwen next 80B 4bit but it ran out of VRAM on my 2x5090 with tensor parallel = 2."

💬 Reddit Discussion: 5 comments 👍 LOWKEY SLAPS

🎯 VRAM Optimization • Multi-GPU Usage • Model Partitioning

💬 "A single 96GB GPU (i.e. 6000 PRO) would use less VRAM" • "that's why 24GB GPU is always better than 2x12GB GPU"

💰 FUNDING

Google agrees to guarantee $1.4B of AI computing startup Fluidstack's $3B, 10-year agreement with Cipher Mining and gets the right to buy a 5.4% stake in Cipher

via Techmeme 👤 Bloomberg 📅 2025-09-27

⚡ Score: 6.7

💰 FUNDING

Are We in an A.I. Bubble? I Suspect So

via HackerNews 👤 paulpauper 📅 2025-09-27

🔺 53 pts ⚡ Score: 6.7

💬 HackerNews Buzz: 56 comments 🐝 BUZZING

🎯 AI infrastructure costs • AI bubble and impact • Varying perspectives on AI bubble

💬 "the infrastructure is not going away" • "the actual value creation from railroads was in the land itself"

🛠️ TOOLS

How developers are using Apple's local AI models in iOS 26: Lil Artist story generation, MoneyCoach's spending insights, F1 race summaries in Lights Out, more

via Techmeme 👤 Techcrunch 📅 2025-09-27

⚡ Score: 6.6

🛠️ TOOLS

How developers are using Apple's local AI models with iOS 26

via HackerNews 👤 mooreds 📅 2025-09-27

🔺 1 pts ⚡ Score: 6.5

🔬 RESEARCH

Sample Forge – Research tool for deterministic inference in LLM's

via HackerNews 👤 nowittyusername 📅 2025-09-27

🔺 1 pts ⚡ Score: 6.5

🛠️ SHOW HN

Show HN: Open-Source Semantic AI Chat Search – 100% Locally

via HackerNews 👤 siv_io_ 📅 2025-09-27

🔺 1 pts ⚡ Score: 6.5

🔬 RESEARCH

Open-source embedding models: which one to use?

via r/LocalLLaMA 👤 u/DhravyaShah 📅 2025-09-26

⬆️ 10 ups ⚡ Score: 6.5

"I’m building a memory engine to add memory to LLMs. Embeddings are a pretty big part of the pipeline, so I was curious which open-source embedding model is the best. Did some tests and thought I’d share them in case anyone else finds them useful: Models tested: * BAAI/bge-base-en-v1.5 * intfloat..."

💬 Reddit Discussion: 4 comments 🐝 BUZZING

🎯 Embedding model benchmarks • Embedding model selection • Embedding model performance

💬 "Getting a mean avg across the board doesn't cut it." • "You really have to look at domain and task specific scores."

🎨 CREATIVE

Vibes – AI Generated Video Feed from Meta

via HackerNews 👤 Ozzie_osman 📅 2025-09-27

🔺 1 pts ⚡ Score: 6.5

🛠️ SHOW HN

Show HN: macOS Local AI Dictation Software

via HackerNews 👤 explosion-s 📅 2025-09-27

🔺 2 pts ⚡ Score: 6.5

🔮 FUTURE

Cost of AGI Delusion:Chasing Superintelligence US Falling Behind in Real AI Race

via HackerNews 👤 bookofjoe 📅 2025-09-27

🔺 51 pts ⚡ Score: 6.5

💬 HackerNews Buzz: 29 comments 🐝 BUZZING

🎯 Open AI models • Chinese AI development • Practical AI applications

💬 "China is releasing the models they train (for the most part)" • "China seems to still be developing in an open way including sharing code, weights, and publishing techniques"

🏢 BUSINESS

At an all-hands, AWS CEO Matt Garman criticized staff for slow product rollouts and demonstrated a new agentic AI product for internal testing called Quick

via Techmeme 👤 Reuters 📅 2025-09-27

⚡ Score: 6.5

🌐 POLICY

An interview with California state Senator Scott Wiener on his new AI safety bill SB 53, the bill's scope, his focus on AI safety bills, AI PACs, and more

via Techmeme 👤 Techcrunch 📅 2025-09-27

⚡ Score: 6.5

🛠️ TOOLS

MCP for talent matching

via r/claudeai 👤 u/ComprehensiveLong369 📅 2025-09-27

⬆️ 2 ups ⚡ Score: 6.5

"We spent €300k+ over 4 years building everything custom. Then we connected Anthropic's Claude via MCP in 2 days and cut our matching times by 95%. At Cosmico Italia and Cosmico España, we process thousands of profiles. For years, we developed everything in-house: a proprietary CV parser, a matching ..."

🔬 RESEARCH

AI's Quiet Geometry: Riemannian and Manifold Learnings

via HackerNews 👤 WASDAai 📅 2025-09-27

🔺 2 pts ⚡ Score: 6.5

🔒 SECURITY

Openai has been caught doing illegal

via r/ChatGPT 👤 u/Striking-Tour-8815 📅 2025-09-27

⬆️ 2135 ups ⚡ Score: 6.5

" Tibor the same engineer who leaked earlier today that OpenAI had already built a parental control and an ads UI and were just waiting for rollout has just confirmed: Yes, both 4 and 5 models are being routed to TWO secret backend models if it judges anything is remotely sensitive or emotional, or..."

🔧 INFRASTRUCTURE

Why a decades old architecture decision is impeding the power of AI computing

via HackerNews 👤 Nezteb 📅 2025-09-26

🔺 47 pts ⚡ Score: 6.3

💬 HackerNews Buzz: 25 comments 👍 LOWKEY SLAPS

🎯 Iterative improvements • Frontier computing concepts • Optical memory

💬 "I just wish more folks would start openly admitting that our current architecture designs are broadly based off 'low hanging fruit' of early electronics and microprocessors" • "Actual result: This new process promises to increase the number of optical fibers that can be connected at the edge of a chip, a measure known as beachfront density, by six times"

🏥 HEALTHCARE

New AI Tool Pinpoints Genes, Drug Combos to Restore Health in Diseased Cells

via HackerNews 👤 ca98am79 📅 2025-09-26

🔺 1 pts ⚡ Score: 6.3

🔧 INFRASTRUCTURE

MetalQwen3: Full GPU-Accelerated Qwen3 Inference on Apple Silicon with Metal Shaders – Built on qwen3.c - WORK IN PROGRESS

via r/LocalLLaMA 👤 u/QuanstScientist 📅 2025-09-27

⬆️ 77 ups ⚡ Score: 6.2

"Hey r/LocalLLaMA, Inspired by Adrian Cable's awesome qwen3.c project (that simple, educational C inference engine for Qwen3 models – check out the original post here: [https://www.reddit.com/r/LocalLLaMA/comments/1lpejnj/qwen3\_inference\_engine\_in\_c\_simple\_educational\_fun/](https://www.reddit..."

💬 Reddit Discussion: 9 comments 🐝 BUZZING

🎯 New architecture • Performance comparison • Project complexity

💬 "Why not trying to pull req llamacpp if you have something faster/better?" • "A C++ project is always a bit complex, I can certainly think about it."

🤖 AI MODELS

AI model trapped in a Raspberry Pi

via HackerNews 👤 harel 📅 2025-09-27

🔺 121 pts ⚡ Score: 6.2

⚖️ ETHICS

OpenAI is destroying users' trust in them

via r/ChatGPT 👤 u/nubiibunn 📅 2025-09-26

⬆️ 1777 ups ⚡ Score: 6.2

"First of all, I hope those who read this article can calm down. I don't want users to fight with each other. The reason why I put forward this view is that since the sudden removal of 4o, OpenAI has always been making some decisions that make users unhappy. Users who miss the 4o have pointed out i..."

🌐 POLICY

A look at some California tech regulation bills, including one banning AI use in firing or disciplining workers, that await Gov. Newsom's signature or his veto

via Techmeme 👤 Bloodinthemachine 📅 2025-09-27

⚡ Score: 6.2

🔒 SECURITY

Health Data Sovereignty: AI Access Control for Personal Data [video]

via HackerNews 👤 transpute 📅 2025-09-27

🔺 1 pts ⚡ Score: 6.2

💰 FUNDING

AI bubble is the only thing keeping the US economy together, Deutsche Bank warns

via HackerNews 👤 smartmic 📅 2025-09-27

🔺 24 pts ⚡ Score: 6.2

💬 HackerNews Buzz: 14 comments 👍 LOWKEY SLAPS

🎯 AI's impact on productivity • AI bubble and market dynamics • Skepticism towards AI hype

💬 "We expect productivity gains from artificial intelligence (AI) to boost GDP" • "AI bubble is the only thing keeping the US [stock market from crashing]"

Stories from September 27, 2025

📡 AI NEWS BUT ACTUALLY GOOD

Turing Award winner on AI succession inevitability