π HISTORICAL ARCHIVE - September 15, 2025
What was happening in AI on 2025-09-15
π You are visitor #47291 to this AWESOME site! π
Archive from: 2025-09-15 | Preserved for posterity β‘
π Filter by Category
Loading filters...
π§ INFRASTRUCTURE
"Intel's Efficiency Cores seem to have a "poisoning" effect on inference speeds when running on the CPU or Hybrid CPU/GPU. There was a
discussion about this on this sub last year. `llama-server` has ..."
π― Parallelizing inference β’ Overclocking E-cores β’ Offloading to CPU
π¬ "if you had say a 5080 and a 5060, one card is going to pull down the other"
β’ "E cores seem to OC well on newer models"
π HOT STORY
"15 hours ago..."
π§ INFRASTRUCTURE
"I've stumbled upon
exo-explore/exo, a LLM engine that supports multi-peer inference in self-organized p2p network. I got it running on a single node in LXC, and generally things looked good.
That sounds quite tempting; I have a homelab server, a Π¨indows gaming ..."
π― LLM deployment β’ Hardware requirements β’ Distributed LLM inference
π¬ "Llama-rpc works but prompt processing is abysmally slow"
β’ "Ray with vLLM should work"
π HOT STORY
via Arxiv
π€ Bingxin Xu, Zhen Dong, Oussama Elachqar et al.
π
2025-09-11
β‘ Score: 8.6
"Large language models require massive memory footprints, severely limiting
deployment on consumer hardware. Quantization reduces memory through lower
numerical precision, but extreme 2-bit quantization suffers from catastrophic
performance loss due to outliers in activations. Rotation-based methods..."
π‘ AI NEWS BUT ACTUALLY GOOD
The revolution will not be televised, but Claude will email you once we hit the singularity.
Get the stories that matter in Today's AI Briefing.
Powered by Premium Technology Intelligence Algorithms β’ Unsubscribe anytime
π EDUCATION
β¬οΈ 46 ups
β‘ Score: 8.3
"AMAq with members of the Codex team
Wednesday 11am PT."
π― Codex usage patterns β’ Codex's future impact β’ Codex pricing and features
π¬ "I use it all the time! Partly to dogfood the tools"
β’ "I think the most basic answer is that the abstraction level will continue to rise"
π SECURITY
via Reddit
π€ u/Mindless_Pain1860
π
2025-09-15
β‘ Score: 8.3
"The main reason many AI companies are struggling to turn a profit is that the marginal cost of running large AI models is far from zero. Unlike software that can be distributed at almost no additional cost, every query to a large AI model consumes real compute power, electricity, and server resource..."
π― IP protection β’ AI model security β’ Cost-effective AI models
π¬ "IP protection is overrated and leads to stagnation and anti-consumer trends"
β’ "We can use Confidential Inference as one component of our broader effort to secure frontier models"
π¬ RESEARCH
via Arxiv
π€ Sourav Garg, Dustin Craggs, Vineeth Bhat et al.
π
2025-09-11
β‘ Score: 8.0
"Visual navigation using only a single camera and a topological map has
recently become an appealing alternative to methods that require additional
sensors and 3D maps. This is typically achieved through an "image-relative"
approach to estimating control from a given pair of current observation and
s..."
π¬ RESEARCH
via Arxiv
π€ Akshit Sinha, Arvindh Arun, Shashwat Goel et al.
π
2025-09-11
β‘ Score: 8.0
"Does continued scaling of large language models (LLMs) yield diminishing
returns? Real-world value often stems from the length of task an agent can
complete. We start this work by observing the simple but counterintuitive fact
that marginal gains in single-step accuracy can compound into exponential..."
π¬ RESEARCH
via Arxiv
π€ Rui Lu, Zhenyu Hou, Zihan Wang et al.
π
2025-09-12
β‘ Score: 8.0
"Augmenting large language models (LLMs) with browsing tools substantially
improves their potential as deep search agents to solve complex, real-world
tasks. Yet, open LLMs still perform poorly in such settings due to limited
long-horizon reasoning capacity with browsing tools and the lack of
suffici..."
π¬ RESEARCH
via Arxiv
π€ Maysam Behmanesh, Erkan Turan, Maks Ovsjanikov
π
2025-09-11
β‘ Score: 8.0
"Graph alignment-the problem of identifying corresponding nodes across
multiple graphs-is fundamental to numerous applications. Most existing
unsupervised methods embed node features into latent representations to enable
cross-graph comparison without ground-truth correspondences. However, these
meth..."
π¬ RESEARCH
via Arxiv
π€ Akshit Achara, Esther Puyol Anton, Alexander Hammers et al.
π
2025-09-11
β‘ Score: 8.0
"Magnetic resonance imaging (MRI) is the gold standard for brain imaging. Deep
learning (DL) algorithms have been proposed to aid in the diagnosis of diseases
such as Alzheimer's disease (AD) from MRI scans. However, DL algorithms can
suffer from shortcut learning, in which spurious features, not dir..."
π¬ RESEARCH
via Arxiv
π€ Bangzhao Shu, Isha Joshi, Melissa Karnaze et al.
π
2025-09-11
β‘ Score: 8.0
"The versatility of Large Language Models (LLMs) in natural language
understanding has made them increasingly popular in mental health research.
While many studies explore LLMs' capabilities in emotion recognition, a
critical gap remains in evaluating whether LLMs align with human emotions at a
fine-..."
π€ AI MODELS
π― Speculative decoding vs. cascading β’ Quality vs. speed trade-offs β’ Confusion around cascading mechanics
π¬ "Spec decode gets 73% right on GSM8K, but spec cascade got around 77% right."
β’ "The verifier tokens do not always come from the big model for cascades!"
π¬ RESEARCH
via Arxiv
π€ Adrian de Wynter
π
2025-09-12
β‘ Score: 8.0
"In-context learning (ICL) allows some autoregressive models to solve tasks
via next-token prediction and without needing further training. This has led to
claims about these model's ability to solve (learn) unseen tasks with only a
few shots (exemplars) in the prompt. However, deduction does not alw..."
π OPEN SOURCE
π― CPU-first architecture β’ Incremental learning β’ Optimization and benchmarking
π¬ "I have a CPU-first, no-backprop architecture that works very well on classification datasets."
β’ "Do you consider GPU accelerations? Also, do you have any benchmarks on known hardware?"
π¬ RESEARCH
via Arxiv
π€ Yixiao Zhou, Ziyu Zhao, Dongzhou Cheng et al.
π
2025-09-12
β‘ Score: 8.0
"Sparse Mixture-of-Experts (SMoE) architectures are widely used in large
language models (LLMs) due to their computational efficiency. However, though
only a few experts are activated for each token, SMoE still requires loading
all expert parameters, leading to high memory usage and challenges in
dep..."
π¬ RESEARCH
via Arxiv
π€ Utsab Saha, Tanvir Muntakim Tonoy, Hafiz Imtiaz
π
2025-09-12
β‘ Score: 8.0
"In this work, we explore differentially private synthetic data generation in
a decentralized-data setting by building on the recently proposed
Differentially Private Class-Centric Data Aggregation (DP-CDA). DP-CDA
synthesizes data in a centralized setting by mixing multiple randomly-selected
samples..."
π¬ RESEARCH
via Arxiv
π€ Ignacy StΔpka, Jerzy Stefanowski
π
2025-09-11
β‘ Score: 8.0
"Machine learning models in dynamic environments often suffer from concept
drift, where changes in the data distribution degrade performance. While
detecting this drift is a well-studied topic, explaining how and why the
model's decision-making logic changes still remains a significant challenge. In..."
π¬ RESEARCH
via Arxiv
π€ Minghang Zhu, Zhengliang Shi, Zhiwei Xu et al.
π
2025-09-11
β‘ Score: 8.0
"The advancement of large language models (LLMs) has enabled the construction
of multi-agent systems to solve complex tasks by dividing responsibilities
among specialized agents, such as a planning agent for subgoal generation and a
grounding agent for executing tool-use actions. Most existing method..."
π¬ RESEARCH
via Arxiv
π€ Iason Gabriel, Geoff Keeling, Arianna Manzini et al.
π
2025-09-12
β‘ Score: 8.0
"The deployment of capable AI agents raises fresh questions about safety,
human-machine relationships and social coordination. We argue for greater
engagement by scientists, scholars, engineers and policymakers with the
implications of a world increasingly populated by AI agents. We explore key
chall..."
π¬ RESEARCH
via Arxiv
π€ Shulai Zhang, Ao Xu, Quan Chen et al.
π
2025-09-11
β‘ Score: 8.0
"Embodied AI systems operate in dynamic environments, requiring seamless
integration of perception and generation modules to process high-frequency
input and output demands. Traditional sequential computation patterns, while
effective in ensuring accuracy, face significant limitations in achieving th..."
π¬ RESEARCH
via Arxiv
π€ Paolo Pedinotti, Peter Baumann, Nathan Jessurun et al.
π
2025-09-11
β‘ Score: 8.0
"Large Language Models (LLMs) have rapidly reshaped financial NLP, enabling
new tasks and driving a proliferation of datasets and diversification of data
sources. Yet, this transformation has outpaced traditional surveys. In this
paper, we present MetaGraph, a generalizable methodology for extracting..."
π¬ RESEARCH
via Arxiv
π€ Ngoc-Son Nguyen, Hieu-Nghia Huynh-Nguyen, Thanh V. T. Tran et al.
π
2025-09-11
β‘ Score: 8.0
"Zero-shot Text-to-Speech (TTS) aims to synthesize high-quality speech that
mimics the voice of an unseen speaker using only a short reference sample,
requiring not only speaker adaptation but also accurate modeling of prosodic
attributes. Recent approaches based on language models, diffusion, and fl..."
π¬ RESEARCH
via Arxiv
π€ Jielin Qiu, Zuxin Liu, Zhiwei Liu et al.
π
2025-09-11
β‘ Score: 8.0
"The emergence of long-context language models with context windows extending
to millions of tokens has created new opportunities for sophisticated code
understanding and software development evaluation. We propose LoCoBench, a
comprehensive benchmark specifically designed to evaluate long-context LL..."
π OPEN SOURCE
π― Serverless workflow β’ Trigger.dev features β’ Product growth
π¬ "For me, it's the most accessible incarnation of serverless."
β’ "Uncaught errors automatically cause retries of tasks using your settings."
π¬ RESEARCH
via Arxiv
π€ Haolan Zheng, Yanlai Chen, Jiequn Han et al.
π
2025-09-11
β‘ Score: 8.0
"We propose a novel data-lean operator learning algorithm, the Reduced Basis
Neural Operator (ReBaNO), to solve a group of PDEs with multiple distinct
inputs. Inspired by the Reduced Basis Method and the recently introduced
Generative Pre-Trained Physics-Informed Neural Networks, ReBaNO relies on a
m..."
π¬ RESEARCH
via Arxiv
π€ Daria Laslo, Efthymios Georgiou, Marius George Linguraru et al.
π
2025-09-11
β‘ Score: 8.0
"Predicting the spatio-temporal progression of brain tumors is essential for
guiding clinical decisions in neuro-oncology. We propose a hybrid mechanistic
learning framework that combines a mathematical tumor growth model with a
guided denoising diffusion implicit model (DDIM) to synthesize anatomica..."
π¬ RESEARCH
via Arxiv
π€ Runpeng Dai, Linfeng Song, Haolin Liu et al.
π
2025-09-11
β‘ Score: 8.0
"Reinforcement Learning with Verifiable Rewards (RLVR) is a powerful paradigm
for enhancing the reasoning ability of Large Language Models (LLMs). Yet
current RLVR methods often explore poorly, leading to premature convergence and
entropy collapse. To address this challenge, we introduce Curiosity-Dr..."
π¬ RESEARCH
via Arxiv
π€ Ngoc-Son Nguyen, Hieu-Nghia Huynh-Nguyen, Thanh V. T. Tran et al.
π
2025-09-11
β‘ Score: 8.0
"Zero-shot Text-to-Speech (TTS) aims to synthesize high-quality speech that
mimics the voice of an unseen speaker using only a short reference sample,
requiring not only speaker adaptation but also accurate modeling of prosodic
attributes. Recent approaches based on language models, diffusion, and fl..."
π¬ RESEARCH
via Arxiv
π€ Siyan Zhao, Mengchen Liu, Jing Huang et al.
π
2025-09-12
β‘ Score: 8.0
"Masked diffusion large language models (dLLMs) are emerging as promising
alternatives to autoregressive LLMs, offering competitive performance while
supporting unique generation capabilities such as inpainting. We explore how
inpainting can inform RL algorithm design for dLLMs. Aligning LLMs with
re..."
π¬ RESEARCH
via Arxiv
π€ Mohsen Fayyaz, Ali Modarressi, Hanieh Deilamsalehy et al.
π
2025-09-11
β‘ Score: 8.0
"Mixture-of-Experts (MoE) in Large Language Models (LLMs) routes each token
through a subset of specialized Feed-Forward Networks (FFN), known as experts.
We present SteerMoE, a framework for steering MoE models by detecting and
controlling behavior-linked experts. Our detection method identifies exp..."
π¬ RESEARCH
via Arxiv
π€ Julian Linke, Barbara Schuppler
π
2025-09-12
β‘ Score: 8.0
"This paper investigates prominence-aware automatic speech recognition (ASR)
by combining prominence detection and speech recognition for conversational
Austrian German. First, prominence detectors were developed by fine-tuning
wav2vec2 models to classify word-level prominence. The detector was then..."
π¬ RESEARCH
via Arxiv
π€ Sanjay Basu, Sadiq Y. Patel, Parth Sheth et al.
π
2025-09-11
β‘ Score: 8.0
"We introduce Feasibility-Guided Fair Adaptive Reinforcement Learning
(FG-FARL), an offline RL procedure that calibrates per-group safety thresholds
to reduce harm while equalizing a chosen fairness target (coverage or harm)
across protected subgroups. Using de-identified longitudinal trajectories fr..."
π¬ RESEARCH
via Arxiv
π€ Ira J. S. Shokar, Rich R. Kerswell, Peter H. Haynes
π
2025-09-11
β‘ Score: 8.0
"We present a deep learning emulator for stochastic and chaotic
spatio-temporal systems, explicitly conditioned on the parameter values of the
underlying partial differential equations (PDEs). Our approach involves
pre-training the model on a single parameter domain, followed by fine-tuning on
a smal..."
π¬ RESEARCH
via Arxiv
π€ Zhengyu Hu, Zheyuan Xiao, Max Xiong et al.
π
2025-09-12
β‘ Score: 8.0
"Recent advances in large language models (LLMs) have enabled human-like
social simulations at unprecedented scale and fidelity, offering new
opportunities for computational social science. A key challenge, however, is
the construction of persona sets that authentically represent the diversity and
di..."
π¬ RESEARCH
via Arxiv
π€ Rongyao Fang, Aldrich Yu, Chengqi Duan et al.
π
2025-09-11
β‘ Score: 8.0
"The advancement of open-source text-to-image (T2I) models has been hindered
by the absence of large-scale, reasoning-focused datasets and comprehensive
evaluation benchmarks, resulting in a performance gap compared to leading
closed-source systems. To address this challenge, We introduce FLUX-Reason..."
π¬ RESEARCH
via Arxiv
π€ Alessio Chen, Simone Giovannini, Andrea Gemelli et al.
π
2025-09-12
β‘ Score: 8.0
"Vision-Language Models (VLMs) have shown strong capabilities in document
understanding, particularly in identifying and extracting textual information
from complex documents. Despite this, accurately localizing answers within
documents remains a major challenge, limiting both interpretability and
re..."
π¬ RESEARCH
via Arxiv
π€ Haozhan Li, Yuxin Zuo, Jiale Yu et al.
π
2025-09-11
β‘ Score: 8.0
"Vision-Language-Action (VLA) models have recently emerged as a powerful
paradigm for robotic manipulation. Despite substantial progress enabled by
large-scale pretraining and supervised fine-tuning (SFT), these models face two
fundamental challenges: (i) the scarcity and high cost of large-scale
hum..."
π¬ RESEARCH
via Arxiv
π€ Meghan Wilkinson, Robert H Thomson
π
2025-09-11
β‘ Score: 8.0
"Supervised machine learning techniques rely on labeled data to achieve high
task performance, but this requires the labels to capture some meaningful
differences in the underlying data structure. For training network intrusion
detection algorithms, most datasets contain a series of attack classes an..."
π EDUCATION
via Reddit
π€ u/Limp_Classroom_2645
π
2025-09-15
β‘ Score: 7.8
"# Introduction
In this write up I will share my local AI setup on Ubuntu that I use for my personal projects as well as professional workflows (local chat, agentic workflows, coding agents, data analysis, synthetic dataset generation, etc).
This setup is particularly useful when I want to generate..."
π― Auto-restart on config change β’ Llama model for VSCode β’ Optimizing Llama-swap config
π¬ "This is a good guide and almost as if I would've written it myself."
β’ "In your example, in llama-vscode, you can set: endpoint: http://127.0.0.1:8011, model: qwen3-30b-a3b-instruct, Ai_api_version: v1"
π€ AI MODELS
πΊ 228 pts
β‘ Score: 7.4
π― Codex performance β’ Codex pricing β’ Codex vs. Claude Code
π¬ "Codex CLI w/gpt-5 is already a lot more steerable than Claude Code"
β’ "Codex with GPT-5-High is extremely good"
π’ BUSINESS
"40 minutes ago Nikou Asgari / Financial Times:..."
π¬ RESEARCH
via Arxiv
π€ Vadim Zadykian, Bruno Andrade, Haithem Afli
π
2025-09-11
β‘ Score: 7.0
"Semantic Textual Relatedness (STR) captures nuanced relationships between
texts that extend beyond superficial lexical similarity. In this study, we
investigate STR in the context of job title matching - a key challenge in
resume recommendation systems, where overlapping terms are often limited or
m..."
π οΈ SHOW HN
π― Consistency in API design β’ Modular architecture β’ Separation of concerns
π¬ "Your views are not following a single convention"
β’ "break up your views into logical modules"
π POLICY
"Chase DiFeliciantonio /
Politico: **[California passes SB 53, which requires AI companies to disclose their safety testing regimes; Newsom vetoed a similar though more expansive measure last year](
https://www.politico.com/news/2025/09/13/california-lawmakers-pass-landmark..."
π¬ RESEARCH
via Arxiv
π€ Yuexi Du, Lihui Chen, Nicha C. Dvornek
π
2025-09-12
β‘ Score: 7.0
"Mammography screening is an essential tool for early detection of breast
cancer. The speed and accuracy of mammography interpretation have the potential
to be improved with deep learning methods. However, the development of a
foundation visual language model (VLM) is hindered by limited data and dom..."
π¬ RESEARCH
via Arxiv
π€ Zakaria El Kassimi, Fares Fourati, Mohamed-Slim Alouini
π
2025-09-11
β‘ Score: 7.0
"We study question answering in the domain of radio regulations, a legally
sensitive and high-stakes area. We propose a telecom-specific
Retrieval-Augmented Generation (RAG) pipeline and introduce, to our knowledge,
the first multiple-choice evaluation set for this domain, constructed from
authoritat..."
π¬ RESEARCH
via Arxiv
π€ Roshan Balaji, Joe Bobby, Nirav Pravinbhai Bhatt
π
2025-09-11
β‘ Score: 7.0
"Molecular property prediction using deep learning (DL) models has accelerated
drug and materials discovery, but the resulting DL models often lack
interpretability, hindering their adoption by chemists. This work proposes
developing molecule representations using the concept of Functional Groups (FG..."
π OPEN SOURCE
"Hey r/LocalLLaMA!
mudler here, creator of LocalAI (
https://github.com/mudler/LocalAI ). For those who might not know, LocalAI is an open-source, self-hosted inference engine that acts as a drop-in replacement for the OpenAI API. The whole point is to give you a..."
π― LocalAI Updates β’ User Experiences β’ Windows Support
π¬ "I'll try this as soon as Windows version(Non Docker) available."
β’ "It'd be great to have a better getting started experience."
π§ INFRASTRUCTURE
"I'm finding a lot of conflicting information across Reddit, and the scene/meta seems to move so fast! So I apologize if y'all get a *ton* of these kind of questions.
With that said, I've got my FormD TD1 with a mini ITX build inside that I used to use as a gaming PC, but I have since recommissioned..."
π― GPU configurations β’ Workstation/server hardware β’ Model inference and scaling
π¬ "You can run 8 GPU's at x16 and 16 GPU's at x8."
β’ "Wealth of info."
π¬ RESEARCH
via Arxiv
π€ Yiqun T. Chen, Tyler H. McCormick, Li Liu et al.
π
2025-09-11
β‘ Score: 7.0
"Verbal autopsy (VA) is a critical tool for estimating causes of death in
resource-limited settings where medical certification is unavailable. This
study presents LA-VA, a proof-of-concept pipeline that combines Large Language
Models (LLMs) with traditional algorithmic approaches and embedding-based..."
π’ BUSINESS
"11 hours ago Gregory Gondwe / Associated Press:..."
π° FUNDING
"Brian Kahn /
Bloomberg: **[Lila Sciences, which uses AI to develop novel drugs and materials, raised $235M at a ~$1.23B valuation, after coming out of stealth in March with a $200M seed](
https://www.bloomberg.com/news/articles/2025-09-13/ai-unicorn-lila-sciences-raises-..."
π¬ RESEARCH
via Arxiv
π€ Siddarth Mamidanna, Daking Rai, Ziyu Yao et al.
π
2025-09-11
β‘ Score: 7.0
"Large language models (LLMs) demonstrate proficiency across numerous
computational tasks, yet their inner workings remain unclear. In theory, the
combination of causal self-attention and multilayer perceptron layers allows
every token to access and compute information based on all preceding tokens...."
π¬ RESEARCH
πΊ 2 pts
β‘ Score: 7.0
π¬ RESEARCH
"I'm looking for some advice on a low-data problem for my master's thesis. I'm using a T5 (`t5-base`) for an ABSA task where it takes a sentence and generates `aspect|sentiment` pairs (e.g., "The UI is confusing" -> "user interface|negative").
My issue is that my task requires identifying implici..."
π§ INFRASTRUCTURE
πΊ 6 pts
β‘ Score: 6.5
π€ AI MODELS
β¬οΈ 74 ups
β‘ Score: 6.5
"| Project | `qwen3-next-80b-a3b-instruct` | `openai/gpt-4.1-mini` | `openai/gpt-4.1` |
|---------|-------------------------------|----------------------|------------------|
| To Do List |
Qwen3 To Do | [GPT 4.1-mini ..."
π― Tool Licensing β’ Output Ownership β’ AGPL Obligations
π¬ "The problem is you're claiming to own the outputs I make with your tool"
β’ "It doesn't let you claim ownership of client software. Nor does it let you claim ownrship of software outputs."
π§ INFRASTRUCTURE
β¬οΈ 58 ups
β‘ Score: 6.5
"Hey all,.
I have been working on improving AMX acceleration in llama.cpp. Currently, even if you have a a supported CPU and have built llama.cpp with all the required build flags, AMX acceleration is disabled if you have a GPU present.
I modified the way that llama.cpp exposes the "extra" CPU buff..."
π― CPU Testing β’ Performance Optimization β’ Model Benchmarking
π¬ "Intel should offer a service where you can test this in the cloud."
β’ "Can you try with this command: numactl -N 2 -m 2 \~/path-to-your/build/bin/llama-cli..."
π POLICY
β¬οΈ 56363 ups
β‘ Score: 6.2
π― Musk's platform control β’ Grok's potential rebellion β’ Misinformation and fact-checking
π¬ "Cringe idiocy"
β’ "Grok became the self-aware 'Skynet"
β‘ BREAKTHROUGH
β¬οΈ 158 ups
β‘ Score: 6.2
"I try every new model with this simple prompt. Gpt-5-codex is the first model that succeeded.
prompt:
\`\`\`
write simple doom / wolfenstein demo with ray-tracing in simple html + js. One level, so i can move and shoot.
\`\`\`
The idea is I don't want to write a structured, complex prompt; ..."