AI News Archive - December 04, 2025 | Metamesh Intelligence

🔒 SECURITY

Reverse engineering a $1B Legal AI tool exposed 100k+ confidential files

via HackerNews 👤 bearsyankees 📅 2025-12-03

🔺 672 pts ⚡ Score: 8.6

💬 HackerNews Buzz: 219 comments 🐝 BUZZING

🎯 Legal ethics & confidentiality • Startup challenges in new domains • Cybersecurity and software engineering

💬 "Attorneys are ethically obligated to follow very stringent rules to protect their client's confidential information." • "The scary bit is that lawyers are being sold 'AI assistant' but what they're actually buying is 'unvetted third party root access to your institutional memory'."

⚡ BREAKTHROUGH

AI-written CUDA kernels outperforming Nvidia

2x SOURCES 🌐 📅 2025-12-04

⚡ Score: 8.4

+++ Reinforcement learning guided a custom CUDA kernel past cuBLAS at matrix multiplication, proving once again that vendor libraries leave performance on the table for anyone willing to optimize obsessively. +++

AI-Written CUDA Kernels Outperforms Nvidia's Best Matmul Library

via HackerNews 👤 dzign 📅 2025-12-04

🔺 3 pts ⚡ Score: 8.4

🛠️ TOOLS

Google DeepMind's mechanistic interpretability team details why it shifted from fully reverse-engineering neural nets to a focus on “pragmatic interpretability”

via Techmeme 👤 Alignmentforum 📅 2025-12-04

⚡ Score: 8.0

🔬 RESEARCH

In-Context Representation Hijacking

via Arxiv 👤 Itay Yona, Amir Sarid, Michael Karasik et al. 📅 2025-12-03

⚡ Score: 7.9

"We introduce \textbf{Doublespeak}, a simple \emph{in-context representation hijacking} attack against large language models (LLMs). The attack works by systematically replacing a harmful keyword (e.g., \textit{bomb}) with a benign token (e.g., \textit{carrot}) across multiple in-context examples, pr..."

🤖 AI MODELS

A Technical Tour of the DeepSeek Models from V3 to V3.2

via r/LocalLLaMA 👤 u/seraschka 📅 2025-12-03

⬆️ 28 ups ⚡ Score: 7.8

"External link discussion - see full content at original source."

💬 Reddit Discussion: 4 comments 🐐 GOATED ENERGY

🎯 DSA Implementation • LLM Development • Content Appreciation

💬 "Shame 3.2 isn't supported in llama.cpp" • "Maybe they didn't think of it as worthwhile"

🧠 NEURAL NETWORKS

Frozen networks show usable early-layer intent: 1370× fewer FLOPs and 10× faster inference (code + weights)9

via r/LocalLLaMA 👤 u/anima-core 📅 2025-12-04

⬆️ 56 ups ⚡ Score: 7.8

"I’ve been experimenting with whether a frozen network’s early activations contain enough “semantic intent” to skip most of the compute. I used a standard ResNet-18 trained on CIFAR-10 (87.89 percent accuracy), pulled a single 64-dimensional vector from an early layer, and trained a tiny decoder on ..."

💬 Reddit Discussion: 24 comments 👍 LOWKEY SLAPS

🎯 Early layer features • Compressed semantic signal • Distillation vs. standalone models

💬 "the early layers of a frozen network already contain enough semantic structure to make the full path unnecessary" • "This is basically early-exit + distillation."

🔬 RESEARCH

TokenPowerBench: Benchmarking the Power Consumption of LLM Inference

via Arxiv 👤 Chenxu Niu, Wei Zhang, Jie Li et al. 📅 2025-12-02

⚡ Score: 7.7

"Large language model (LLM) services now answer billions of queries per day, and industry reports show that inference, not training, accounts for more than 90% of total power consumption. However, existing benchmarks focus on either training/fine-tuning or performance of inference and provide little..."

🔬 RESEARCH

AI persuasion and elite preference shaping

2x SOURCES 🌐 📅 2025-12-03

⚡ Score: 7.6

+++ Academic researchers formalize what political operatives already knew: when AI slashes the cost of targeted persuasion, shaping public opinion stops being an accident of media access and becomes deliberate infrastructure. Consensus, meet design. +++

Elites Could Shape Mass Preferences as AI Reduces Persuasion Costs

via HackerNews 👤 50kIters 📅 2025-12-04

🔺 417 pts ⚡ Score: 7.7

💬 HackerNews Buzz: 436 comments 👍 LOWKEY SLAPS

🎯 Section 230 reform • Algorithmic bias • Misinformation & manipulation

💬 "We need to bring Section 230 into the modern era" • "Algorithms reflect government policy and interests"

Polarization by Design: How Elites Could Shape Mass Preferences as AI Reduces Persuasion Costs

via Arxiv 👤 Nadav Kunievsky 📅 2025-12-03

⚡ Score: 7.1

"In democracies, major policy decisions typically require some form of majority or consensus, so elites must secure mass support to govern. Historically, elites could shape support only through limited instruments like schooling and mass media; advances in AI-driven persuasion sharply reduce the cost..."

🤖 AI MODELS

The Best Open Weights Coding Models of 2025

via r/LocalLLaMA 👤 u/mr_riptano 📅 2025-12-03

⬆️ 63 ups ⚡ Score: 7.5

"Hi all, I'm back with uncontaminated evals for DeepSeek-V3.2, Kimi K2 Thinking, and MiniMax M2. (We caught GLM 4.6 last time around.) If you just want the numbers, you can find them for the finalists here and for ev..."

💬 Reddit Discussion: 41 comments 👍 LOWKEY SLAPS

🎯 Architecture & Design Patterns • Code Organization • Benchmarking & Evaluation

💬 "If you're not telling them what architecture and design pattern to use, they'll inevitably try a different one every prompt" • "Appreciate results, but little process details raises a brow"

🤖 AI MODELS

Google rolls out Gemini 3 Deep Think to Google AI Ultra subscribers in the Gemini app, after saying in November it needed “extra time for safety evaluations”

via Techmeme 👤 9To5Google 📅 2025-12-04

⚡ Score: 7.5

🛠️ TOOLS

Cruxy: Train 1.5B models on 4GB VRAM - new optimiser just released

via r/LocalLLaMA 👤 u/National_Control4101 📅 2025-12-04

⬆️ 111 ups ⚡ Score: 7.5

"Hey all, I've just released Cruxy - an adaptive optimiser that lets you fine-tune billion-parameter models on consumer GPUs. **What it does:** - Drop-in replacement for AdamW - Meta-Lion mode uses 1/3 the memory of AdamW - Automatic stability control - no scheduler tuning needed - Verified on TinyL..."

💬 Reddit Discussion: 33 comments 🐐 GOATED ENERGY

🎯 Optimizer Theory • Practical Implementation • Modeling Capabilities

💬 "Best way to learn is to read existing optimizer code and experiment." • "A 3090 would absolutely fly with it."

🔬 RESEARCH

AI agent achieves Rank 1 across major CTFs – a defining moment for cybersecurity

via HackerNews 👤 vmayoral 📅 2025-12-04

🔺 2 pts ⚡ Score: 7.4

🔒 SECURITY

OpenAI loses fight to keep ChatGPT logs secret in copyright case

via r/ChatGPT 👤 u/Moth_LovesLamp 📅 2025-12-04

⬆️ 272 ups ⚡ Score: 7.2

"External link discussion - see full content at original source."

💬 Reddit Discussion: 27 comments 👍 LOWKEY SLAPS

🎯 Privacy Concerns • Ethical Data Usage • Transparency in Journalism

💬 "What kind of logic is this? Why dox people, for what purpose?" • "Users have been fingerprinted: 'a male dentist local to Bumsfuck, Minnesota talks about (embarrassing topic)"

🔬 RESEARCH

Lumos: Let there be Language Model System Certification

via Arxiv 👤 Isha Chaudhary, Vedaant Jain, Avaljot Singh et al. 📅 2025-12-02

⚡ Score: 7.2

"We introduce the first principled framework, Lumos, for specifying and formally certifying Language Model System (LMS) behaviors. Lumos is an imperative probabilistic programming DSL over graphs, with constructs to generate independent and identically distributed prompts for LMS. It offers a structu..."

🔒 SECURITY

BrowseSafe, An Open-Source Model for AI Agents Browser Security

via r/LocalLLaMA 👤 u/Dear-Success-1441 📅 2025-12-04

⬆️ 15 ups ⚡ Score: 7.2

"BrowseSafe is an open-source security model trained to protect AI browser agents from prompt injection attacks embedded in real-world web content. BrowseSafe model is based on the **Qwen3-30B-A3B.** Here is a brief overview of key features of BrowseSafe model: **1. State-of-the-Art Detection**: A..."

🔬 RESEARCH

Reconstructing KV Caches with Cross-layer Fusion For Enhanced Transformers

via Arxiv 👤 Hongzhan Lin, Zhiqi Bai, Xinmiao Zhang et al. 📅 2025-12-03

⚡ Score: 7.1

"Transformer decoders have achieved strong results across tasks, but the memory required for the KV cache becomes prohibitive at long sequence lengths. Although Cross-layer KV Cache sharing (e.g., YOCO, CLA) offers a path to mitigate KV Cache bottleneck, it typically underperforms within-layer method..."

⚡ BREAKTHROUGH

[R] Is Nested Learning a new ML paradigm?

via r/MachineLearning 👤 u/Odd_Manufacturer2215 📅 2025-12-04

⬆️ 9 ups ⚡ Score: 7.1

"LLMs still don’t have a way of updating their long-term memory on the fly. Researchers at Google, inspired by the human brain, believe they have a solution to this. Their ‘Nested learning’ approach ..."

💬 Reddit Discussion: 18 comments 🐝 BUZZING

🎯 Skepticism towards claimed progress • Criticism of overly ambitious claims • Concerns about lack of concrete results

💬 "I find them very ambitious in form, more than they are in substance and in results." • "It doesn't really solve new tasks where the classic LLMs do poorly, or rather that they just can't do."

🔬 RESEARCH

Kimina-Prover: Applying Test-Time RL Search on Large Formal Reasoning Models

via HackerNews 👤 ibobev 📅 2025-12-04

🔺 1 pts ⚡ Score: 7.1

🔬 RESEARCH

Training-Free Policy Violation Detection via Activation-Space Whitening in LLMs

via Arxiv 👤 Oren Rachmil, Roy Betser, Itay Gershon et al. 📅 2025-12-03

⚡ Score: 7.1

"Aligning proprietary large language models (LLMs) with internal organizational policies has become an urgent priority as organizations increasingly deploy LLMs in sensitive domains such as legal support, finance, and medical services. Beyond generic safety filters, enterprises require reliable mecha..."

🛡️ SAFETY

OpenAI LLM "confession" training method

2x SOURCES 🌐 📅 2025-12-03

⚡ Score: 7.1

+++ OpenAI is training language models to self-report their reasoning and admit when they're faking it, which is either genuine interpretability progress or an expensive way to document that AI still doesn't know what it's doing. +++

OpenAI has trained its LLM to confess to bad behavior

via r/OpenAI 👤 u/techreview 📅 2025-12-03

⬆️ 63 ups ⚡ Score: 7.2

"OpenAI is testing another new way to expose the complicated processes at work inside large language models. Researchers at the company can make an LLM produce what they call a confessio..."

💬 Reddit Discussion: 6 comments 😐 MID OR MIXED

🎯 Strange response • Paternalistic behavior • Outdated language models

💬 "They're probably the type who will call you and tell you they know what's best for you" • "Cool. Have fun staying in the past with old models."

🔬 RESEARCH

Principled RL for Diffusion LLMs Emerges from a Sequence-Level Perspective

via Arxiv 👤 Jingyang Ou, Jiaqi Han, Minkai Xu et al. 📅 2025-12-03

⚡ Score: 7.0

"Reinforcement Learning (RL) has proven highly effective for autoregressive language models, but adapting these methods to diffusion large language models (dLLMs) presents fundamental challenges. The core difficulty lies in likelihood approximation: while autoregressive models naturally provide token..."

🔬 RESEARCH

A smarter way for large language models to think about hard problems

via HackerNews 👤 pedriquepacheco 📅 2025-12-04

🔺 2 pts ⚡ Score: 7.0

🔬 RESEARCH

PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation

via Arxiv 👤 Xiaolong Li, Youping Gu, Xi Lin et al. 📅 2025-12-03

⚡ Score: 7.0

"Attention mechanisms are the core of foundation models, but their quadratic complexity remains a critical bottleneck for scaling. This challenge has driven the development of efficient attention mechanisms, with sparsity emerging as the dominant paradigm. Current methods typically retain or discard..."

🔬 RESEARCH

SkillFactory: Self-Distillation For Learning Cognitive Behaviors

via Arxiv 👤 Zayne Sprague, Jack Lu, Manya Wadhwa et al. 📅 2025-12-03

⚡ Score: 7.0

"Reasoning models leveraging long chains of thought employ various cognitive skills, such as verification of their answers, backtracking, retrying by an alternate method, and more. Previous work has shown that when a base language model exhibits these skills, training that model further with reinforc..."

🔬 RESEARCH

Efficient Public Verification of Private ML via Regularization

via Arxiv 👤 Zoë Ruha Bell, Anvith Thudi, Olive Franzese-McLaughlin et al. 📅 2025-12-03

⚡ Score: 6.9

"Training with differential privacy (DP) provides a guarantee to members in a dataset that they cannot be identified by users of the released model. However, those data providers, and, in general, the public, lack methods to efficiently verify that models trained on their data satisfy DP guarantees...."

🛠️ TOOLS

smallevals - Tiny 0.6B Evaluation Models and a Local LLM Evaluation Framework

via r/LocalLLaMA 👤 u/mburaksayici 📅 2025-12-04

⬆️ 9 ups ⚡ Score: 6.9

"Hi r/LocalLLaMA , you may know me from the latest blogs I've shared on mburaksayici.com/ , discussing LLM and RAG systems, and RAG Boilerplates. When I study evaluation frameworks on LLMs, I've seen they require lots of API calls to generate golden datasets, open-ended ..."

🔬 RESEARCH

Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation

via Arxiv 👤 Hang Xu, Linjiang Huang, Feng Zhao 📅 2025-12-03

⚡ Score: 6.9

"Test-time scaling (TTS) aims to achieve better results by increasing random sampling and evaluating samples based on rules and metrics. However, in text-to-image(T2I) diffusion models, most related works focus on search strategies and reward models, yet the impact of the stochastic characteristic of..."

🔬 RESEARCH

AugServe: Adaptive Request Scheduling for Augmented Large Language Model Inference Serving

via Arxiv 👤 Ying Wang, Zhen Jin, Jiexiong Xu et al. 📅 2025-12-03

⚡ Score: 6.9

"As augmented large language models (LLMs) with external tools become increasingly popular in web applications, improving augmented LLM inference serving efficiency and optimizing service-level objectives (SLOs) are critical for enhancing user experience. To achieve this, inference systems must maxim..."

📊 DATA

A Protocol for Measuring Answer Space Occupancy in Large Language Models

via HackerNews 👤 businessmate 📅 2025-12-04

🔺 1 pts ⚡ Score: 6.9

🔬 RESEARCH

Is Lying Only Sinful in Islam? Exploring Religious Bias in Multilingual Large Language Models Across Major Religions

via Arxiv 👤 Kazi Abrab Hossain, Jannatul Somiya Mahmud, Maria Hossain Tuli et al. 📅 2025-12-03

⚡ Score: 6.8

"While recent developments in large language models have improved bias detection and classification, sensitive subjects like religion still present challenges because even minor errors can result in severe misunderstandings. In particular, multilingual models often misrepresent religions and have dif..."

🔬 RESEARCH

DIQ-H: Evaluating Hallucination Persistence in VLMs Under Temporal Visual Degradation

via Arxiv 👤 Zexin Lin, Hawen Wan, Yebin Zhong et al. 📅 2025-12-03

⚡ Score: 6.8

"Vision-Language Models (VLMs) deployed in safety-critical applications such as autonomous driving must handle continuous visual streams under imperfect conditions. However, existing benchmarks focus on static, high-quality images and ignore temporal degradation and error propagation, which are criti..."

🛠️ TOOLS

speed optimizations for Qwen Next on CUDA have been merged into llama.cpp

via r/LocalLLaMA 👤 u/jacek2023 📅 2025-12-04

⬆️ 6 ups ⚡ Score: 6.8

"Open source code repository or project related to AI/ML."

🔬 RESEARCH

MarkTune: Improving the Quality-Detectability Trade-off in Open-Weight LLM Watermarking

via Arxiv 👤 Yizhou Zhao, Zhiwei Steven Wu, Adam Block 📅 2025-12-03

⚡ Score: 6.8

"Watermarking aims to embed hidden signals in generated text that can be reliably detected when given access to a secret key. Open-weight language models pose acute challenges for such watermarking schemes because the inference-time interventions that dominate contemporary approaches cannot be enforc..."

🛠️ SHOW HN

Show HN: TabPFN Scaling Mode – Tabular Foundation Model on millions of rows

via HackerNews 👤 onasta 📅 2025-12-03

🔺 3 pts ⚡ Score: 6.8

🔬 RESEARCH

LORE: A Large Generative Model for Search Relevance

via Arxiv 👤 Chenji Lu, Zhuo Chen, Hui Zhao et al. 📅 2025-12-02

⚡ Score: 6.8

"Achievement. We introduce LORE, a systematic framework for Large Generative Model-based relevance in e-commerce search. Deployed and iterated over three years, LORE achieves a cumulative +27\% improvement in online GoodRate metrics. This report shares the valuable experience gained throughout its de..."

🤖 AI MODELS

Structured Outputs Now Available for Haiku 4.5

via r/claudeai 👤 u/ClaudeOfficial 📅 2025-12-04

⬆️ 44 ups ⚡ Score: 6.7

"A few weeks ago we launched Structured Outputs in public beta for Claude Sonnet 4.5 and Opus 4.1—giving you 100% schema compliance and perfectly formatted responses on every request. Today, we'..."

🔬 RESEARCH

Training and Evaluation of Guideline-Based Medical Reasoning in LLMs

via Arxiv 👤 Michael Staniek, Artem Sokolov, Stefan Riezler 📅 2025-12-03

⚡ Score: 6.7

"Machine learning for early prediction in medicine has recently shown breakthrough performance, however, the focus on improving prediction accuracy has led to a neglect of faithful explanations that are required to gain the trust of medical practitioners. The goal of this paper is to teach LLMs to fo..."

🔬 RESEARCH

Eval Factsheets: A Structured Framework for Documenting AI Evaluations

via Arxiv 👤 Florian Bordes, Candace Ross, Justine T Kao et al. 📅 2025-12-03

⚡ Score: 6.7

"The rapid proliferation of benchmarks has created significant challenges in reproducibility, transparency, and informed decision-making. However, unlike datasets and models -- which benefit from structured documentation frameworks like Datasheets and Model Cards -- evaluation methodologies lack syst..."

🛠️ SHOW HN

Show HN: Turn APIs into MCP servers without code

via HackerNews 👤 rishavmitra 📅 2025-12-04

🔺 11 pts ⚡ Score: 6.6

🔬 RESEARCH

Jina-VLM: Small Multilingual Vision Language Model

via Arxiv 👤 Andreas Koukounas, Georgios Mastrapas, Florian Hönicke et al. 📅 2025-12-03

⚡ Score: 6.6

"We present Jina-VLM, a 2.4B parameter vision-language model that achieves state-of-the-art multilingual visual question answering among open 2B-scale VLMs. The model couples a SigLIP2 vision encoder with a Qwen3 language backbone through an attention-pooling connector that enables token-efficient pr..."

🔬 RESEARCH

promptolution: A Unified, Modular Framework for Prompt Optimization

via Arxiv 👤 Tom Zehle, Timo Heiß, Moritz Schlager et al. 📅 2025-12-02

⚡ Score: 6.6

"Prompt optimization has become crucial for enhancing the performance of large language models (LLMs) across a broad range of tasks. Although many research papers show its effectiveness, practical adoption is hindered as existing implementations are often tied to unmaintained and isolated research co..."

🛠️ TOOLS

A look at startups like AGI and Plato, which build replicas of websites to let AI agents learn to navigate the internet and complete tasks, like booking flights

via Techmeme 👤 Nytimes 📅 2025-12-03

⚡ Score: 6.5

🤖 AI MODELS

Why is Anthropic saying "software engineering is done"?

via HackerNews 👤 wordsaboutcode 📅 2025-12-04

🔺 5 pts ⚡ Score: 6.5

💬 HackerNews Buzz: 6 comments 😐 MID OR MIXED

🎯 Marketing Campaign • Software Engineering • IPO Hype

💬 "They have a product to sell" • "not written by a software engineer"

🔬 RESEARCH

AutoNeural: Co-Designing Vision-Language Models for NPU Inference

via Arxiv 👤 Wei Chen, Liangmin Wu, Yunhai Hu et al. 📅 2025-12-02

⚡ Score: 6.5

"While Neural Processing Units (NPUs) offer high theoretical efficiency for edge AI, state-of-the-art Vision--Language Models (VLMs) tailored for GPUs often falter on these substrates. We attribute this hardware-model mismatch to two primary factors: the quantization brittleness of Vision Transformer..."

🔒 SECURITY

Prompt Injection via Poetry

via HackerNews 👤 bumbailiff 📅 2025-12-03

🔺 74 pts ⚡ Score: 6.5

💬 HackerNews Buzz: 32 comments 😤 NEGATIVE ENERGY

🎯 Jailbreaking AI models • Prompt injection vs. jailbreaking • Poetic jailbreaks

💬 "There are an infinite amount of ways to jailbreak AI models." • "Prompt injection and jailbreaking are not the same thing."

🎯 PRODUCT

New model, microsoft/VibeVoice-Realtime-0.5B

via r/LocalLLaMA 👤 u/edward-dev 📅 2025-12-04

⬆️ 247 ups ⚡ Score: 6.3

"VibeVoice: A Frontier Open-Source Text-to-Speech Model VibeVoice-Realtime is a lightweight real‑time text-to-speech model supporting streaming text input. It can be used to build realtime TTS services, narrate live data streams, and let different LLMs start speaking from their very first tokens (pl..."

💬 Reddit Discussion: 43 comments 👍 LOWKEY SLAPS

🎯 Language Models • Repository Issues • Usage Difficulties

💬 "I'm still waiting for a great german model" • "Funny how they forgot they unreleased VibeVoice-Large"

🔮 FUTURE

Death of chatgpt is near

via r/ChatGPT 👤 u/IshigamiSenku04 📅 2025-12-03

⬆️ 2496 ups ⚡ Score: 6.2

"External link discussion - see full content at original source."

💬 Reddit Discussion: 466 comments 😐 MID OR MIXED

🎯 Monetization of AI • Skepticism towards OpenAI • Preference for local AI models

💬 "Imagine pulling this one on your premium users" • "If Kimi, Mistral, Grok etc keep playing the game well GPT will be a sad case"

🤖 AI MODELS

Sources: Beijing-based Cambricon plans to more than triple its AI chip production to 500K units in 2026, including 300K of its advanced Siyuan 590 and 690 chips

via Techmeme 👤 Bloomberg 📅 2025-12-04

⚡ Score: 6.2

🛠️ SHOW HN

Show HN: A SOTA chart-extraction system combining traditional CV and LVMs

via HackerNews 👤 raunakchowdhuri 📅 2025-12-04

🔺 1 pts ⚡ Score: 6.2

💼 JOBS

Microsoft drops AI sales targets in half after salespeople miss their quotas

via HackerNews 👤 OptionOfT 📅 2025-12-04

🔺 289 pts ⚡ Score: 6.2

💬 HackerNews Buzz: 218 comments 🐝 BUZZING

🎯 Microsoft's AI challenges • Misalignment of AI capabilities • Concerns about AI bubble

💬 "their integration of copilot shows all the taste and good tradeoff choices of Teams but to far greater consequence" • "AI agent technology likely isn't ready for the kind of high-stakes autonomous business work Microsoft is promising"

🔬 RESEARCH

Cross-Lingual Prompt Steerability: Towards Accurate and Robust LLM Behavior across Languages

via Arxiv 👤 Lechen Zhang, Yusheng Zhou, Tolga Ergen et al. 📅 2025-12-02

⚡ Score: 6.1

"System prompts provide a lightweight yet powerful mechanism for conditioning large language models (LLMs) at inference time. While prior work has focused on English-only settings, real-world deployments benefit from having a single prompt to operate reliably across languages. This paper presents a c..."

🛠️ SHOW HN

Show HN: Airena – Client-side arena for comparing AI models across 68 providers

via HackerNews 👤 andronov04 📅 2025-12-03

🔺 1 pts ⚡ Score: 6.1

🤖 AI MODELS

Nvidia says its GB200 Blackwell AI servers boost performance 10x compared to H200 servers for MoE models like Moonshot's Kimi K2 Thinking and DeepSeek's R1

via Techmeme 👤 Reuters 📅 2025-12-04

⚡ Score: 6.1

Stories from December 04, 2025

AI-written CUDA kernels outperforming Nvidia

AI persuasion and elite preference shaping

📡 AI NEWS BUT ACTUALLY GOOD

OpenAI LLM "confession" training method