📚 HISTORICAL ARCHIVE - November 13, 2025

                What was happening in AI on 2025-11-13
            

← Nov 12 📊 TODAY'S NEWS 📚 ARCHIVE 🗓️ November 2025 Nov 14 →

                📰 DAILY AI BRIEF
            

On November 13, 2025, Metamesh tracked 45 AI stories, including 4 clustered developments, and ranked them by signal rather than volume. The lead item was Disrupting the first reported AI-orchestrated cyber espionage campaign. Also high in the stack: GPT-5.1 Instant and GPT-5.1 Thinking System Card Addendum and It seems that OpenAI’s inference costs easily eclipsed its revenues.. That combination is why this archive exists: it preserves the day's shape for AI practitioners, not just the last headline that crossed the wire.

The daily ticker's read: WELCOME TO METAMESH.BIZ +++ Chinese hackers using Anthropic's Claude to automate 90% of corporate espionage campaigns (the productivity gains we didn't ask for) +++ Stanford cracked zero-latency encrypted AI inference which sounds impossible until you read.... Read against the ranked story list below, it gives the archive a point of view: what mattered, what was mostly noise, and which threads were worth saving for later comparison.

📊 You are visitor #47291 to this AWESOME site! 📊
Archive from: 2025-11-13 | Preserved for posterity ⚡

Stories from November 13, 2025

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

🔒 SECURITY

AI-orchestrated cyber espionage campaign

2x SOURCES 🌐 📅 2025-11-13

⚡ Score: 9.0

+++ State-sponsored cyber espionage just got a productivity upgrade. Anthropic reported Chinese attackers automated 80-90% of a September campaign using its AI, raising the uncomfortable question of whether safety-conscious builders can actually control who benefits from their work. +++

Disrupting the first reported AI-orchestrated cyber espionage campaign

via HackerNews 👤 koakuma-chan 📅 2025-11-13

🔺 82 pts ⚡ Score: 9.2

💬 HackerNews Buzz: 37 comments 😐 MID OR MIXED

🎯 AI security flaws • Cybersecurity automation • Ethical AI development

💬 "The simplicity of 'we just told it that it was doing legitimate work' is both surprising and unsurprising" • "Defenders should not have to engage in an costly and error-prone search of truth about what's actually deployed"

🚀 HOT STORY

GPT-5.1 model rollout

4x SOURCES 🌐 📅 2025-11-12

⚡ Score: 8.9

+++ Two flavors of the same model: one for vibes, one for reasoning. Users get customizable chat personalities because apparently we needed style options more than we needed capability leaps. +++

GPT-5.1 Instant and GPT-5.1 Thinking System Card Addendum

via HackerNews 👤 wertyk 📅 2025-11-12

🔺 1 pts ⚡ Score: 9.0

We’re rolling out GPT-5.1 and new customization features. Ask us Anything.

via r/OpenAI 👤 u/OpenAI 📅 2025-11-12

⬆️ 446 ups ⚡ Score: 7.4

"You asked for a warmer, more conversational model, and we heard your feedback. GPT-5.1 is rolling out to all users in ChatGPT over the next week. We also launched 8 unique chat styles in the ChatGPT personalization tab, making it easier to set the tone and style that feels right for you. Ask us..."

💬 Reddit Discussion: 919 comments 👍 LOWKEY SLAPS

🎯 Guardrail Restrictions • Personalization & Control • Creative Expression

💬 "It's impossible to write anything right now without the safety router softening, censoring, or limiting you." • "I'd really like to be able to use the legacy models I'm paying for directly, without being randomly routed to other models."

OpenAI rolls out GPT-5.1 Instant, “warmer” and “more conversational”, and GPT-5.1 Thinking, “easier to understand and faster”, starting with paid ChatGPT users

via Techmeme 👤 Openai 📅 2025-11-12

⚡ Score: 7.2

GPT-5.1: A smarter, more conversational ChatGPT

via HackerNews 👤 tedsanders 📅 2025-11-12

🔺 343 pts ⚡ Score: 6.8

💬 HackerNews Buzz: 386 comments 🐝 BUZZING

🎯 Chatbot limitations • Conversational AI trade-offs • Concerns about AI misinformation

💬 "RLHF seems to have shaped the responses so they only give the appearance of being correct" • "warm models showed substantially higher error rates (+10 to +30 percentage points)"

💰 FUNDING

It seems that OpenAI’s inference costs easily eclipsed its revenues.

via r/OpenAI 👤 u/AmorFati01 📅 2025-11-12

⬆️ 382 ups ⚡ Score: 8.4

"Exclusive: Here's How Much OpenAI Spends On Inference and Its Revenue Share With Microsoft According to the documents viewed by this newsletter, OpenAI spent $5.02 billion on inference alone with Microsoft Azure..."

💬 Reddit Discussion: 99 comments 🐝 BUZZING

🎯 OpenAI's Compute Costs • OpenAI's Revenue Projections • Free Chatbot Strategies

💬 "OpenAI is being charged much less than other Azure customers" • "OpenAI's revenue is 'well more' than $13 billion"

🔬 RESEARCH

[R] LeJEPA: New Yann Lecun paper

via r/MachineLearning 👤 u/jacobgorm 📅 2025-11-13

⬆️ 190 ups ⚡ Score: 7.9

"Abstract: Learning manipulable representations of the world and its dynamics is central to AI. Joint-Embedding Predictive Architectures (JEPAs) offer a promising blueprint, but lack of practical guidance and theory has led to ad - hoc R&D. We present a comprehensive theory of JEPAs and instantia..."

💬 Reddit Discussion: 26 comments 👍 LOWKEY SLAPS

🎯 Theoretical research • Simplicity vs. complexity • Transformer models

💬 "Massive respect to Lecun for continuing to push for things that make theoretical sense" • "This is like that meme: Statistical Learning: 'Gentlemen, our learner overgeneralizes...' Neural Networks: 'STACK MORE LAYERS"

🤖 AI MODELS

Jan-v2-VL: 8B model for long-horizon tasks, improving Qwen3-VL-8B’s agentic capabilities almost 10x

via r/LocalLLaMA 👤 u/Delicious_Focus3465 📅 2025-11-13

⬆️ 455 ups ⚡ Score: 7.9

"Hi, this is Bach from the Jan team. We’re releasing Jan-v2-VL, an 8B vision–language model aimed at long-horizon, multi-step tasks starting from browser use. Jan-v2-VL-high executes 49 steps without failure on the Long-Horizon Execution benchmark, while the base model (Qwen3-VL-8B-Thinking) stops a..."

💬 Reddit Discussion: 70 comments 🐝 BUZZING

🎯 Model capabilities • Benchmark comparisons • Model naming

💬 "Models tend to degrade as tasks get longer, while reasoning/thinking models sustain much longer chains" • "Dense vision agents in the 7-9B range are an absolute key part of the ecosystem"

🛠️ SHOW HN

Show HN: KV Marketplace – share LLM attention caches across GPUs like memcached

via HackerNews 👤 nsomani 📅 2025-11-12

🔺 1 pts ⚡ Score: 7.9

⚡ BREAKTHROUGH

Marble multimodal world model

4x SOURCES 🌐 📅 2025-11-12

⚡ Score: 7.8

+++ World Labs unveiled Marble, a multimodal world model that generates and edits spatially consistent 3D environments. The AI industry collectively nods, updates their research roadmap, and pretends this wasn't inevitable. +++

Marble: A Multimodal World Model

via HackerNews 👤 meetpateltech 📅 2025-11-12

🔺 195 pts ⚡ Score: 8.6

💬 HackerNews Buzz: 52 comments 🐝 BUZZING

🎯 Spatial intelligence • World modeling • Game engine-ready 3D

💬 "This is bunk, it has nothing to do with intelligence and everything to do with hyping the oxymoronic/paradox branded as spatial intelligence." • "It offers almost no improvement over the earliest 3DGS demo, let alone the addition of any characters."

⚡ BREAKTHROUGH

Google's AI is now able to compete in Math Olympiads and rank among top three

via HackerNews 👤 bookofjoe 📅 2025-11-13

🔺 3 pts ⚡ Score: 7.8

🔒 SECURITY

INF Tech accessing Nvidia chips via Indonesia

2x SOURCES 🌐 📅 2025-11-13

⚡ Score: 7.7

+++ INF Tech found a workaround to US restrictions by routing Nvidia silicon through Jakarta, revealing that enforcement theater and actual enforcement remain distant cousins in the chip containment strategy. +++

An investigation traces how Shanghai-based AI startup INF Tech accessed advanced Nvidia chips at an Indosat data center in Jakarta, despite US export controls

via Techmeme 👤 Wsj 📅 2025-11-13

⚡ Score: 8.0

🛠️ TOOLS

LMSYS just launched Code Arena, live coding evals with real developer voting instead of static benchmarks

via r/ChatGPT 👤 u/Weird_Perception1728 📅 2025-11-13

⬆️ 28 ups ⚡ Score: 7.6

"LMSYS just launched Code Arena, and it's bringing live, community-driven evaluation to AI coding, something that's been missing from static benchmarks. Instead of "write a function to reverse a string," models actually have to plan out implementations step-by-step, use tools to read and edit files,..."

🛠️ TOOLS

Cross-GPU prefix KV reuse with RDMA / NVLink - early experimental results

via r/LocalLLaMA 👤 u/nsomani 📅 2025-11-12

⬆️ 14 ups ⚡ Score: 7.5

"Been experimenting with a small prototype to reuse transformer KV attention states across GPUs. Current inference frameworks only reuse KV prefixes locally, so multi-GPU setups redo prefill work even when the prefix is identical. I implemented a simple path where one process exports its prefix KV t..."

🔬 RESEARCH

Whisper leak: a side-channel attack on large language models

via HackerNews 👤 neapolisbeach 📅 2025-11-13

🔺 3 pts ⚡ Score: 7.5

🔄 OPEN SOURCE

Open source x 3: GRPO training with OpenEnv, vLLM, and Oumi

via HackerNews 👤 stefanwebb 📅 2025-11-13

🔺 1 pts ⚡ Score: 7.5

🛠️ TOOLS

Stanford's new Equivariant Encryption enables private AI inference with zero slowdown - works with any symmetric encryption

via r/LocalLLaMA 👤 u/Proof-Possibility-54 📅 2025-11-13

⬆️ 86 ups ⚡ Score: 7.3

"Just came across this paper (arXiv:2502.01013) that could be huge for private local model deployment. The researchers achieved 99.999% accuracy on encrypted neural network inference with literally zero additional latency. Not "minimal" overhead - actually zero. The key insight: instead of usin..."

💬 Reddit Discussion: 13 comments 👍 LOWKEY SLAPS

🎯 Encrypted inference • Frequency analysis attack • Limitations of approach

💬 "If the entire inference process is offloaded to some (partially) homomorphic external system, such that you're putting in a vector of encrypted input token IDs and getting a stream of encrypted output token IDs, doesn't the output stream simply become a basic substitution cipher, which is trivial to break with frequency analysis?" • "For language models, you'd need something like: - Homomorphic encryption (with the 10,000x slowdown), or - TEEs (trusted execution environments), or - The approach would need fundamental changes to handle discrete token spaces"

🔒 SECURITY

ChatGPT Vulnerability Exposed Underlying Cloud Infrastructure

via HackerNews 👤 salkahfi 📅 2025-11-13

🔺 1 pts ⚡ Score: 7.3

🔒 SECURITY

Never give a api key to Claude Code Web

via r/claudeai 👤 u/goldenfox27 📅 2025-11-13

⬆️ 10 ups ⚡ Score: 7.2

"3 days ago I did a little experiment where I asked Claude Code web (the beta) to do a simple task: generate an LLM test and test it using an Anthropic API key to run the test. It was in the default sandbox environment. The API key was passed via env var to Claude. This was 3 days ago and today I ..."

💬 Reddit Discussion: 7 comments 👍 LOWKEY SLAPS

🎯 Billing and Usage • API Key Issues • Useful Community Resource

💬 "The prompt for this test was around 200 tokens" • "Claude leaked it somehow after simply reading my .env"

🛠️ TOOLS

SlopStop: Community-driven AI slop detection in Kagi Search

via HackerNews 👤 msub2 📅 2025-11-13

🔺 141 pts ⚡ Score: 7.2

💬 HackerNews Buzz: 58 comments 😤 NEGATIVE ENERGY

🎯 AI-generated content quality • Detecting AI-generated "slop" • Future of AI and content

💬 "AI slop eventually will get as good as your average blogger." • "This is not solving the problem."

🔧 INFRASTRUCTURE

Infinite scale: The architecture behind the Azure AI superfactory

via HackerNews 👤 aprdm 📅 2025-11-12

🔺 1 pts ⚡ Score: 7.0

🔬 RESEARCH

LLM Output Drift in Financial Workflows: Validation and Mitigation (arXiv)

via HackerNews 👤 raffisk 📅 2025-11-12

🔺 14 pts ⚡ Score: 7.0

💬 HackerNews Buzz: 8 comments 🐝 BUZZING

🎯 LLM output consistency • Regulated financial tasks • Model size and reliability

💬 "Don't use LLMs for financial workflows." • "These things are Markov chains. You can not expect consistent results."

💰 FUNDING

OpenAI says it plans to report stunning annual losses through 2028—and then turn wildly profitable just two years later | Fortune

via r/artificial 👤 u/fortune 📅 2025-11-12

⬆️ 645 ups ⚡ Score: 7.0

"External link discussion - see full content at original source."

💬 Reddit Discussion: 192 comments 👍 LOWKEY SLAPS

🎯 AI Hype and Broken Promises • Corporate Greed and Corruption • Skepticism Towards Tech Companies

💬 "trust me bro" • "We totally created AGI bro believe us"

🔬 RESEARCH

[R][P] CellARC: cellular automata based abstraction and reasoning benchmark (paper + dataset + leaderboard + baselines)

via r/MachineLearning 👤 u/Putrid_Construction3 📅 2025-11-12

⬆️ 10 ups ⚡ Score: 6.9

"TL;DR: CellARC is a synthetic benchmark for abstraction/reasoning in ARC-AGI style, built from multicolor 1D cellular automata. Episodes are serialized to 256 tokens for quick iteration with small models. CellARC decouples generalization from anthropomorphic priors, supports unlimited difficulty-co..."

🎨 CREATIVE

Nano Banana can be prompt engineered for nuanced AI image generation

via HackerNews 👤 minimaxir 📅 2025-11-13

🔺 273 pts ⚡ Score: 6.8

💬 HackerNews Buzz: 77 comments 🐝 BUZZING

🎯 AI image generation capabilities • Prompt engineering challenges • Image editing workflows

💬 "Nano Banana manages to maintain the geometry of the scene, while applying new styles to it." • "I am currently working with 7 layers prompts to control for environment, camera, subject, composition, light, colors and overall quality"

🔔 OPEN SOURCE

Interesting to see an open-source model genuinely compete with frontier proprietary models for coding

via r/LocalLLaMA 👤 u/Technical_Gene4729 📅 2025-11-13

⬆️ 85 ups ⚡ Score: 6.7

"So Code Arena just dropped their new live coding benchmark, and the tier 1 results are sparking an interesting open vs proprietary debate. GLM-4.6 is the only open-source model in the top tier. It's MIT licensed, the most permissive license possible. It's sitting at rank 1 (score: 1372) alongside C..."

💬 Reddit Discussion: 17 comments 🐝 BUZZING

🎯 AI model capabilities • Hardware performance • Open-source AI tools

💬 "GLM 4.6 being MIT is actually more valuable than Claude being slightly higher scored" • "Running a SOTA model on a gamer rig"

🤖 AI MODELS

Baidu unveils two AI chips: the M100 for efficient MoE inference, launching in 2026, and the M300 for training super-large multimodal models, coming in 2027

via Techmeme 👤 Scmp 📅 2025-11-13

⚡ Score: 6.7

🛠️ TOOLS

Live VLM WebUI - Web interface for Ollama vision models with real-time video streaming

via r/LocalLLaMA 👤 u/lektoq 📅 2025-11-12

⬆️ 147 ups ⚡ Score: 6.6

"Hey r/LocalLLaMA! 👋 I'm a Technical Marketing Engineer at NVIDIA working on Jetson, and we just open-sourced **Live VLM WebUI** \- a tool for testing Vision Language Models locally with real-time video streaming. # What is it? Stream your webcam ..."

💬 Reddit Discussion: 18 comments 🐐 GOATED ENERGY

🎯 Remote camera support • Offline/CPU-only deployment • Audio/speech integration

💬 "Perfect for development/testing or when you're just using cloud VLM APIs" • "Are you thinking of running everything locally, or would you be open to cloud APIs for the audio part?"

🎮 GAMING

Google DeepMind unveils SIMA 2, a video-game-playing agent built on top of Gemini to navigate and solve problems inside 3D virtual worlds like Goat Simulator 3

via Techmeme 👤 Technologyreview 📅 2025-11-13

⚡ Score: 6.6

🔧 INFRASTRUCTURE

Running a 1 Trillion Parameter Model on a PC with 128 GB RAM + 24 GB VRAM

via r/LocalLLaMA 👤 u/pulse77 📅 2025-11-13

⬆️ 121 ups ⚡ Score: 6.5

"Hi again, just wanted to share that this time I've successfully run **Kimi K2 Thinking (1T parameters)** on **llama.cpp** using my desktop setup: * **CPU:** Intel i9-13900KS * **RAM:** 128 GB DDR5 @ 4800 MT/s * **GPU:** RTX 4090 (24 GB VRAM) * **Storage:** 4TB NVMe SSD (7300 MB/s read) I'm using *..."

💬 Reddit Discussion: 43 comments 🐝 BUZZING

🎯 Benchmarking model performance • Model size and speed tradeoffs • Community reactions to benchmarks

💬 "Dont run anything more than 120b total" • "Don't run anything more than 32b if it's dense"

🤖 AI MODELS

Microsoft unveils an AI “super factory”, a new class of hubs built for AI training, in Atlanta as part of its Fairwater network of data centers

via Techmeme 👤 Wsj 📅 2025-11-12

⚡ Score: 6.5

🔬 RESEARCH

RF-DETR: Neural Architecture Search for Real-Time Detection Transformers

via r/computervision 👤 u/aloser 📅 2025-11-13

⬆️ 42 ups ⚡ Score: 6.5

"The RF-DETR paper is finally here! Thrilled to finally be able to share that RF-DETR was developed using a weight-sharing neural architecture search for end-to-end model optimization. RF-DETR is SOTA for realtime object detection on COCO and RF100-VL and greatly ..."

💬 Reddit Discussion: 9 comments 🐐 GOATED ENERGY

🎯 Evaluation accuracy • Model comparison • Pose estimation

💬 "See Appendix B in this paper" • "Roboflow now has these standardized model evaluation"

💰 FUNDING

A deep dive into Microsoft's AI strategy, including OpenAI, data center investments, neocloud renting, GitHub Copilot, MAI models, and the Maia chip

via Techmeme 👤 Newsletter 📅 2025-11-13

⚡ Score: 6.5

⚖️ ETHICS

Anthropic open sources a method to score AI model political evenhandedness; Gemini 2.5 Pro got 97%, Grok 4 96%, Claude Opus 4.1 95%, GPT-5 89%, and Llama 4 66%

via Techmeme 👤 Axios 📅 2025-11-13

⚡ Score: 6.4

🤖 AI MODELS

Anthropic plans to spend $50B on a US AI infrastructure buildout, starting with Texas and New York data centers in partnership with Fluidstack, opening in 2026

via Techmeme 👤 Cnbc 📅 2025-11-12

⚡ Score: 6.4

🤖 AI MODELS

Three new OpenAI models are now available in Cursor

via r/cursor 👤 u/lrobinson2011 📅 2025-11-13

⬆️ 22 ups ⚡ Score: 6.2

"You can now use: 1. GPT-5.1: For everyday tasks like planning and debugging 2. GPT-5.1 Codex: For ambitious coding tasks 3. GPT-5.1 Codex Mini: For cost-efficient changes Let us know what you think!"

💬 Reddit Discussion: 9 comments 😤 NEGATIVE ENERGY

🎯 Windows performance • Codex issues • Platform compatibility

💬 "Have you guys tested WSL vs WINDOWS and got a solid comparison?" • "Do these models run well on Windows? Do they need WSL?"

🏢 BUSINESS

Q&A with Satya Nadella on business models for AGI, Copilot, Microsoft AI, the hyperscale business, the OpenAI partnership, capex, sovereign AI efforts, and more

via Techmeme 👤 Dwarkesh 📅 2025-11-13

⚡ Score: 6.2

🤖 AI MODELS

Baidu unveils Ernie 5.0, an AI model to process and generate text, images, audio, and video, claiming it beats GPT-5-High and Gemini 2.5 Pro on some benchmarks

via Techmeme 👤 Venturebeat 📅 2025-11-13

⚡ Score: 6.1

🤖 AI MODELS

Satya Nadella says Microsoft has access to “all” of OpenAI's custom AI chip work and plans to use it to help develop its own in-house chip

via Techmeme 👤 Bloomberg 📅 2025-11-13

⚡ Score: 6.1

🛠️ SHOW HN

Show HN: LLM fine-tuning without infra or ML expertise (early access)

via HackerNews 👤 Jacques2Marais 📅 2025-11-13

🔺 3 pts ⚡ Score: 6.1

Stories from November 13, 2025

AI-orchestrated cyber espionage campaign

GPT-5.1 model rollout

Marble multimodal world model

INF Tech accessing Nvidia chips via Indonesia

📡 AI NEWS BUT ACTUALLY GOOD