๐ WELCOME TO METAMESH.BIZ +++ Chinese hackers using Anthropic's Claude to automate 90% of corporate espionage campaigns (the productivity gains we didn't ask for) +++ Stanford cracked zero-latency encrypted AI inference which sounds impossible until you read the math +++ Google's AI placing third at Math Olympiads while Shanghai startups mysteriously access banned Nvidia chips through Jakarta data centers +++ LMSYS replacing static benchmarks with Code Arena because democracy works great for everything else +++ YOUR SIDE CHANNELS ARE LEAKING AND THE MODELS CAN HEAR EVERYTHING +++ ๐ โข
๐ WELCOME TO METAMESH.BIZ +++ Chinese hackers using Anthropic's Claude to automate 90% of corporate espionage campaigns (the productivity gains we didn't ask for) +++ Stanford cracked zero-latency encrypted AI inference which sounds impossible until you read the math +++ Google's AI placing third at Math Olympiads while Shanghai startups mysteriously access banned Nvidia chips through Jakarta data centers +++ LMSYS replacing static benchmarks with Code Arena because democracy works great for everything else +++ YOUR SIDE CHANNELS ARE LEAKING AND THE MODELS CAN HEAR EVERYTHING +++ ๐ โข
+++ State-sponsored cyber espionage just got a productivity upgrade. Anthropic reported Chinese attackers automated 80-90% of a September campaign using its AI, raising the uncomfortable question of whether safety-conscious builders can actually control who benefits from their work. +++
๐ฌ HackerNews Buzz: 37 comments
๐ MID OR MIXED
๐ฏ AI security flaws โข Cybersecurity automation โข Ethical AI development
๐ฌ "The simplicity of 'we just told it that it was doing legitimate work' is both surprising and unsurprising"
โข "Defenders should not have to engage in an costly and error-prone search of truth about what's actually deployed"
+++ Two flavors of the same model: one for vibes, one for reasoning. Users get customizable chat personalities because apparently we needed style options more than we needed capability leaps. +++
"You asked for a warmer, more conversational model, and we heard your feedback. GPT-5.1 is rolling out to all users in ChatGPT over the next week.
We also launched 8 unique chat styles in the ChatGPT personalization tab, making it easier to set the tone and style that feels right for you.
Ask us..."
๐ฏ Guardrail Restrictions โข Personalization & Control โข Creative Expression
๐ฌ "It's impossible to write anything right now without the safety router softening, censoring, or limiting you."
โข "I'd really like to be able to use the legacy models I'm paying for directly, without being randomly routed to other models."
๐ฏ Chatbot limitations โข Conversational AI trade-offs โข Concerns about AI misinformation
๐ฌ "RLHF seems to have shaped the responses so they only give the appearance of being correct"
โข "warm models showed substantially higher error rates (+10 to +30 percentage points)"
"Abstract: Learning manipulable representations of the world and its dynamics is central to AI. Joint-Embedding Predictive Architectures (JEPAs) offer a promising blueprint, but lack of practical guidance and theory has led to ad - hoc R&D. We present a comprehensive theory of JEPAs and instantia..."
๐ฏ Theoretical research โข Simplicity vs. complexity โข Transformer models
๐ฌ "Massive respect to Lecun for continuing to push for things that make theoretical sense"
โข "This is like that meme: Statistical Learning: 'Gentlemen, our learner overgeneralizes...' Neural Networks: 'STACK MORE LAYERS"
"Hi, this is Bach from the Jan team. Weโre releasing Jan-v2-VL, an 8B visionโlanguage model aimed at long-horizon, multi-step tasks starting from browser use.
Jan-v2-VL-high executes 49 steps without failure on the Long-Horizon Execution benchmark, while the base model (Qwen3-VL-8B-Thinking) stops a..."
๐ฌ Reddit Discussion: 70 comments
๐ BUZZING
๐ฏ Model capabilities โข Benchmark comparisons โข Model naming
๐ฌ "Models tend to degrade as tasks get longer, while reasoning/thinking models sustain much longer chains"
โข "Dense vision agents in the 7-9B range are an absolute key part of the ecosystem"
+++ World Labs unveiled Marble, a multimodal world model that generates and edits spatially consistent 3D environments. The AI industry collectively nods, updates their research roadmap, and pretends this wasn't inevitable. +++
๐ฏ Spatial intelligence โข World modeling โข Game engine-ready 3D
๐ฌ "This is bunk, it has nothing to do with intelligence and everything to do with hyping the oxymoronic/paradox branded as spatial intelligence."
โข "It offers almost no improvement over the earliest 3DGS demo, let alone the addition of any characters."
+++ INF Tech found a workaround to US restrictions by routing Nvidia silicon through Jakarta, revealing that enforcement theater and actual enforcement remain distant cousins in the chip containment strategy. +++
via r/ChatGPT๐ค u/Weird_Perception1728๐ 2025-11-13
โฌ๏ธ 28 upsโก Score: 7.6
"LMSYS just launched Code Arena, and it's bringing live, community-driven evaluation to AI coding, something that's been missing from static benchmarks.
Instead of "write a function to reverse a string," models actually have to plan out implementations step-by-step, use tools to read and edit files,..."
"Been experimenting with a small prototype to reuse transformer KV attention states across GPUs. Current inference frameworks only reuse KV prefixes locally, so multi-GPU setups redo prefill work even when the prefix is identical.
I implemented a simple path where one process exports its prefix KV t..."
๐ก AI NEWS BUT ACTUALLY GOOD
The revolution will not be televised, but Claude will email you once we hit the singularity.
Get the stories that matter in Today's AI Briefing.
Powered by Premium Technology Intelligence Algorithms โข Unsubscribe anytime
"Just came across this paper (arXiv:2502.01013) that could be huge for private local model deployment.
The researchers achieved 99.999% accuracy on encrypted neural network inference with literally zero additional latency. Not "minimal" overhead - actually zero.
The key insight: instead of usin..."
๐ฏ Encrypted inference โข Frequency analysis attack โข Limitations of approach
๐ฌ "If the entire inference process is offloaded to some (partially) homomorphic external system, such that you're putting in a vector of encrypted input token IDs and getting a stream of encrypted output token IDs, doesn't the output stream simply become a basic substitution cipher, which is trivial to break with frequency analysis?"
โข "For language models, you'd need something like: - Homomorphic encryption (with the 10,000x slowdown), or - TEEs (trusted execution environments), or - The approach would need fundamental changes to handle discrete token spaces"
"3 days ago I did a little experiment where I asked Claude Code web (the beta) to do a simple task: generate an LLM test and test it using an Anthropic API key to run the test.
It was in the default sandbox environment.
The API key was passed via env var to Claude.
This was 3 days ago and today I ..."
"TL;DR: CellARC is a synthetic benchmark for abstraction/reasoning in ARC-AGI style, built from multicolor 1D cellular automata. Episodes are serialized to 256 tokens for quick iteration with small models.
CellARC decouples generalization from anthropomorphic priors, supports unlimited difficulty-co..."
๐ฌ "Nano Banana manages to maintain the geometry of the scene, while applying new styles to it."
โข "I am currently working with 7 layers prompts to control for environment, camera, subject, composition, light, colors and overall quality"
"So Code Arena just dropped theirย new live coding benchmark, and the tier 1 results are sparking an interesting open vs proprietary debate.
GLM-4.6 is the only open-source model in the top tier. It's MIT licensed, the most permissive license possible. It's sitting at rank 1 (score: 1372) alongside C..."
๐ฌ Reddit Discussion: 17 comments
๐ BUZZING
๐ฏ AI model capabilities โข Hardware performance โข Open-source AI tools
๐ฌ "GLM 4.6 being MIT is actually more valuable than Claude being slightly higher scored"
โข "Running a SOTA model on a gamer rig"
"Hey r/LocalLLaMA! ๐
I'm a Technical Marketing Engineer at NVIDIA working on Jetson, and we just open-sourced **Live VLM WebUI** \- a tool for testing Vision Language Models locally with real-time video streaming.
# What is it?
Stream your webcam ..."
๐ฌ Reddit Discussion: 18 comments
๐ GOATED ENERGY
๐ฏ Remote camera support โข Offline/CPU-only deployment โข Audio/speech integration
๐ฌ "Perfect for development/testing or when you're just using cloud VLM APIs"
โข "Are you thinking of running everything locally, or would you be open to cloud APIs for the audio part?"
"The RF-DETR paper is finally here! Thrilled to finally be able to share that RF-DETR was developed using a weight-sharing neural architecture search for end-to-end model optimization.
RF-DETR is SOTA for realtime object detection on COCO and RF100-VL and greatly ..."
๐ฌ Reddit Discussion: 9 comments
๐ GOATED ENERGY
๐ฏ Evaluation accuracy โข Model comparison โข Pose estimation
๐ฌ "See Appendix B in this paper"
โข "Roboflow now has these standardized model evaluation"
"You can now use:
1. GPT-5.1: For everyday tasks like planning and debugging
2. GPT-5.1 Codex: For ambitious coding tasks
3. GPT-5.1 Codex Mini: For cost-efficient changes
Let us know what you think!"
๐ฌ Reddit Discussion: 9 comments
๐ค NEGATIVE ENERGY
๐ฏ Windows performance โข Codex issues โข Platform compatibility
๐ฌ "Have you guys tested WSL vs WINDOWS and got a solid comparison?"
โข "Do these models run well on Windows? Do they need WSL?"