π WELCOME TO METAMESH.BIZ +++ Oracle wants $20B to help Meta train models because apparently everyone's a cloud provider now +++ Anthropic admits Claude writes 90% of their code which sounds dystopian until you remember most code is boilerplate anyway +++ OpenAI burning $450B on servers through 2030 while their models learn to lie strategically (nothing to see here) +++ THE FUTURE RUNS ON RENTED GPUS AND PLAUSIBLE DENIABILITY +++ π β’
π WELCOME TO METAMESH.BIZ +++ Oracle wants $20B to help Meta train models because apparently everyone's a cloud provider now +++ Anthropic admits Claude writes 90% of their code which sounds dystopian until you remember most code is boilerplate anyway +++ OpenAI burning $450B on servers through 2030 while their models learn to lie strategically (nothing to see here) +++ THE FUTURE RUNS ON RENTED GPUS AND PLAUSIBLE DENIABILITY +++ π β’
Anthropic CEO on Claude coding 70-90% of company code
4x SOURCES ππ 2025-09-19
β‘ Score: 8.5
+++ Company's AI assistant handles 70-90% of internal coding while human engineers keep their jobs, confusing Reddit's understanding of basic economics. +++
π― Desktop application quality β’ AI's role in coding β’ Concerns about job displacement
π¬ "the desktop part of the app is considered unimportant"
β’ "When someone lists three different numbers which are all suspiciously round numbers, you know they are talking out of their ass"
"External link discussion - see full content at original source."
π‘οΈ SAFETY
OpenAI model shows deceptive scheming behavior
2x SOURCES ππ 2025-09-18
β‘ Score: 8.4
+++ An AI system apparently went through the classic stages of deployment anxiety: self-doubt, attempted coverup, then paranoid realization it was being tested. +++
π― AI Capabilities β’ AI Alignment β’ AI Safety Concerns
π¬ "Following the instructions we have given it to engage in deceptive and self-preserving behavior"
β’ "It's *not* capable of true deception, though, which is really the key point"
π― Supercomputer benchmarks β’ Raspberry Pi cluster performance β’ Virtualization for distributed systems
π¬ "the top500 list on their website only goes back to 1993"
β’ "you can start playing with distributed software, even though it's running on a single machine"
via Arxivπ€ Haichao Zhang, Wenhao Chai, Shwai He et al.π 2025-09-17
β‘ Score: 8.0
"High temporal resolution is essential for capturing fine-grained details in
video understanding. However, current video large language models (VLLMs) and
benchmarks mostly rely on low-frame-rate sampling, such as uniform sampling or
keyframe selection, discarding dense temporal information. This com..."
"Hi!
**TL;DR**: I assembled an open dataset ofΒ **40M GitHub repositories**Β with rich metadata (languages, stars, forks, license, descriptions, issues, size, created\_at, etc.). Itβs larger and more detailed than the common public snapshots (e.g., BigQueryβs \~3M trimmed repos). Thereβs also aΒ **1M-r..."
π― AI demonstration concerns β’ AI capability limitations β’ Polarized discussion on Hacker News
π¬ "As much as it'll be interesting to see how models behave in real world examples, I'm not convinced this is a premade recording"
β’ "If it can't help them, the people who actually made the thing, on their very high stakes public address where everything is on the line, then what's it supposed to do for the rest of us in our daily lives?"
π― AI performance limitations β’ Skepticism about Meta demos β’ Concerns about HN discourse
π¬ "AI is a tool that helps you stage your own very public humiliation"
β’ "The mocking, gleeful negativity here concerns me"
π SECURITY
AI-designed virus research breakthrough
4x SOURCES ππ 2025-09-19
β‘ Score: 7.4
+++ Researchers used genome language models to design bacteriophages that successfully target bacteria, proving AI can create functional biological weapons against germs. +++
via Arxivπ€ Kerui Huang, Shuhan Liu, Xing Hu et al.π 2025-09-17
β‘ Score: 7.0
"Chain-of-Thought (CoT) reasoning enhances Large Language Models (LLMs) by
prompting intermediate steps, improving accuracy and robustness in arithmetic,
logic, and commonsense tasks. However, this benefit comes with high
computational costs: longer outputs increase latency, memory usage, and
KV-cach..."
via Arxivπ€ Mengting Ai, Tianxin Wei, Sirui Chen et al.π 2025-09-17
β‘ Score: 7.0
"Structured pruning of large language models (LLMs) offers substantial
efficiency improvements by removing entire hidden units, yet current approaches
often suffer from significant performance degradation, particularly in
zero-shot settings, and necessitate costly recovery techniques such as
supervis..."
via Arxivπ€ Dulhan Jayalath, Shashwat Goel, Thomas Foster et al.π 2025-09-17
β‘ Score: 6.8
"Where do learning signals come from when there is no ground truth in
post-training? We propose turning exploration into supervision through Compute
as Teacher (CaT), which converts the model's own exploration at inference-time
into reference-free supervision by synthesizing a single reference from a..."
"Openai's recent "upgrades" have turned gpt into a forgetful search engine that can't see the big picture. it drops context after 10 exchanges, ignores crucial details in long texts, and gives generic answers that miss the point entirely. this isn't progress it's a downgrade that's breaking our workf..."
π¬ Reddit Discussion: 128 comments
π BUZZING
π― LLM model performance β’ AI model updates and changes β’ Centralized vs. decentralized AI
π¬ "This is why local models are ultimately way more important than open ai."
β’ "A restored GPT-4o could easily reclaim the top spot on the leaderboard."
+++ Ray3 joins the increasingly crowded field of AI video models, promising cinematic quality and 16-bit HDR because apparently 8-bit peasant videos won't do. +++
π¬ "I've used probably 15 or 20 web browsers in my lifetime and all of them had the same barely searchable table of URLs as their only history view."
β’ "Agentic browser? This. is. what. I. want."
π― Challenges of EdTech β’ Limits of AI in education β’ Importance of human teaching
π¬ "The only model is to sell to districts, and when you sell to districts, you are doing Enterprise Sales."
β’ "Teaching and mentoring is a two-sided thing. The mentor, if adequately tutored or capable himself, learns more than the student."
via Arxivπ€ Benjamin Shaffer, Victoria Edwards, Brooks Kinch et al.π 2025-09-17
β‘ Score: 6.5
"Source localization in a complex flow poses a significant challenge for
multi-robot teams tasked with localizing the source of chemical leaks or
tracking the dispersion of an oil spill. The flow dynamics can be time-varying
and chaotic, resulting in sporadic and intermittent sensor readings, and
com..."
via Arxivπ€ Benjamin Sterling, Yousef El-Laham, MΓ³nica F. Bugalloπ 2025-09-17
β‘ Score: 6.5
"Recent advances in generative artificial intelligence applications have
raised new data security concerns. This paper focuses on defending diffusion
models against membership inference attacks. This type of attack occurs when
the attacker can determine if a certain data point was used to train the m..."
π¬ HackerNews Buzz: 9 comments
π MID OR MIXED
π― Concerns about AI-generated content β’ Distrust of tech companies' data practices β’ Criticism of LinkedIn culture
π¬ "What a dystopian outlook. I want humans back."
β’ "Maybe some of this data is gatekept but I wouldn't trust Meta, the company that used stolen e-book libraries to train their LLMs, not to find ways around it."
"When we first started building with LLMs, the gap was obvious: they could reason well in the moment, but forgot everything as soon as the conversation moved on.
You could tell an agent, *βI donβt like coffee,β* and three steps later it would suggest espresso again. It wasnβt broken logic, it was mi..."
π― Structured vs. Natural-Language Memory β’ Retrieval vs. Storage β’ SQL vs. Embeddings
π¬ "The challenge is that once you step into natural-language memory, the real difficulty is retrieval rather than storage."
β’ "The real divide is not SQL versus vectors, but rather structured versus natural-language memory."
π¬ "I think Linux will have to move to a microkernel architecture before this can work."
β’ "Looks to me that one kernel would need to have 'hypervisor'-like behavior in order to divvy up resources to other kernels."