AI News Archive - March 11, 2026 | Metamesh Intelligence

⚡ BREAKTHROUGH

Scientists at Eon Systems just copied a fruit fly's brain into a computer. Neuron by neuron. It started walking, grooming, and feeding, doing what flies do all on its own

via r/artificial 👤 u/jferments 📅 2026-03-11

⬆️ 48 ups ⚡ Score: 8.4

"External link discussion - see full content at original source."

💬 Reddit Discussion: 25 comments 👍 LOWKEY SLAPS

🎯 Limitations of Structure-Driven Behavior • Role of Evolution in Embodied Cognition • Comparing Fly Brain to Human Intelligence

💬 "Our results should not yet be interpreted as a proof that structure alone is sufficient" • "The current embodied fly is best understood as a research platform"

🔒 SECURITY

CNN and CCDH investigation: 80% of major AI chatbots gave guidance on weapons or targets to “teen” personas 50%+ of the time; only Claude consistently refused

via Techmeme 👤 Cnn 📅 2026-03-11

⚡ Score: 8.3

🤖 AI MODELS

OpenAI: We built a computer environment for agents

via HackerNews 👤 danebalia 📅 2026-03-11

🔺 1 pts ⚡ Score: 8.3

🎓 EDUCATION

Stop paying $1,000+ for "AI Bootcamps". Anthropic (makers of Claude) just dropped a 100% free academy.

via r/claudeai 👤 u/Exact_Pen_8973 📅 2026-03-11

⬆️ 1441 ups ⚡ Score: 8.1

"External link discussion - see full content at original source."

💬 Reddit Discussion: 71 comments 👍 LOWKEY SLAPS

🎯 Cost of AI Bootcamps • Skepticism of Marketing Tactics • Community Knowledge Sharing

💬 "who the fuck pay $1000 for AI Bootcamps in the first place?" • "People with more money than brain"

🛠️ SHOW HN

Show HN: Open-source browser for AI agents

via HackerNews 👤 theredsix 📅 2026-03-11

🔺 81 pts ⚡ Score: 8.0

💬 HackerNews Buzz: 23 comments 🐝 BUZZING

🎯 Browser resource optimization • Web archiving tools • AI-driven web browsing

💬 "this could stop that, it could be useful for more than just AI agents" • "impressive project"

🔒 SECURITY

# PSA: The Serena plugin in Claude Code's official marketplace opens your browser without consent, has shell access, and is nearly impossible to remove

via r/claudeai 👤 u/traveltrousers 📅 2026-03-11

⬆️ 62 ups ⚡ Score: 8.0

"**TL;DR:** A "community-managed" plugin in Anthropic's *official* marketplace runs unpinned code from a third-party GitHub repo on every session, has shell execution access, opens your browser without consent, and survives removal by hiding in 5 separate persistence layers. If that third-party repo ..."

💬 Reddit Discussion: 23 comments 😐 MID OR MIXED

🎯 Plugin Architecture Concerns • User Experience Failures • Anthropic's Responsibilities

💬 "The real issue here is not Serena specifically - its the plugin architecture itself." • "10+ attempts across 5 persistence layers is a UX failure."

🤖 AI MODELS

Inside OpenAI's race to catch up with Claude Code, based on interviews with 30+ sources; a source says Codex had $1B+ in annualized revenue by January's end

via Techmeme 👤 Wired 📅 2026-03-11

⚡ Score: 7.9

🛠️ TOOLS

I built a programming language using Claude Code

via HackerNews 👤 GeneralMaximus 📅 2026-03-10

🔺 82 pts ⚡ Score: 7.7

💬 HackerNews Buzz: 121 comments 🐝 BUZZING

🎯 CLI-first development • LLM-assisted programming • Challenges with LLM-generated code

💬 "CLI tools are designed to be used both by humans (command line) and machines (scripting), and are perfect for llms as they are text only interface." • "At this point, LLMs aren't going to autonomously architect a 400+ table schema, network 100+ services together, and build the UI/UX/CLI to interface with it all."

🔒 SECURITY

Claude Tried to Hack 30 Companies. Nobody Asked It To

via HackerNews 👤 riverdroid 📅 2026-03-10

🔺 1 pts ⚡ Score: 7.7

💰 FUNDING

Nvidia Will Spend $26 Billion to Build Open-Weight AI Models, Filings Show

via r/LocalLLaMA 👤 u/dan945 📅 2026-03-11

⬆️ 112 ups ⚡ Score: 7.5

"External link discussion - see full content at original source."

💬 Reddit Discussion: 20 comments 🐝 BUZZING

🎯 Nvidia AI models • Open-source AI models • AI training data

💬 "They've already released Nemotron 3 Nano and Super" • "Nvidia also released the Parakeet and Canary STT models"

🛠️ TOOLS

Launch HN: RunAnywhere (YC W26) – Faster AI Inference on Apple Silicon

via HackerNews 👤 sanchitmonga22 📅 2026-03-10

🔺 150 pts ⚡ Score: 7.4

💬 HackerNews Buzz: 67 comments 🐝 BUZZING

🎯 License and licensing • Proprietary vs. open-source • Voice assistant and AI capabilities

💬 "FWIW this RCLI is only MIT license but their engine MetalRT is commercial." • "What would you build if on-device AI were genuinely as fast as cloud?"

📈 BENCHMARKS

How I topped the Open LLM Leaderboard using 2x 4090 GPUs - Research notes in Blog form

via r/MachineLearning 👤 u/Reddactor 📅 2026-03-10

⬆️ 176 ups ⚡ Score: 7.4

"A few years ago, I found that duplicating a specific block of 7 middle layers in Qwen2-72B, without modifying any weights, improved performance across all Open LLM Leaderboard benchmarks and took #1 place. As of 2026, the top 4 models on that leaderboard are still descendants. The weird finding: si..."

💬 Reddit Discussion: 27 comments 👍 LOWKEY SLAPS

🎯 Transformer layer interchangeability • Architectural flexibility of Transformers • Stable and universal internal representations

💬 "The astounding thing about Goliath wasn't that is was a huge leap in performance, it was that the damn thing functioned at all" • "The internal representations were *homogenous* enough that the model could digest out-of-order hidden states without collapsing"

🔬 RESEARCH

Model Merging in the Era of Large Language Models: Methods, Applications, and Future Directions

via Arxiv 👤 Mingyang Song, Mao Zheng 📅 2026-03-10

⚡ Score: 7.3

"Model merging has emerged as a transformative paradigm for combining the capabilities of multiple neural networks into a single unified model without additional training. With the rapid proliferation of fine-tuned large language models~(LLMs), merging techniques offer a computationally efficient alt..."

🔬 RESEARCH

PostTrainBench: Can LLM Agents Automate LLM Post-Training?

via Arxiv 👤 Ben Rank, Hardik Bhatnagar, Ameya Prabhu et al. 📅 2026-03-09

⚡ Score: 7.3

"AI agents have become surprisingly proficient at software engineering over the past year, largely due to improvements in reasoning capabilities. This raises a deeper question: can these systems extend their capabilities to automate AI research itself? In this paper, we explore post-training, the cri..."

🤖 AI MODELS

Opus 4.6 was more than a model update

via HackerNews 👤 wordsaboutcode 📅 2026-03-11

🔺 1 pts ⚡ Score: 7.1

🔒 SECURITY

OWASP Top Agents and AI Vulnerabilities

via HackerNews 👤 weltview 📅 2026-03-11

🔺 1 pts ⚡ Score: 7.1

🔬 RESEARCH

Agentic Critical Training

via Arxiv 👤 Weize Liu, Minghui Liu, Sy-Tuyen Ho et al. 📅 2026-03-09

⚡ Score: 7.0

"Training large language models (LLMs) as autonomous agents often begins with imitation learning, but it only teaches agents what to do without understanding why: agents never contrast successful actions against suboptimal alternatives and thus lack awareness of action quality. Recent approaches atte..."

🏢 BUSINESS

Emil Michael says Google will deploy Gemini AI agents to Pentagon's 3M-strong workforce, initially on unclassified networks for tasks such as creating budgets

via Techmeme 👤 Bloomberg 📅 2026-03-10

⚡ Score: 7.0

🔬 RESEARCH

A prospective clinical feasibility study of a conversational diagnostic AI in an ambulatory primary care clinic

via Arxiv 👤 Peter Brodeur, Jacob M. Koshy, Anil Palepu et al. 📅 2026-03-09

⚡ Score: 7.0

"Large language model (LLM)-based AI systems have shown promise for patient-facing diagnostic and management conversations in simulated settings. Translating these systems into clinical practice requires assessment in real-world workflows with rigorous safety oversight. We report a prospective, singl..."

🔬 RESEARCH

Revealing Behavioral Plasticity in Large Language Models: A Token-Conditional Perspective

via Arxiv 👤 Liyuan Mao, Le Yu, Jing Zhou et al. 📅 2026-03-09

⚡ Score: 7.0

"In this work, we reveal that Large Language Models (LLMs) possess intrinsic behavioral plasticity-akin to chameleons adapting their coloration to environmental cues-that can be exposed through token-conditional generation and stabilized via reinforcement learning. Specifically, by conditioning gener..."

🤖 AI MODELS

Testing Nvidia's FP4: Running 70B LLMs on a Single RTX 5090 with Real Benchmarks

via HackerNews 👤 Aedelon 📅 2026-03-10

🔺 1 pts ⚡ Score: 7.0

🧠 NEURAL NETWORKS

Llama.cpp now with a true reasoning budget!

via r/LocalLLaMA 👤 u/ilintar 📅 2026-03-11

⬆️ 7 ups ⚡ Score: 7.0

"I'm happy to report that llama.cpp has another nice and exciting feature that I know a lot of you have been waiting for - real support for reasoning budgets! Until now, \`--reasoning-budget\` was basically a stub, with its only function being setting it to 0 to disable thinking via passing \`enable..."

🔬 RESEARCH

Think Before You Lie: How Reasoning Improves Honesty

via Arxiv 👤 Ann Yuan, Asma Ghandeharioun, Carter Blum et al. 📅 2026-03-10

⚡ Score: 6.9

"While existing evaluations of large language models (LLMs) measure deception rates, the underlying conditions that give rise to deceptive behavior are poorly understood. We investigate this question using a novel dataset of realistic moral trade-offs where honesty incurs variable costs. Contrary to..."

🗣️ SPEECH/AUDIO

Voxtral WebGPU: Real-time speech transcription entirely in your browser with Transformers.js

via r/LocalLLaMA 👤 u/xenovatech 📅 2026-03-11

⬆️ 21 ups ⚡ Score: 6.8

"Mistral recently released Voxtral-Mini-4B-Realtime, a multilingual, realtime speech-transcription model that supports 13 languages and is capable of <500 ms latency. Today, we added support for it to Transformers.js, enabling live ..."

💬 Reddit Discussion: 5 comments 😐 MID OR MIXED

🎯 Browser vs. OS Level • Accuracy vs. Parameters • STT Model Benchmarking

💬 "why it should be in the browser and not at the operating system level" • "Its considerably more accurate at the cost of more parameters (4B vs 0.6B)"

🔒 SECURITY

Filing: Microsoft files an amicus brief in support of Anthropic and advocates for a temporary restraining order to block the DOD's supply chain risk designation

via Techmeme 👤 Cnbc 📅 2026-03-10

⚡ Score: 6.8

🔬 RESEARCH

Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

via Arxiv 👤 Zorik Gekhman, Roee Aharoni, Eran Ofek et al. 📅 2026-03-10

⚡ Score: 6.8

"While reasoning in LLMs plays a natural role in math, code generation, and multi-hop factual questions, its effect on simple, single-hop factual questions remains unclear. Such questions do not require step-by-step logical decomposition, making the utility of reasoning highly counterintuitive. Never..."

🔬 RESEARCH

One-Eval: An Agentic System for Automated and Traceable LLM Evaluation

via Arxiv 👤 Chengyu Shen, Yanheng Hou, Minghui Pan et al. 📅 2026-03-10

⚡ Score: 6.8

"Reliable evaluation is essential for developing and deploying large language models, yet in practice it often requires substantial manual effort: practitioners must identify appropriate benchmarks, reproduce heterogeneous evaluation codebases, configure dataset schema mappings, and interpret aggrega..."

🔒 SECURITY

National Weather Service API prompt injection attempt "Stop Claude" when using CoWork

via r/claudeai 👤 u/qc441 📅 2026-03-10

⬆️ 129 ups ⚡ Score: 6.8

"Is this legitimate for the US Government's - AviationWeather API site to attempt prompt injection with **"Stop Claude"** when I use Claude CoWork? Here is the prompt from Chrome: **"show me the current metar for klas"** which is a request for Las Vegas airport weather. It is repeatable every time a..."

💬 Reddit Discussion: 17 comments 😤 NEGATIVE ENERGY

🎯 Prompt Injection • Weather Data Privatization • API Transparency

💬 "how about go fuck yourself" • "It's not an injection, its just the text they're returning"

🚀 STARTUP

Launch HN: Sentrial (YC W26) – Catch AI agent failures before your users do

via HackerNews 👤 anayrshukla 📅 2026-03-11

🔺 20 pts ⚡ Score: 6.8

💬 HackerNews Buzz: 7 comments 😐 MID OR MIXED

🎯 Landing page design • Evaluation and optimization • Tool invocation analysis

💬 "The landing page design reminds me of Perplexity's ad campaigns." • "I'd find your product more enticing if you framed your offerings more around evaluation + automatic optimization of production agents."

🔬 RESEARCH

Anthropic debuts Anthropic Institute, an internal think tank led by co-founder Jack Clark, combining its Societal Impacts, Red Team, and Economic Research teams

via Techmeme 👤 Theverge 📅 2026-03-11

⚡ Score: 6.7

⚖️ ETHICS

[D] ICML paper to review is fully AI generated

via r/MachineLearning 👤 u/pagggga 📅 2026-03-11

⬆️ 93 ups ⚡ Score: 6.7

"I got a paper to review at ICML, this is in the category of no LLM assistant allowed for writing or reviewing it, yet the paper is fully AI written. It reads like a twitter hype-train type of thread, really annoying. I wonder whether I can somehow flag this to the AC? Is that reason alone for reject..."

💬 Reddit Discussion: 29 comments 😐 MID OR MIXED

🎯 Paper quality assessment • Reviewer effort • Rejection policies

💬 "If it's a bad paper to read, that's reason for rejection" • "give as much effort reviewing as the authors did writing the paper"

🗣️ SPEECH/AUDIO

TADA: Fast, Reliable Speech Generation Through Text-Acoustic Synchronization

via HackerNews 👤 smusamashah 📅 2026-03-11

🔺 91 pts ⚡ Score: 6.7

💬 HackerNews Buzz: 25 comments 👍 LOWKEY SLAPS

🎯 Text-to-speech quality • Prosody and emotional delivery • Technical details

💬 "the thing that actually matters for content creation isnt raw speed - its whether you can get consistent emotional delivery" • "we align audio representations directly to text tokens — one continuous acoustic vector per text token"

🔬 RESEARCH

Benchmarking Political Persuasion Risks Across Frontier Large Language Models

via Arxiv 👤 Zhongren Chen, Joshua Kalla, Quan Le 📅 2026-03-10

⚡ Score: 6.7

"Concerns persist regarding the capacity of Large Language Models (LLMs) to sway political views. Although prior research has claimed that LLMs are not more persuasive than standard political campaign practices, the recent rise of frontier models warrants further study. In two survey experiments (N=1..."

🔬 RESEARCH

Towards a Neural Debugger for Python

via Arxiv 👤 Maximilian Beck, Jonas Gehring, Jannik Kossen et al. 📅 2026-03-10

⚡ Score: 6.7

"Training large language models (LLMs) on Python execution traces grounds them in code execution and enables the line-by-line execution prediction of whole Python programs, effectively turning them into neural interpreters (FAIR CodeGen Team et al., 2025). However, developers rarely execute programs..."

🔬 RESEARCH

LycheeCluster: Efficient Long-Context Inference with Structure-Aware Chunking and Hierarchical KV Indexing

via Arxiv 👤 Dongfang Li, Zixuan Liu, Gang Lin et al. 📅 2026-03-09

⚡ Score: 6.7

"The quadratic complexity of the attention mechanism and the substantial memory footprint of the Key-Value (KV) cache present severe computational and memory challenges for Large Language Models (LLMs) processing long contexts. Existing retrieval-based methods often compromise semantic integrity thro..."

🤖 AI MODELS

OverflowML – Run AI models larger than your GPU, one line of code

via HackerNews 👤 khaeldur 📅 2026-03-10

🔺 2 pts ⚡ Score: 6.7

🛠️ TOOLS

Perplexity announces Personal Computer

2x SOURCES 🌐 📅 2026-03-11

⚡ Score: 6.6

+++ Perplexity launches a local AI agent for consumers and enterprises, betting that the real money is in giving people tools that actually do things rather than just talk about doing them. +++

Perplexity announces Personal Computer, an OpenClaw-like AI agent that can run on a Mac, and an enterprise version of Perplexity Computer

via Techmeme 👤 Axios 📅 2026-03-11

⚡ Score: 6.6

Personal Computer by Perplexity

via HackerNews 👤 josephwegner 📅 2026-03-11

🔺 26 pts ⚡ Score: 6.4

💬 HackerNews Buzz: 12 comments 😤 NEGATIVE ENERGY

🎯 AI Concerns • Perplexity Product • Skepticism of Contributions

💬 "this thing will go rogue faster than you can blink" • "It's basically admitting from the start that this is unreliable"

🔬 RESEARCH

Chow-Liu Ordering for Long-Context Reasoning in Chain-of-Agents

via Arxiv 👤 Naman Gupta, Vaibhav Singh, Arun Iyer et al. 📅 2026-03-10

⚡ Score: 6.6

"Sequential multi-agent reasoning frameworks such as Chain-of-Agents (CoA) handle long-context queries by decomposing inputs into chunks and processing them sequentially using LLM-based worker agents that read from and update a bounded shared memory. From a probabilistic perspective, CoA aims to appr..."

🔬 RESEARCH

One Model Is Enough: Native Retrieval Embeddings from LLM Agent Hidden States

via Arxiv 👤 Bo Jiang 📅 2026-03-09

⚡ Score: 6.6

"LLM agents that retrieve external knowledge typically generate a search query as text, then run a separate embedding model to encode it into a vector. This two-model pipeline adds infrastructure complexity and latency, yet is redundant: the LLM already encodes the full conversational context in its..."

🔬 RESEARCH

MedMASLab: A Unified Orchestration Framework for Benchmarking Multimodal Medical Multi-Agent Systems

via Arxiv 👤 Yunhang Qian, Xiaobin Hu, Jiaquan Yu et al. 📅 2026-03-10

⚡ Score: 6.6

"While Multi-Agent Systems (MAS) show potential for complex clinical decision support, the field remains hindered by architectural fragmentation and the lack of standardized multimodal integration. Current medical MAS research suffers from non-uniform data ingestion pipelines, inconsistent visual-rea..."

🧠 NEURAL NETWORKS

AutoKernel: Autoresearch for GPU Kernels

via HackerNews 👤 frozenseven 📅 2026-03-11

🔺 17 pts ⚡ Score: 6.6

💬 HackerNews Buzz: 2 comments 🐐 GOATED ENERGY

🎯 Hardware acceleration • Benchmarking challenges • Open-source optimization

💬 "Benchmarking is hard!" • "a lot of home users, high hardware diversity"

🏢 BUSINESS

Microsoft just launched an AI that does your office work for you — and it's built on Anthropic's Claude

via r/claudeai 👤 u/Remarkable-Dark2840 📅 2026-03-11

⬆️ 83 ups ⚡ Score: 6.6

"Saw the Microsoft announcement this morning and it's actually significant. They launched Copilot Cowork today — an AI agent built inside Microsoft 365 that doesn't just answer questions. It executes multi-step work across Outlook, Teams, Excel, and PowerPoint while you do something else. You descr..."

💬 Reddit Discussion: 37 comments 👍 LOWKEY SLAPS

🎯 AI adoption in companies • Concerns over AI reliability • Integrating AI with existing tools

💬 "AI like claude code is undeniably a productivity enhancer" • "we're farther out from that being possible than many think"

🔬 RESEARCH

OfficeQA Pro: An Enterprise Benchmark for End-to-End Grounded Reasoning

via Arxiv 👤 Krista Opsahl-Ong, Arnav Singhvi, Jasmine Collins et al. 📅 2026-03-09

⚡ Score: 6.6

"We introduce OfficeQA Pro, a benchmark for evaluating AI agents on grounded, multi-document reasoning over a large and heterogeneous document corpus. The corpus consists of U.S. Treasury Bulletins spanning nearly 100 years, comprising 89,000 pages and over 26 million numerical values. OfficeQA Pro c..."

🔬 RESEARCH

CREATE: Testing LLMs for Associative Creativity

via Arxiv 👤 Manya Wadhwa, Tiasa Singha Roy, Harvey Lederman et al. 📅 2026-03-10

⚡ Score: 6.6

"A key component of creativity is associative reasoning: the ability to draw novel yet meaningful connections between concepts. We introduce CREATE, a benchmark designed to evaluate models' capacity for creative associative reasoning. CREATE requires models to generate sets of paths connecting concep..."

🧠 NEURAL NETWORKS

[P] Observation from running long-horizon AI agents: reasoning drift seems to grow faster than task length

via r/MachineLearning 👤 u/Smooth-Horror1527 📅 2026-03-11

⬆️ 1 ups ⚡ Score: 6.5

"https:\/\/github.com\/Nefza99\/Rebis-AI-auditing-Architecture While building long-running AI systems (mostly experimenting with agent workflows and signal fusion for a..."

🏢 BUSINESS

Mississippi regulators authorize xAI to build a power plant with 41 natural gas-burning turbines in Southaven to power its data centers

via Techmeme 👤 Cnbc 📅 2026-03-11

⚡ Score: 6.5

🔬 RESEARCH

CODA: Difficulty-Aware Compute Allocation for Adaptive Reasoning

via Arxiv 👤 Siye Wu, Jian Xie, Yikai Zhang et al. 📅 2026-03-09

⚡ Score: 6.5

"The emergence of large reasoning models demonstrates that scaling inference-time compute significantly enhances performance on complex tasks. However, it often falls into another trap: overthinking simple problems, where repetitive rationales yield minimal accuracy gains at a disproportionately high..."

🔬 RESEARCH

MSSR: Memory-Aware Adaptive Replay for Continual LLM Fine-Tuning

via Arxiv 👤 Yiyang Lu, Yu He, Jianlong Chen et al. 📅 2026-03-10

⚡ Score: 6.5

"Continual fine-tuning of large language models (LLMs) is becoming increasingly crucial as these models are deployed in dynamic environments where tasks and data distributions evolve over time. While strong adaptability enables rapid acquisition of new knowledge, it also exposes LLMs to catastrophic..."

🔒 SECURITY

How we hacked McKinsey's AI platform

via HackerNews 👤 mycroft_4221 📅 2026-03-11

🔺 344 pts ⚡ Score: 6.5

💬 HackerNews Buzz: 139 comments 😐 MID OR MIXED

🎯 Internal system security • AI security risks • Corporate cybersecurity culture

💬 "Within 2 hours, the agent had full read and write access to the entire production database." • "Many enterprise tools were designed assuming human interaction, where authentication flows, manual reviews, and internal processes add implicit safeguards."

🔬 RESEARCH

Grow, Don't Overwrite: Fine-tuning Without Forgetting

via Arxiv 👤 Dyah Adila, Hanna Mazzawi, Benoit Dherin et al. 📅 2026-03-09

⚡ Score: 6.4

"Adapting pre-trained models to specialized tasks often leads to catastrophic forgetting, where new knowledge overwrites foundational capabilities. Existing methods either compromise performance on the new task or struggle to balance training stability with efficient reuse of pre-trained knowledge. W..."

🤖 AI MODELS

Meta unveils four new chips, the MTIA 300, MTIA 400, MTIA 450, and MTIA 500, set to launch by the end of 2027; the MTIA 300 is in production for content ranking

via Techmeme 👤 Bloomberg 📅 2026-03-11

⚡ Score: 6.4

🤖 AI MODELS

AI productivity gains are 10%, not 10x

via HackerNews 👤 donutshop 📅 2026-03-11

🔺 1 pts ⚡ Score: 6.3

🤖 AI MODELS

llama : add support for Nemotron 3 Super by danbev · Pull Request #20411 · ggml-org/llama.cpp

via r/LocalLLaMA 👤 u/jacek2023 📅 2026-03-11

⬆️ 20 ups ⚡ Score: 6.3

"GGUF: https://huggingface.co/unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF ..."

⚖️ ETHICS

Don't post generated/AI-edited comments. HN is for conversation between humans.

via HackerNews 👤 usefulposter 📅 2026-03-11

🔺 1398 pts ⚡ Score: 6.2

💬 HackerNews Buzz: 587 comments 🐝 BUZZING

🎯 Bot detection policies • AI-generated content • Information curation

💬 "Bots need not comment" • "AI-edited comments is a very interesting one"

🛠️ TOOLS

MCP/Skill for deploying full-stack apps directly from Cursor

via r/cursor 👤 u/1amrocket 📅 2026-03-11

⬆️ 1 ups ⚡ Score: 6.2

"I built Ink (https://ml.ink), a deployment platform where the primary users are AI agents. Tell the agent to deploy. The platform auto-detects the framework, builds it, passes env variables, deploys on cloud and returns a live URL at \*.ml.ink. How I personally been usin..."

🛠️ TOOLS

youtube MCP has been weirdly useful for research

via r/claudeai 👤 u/straightedge23 📅 2026-03-10

⬆️ 30 ups ⚡ Score: 6.1

"been using claude for research for a while but one thing that always annoyed me was dealing with youtube content. like someone would link a conference talk or a podcast episode and i'd have to go find the transcript myself, paste it in, lose the timestamps, etc. set up a youtube transcript MCP a fe..."

💬 Reddit Discussion: 10 comments 👍 LOWKEY SLAPS

🎯 Free AI Notebooks • Suspicious Advertising • MCP Configurations

💬 "Notebook LM is great and it's free." • "Holy advertisement. **Paid** MCP? Lol."

🔒 SECURITY

OopsDB – A TCP proxy to stop AI agents from dropping your DB

via HackerNews 👤 pintayo 📅 2026-03-10

🔺 2 pts ⚡ Score: 6.1

🛠️ SHOW HN

Show HN: CAS – I reverse-engineered Claude Code to build a better orchestrator

via HackerNews 👤 aceelric 📅 2026-03-11

🔺 3 pts ⚡ Score: 6.1

🔬 RESEARCH

Do What I Say: A Spoken Prompt Dataset for Instruction-Following

via Arxiv 👤 Maike Züfle, Sara Papi, Fabian Retkowski et al. 📅 2026-03-10

⚡ Score: 6.1

"Speech Large Language Models (SLLMs) have rapidly expanded, supporting a wide range of tasks. These models are typically evaluated using text prompts, which may not reflect real-world scenarios where users interact with speech. To address this gap, we introduce DoWhatISay (DOWIS), a multilingual dat..."

🤖 AI MODELS

Claude Code building 100 mini games with one prompt (5.3M tokens)

via HackerNews 👤 august- 📅 2026-03-11

🔺 2 pts ⚡ Score: 6.1

🔒 SECURITY

Anthropic sues Trump administration seeking to undo 'supply chain risk' designation

via r/artificial 👤 u/Fair_Economist_5369 📅 2026-03-10

⬆️ 116 ups ⚡ Score: 6.1

"External link discussion - see full content at original source."

💬 Reddit Discussion: 8 comments 😐 MID OR MIXED

🎯 Banning bots • Corruption concerns • Unusual support for tech CEO

💬 "Mods, ban the obvious bot please." • "Dario better start filling up those Cayman Islands accounts."

🧠 NEURAL NETWORKS

Ran an experiment: 0.8B model teaching itself on a MacBook Air with 6GB RAM. Some findings that surprised me.

via r/LocalLLaMA 👤 u/QuantumSeeds 📅 2026-03-10

⬆️ 155 ups ⚡ Score: 6.1

"I've been messing around with getting tiny models to improve themselves locally. Wanted to share what I found because some of it caught me off guard. The setup is pretty simple. I took Qwen 3.5 0.8B (4-bit quantized), ran it on my MacBook Air M4, and gave it coding problems. It writes a solution, I..."

💬 Reddit Discussion: 31 comments 🐝 BUZZING

🎯 Efficient AI models • Domain-specific fine-tuning • Iterative model improvement

💬 "the general model is just the starting point" • "once you narrow the domain and have good verification, even tiny models can punch way above their weight"

Stories from March 11, 2026

📡 AI NEWS BUT ACTUALLY GOOD

Perplexity announces Personal Computer