π HISTORICAL ARCHIVE - March 15, 2026
What was happening in AI on 2026-03-15
π You are visitor #47291 to this AWESOME site! π
Archive from: 2026-03-15 | Preserved for posterity β‘
π Filter by Category
Loading filters...
π¬ RESEARCH
πΊ 57 pts
β‘ Score: 8.5
π― Inference Cost β’ MCTS Applications β’ Compute Budgets
π¬ "MCTS uses more inference compute on a per-sample basis than GRPO"
β’ "I wonder why MCTS is not more popular as a test time compute harness"
π₯ HEALTHCARE
β¬οΈ 66 ups
β‘ Score: 7.8
"Saw a few posts about Paul Conyngham designing an mRNA cancer vaccine for his dog using ChatGPT and AlphaFold. A lot of people are curious on how he actually did it - including me! Sox I dug into the detailsβ¦
Here is an exact 7-step pipeline to replicate his work, or sequence and analyze your own D..."
π― Vaccine synthesis complexity β’ Cost and feasibility β’ Regulatory barriers in healthcare
π¬ "This was a specific vaccine for a specific dog for a specific cancer."
β’ "Regulatory barriers in healthcare are there for a reason."
π€ AI MODELS
πΊ 96 pts
β‘ Score: 7.6
π― Progression analysis β’ Influence visualization β’ LLM architecture insights
π¬ "understand the threads of evolutions and revolution"
β’ "teach you something about LLM Architecture"
π¬ RESEARCH
via Arxiv
π€ Ninghui Li, Kaiyuan Zhang, Kyle Polley et al.
π
2026-03-12
β‘ Score: 7.3
"This article, a lightly adapted version of Perplexity's response to NIST/CAISI Request for Information 2025-0035, details our observations and recommendations concerning the security of frontier AI agents. These insights are informed by Perplexity's experience operating general-purpose agentic syste..."
π’ BUSINESS
β¬οΈ 334 ups
β‘ Score: 7.3
"Dragging the controllers of the 3 parameters left or right automatically adjusts the chart in a real time. And you get that from a six word prompt."
π― AI hype in edtech β’ Dismissal of education sector β’ Skepticism towards "AI" startups
π¬ "The chart generating startup industry is doomed!"
β’ "How much were you paid to make this post OP"
π¬ RESEARCH
via Arxiv
π€ Yushi Bai, Qian Dong, Ting Jiang et al.
π
2026-03-12
β‘ Score: 7.3
"Long-context agentic workflows have emerged as a defining use case for large language models, making attention efficiency critical for both inference speed and serving cost. Sparse attention addresses this challenge effectively, and DeepSeek Sparse Attention (DSA) is a representative production-grad..."
π¬ RESEARCH
via Arxiv
π€ Krishnakumar Balasubramanian, Shiva Prasad Kasiviswanathan
π
2026-03-12
β‘ Score: 7.2
"Continual post-training of generative models is widely used, yet a principled understanding of when and why forgetting occurs remains limited. We develop theoretical results under a two-mode mixture abstraction (representing old and new tasks), proposed by Chen et al. (2025) (arXiv:2510.18874), and..."
π οΈ TOOLS
β¬οΈ 3005 ups
β‘ Score: 7.2
"I want to share something I built with Claude Code this past week because I think it shows what AI-assisted development can actually do when pointed at a genuinely hard problem.
Disney Infinity 1.0 (2013) is a game where you place physical figures on a base to play as characters. Each character is ..."
π― Reverse engineering β’ Disassembly analysis β’ Gameplay restrictions
π¬ "This is the kind of use case that actually demonstrates what these tools are capable of"
β’ "The fact that you had to trace 13 separate validation call sites through a stripped binary is the part most people will gloss over"
π οΈ TOOLS
β¬οΈ 511 ups
β‘ Score: 7.1
"Iβve been enjoying the Claude Code CLI for a while now, but managing multiple agents became kinda messy. I tried PixelHQ and the VS Code plugin, but they didn't quite get it right for me.
I ended up buildingΒ **Recon**, a tmux-native dashboard to track them all. I might have spent a bit too much tim..."
π― Monitoring multiple AI agents β’ Leveraging tmux for agent management β’ Metrics for agent performance
π¬ "The hardest part of running multiple agents isn't monitoring them, it's keeping them from stepping on each other."
β’ "Having a visual state that isn't just a wall of scrolling logs makes the whole thing feel way more manageable."
π¬ RESEARCH
via Arxiv
π€ Alexandre Le Mercier, Thomas Demeester, Chris Develder
π
2026-03-12
β‘ Score: 7.1
"State space models (SSMs) like Mamba have gained significant traction as efficient alternatives to Transformers, achieving linear complexity while maintaining competitive performance. However, Hidden State Poisoning Attacks (HiSPAs), a recently discovered vulnerability that corrupts SSM memory throu..."
π€ AI MODELS
πΊ 2 pts
β‘ Score: 7.0
π‘ AI NEWS BUT ACTUALLY GOOD
The revolution will not be televised, but Claude will email you once we hit the singularity.
Get the stories that matter in Today's AI Briefing.
Powered by Premium Technology Intelligence Algorithms β’ Unsubscribe anytime
π οΈ SHOW HN
πΊ 7 pts
β‘ Score: 7.0
π― Email Automation for Agents β’ Scalable Email Infrastructure β’ Anti-Abuse Measures
π¬ "every AI agent that needs to sign up for a website needs a real email address, and there's no good free way to get one programmatically"
β’ "When a domain degrades, it rotates out. No per-mailbox cost."
π¬ RESEARCH
"Large language models struggle to catch errors in their own outputs when the review happens in the same session that produced them. This paper introduces Cross-Context Review (CCR), a straightforward method where the review is conducted in a fresh session with no access to the production conversatio..."
π οΈ TOOLS
β¬οΈ 25 ups
β‘ Score: 7.0
"A few weeks ago I was working on a training run that produced garbage results.
No errors, no crashes, just a model that learned nothing. Three days later I found it. Label leakage between train and val. The model had been cheating the whole time.
So I built preflight. It's a CLI tool you run befo..."
π οΈ TOOLS
πΊ 2 pts
β‘ Score: 7.0
π οΈ TOOLS
β¬οΈ 22 ups
β‘ Score: 7.0
"
https://github.com/ggml-org/llama.cpp/releases/tag/b8338
Lots of work done by the Intel team, I'm looking forward to trying this out on the 255H with the Arc 140T iGPU..."
π― GPU performance β’ NPU support β’ Linux development
π¬ "Maybe my three intel GPUs will get some more to shine"
β’ "The latest lemonade from 3 days ago really adds support for NPUs"
π§ INFRASTRUCTURE
πΊ 1 pts
β‘ Score: 7.0
π οΈ TOOLS
β¬οΈ 225 ups
β‘ Score: 7.0
"If you train Graph Neural Networks on large datasets (like Papers100M), you already know the pain: trying to load the edge list and feature matrix usually results in an instant 24GB+ OOM allocation crash before the GPU even gets to do any work.
I just open-sourced **GraphZero v0.2**, a custom C++ d..."
π― GNN neighbor sampling β’ Edge-to-node pooling β’ Systems-first approach
π¬ "GraphZero pushes all the heavy, multi-threaded sampling down to C++ to guarantee true zero-copy execution before the data ever reaches PyTorch."
β’ "A custom CUDA kernel for that would be a huge throughput win for future version."
πΌ JOBS
πΊ 110 pts
β‘ Score: 7.0
π― AI-assisted development β’ Code quality and maintainability β’ Impact on programmer skills
π¬ "For larger projects that need to plugin to the legacy code base, which I'll need to maintain for years, I still prefer to do things myself"
β’ "Ultimately that's what this is all about- writing code is a big part of my career but the thing that has kept me employed is being able to figure out what to do when some code that I assembled is not behaving the way I had hoped"
π¬ RESEARCH
πΊ 2 pts
β‘ Score: 6.9
π¬ RESEARCH
via Arxiv
π€ Samy Jelassi, Mujin Kwun, Rosie Zhao et al.
π
2026-03-12
β‘ Score: 6.8
"Cross-entropy (CE) training provides dense and scalable supervision for language models, but it optimizes next-token prediction under teacher forcing rather than sequence-level behavior under model rollouts. We introduce a feature-matching objective for language-model fine-tuning that targets sequen..."
π¬ RESEARCH
via Arxiv
π€ Yixin Liu, Yue Yu, DiJia Su et al.
π
2026-03-12
β‘ Score: 6.7
"Reasoning LLMs-as-Judges, which can benefit from inference-time scaling, provide a promising path for extending the success of reasoning models to non-verifiable domains where the output correctness/quality cannot be directly checked. However, while reasoning judges have shown better performance on..."
π SECURITY
πΊ 1 pts
β‘ Score: 6.7
π οΈ TOOLS
πΊ 1 pts
β‘ Score: 6.5
π DATA
πΊ 1 pts
β‘ Score: 6.5
π¬ RESEARCH
β¬οΈ 1 ups
β‘ Score: 6.5
"We're sharing ZeroProofML, a small framework for scientific ML problems where the target can be genuinely undefined or non-identifiable: poles, assay censoring boundaries, kinematic locks, etc. The underlying issue is division by zero. Not as a numerical bug, but as a semantic event that shows up wh..."
π§ INFRASTRUCTURE
πΊ 1 pts
β‘ Score: 6.5
π¬ RESEARCH
πΊ 2 pts
β‘ Score: 6.5
π οΈ SHOW HN
πΊ 2 pts
β‘ Score: 6.5
π’ BUSINESS
πΊ 1 pts
β‘ Score: 6.5
π οΈ TOOLS
β¬οΈ 131 ups
β‘ Score: 6.4
"There's been a lot of discussion about using AI for writing papers and documents. But most tools either require you to upload everything to the cloud, or force you to deal with clunky local setups that have zero quality-of-life features.
I've been a researcher writing papers for years. My setup was..."
π― Local-first AI model β’ Academic writing tools β’ Trademark concerns
π¬ "The local-first angle is actually the key differentiator here."
β’ "This approach sounds interesting, though it's somewhat similar to what I've been using in my workflow."
π¬ RESEARCH
via Arxiv
π€ Yuetian Du, Yucheng Wang, Rongyu Zhang et al.
π
2026-03-12
β‘ Score: 6.3
"Recent advances in Multi-modal Large Language Models (MLLMs) have predominantly focused on enhancing visual perception to improve accuracy. However, a critical question remains unexplored: Do models know when they do not know? Through a probing experiment, we reveal a severe confidence miscalibratio..."
π¬ RESEARCH
via Arxiv
π€ Yulu Gan, Phillip Isola
π
2026-03-12
β‘ Score: 6.3
"Pretraining produces a learned parameter vector that is typically treated as a starting point for further iterative adaptation. In this work, we instead view the outcome of pretraining as a distribution over parameter vectors, whose support already contains task-specific experts. We show that in sma..."
π€ AI MODELS
β¬οΈ 22 ups
β‘ Score: 6.3
"Hey folks, I ran a series of benchmarks comparing `ik_llama.cpp` against the official `llama.cpp` across multiple Qwen3 and Qwen3.5 variants (including MoE architectures). The results showed some interesting performance flips depending on the model architecture and backend provider.
**Hardware:**
..."
π― Performance Optimization β’ Model Comparison β’ Quantization Techniques
π¬ "when using ik, make sure to add `--merge-qkv -muge` for fused ops"
β’ "if you have 2 or more GPUs make sure to use `-sm layer` for tensor parallel support"
π EDUCATION
πΊ 289 pts
β‘ Score: 6.2
π― Interactive Learning Resources β’ Visualization in ML Explanations β’ Recommendation of Visual Learning Resources
π¬ "I am thinking of creating a bookmark manager that uses my criteria above and runs across every damn blog link ever posted on HN to categorize them as S-TIER, A-TIER, opinion and so on"
β’ "Stunningly good also in the sense that it advances the story so people don't just drool at the pretty animation and stop engaging."
π SECURITY
β¬οΈ 3 ups
β‘ Score: 6.2
"External link discussion - see full content at original source."
π οΈ TOOLS
πΊ 1 pts
β‘ Score: 6.2
π οΈ SHOW HN
πΊ 1 pts
β‘ Score: 6.2
π οΈ SHOW HN
πΊ 1 pts
β‘ Score: 6.1