📚 HISTORICAL ARCHIVE - April 22, 2026

                What was happening in AI on 2026-04-22
            

← Apr 21 📊 TODAY'S NEWS 📚 ARCHIVE 🗓️ April 2026 Apr 23 →

                📰 DAILY AI BRIEF
            

On April 22, 2026, Metamesh tracked 70 AI stories, including 6 clustered developments, and ranked them by signal rather than volume. The lead item was I built a /graphify skill for Claude Code that maps your entire codebase into a knowledge graph, 71x fewer tokens.... Also high in the stack: Introducing ChatGPT Images 2.0 and Our eighth generation TPUs: two chips for the agentic era. That combination is why this archive exists: it preserves the day's shape for AI practitioners, not just the last headline that crossed the wire.

The daily ticker's read: WELCOME TO METAMESH.BIZ +++ Google drops TPU 8t and 8i for the "agentic era" because apparently seven generations wasn't enough silicon +++ Opus 4.7 was burning 80% of its context window on nothing (Claude Code v2.1.117 emergency patch incoming) +++ OpenAI.... Read against the ranked story list below, it gives the archive a point of view: what mattered, what was mostly noise, and which threads were worth saving for later comparison.

📊 You are visitor #47291 to this AWESOME site! 📊
Archive from: 2026-04-22 | Preserved for posterity ⚡

Stories from April 22, 2026

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

📰 NEWS

I built a /graphify skill for Claude Code that maps your entire codebase into a knowledge graph, 71x fewer tokens, way less hallucination (32k stars, 250k downloads)

via r/claudeai 👤 u/captainkink07 📅 2026-04-21

⬆️ 357 ups ⚡ Score: 9.0

"Every time I joined a new codebase I’d spend the first week asking Claude to “explain how X works”, watching it hallucinate, then reading 40 files to correct it. The problem isn’t the LLM — it’s that raw files are an awful context format. So I built graphify. Install it once in Claude Code and it b..."

💬 Reddit Discussion: 44 comments 👍 LOWKEY SLAPS

📰 NEWS

ChatGPT Images 2.0 Launch

6x SOURCES 🌐 📅 2026-04-21

⚡ Score: 8.7

+++ ChatGPT Images 2.0 arrives with dual variants offering web search integration and up to 2K resolution, because apparently static prompts weren't iterative enough for the image generation crowd. +++

Introducing ChatGPT Images 2.0

via r/OpenAI 👤 u/py-net 📅 2026-04-21

⬆️ 582 ups ⚡ Score: 8.7

"Official OpenAI announcement or research publication."

💬 Reddit Discussion: 169 comments 🐝 BUZZING

📰 NEWS

Google TPU 8 Announcement

2x SOURCES 🌐 📅 2026-04-22

⚡ Score: 8.6

+++ Google splits its eighth generation TPUs into training and inference variants, because apparently one chip doing both things efficiently remains science fiction. Availability later this year, which in tech time means "when it's ready." +++

Our eighth generation TPUs: two chips for the agentic era

via HackerNews 👤 xnx 📅 2026-04-22

🔺 349 pts ⚡ Score: 8.8

💬 HackerNews Buzz: 173 comments 🐝 BUZZING

📰 NEWS

Claude Code was wasting 80% of Opus 4.7's context window. Upgrade to v2.1.117 now.

via r/claudeai 👤 u/oh-keh 📅 2026-04-22

⬆️ 303 ups ⚡ Score: 8.6

"Morning Everyone! All pretty standard changes - except a **huge** bug was fixed for Opus 4.7 which hopefully should result in some pretty big improvements. I normally just link the full notes but I think this one note I have to include: `Opus 4.7's 1M context window was being wasted. Since Opus..."

💬 Reddit Discussion: 54 comments 👍 LOWKEY SLAPS

📰 NEWS

OpenAI Workspace Agents

4x SOURCES 🌐 📅 2026-04-22

⚡ Score: 8.5

+++ Teams can now deploy custom ChatGPT bots that handle tasks autonomously, which OpenAI carefully frames as "an evolution" rather than admitting GPTs needed actual agency all along. +++

Workspace Agents in ChatGPT

via HackerNews 👤 mfiguiere 📅 2026-04-22

🔺 63 pts ⚡ Score: 8.8

💬 HackerNews Buzz: 22 comments 🐝 BUZZING

🔬 RESEARCH

Latent Phase-Shift Rollback: Inference-Time Error Correction via Residual Stream Monitoring and KV-Cache Steering

via Arxiv 👤 Manan Gupta, Dhruv Kumar 📅 2026-04-20

⚡ Score: 8.0

"Large language models frequently commit unrecoverable reasoning errors mid-generation: once a wrong step is taken, subsequent tokens compound the mistake rather than correct it. We introduce $\textbf{Latent Phase-Shift Rollback}$ (LPSR): at each generation step, we monitor the residual stream at a c..."

📰 NEWS

Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model

via HackerNews 👤 mfiguiere 📅 2026-04-22

🔺 539 pts ⚡ Score: 8.0

💬 HackerNews Buzz: 263 comments 🐝 BUZZING

🔬 RESEARCH

Different Paths to Harmful Compliance: Behavioral Side Effects and Mechanistic Divergence Across LLM Jailbreaks

via Arxiv 👤 Md Rysul Kabir, Zoran Tiganj 📅 2026-04-20

⚡ Score: 7.8

"Open-weight language models can be rendered unsafe through several distinct interventions, but the resulting models may differ substantially in capabilities, behavioral profile, and internal failure mode. We study behavioral and mechanistic properties of jailbroken models across three unsafe routes:..."

📰 NEWS

An interview with Sam Altman and Greg Brockman on OpenAI's restructuring, cutting Sora, “personal AGI”, Anthropic's “fear-based marketing” for Mythos, and more

via Techmeme 👤 Corememory 📅 2026-04-21

⚡ Score: 7.8

📰 NEWS

Meta Employee Tracking for AI

3x SOURCES 🌐 📅 2026-04-21

⚡ Score: 7.7

+++ Meta is instrumenting employee workstations to capture interaction patterns for AI model training, transforming the distinction between "work tool" and "data collection apparatus" into something genuinely ambiguous. +++

Meta capturing employee mouse movements, keystrokes for AI training data

via HackerNews 👤 dlx 📅 2026-04-21

🔺 141 pts ⚡ Score: 7.2

💬 HackerNews Buzz: 87 comments 🐝 BUZZING

📰 NEWS

Google announces the Gemini Enterprise Agent Platform, a revamped developer tool built on Vertex AI that manages the full lifecycle of AI agent fleets

via Techmeme 👤 Zdnet 📅 2026-04-22

⚡ Score: 7.5

📰 NEWS

We open-sourced Chaperone-Thinking-LQ-1.0 — a 4-bit GPTQ + QLoRA fine-tuned DeepSeek-R1-32B that hits 84% on MedQA in ~20GB[N]

via r/MachineLearning 👤 u/AltruisticCouple3491 📅 2026-04-21

⬆️ 15 ups ⚡ Score: 7.4

"Hey everyone, We just open-sourced our reasoning model, Chaperone-Thinking-LQ-1.0, on Hugging Face. It's built on DeepSeek-R1-Distill-Qwen-32B but goes well beyond a simple quantization — here's what we actually did: The pipeline: 1. 4-bit GPTQ quantization — compressed the model from \~60GB down..."

📰 NEWS

We ran 52 controlled benchmarks on Claude Code. Agent Teams cost 73-124% more than sequential with zero quality gain.

via r/claudeai 👤 u/UpGPT 📅 2026-04-22

⬆️ 52 ups ⚡ Score: 7.4

"Three weeks of controlled experiments on a real production Next.js/TypeScript/Supabase codebase, Sonnet 4.6 worker, Opus 4.7 grader. Full data public, tool is MIT. A few findings that overturned the assumptions I started with: \- \*\*CONTRACT.md before code cut cost 54% and raised quality from 5/1..."

💬 Reddit Discussion: 24 comments 🐝 BUZZING

🔬 RESEARCH

Adversarial Humanities Benchmark: Results on Stylistic Robustness in Frontier Model Safety

via Arxiv 👤 Marcello Galisai, Susanna Cifani, Francesco Giarrusso et al. 📅 2026-04-20

⚡ Score: 7.3

"The Adversarial Humanities Benchmark (AHB) evaluates whether model safety refusals survive a shift away from familiar harmful prompt forms. Starting from harmful tasks drawn from MLCommons AILuminate, the benchmark rewrites the same objectives through humanities-style transformations while preservin..."

🛠️ SHOW HN

Show HN: We benchmarked 18 LLMs on OCR (7K+ calls) – cheaper models win

via HackerNews 👤 TimoKerr 📅 2026-04-22

🔺 5 pts ⚡ Score: 7.3

📰 NEWS

Recent Open models from last 6 Months - Nov 2025 - Apr 2026

via r/LocalLLaMA 👤 u/pmttyji 📅 2026-04-22

⬆️ 124 ups ⚡ Score: 7.3

"I created this chart with recent open models from last 6 months. Few might be older than that possibly. Included only latest versions(Ex: Only Kimi-K2.6, no Kimi-K2.5 & Kimi-K2. Also only GLM-5.1 & GLM-4.7, no GLM-4.6 & GLM-4.5). I couldn't add some models like Ling-2.5-1T, Ring-2.5-1T,..."

💬 Reddit Discussion: 28 comments 🐝 BUZZING

📰 NEWS

Claude Code Removed from Pro Plan

3x SOURCES 🌐 📅 2026-04-21

⚡ Score: 7.2

+++ Claude Code exits the Pro plan feature list, leaving subscribers to wonder if this was a stealth downgrade or just honest pricing realignment for a capability that apparently couldn't justify premium positioning. +++

PSA: Claude Pro no longer lists Claude Code as an included feature

via r/claudeai 👤 u/randomswifter 📅 2026-04-21

⬆️ 2758 ups ⚡ Score: 7.2

"Just noticed while checking the pricing page. Claude Code is no longer listed as a feature of the Pro plan. Source: https://claude.com/pricing Did I miss an announcement? EDIT: the support article at [https://support.claude.com/en/articles/11145838-using-claude-code-..."

💬 Reddit Discussion: 722 comments 👍 LOWKEY SLAPS

📰 NEWS

Qwen3.6-35B becomes competitive with cloud models when paired with the right agent

via r/LocalLLaMA 👤 u/Creative-Regular6799 📅 2026-04-22

⬆️ 506 ups ⚡ Score: 7.1

"A short follow-up to my previous post, where I showed that changing the scaffold around the same 9B Qwen model moved benchmark performance from 19.11% to 45.56%: https://www.reddit.com/r/LocalLLaMA/s/JMHuAGj1LV After feedback from people here, I ..."

💬 Reddit Discussion: 131 comments 🐝 BUZZING

🔬 RESEARCH

When Can LLMs Learn to Reason with Weak Supervision?

via Arxiv 👤 Salman Rahman, Jingyan Shen, Anna Mordvina et al. 📅 2026-04-20

⚡ Score: 7.0

"Large language models have achieved significant reasoning improvements through reinforcement learning with verifiable rewards (RLVR). Yet as model capabilities grow, constructing high-quality reward signals becomes increasingly difficult, making it essential to understand when RLVR can succeed under..."

🔬 RESEARCH

MASS-RAG: Multi-Agent Synthesis Retrieval-Augmented Generation

via Arxiv 👤 Xingchen Xiao, Heyan Huang, Runheng Liu et al. 📅 2026-04-20

⚡ Score: 7.0

"Large language models (LLMs) are widely used in retrieval-augmented generation (RAG) to incorporate external knowledge at inference time. However, when retrieved contexts are noisy, incomplete, or heterogeneous, a single generation process often struggles to reconcile evidence effectively. We propos..."

📰 NEWS

Gemma 4 is not your standard transformer

via HackerNews 👤 smaddrellmander 📅 2026-04-22

🔺 2 pts ⚡ Score: 7.0

🔬 RESEARCH

An AI Agent Execution Environment to Safeguard User Data

via Arxiv 👤 Robert Stanley, Avi Verma, Lillian Tsai et al. 📅 2026-04-21

⚡ Score: 7.0

"AI agents promise to serve as general-purpose personal assistants for their users, which requires them to have access to private user data (e.g., personal and financial information). This poses a serious risk to security and privacy. Adversaries may attack the AI model (e.g., via prompt injection) t..."

📰 NEWS

mm – Unix tools (find/cat/grep) rebuilt for the multimodal era

via r/computervision 👤 u/fuzzysingularity 📅 2026-04-22

⬆️ 14 ups ⚡ Score: 6.9

"Excited to share one of our weekend builds that turned into something we now use daily with our coding agents. mm – fast, multimodal context for agents. Coding agents read text fine, but the moment a directory has images, videos, or PDFs with rich visual content, they fail at extracting meaningful..."

🔬 RESEARCH

A multimodal and temporal foundation model for virtual patient representations at healthcare system scale

via Arxiv 👤 Andrew Zhang, Tong Ding, Sophia J. Wagner et al. 📅 2026-04-20

⚡ Score: 6.9

"Modern medicine generates vast multimodal data across siloed systems, yet no existing model integrates the full breadth and temporal depth of the clinical record into a unified patient representation. We introduce Apollo, a multimodal temporal foundation model trained and evaluated on over three dec..."

📰 NEWS

Dark Factories: Retooling for LLM Velocity

via HackerNews 👤 sitapati 📅 2026-04-21

🔺 2 pts ⚡ Score: 6.9

🔬 RESEARCH

Back into Plato's Cave: Examining Cross-modal Representational Convergence at Scale

via Arxiv 👤 A. Sophia Koepke, Daniil Zverev, Shiry Ginosar et al. 📅 2026-04-20

⚡ Score: 6.9

"The Platonic Representation Hypothesis suggests that neural networks trained on different modalities (e.g., text and images) align and eventually converge toward the same representation of reality. If true, this has significant implications for whether modality choice matters at all. We show that th..."

🔬 RESEARCH

Document-as-Image Representations Fall Short for Scientific Retrieval

via Arxiv 👤 Ghazal Khalighinejad, Raghuveer Thirukovalluru, Alexander H. Oh et al. 📅 2026-04-20

⚡ Score: 6.8

"Many recent document embedding models are trained on document-as-image representations, embedding rendered pages as images rather than the underlying source. Meanwhile, existing benchmarks for scientific document retrieval, such as ArXivQA and ViDoRe, treat documents as images of pages, implicitly f..."

🔬 RESEARCH

SafetyALFRED: Evaluating Safety-Conscious Planning of Multimodal Large Language Models

via Arxiv 👤 Josue Torres-Fonseca, Naihao Deng, Yinpei Dai et al. 📅 2026-04-21

⚡ Score: 6.8

"Multimodal Large Language Models are increasingly adopted as autonomous agents in interactive environments, yet their ability to proactively address safety hazards remains insufficient. We introduce SafetyALFRED, built upon the embodied agent benchmark ALFRED, augmented with six categories of real-w..."

🔬 RESEARCH

VLA Foundry: A Unified Framework for Training Vision-Language-Action Models

via Arxiv 👤 Jean Mercat, Sedrick Keh, Kushal Arora et al. 📅 2026-04-21

⚡ Score: 6.8

"We present VLA Foundry, an open-source framework that unifies LLM, VLM, and VLA training in a single codebase. Most open-source VLA efforts specialize on the action training stage, often stitching together incompatible pretraining pipelines. VLA Foundry instead provides a shared training stack with..."

📰 NEWS

Symbiont – Typestate-enforced policy gates for AI agents (Rust)

via HackerNews 👤 smugglereal 📅 2026-04-22

🔺 1 pts ⚡ Score: 6.8

🔬 RESEARCH

LLM Safety From Within: Detecting Harmful Content with Internal Representations

via Arxiv 👤 Difan Jiao, Yilun Liu, Ye Yuan et al. 📅 2026-04-20

⚡ Score: 6.8

"Guard models are widely used to detect harmful content in user prompts and LLM responses. However, state-of-the-art guard models rely solely on terminal-layer representations and overlook the rich safety-relevant features distributed across internal layers. We present SIREN, a lightweight guard mode..."

🔬 RESEARCH

Pause or Fabricate? Training Language Models for Grounded Reasoning

via Arxiv 👤 Yiwen Qiu, Linjuan Wu, Yizhou Liu et al. 📅 2026-04-21

⚡ Score: 6.7

"Large language models have achieved remarkable progress on complex reasoning tasks. However, they often implicitly fabricate information when inputs are incomplete, producing confident but unreliable conclusions -- a failure mode we term ungrounded reasoning. We argue that this issue arises not from..."

🔬 RESEARCH

Micro Language Models Enable Instant Responses

via Arxiv 👤 Wen Cheng, Tuochao Chen, Karim Helwani et al. 📅 2026-04-21

⚡ Score: 6.7

"Edge devices such as smartwatches and smart glasses cannot continuously run even the smallest 100M-1B parameter language models due to power and compute constraints, yet cloud inference introduces multi-second latencies that break the illusion of a responsive assistant. We introduce micro language m..."

🛠️ SHOW HN

Show HN: Daemons – we pivoted from building agents to cleaning up after them

via HackerNews 👤 rileyt 📅 2026-04-21

🔺 44 pts ⚡ Score: 6.7

💬 HackerNews Buzz: 26 comments 🐝 BUZZING

🔬 RESEARCH

FUSE: Ensembling Verifiers with Zero Labeled Data

via Arxiv 👤 Joonhyuk Lee, Virginia Ma, Sarah Zhao et al. 📅 2026-04-20

⚡ Score: 6.7

"Verification of model outputs is rapidly emerging as a key primitive for both training and real-world deployment of large language models (LLMs). In practice, this often involves using imperfect LLM judges and reward models since ground truth acquisition can be time-consuming and expensive. We intro..."

📰 NEWS

I tested 9 local models on the same flight sim prompt, all Q8, different Q providers, MLX

via r/LocalLLaMA 👤 u/StudentDifficult8240 📅 2026-04-21

⬆️ 24 ups ⚡ Score: 6.7

"**I gave 9 local models the same flight combat sim prompt. The results broke a few of my assumptions about quant providers and parameter count.** *All 8-bit MLX, M3 Max 128GB, served via omlx, prompted through Claude Code. Same prompt every time — single-file HTML, three selectable planes (jet, pro..."

💬 Reddit Discussion: 9 comments 🐐 GOATED ENERGY

🔬 RESEARCH

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

via Arxiv 👤 Jinghui Lu, Jiayi Guan, Zhijian Huang et al. 📅 2026-04-20

⚡ Score: 6.7

"Chain-of-Thought (CoT) reasoning has become a powerful driver of trajectory prediction in VLA-based autonomous driving, yet its autoregressive nature imposes a latency cost that is prohibitive for real-time deployment. Latent CoT methods attempt to close this gap by compressing reasoning into contin..."

📰 NEWS

Llama.cpp's auto fit works much better than I expected

via r/LocalLLaMA 👤 u/a9udn9u 📅 2026-04-21

⬆️ 133 ups ⚡ Score: 6.6

"I always thought with 32GB of VRAM, the biggest models I could run were around 20GB, like Qwen3.5 27B Q4 or Q6. I had an impression that everything had to fit in VRAM or I'd get 2 t/s. Man was I wrong. I just tested Qwen3.6 Q8 with 256k context on llama.cpp, with \`--fit\` on, the weights alone are..."

💬 Reddit Discussion: 54 comments 🐝 BUZZING

📰 NEWS

PSA: Anthropic bans organizations without warning

via r/claudeai 👤 u/ur_frnd_the_footnote 📅 2026-04-22

⬆️ 934 ups ⚡ Score: 6.6

"I work at at an agricultural technology company. On Monday, everyone in our org woke up to emails saying that their Claude accounts had been suspended (\~110 users). At first -- since the email was to me, with a link to a Google Form if I personally wanted to appeal -- I thought it must be an indiv..."

💬 Reddit Discussion: 145 comments 👍 LOWKEY SLAPS

🔬 RESEARCH

HardNet++: Nonlinear Constraint Enforcement in Neural Networks

via Arxiv 👤 Andrea Goertzen, Kaveh Alim, Navid Azizan 📅 2026-04-21

⚡ Score: 6.6

"Enforcing constraint satisfaction in neural network outputs is critical for safety, reliability, and physical fidelity in many control and decision-making applications. While soft-constrained methods penalize constraint violations during training, they do not guarantee constraint adherence during in..."

📰 NEWS

OpenAI releases Privacy Filter, an open-weight model for masking personally identifiable information in text, with 1.5B total and 50M active parameters

via Techmeme 👤 Openai 📅 2026-04-22

⚡ Score: 6.6

📰 NEWS

Google says 75% of new code created inside the company is now generated by AI and reviewed by human engineers, up from 50% last fall

via Techmeme 👤 Businessinsider 📅 2026-04-22

⚡ Score: 6.5

📰 NEWS

Zindex – Diagram Infrastructure for Agents

via HackerNews 👤 _ben_ 📅 2026-04-21

🔺 50 pts ⚡ Score: 6.5

💬 HackerNews Buzz: 17 comments 👍 LOWKEY SLAPS

🔬 RESEARCH

GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling

via Arxiv 👤 Alireza Dadgarnia, Soroush Tabesh, Mahdi Nikdan et al. 📅 2026-04-20

⚡ Score: 6.5

"Weight quantization has become a standard tool for efficient LLM deployment, especially for local inference, where models are now routinely served at 2-3 bits per parameter. The state of the art is currently split into two sets of methods: simple scalar quantization techniques, such as GPTQ or AWQ,..."

🔬 RESEARCH

Safety-Critical Contextual Control via Online Riemannian Optimization with World Models

via Arxiv 👤 Tongxin Li 📅 2026-04-21

⚡ Score: 6.5

"Modern world models are becoming too complex to admit explicit dynamical descriptions. We study safety-critical contextual control, where a Planner must optimize a task objective using only feasibility samples from a black-box Simulator, conditioned on a context signal $ξ_t$. We develop a sample-bas..."

🔬 RESEARCH

ConforNets: Latents-Based Conformational Control in OpenFold3

via Arxiv 👤 Minji Lee, Colin Kalicki, Minkyu Jeon et al. 📅 2026-04-20

⚡ Score: 6.4

"Models from the AlphaFold (AF) family reliably predict one dominant conformation for most well-ordered proteins but struggle to capture biologically relevant alternate states. Several efforts have focused on eliciting greater conformational variability through ad hoc inference-time perturbations of..."

📰 NEWS

A Comparison of Agentic AI Systems and Human Economists

via HackerNews 👤 paulpauper 📅 2026-04-21

🔺 1 pts ⚡ Score: 6.4

📰 NEWS

OpenAI Privacy Filter Model

via r/LocalLLaMA 👤 u/ai_hedge_fund 📅 2026-04-22

⬆️ 29 ups ⚡ Score: 6.4

"Just saw this posted by Bloomberg in a different sub: https://huggingface.co/openai/privacy-filter Open weights, Apache 2.0, etc I like the contribution to the space between local models for protecting privacy and some level of quality conferred by ..."

💬 Reddit Discussion: 6 comments 🐐 GOATED ENERGY

📰 NEWS

Mozilla Firefox Mythos Vulnerability Fixes

2x SOURCES 🌐 📅 2026-04-21

⚡ Score: 6.3

+++ Mozilla patched 271 vulnerabilities using early access to Anthropic's Mythos, proving that AI code review works better when you're not competing with everyone else for the same tool. +++

Mozilla Used Anthropic's Mythos to Find and Fix 271 Bugs in Firefox

via HackerNews 👤 cpeterso 📅 2026-04-21

🔺 12 pts ⚡ Score: 6.2

💬 HackerNews Buzz: 2 comments 👍 LOWKEY SLAPS

📰 NEWS

AI Has No Moat

via HackerNews 👤 thoughtpeddler 📅 2026-04-22

🔺 3 pts ⚡ Score: 6.3

📰 NEWS

Odyssey-2 Max: Scaled World Simulation

via HackerNews 👤 olivercameron 📅 2026-04-21

🔺 1 pts ⚡ Score: 6.3

📰 NEWS

Ultimate List: Best Open Models for Coding, Chat, Vision, Audio & More

via r/LocalLLaMA 👤 u/techlatest_net 📅 2026-04-22

⬆️ 186 ups ⚡ Score: 6.2

"Open-source AI is evolving insanely fast, but it’s hard to know which model is actually best for each use case. So I put together a list of the best open-source models across different categories Best Audio Generation Open Source Models # Text-to-Speech (TTS) * [Qwen3-TTS](https://github.com/Qwen..."

💬 Reddit Discussion: 42 comments 👍 LOWKEY SLAPS

🛠️ SHOW HN

Show HN: FieldOps-Bench an open eval for physical-world AI agents

via HackerNews 👤 Aeroi 📅 2026-04-21

🔺 1 pts ⚡ Score: 6.2

🛠️ SHOW HN

Scoring Show HN submissions for AI design patterns

via HackerNews 👤 hubraumhugo 📅 2026-04-22

🔺 245 pts ⚡ Score: 6.2

💬 HackerNews Buzz: 190 comments 🐝 BUZZING

🔬 RESEARCH

MathNet: a Global Multimodal Benchmark for Mathematical Reasoning and Retrieval

via Arxiv 👤 Shaden Alshammari, Kevin Wen, Abrar Zainal et al. 📅 2026-04-20

⚡ Score: 6.1

"Mathematical problem solving remains a challenging test of reasoning for large language and multimodal models, yet existing benchmarks are limited in size, language coverage, and task diversity. We introduce MathNet, a high-quality, large-scale, multimodal, and multilingual dataset of Olympiad-level..."

🔬 RESEARCH

FASTER: Value-Guided Sampling for Fast RL

via Arxiv 👤 Perry Dong, Alexander Swerdlow, Dorsa Sadigh et al. 📅 2026-04-21

⚡ Score: 6.1

"Some of the most performant reinforcement learning algorithms today can be prohibitively expensive as they use test-time scaling methods such as sampling multiple action candidates and selecting the best one. In this work, we propose FASTER, a method for getting the benefits of sampling-based test-t..."

Stories from April 22, 2026

ChatGPT Images 2.0 Launch

Google TPU 8 Announcement

OpenAI Workspace Agents

Meta Employee Tracking for AI

📡 AI NEWS BUT ACTUALLY GOOD

Claude Code Removed from Pro Plan

Mozilla Firefox Mythos Vulnerability Fixes