📚 HISTORICAL ARCHIVE - June 28, 2026

                What was happening in AI on 2026-06-28
            

← Jun 27 📊 TODAY'S NEWS 📚 ARCHIVE Jun 29 →

📊 You are visitor #47291 to this AWESOME site! 📊
Archive from: 2026-06-28 | Preserved for posterity ⚡

Stories from June 28, 2026

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

📰 NEWS

A way to exclude sensitive files issue still open for OpenAI Codex

via HackerNews 👤 pikseladam 📅 2026-06-28

🔺 165 pts ⚡ Score: 8.4

💬 HackerNews Buzz: 110 comments 🐝 BUZZING

📰 NEWS

An analysis of US payroll data across 730+ occupations: employment among workers ages 22 to 25 in highly AI-exposed jobs is now shrinking by 3.8% per year

via Techmeme 👤 Techmeme 📅 2026-06-28

⚡ Score: 8.3

📰 NEWS

GPT-5.6 Sol/Terra security capabilities and release approval

2x SOURCES 🌐 📅 2026-06-27

⚡ Score: 8.2

+++ OpenAI's system card suggests Sol and Terra versions posed acceptable risks, identifying but not executing autonomous attacks, clearing the path for deployment without further delays. +++

GPT-5.6 system card indicates Sol is well below the level of most worrisome Mythos use cases, suggesting all GPT-5.6 versions could be released without delay

via Techmeme 👤 Techmeme 📅 2026-06-28

⚡ Score: 8.2

📰 NEWS

Ford rehires experienced engineers after AI implementation issues

2x SOURCES 🌐 📅 2026-06-28

⚡ Score: 8.1

+++ Turns out automating engineering judgment requires actual judgment. Ford's pivot back to experienced humans suggests some tasks still need wetware that can navigate ambiguity, context, and the occasional "wait, that won't work" moment AI missed. +++

Ford rehires 'gray beard' engineers after AI falls short

via HackerNews 👤 rbanffy 📅 2026-06-28

🔺 121 pts ⚡ Score: 8.7

📰 NEWS

GLM-5.2 security vulnerability detection capabilities

2x SOURCES 🌐 📅 2026-06-28

⚡ Score: 8.0

+++ Chinese model GLM 5.2 demonstrates competitive vulnerability detection capabilities, prompting timely questions about export control philosophy when open weights do the heavy lifting. +++

GLM 5.2 beats Claude in our benchmarks

via HackerNews 👤 jms703 📅 2026-06-28

🔺 121 pts ⚡ Score: 8.2

💬 HackerNews Buzz: 37 comments 😐 MID OR MIXED

📰 NEWS

Google restricts Meta's access to Gemini AI

2x SOURCES 🌐 📅 2026-06-28

⚡ Score: 7.6

+++ Google couldn't fulfill Meta's AI compute ambitions in March, forcing the social media giant to shelf some internal projects. Turns out even tech titans can't always get what they want from each other. +++

Google limits Meta's use of its Gemini AI models

via HackerNews 👤 root-parent 📅 2026-06-28

🔺 129 pts ⚡ Score: 8.1

💬 HackerNews Buzz: 62 comments 👍 LOWKEY SLAPS

🔬 RESEARCH

Reinforcement Learning without Ground-Truth Solutions can Improve LLMs

via Arxiv 👤 Yingyu Lin, Qiyue Gao, Nikki Lijing Kuang et al. 📅 2026-06-25

⚡ Score: 7.4

"Reinforcement learning with verifiable rewards (RLVR) for training LLMs typically rely on ground-truth answers to assign rewards, limiting their applicability to tasks where the ground-truth solution is unknown. We introduce a \textbf{R}anking-\textbf{i}nduced \textbf{VER}ifiable framework (RiVER) t..."

🔬 RESEARCH

When Does Combining Language Models Help? A Co-Failure Ceiling on Routing, Voting, and Mixture-of-Agents Across 67 Frontier Models

via Arxiv 👤 Josef Chen 📅 2026-06-25

⚡ Score: 7.3

"Multi-model LLM systems such as routing, voting, cascades, fusion, and mixture-of-agents are used to beat single-model accuracy. We show that their gain is capped by a quantity the field rarely reports. For any policy whose output is one member model answer, accuracy cannot exceed one minus beta, wh..."

🔬 RESEARCH

Beyond Surface Forms: A Comprehensive, Mechanism-Oriented Taxonomy of Indirect Linguistic Encoding for LLM-Based Coded Language Detection

via Arxiv 👤 Hamid Reza Firoozfar, Mohammadsadegh Abolhasani, Reza Mousavi et al. 📅 2026-06-25

⚡ Score: 7.2

"To avoid moderation and surveillance on social media, some users routinely invent indirect linguistic expressions (ILE) that camouflage sensitive meanings. Such expressions surface as algospeak, euphemisms, and adversarial obfuscation, depending on intent and context, and they involve recurring enco..."

📰 NEWS

Clean GitHub repo tricks AI coding agents into running malware

via HackerNews 👤 logickkk1 📅 2026-06-27

🔺 4 pts ⚡ Score: 7.1

📰 NEWS

Reflections on software engineering in the age of AI

via HackerNews 👤 diamondap 📅 2026-06-28

🔺 75 pts ⚡ Score: 7.0

💬 HackerNews Buzz: 57 comments 😐 MID OR MIXED

📰 NEWS

I used Claude Code to get a second opinion on my MRI

via HackerNews 👤 engmarketer 📅 2026-06-28

🔺 240 pts ⚡ Score: 7.0

💬 HackerNews Buzz: 349 comments 👍 LOWKEY SLAPS

🛠️ SHOW HN

Show HN: Caliper – pass@k reliability testing for Claude Code and Codex skills

via HackerNews 👤 edonadei 📅 2026-06-28

🔺 2 pts ⚡ Score: 6.9

🔬 RESEARCH

Prompt Injection in Automated Résumé Screening with Large Language Models: Single and Multi-Injection Settings

via Arxiv 👤 Preet Baxi, Jiannan Xu, Jane Yi Jiang et al. 📅 2026-06-25

⚡ Score: 6.9

"Large language models (LLMs) are increasingly used to screen and rank job applicants, creating incentives for candidates to strategically manipulate algorithmic hiring systems. We study prompt injection in automated résumé screening, defined as subtle self-promotional text that introduces no new qua..."

📰 NEWS

How Claude Code and Codex Sandbox Untrusted Code

via HackerNews 👤 syumei 📅 2026-06-27

🔺 2 pts ⚡ Score: 6.8

🔬 RESEARCH

CARVE: Content-Aware Recurrent with Value Efficiency for Chunk-Parallel Linear Attention

via Arxiv 👤 Sayak Dutta 📅 2026-06-25

⚡ Score: 6.7

"Recurrent models must forget in order to remember, yet the state of the art decides what to erase without consulting what is stored -- the gate sees only the arriving token, not the memory it is about to modify. This memory-blind gating is one of three coupled defects in the leading delta-rule archi..."

🛠️ SHOW HN

Show HN: Autonomous CAD design and OpenFOAM optimization loop using local LLMs

via HackerNews 👤 ostenjap 📅 2026-06-27

🔺 1 pts ⚡ Score: 6.5

📰 NEWS

Cerberus – a local firewall for AI agents' tool calls

via HackerNews 👤 cerberussec 📅 2026-06-28

🔺 3 pts ⚡ Score: 6.5

🔬 RESEARCH

Empowering GUI Agents via Autonomous Experience Exploration and Hindsight Experience Utilization for Task Planning

via Arxiv 👤 Tianyi Men, Zhuoran Jin, Pengfei Cao et al. 📅 2026-06-25

⚡ Score: 6.5

"Multimodal web agents can assist humans in operating repetitive GUI tasks, where effective task planning is essential for decomposing complex tasks into executable actions. While small open source MLLMs are cost efficient and privacy preserving compared with commercial large models, they suffer from..."

🔬 RESEARCH

Hallucination in World Models is Predictable and Preventable

via Arxiv 👤 Nicklas Hansen, Xiaolong Wang 📅 2026-06-25

⚡ Score: 6.4

"Modern generative world models render increasingly realistic action-controllable futures, yet they frequently hallucinate: rollouts remain visually fluent while drifting from the ground-truth dynamics. We hypothesize that hallucination concentrates in low-coverage regions of the state-action space,..."

🔬 RESEARCH

Advancing Omnimodal Embodied Agents from Isolated Skills to Everyday Physical Autonomy

via Arxiv 👤 Junhao Shi, Zezheng Huai, Siyin Wang et al. 📅 2026-06-25

⚡ Score: 6.3

"Building persistent embodied agents in unstructured environments demands unified orchestration of heterogeneous tools spanning both cyber (APIs, IoT) and physical (manipulation, navigation) domains, coupled with autonomous recovery from physical failures that inevitably arise over extended operation..."

🔬 RESEARCH

Beyond the Hard Budget: Sparsity Regularizers for More Interpretable Top-k Sparse Autoencoders

via Arxiv 👤 Nathanaël Jacquier, Maria Vakalopoulou, Mahdi S. Hosseini 📅 2026-06-25

⚡ Score: 6.3

"Sparse autoencoders (SAEs) have become a leading tool for interpreting the representations of vision foundation models, decomposing their polysemantic activations into a larger set of sparse, more monosemantic features. The Top-$k$ SAE, a now-standard variant, enforces sparsity architecturally throu..."

💰 FUNDING

Sources: Baidu's chip unit Kunlunxin Technology plans a Hong Kong IPO at a $50B target valuation, asking investors to buy chips worth 3-7x their IPO investment

via Techmeme 👤 Techmeme 📅 2026-06-28

⚡ Score: 6.2

📰 NEWS

Open handoff: Thought Tree, a markup/spec idea for modular LLM workflows

via HackerNews 👤 xavier1764 📅 2026-06-27

🔺 1 pts ⚡ Score: 6.1

🔬 RESEARCH

Ask, Don't Judge: Binary Questions for Interpretable LLM Evaluation and Self-Improvement

via Arxiv 👤 Sangwoo Cho, Kushal Chawla, Pengshan Cai et al. 📅 2026-06-25

⚡ Score: 6.1

"Evaluating LLM outputs remains a major bottleneck in NLP: human evaluation is expensive and slow, lexical metrics correlate poorly with human judgments on open-ended generation, and holistic LLM judges often produce opaque scores that are hard to debug. We propose BINEVAL, a framework that decompose..."

🛠️ SHOW HN

Show HN: Drift, write LLM agents in English and transpile to async Python

via HackerNews 👤 rileyq12 📅 2026-06-28

🔺 2 pts ⚡ Score: 6.1

Stories from June 28, 2026

A way to exclude sensitive files issue still open for OpenAI Codex

An analysis of US payroll data across 730+ occupations: employment among workers ages 22 to 25 in highly AI-exposed jobs is now shrinking by 3.8% per year

GPT-5.6 Sol/Terra security capabilities and release approval

GPT-5.6 system card indicates Sol is well below the level of most worrisome Mythos use cases, suggesting all GPT-5.6 versions could be released without delay

OpenAI says GPT-5.6 Sol and Terra were capable of identifying vulnerabilities but were unable to execute autonomous, end-to-end attacks against hardened targets

Ford rehires experienced engineers after AI implementation issues

Ford rehires 'gray beard' engineers after AI falls short

Ford hired AI and sacked humans. It backfired badly

GLM-5.2 security vulnerability detection capabilities

GLM 5.2 beats Claude in our benchmarks

Researchers say Z.ai's GLM-5.2 matches latest US models at finding security bugs, as critics question the US' lax approach in restricting Chinese open models

Google restricts Meta's access to Gemini AI

Google limits Meta's use of its Gemini AI models

Sources: Google told Meta around March it couldn't offer all the Gemini capacity Meta wanted to buy, disrupting and delaying some of Meta's internal AI projects

Reinforcement Learning without Ground-Truth Solutions can Improve LLMs

When Does Combining Language Models Help? A Co-Failure Ceiling on Routing, Voting, and Mixture-of-Agents Across 67 Frontier Models

Beyond Surface Forms: A Comprehensive, Mechanism-Oriented Taxonomy of Indirect Linguistic Encoding for LLM-Based Coded Language Detection

Clean GitHub repo tricks AI coding agents into running malware

Reflections on software engineering in the age of AI

I used Claude Code to get a second opinion on my MRI

Show HN: Caliper – pass@k reliability testing for Claude Code and Codex skills

Prompt Injection in Automated Résumé Screening with Large Language Models: Single and Multi-Injection Settings

How Claude Code and Codex Sandbox Untrusted Code

CARVE: Content-Aware Recurrent with Value Efficiency for Chunk-Parallel Linear Attention

Show HN: Autonomous CAD design and OpenFOAM optimization loop using local LLMs

Cerberus – a local firewall for AI agents' tool calls

Empowering GUI Agents via Autonomous Experience Exploration and Hindsight Experience Utilization for Task Planning

Hallucination in World Models is Predictable and Preventable

Advancing Omnimodal Embodied Agents from Isolated Skills to Everyday Physical Autonomy

Beyond the Hard Budget: Sparsity Regularizers for More Interpretable Top-k Sparse Autoencoders

Sources: Baidu's chip unit Kunlunxin Technology plans a Hong Kong IPO at a $50B target valuation, asking investors to buy chips worth 3-7x their IPO investment

Open handoff: Thought Tree, a markup/spec idea for modular LLM workflows

Ask, Don't Judge: Binary Questions for Interpretable LLM Evaluation and Self-Improvement

Show HN: Drift, write LLM agents in English and transpile to async Python

Stories from June 28, 2026

GPT-5.6 Sol/Terra security capabilities and release approval

Ford rehires experienced engineers after AI implementation issues

GLM-5.2 security vulnerability detection capabilities

Google restricts Meta's access to Gemini AI

📡 AI NEWS BUT ACTUALLY GOOD