๐ HISTORICAL ARCHIVE - June 12, 2026
What was happening in AI on 2026-06-12
๐ You are visitor #47291 to this AWESOME site! ๐
Archive from: 2026-06-12 | Preserved for posterity โก
๐ Filter by Category
Loading filters...
๐ฐ NEWS
๐บ 143 pts
โก Score: 7.5
๐ฌ RESEARCH
via Arxiv
๐ค Elias Lumer, Sahil Sen, Kevin Paul et al.
๐
2026-06-11
โก Score: 7.3
"Recursive language models (RLMs) showed that recursion over model calls is an effective strategy for long-context reasoning, and production coding agents have begun to write code that spawns subagents at scale, most recently in Anthropic's dynamic workflows. We name and study the pattern between the..."
๐ฐ NEWS
๐บ 4 pts
โก Score: 7.3
๐ฌ RESEARCH
via Arxiv
๐ค Sanjay Adhikesaven, Haoxiang Sun, Sewon Min
๐
2026-06-10
โก Score: 7.3
"Modern LLM training pipelines increasingly rely on other models to generate data, filter corpora, judge outputs, and guide development decisions. These dependencies are recursive: a model may depend on an upstream artifact whose own dependencies are documented only in separate releases and artifacts..."
๐ฐ NEWS
๐บ 2 pts
โก Score: 7.3
๐ฌ RESEARCH
via Arxiv
๐ค Leon Bergen, Usha Bhalla, Sidharth Baskaran et al.
๐
2026-06-10
โก Score: 7.1
"Language-model post-training is the main stage at which model behavior is shaped, yet it still largely involves optimization of scalar rewards that summarize diverse desiderata. This abstraction gives practitioners little visibility into what their data actually teaches models, allowing spurious cor..."
๐ฐ NEWS
๐บ 2 pts
โก Score: 7.1
๐ฌ RESEARCH
via Arxiv
๐ค Jundong Xu, Qingchuan Li, Jiaying Wu et al.
๐
2026-06-11
โก Score: 7.1
"Large language model (LLM) agents have achieved strong performance on a wide range of benchmarks, yet most evaluations assume static environments. In contrast, real-world deployment is inherently dynamic, requiring agents to continually align their knowledge, skills, and behavior with changing envir..."
๐ฌ RESEARCH
via Arxiv
๐ค Cheng-Yu Yang, Shao-Yuan Lo, Yu-Lun Liu
๐
2026-06-10
โก Score: 7.0
"Vision-language models (VLMs) project images into hundreds to thousands of visual tokens, making decoder inference expensive in both attention computation and KV-cache memory. Existing visual-token reduction methods largely follow a rank-and-remove paradigm: they score visual tokens, keep a compact..."
๐ฐ NEWS
๐บ 5 pts
โก Score: 7.0
๐ฌ RESEARCH
via Arxiv
๐ค Amy Xin, Jiening Siow, Junjie Wang et al.
๐
2026-06-11
โก Score: 7.0
"LLM-based agents have shown increasing potential in automating scientific discovery. Given an optimizable metric and an execution environment, they can propose, validate, and iterate scientific solutions, and have produced results that outperform human-designed approaches. As model capabilities cont..."
๐ก AI NEWS BUT ACTUALLY GOOD
The revolution will not be televised, but Claude will email you once we hit the singularity.
Get the stories that matter in Today's AI Briefing.
Powered by Premium Technology Intelligence Algorithms โข Unsubscribe anytime
๐ฌ RESEARCH
via Arxiv
๐ค Noรฉmi รltetล, Nathaniel D. Daw, Kimberly L. Stachenfeld et al.
๐
2026-06-10
โก Score: 7.0
"Advancing scientific understanding through mechanistic modeling requires posing the right experimental questions to yield maximally informative data. To automate this pursuit within cognitive science, we introduce ATLAS (Active Theory Learning for Automated Science), an active learning framework for..."
๐ฐ NEWS
๐บ 3 pts
โก Score: 7.0
๐ฐ NEWS
๐บ 1 pts
โก Score: 7.0
๐ฌ RESEARCH
via Arxiv
๐ค Zhiyi Chen, Jie Song, Peng Li
๐
2026-06-10
โก Score: 7.0
"Large Language Models (LLMs) have democratized database access through Text-to-SQL, but moving from prototypes to production remains difficult. Real deployments must handle strict SQL dialects, massive schemas, and evolving user preferences, while supervised fine-tuning is costly and rigid and agent..."
๐ฐ NEWS
๐บ 2 pts
โก Score: 6.9
๐ฌ RESEARCH
via Arxiv
๐ค Xingjian Diao, Wenbo Li, Yashas Malur Saidutta et al.
๐
2026-06-10
โก Score: 6.9
"Long input sequences are central to document understanding and multi-step reasoning in Large Language Models, yet the quadratic cost of attention makes inference both memory-intensive and slow. Context distillation mitigates this by compressing contextual information into model parameters, and recen..."
๐ฌ RESEARCH
๐บ 1 pts
โก Score: 6.9
๐ ๏ธ SHOW HN
๐บ 1 pts
โก Score: 6.9
๐ฌ RESEARCH
via Arxiv
๐ค Minghao Luo, Liang Chen
๐
2026-06-11
โก Score: 6.9
"Search-augmented LLMs increasingly mediate everyday consumer recommendations by retrieving live web content. This creates a new risk: generative recommenders may consume polluted web content, such as fake reviews and promotional pages crafted to mislead recommendations. We ask: to what extent do sea..."
๐ฐ NEWS
๐บ 190 pts
โก Score: 6.8
๐ฌ RESEARCH
via Arxiv
๐ค Xiaoyuan Liu, Jianhong Tu, Yuqi Chen et al.
๐
2026-06-11
โก Score: 6.8
"Agent systems are advancing quickly across domains, but their evaluation remains fragmented. Most benchmarks rely on fixed, LLM-centric harnesses that require heavy integration, create test-production mismatch, and limit fair comparison across diverse agent designs. The root problem is the lack of a..."
๐ฌ RESEARCH
via Arxiv
๐ค Yaxin Du, Yifan Zhou, Yujie Ge et al.
๐
2026-06-11
โก Score: 6.8
"Tool-augmented LLM agents commonly rely on step-wise atomic tool calls, where each invocation, observation, and value transfer is exposed in the main reasoning trace. This creates an \emph{execution-granularity mismatch}: locally deterministic tool workflows are unfolded into repeated model-visible..."
๐ฌ RESEARCH
via Arxiv
๐ค Anamaria-Roberta Hartl, Levente Zรณlyomi, David Stap et al.
๐
2026-06-10
โก Score: 6.8
"Transformers dominate modern sequence modeling, but their quadratic attention incurs substantial computational cost. Subquadratic architectures offer a scalable alternative. However, it remains unclear which designs yield the most effective sequence models. We compare three leading approaches: xLSTM..."
๐ฌ RESEARCH
via Arxiv
๐ค Zongsheng Cao, Bihao Zhan, Jinxin Shi et al.
๐
2026-06-11
โก Score: 6.8
"Current LLM-based research agents have advanced through agent orchestration, yet largely overlook scientific knowledge orchestration. Existing works often reduce papers to abstracts, surface mentions, and flat \texttt{cites} edges, omitting key entities, claims, evidence, mechanisms, and method line..."
๐ ๏ธ SHOW HN
๐บ 1 pts
โก Score: 6.8
๐ฐ NEWS
๐บ 2 pts
โก Score: 6.8
๐ฌ RESEARCH
via Arxiv
๐ค Hongjian Zhou, Xinyu Zou, Jinge Wu et al.
๐
2026-06-10
โก Score: 6.8
"Large language models (LLMs) now reach expert-level scores on medical licensing exams, encouraging the assumption that high scores imply safe medical judgment while patients increasingly use them for health advice. We show this assumption is fragile: when misleading context is injected into question..."
๐ฐ NEWS
๐บ 431 pts
โก Score: 6.8
๐ฌ RESEARCH
via Arxiv
๐ค Zilin Xiao, Qi Ma, Chun-cheng Jason Chen et al.
๐
2026-06-11
โก Score: 6.7
"Retrieval-augmented generation (RAG) has become a standard mechanism for grounding language models in external knowledge, yet conventional retrieval based on lexical or semantic similarity is poorly suited for complex reasoning tasks: a semantically similar problem may demand an entirely different s..."
๐ฌ RESEARCH
via Arxiv
๐ค Mengyu Zheng, Kai Han, Boxun Li et al.
๐
2026-06-10
โก Score: 6.7
"General-purpose agents such as OpenClaw are increasingly used as autonomous tool users, but their coding ability is difficult to measure under SWE-bench: a generic agent does not by itself satisfy the clean Docker workspace, patch, and prediction contract required for scoring. We introduce Claw-SWE-..."
๐ฌ RESEARCH
via Arxiv
๐ค Chirag Chawla, Pratinav Seth, Vinay Kumar Sankarapu
๐
2026-06-10
โก Score: 6.7
"Domain fine-tuning degrades the safety of large language models: fine-tuned specialists readily comply with harmful prompts framed in domain language. Existing inference-time defenses that mix logits from a safe anchor model require both models to share a vocabulary, which rules them out for the cro..."
๐ฌ RESEARCH
via Arxiv
๐ค Xucong Wang, Ziyu Ma, Yong Wang et al.
๐
2026-06-10
โก Score: 6.7
"Recent advances in agentic Reinforcement Learning (RL) have substantially improved the multi-turn tool-use capabilities of large language model agents. However, most existing methods assign credit over coarse heuristic units, such as tool-call boundaries or fixed workflows, making it difficult to id..."
๐ฌ RESEARCH
via Arxiv
๐ค King Yeung Tsang, Zihao Zhao, Vishal Venkataramani et al.
๐
2026-06-11
โก Score: 6.6
"Multi-Agent Systems (MAS) built on Large Language Models (LLMs) require effective orchestration to coordinate specialized agents, yet training such orchestrators is hindered by limited supervision and high computational cost. We propose Orchestration Reward Modeling (OrchRM), a self-supervised frame..."
๐ฌ RESEARCH
via Arxiv
๐ค Yucheng Li, Huiqiang Jiang, Yang Xu et al.
๐
2026-06-10
โก Score: 6.6
"Reinforcement learning (RL) has become a key component in modern large language models, yet the rollout stage remains the key bottleneck in RL training pipelines. Although Multi-Token Prediction (MTP) offers a natural solution to accelerate rollouts through speculative decoding, many studies have ob..."
๐ฐ NEWS
๐บ 7 pts
โก Score: 6.4
๐ฌ RESEARCH
๐บ 2 pts
โก Score: 6.4
๐ฐ NEWS
๐บ 1 pts
โก Score: 6.2
๐ ๏ธ SHOW HN
๐บ 2 pts
โก Score: 6.1
๐ฐ NEWS
๐บ 2 pts
โก Score: 6.1