AI News Archive: June 1, 2026 — Part 21

Sourced from 500+ daily AI sources, scored by relevance.

Ablating Archetypes: The Stability of Archetypal SAEs is an Artifact of Initialization and Metric Design
Dictionary learning with sparse autoencoders (SAEs) produces overcomplete bases from neural network activations that are often interpretable and reduces polysemanticity. However, features from SAEs vary substantially across random seeds -- a problem known as instability. Archetypal SAEs (Fel et al.,...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.02061v1
Realistic noise synthesis reduces bias and improves tissue microstructure estimation with supervised machine learning
Diffusion MRI enables non-invasive probing of tissue microstructure, but accurate parameter estimation is challenged by noise-related effects. In supervised machine learning frameworks trained on simulated data, discrepancies between the noise characteristics of simulated and acquired signals introd...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.02044v1
Uncertainty-Aware Graph Neural Reconstruction of Urban Temperature Fields from Sparse Sensors under Deployment Constraints
Reconstructing spatially continuous daily temperature fields from sparse observations is important for urban climate monitoring and heat-risk analysis, but practical deployments are limited by sensor budgets and spacing constraints. This study proposes an uncertainty-aware graph neural network (GNN)...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.02038v1
World-Task Factorization for Robot Learning
Robot learning must produce policies that generalize to new combinations of constraints, teammates, and environments. To achieve this, we must structurally factor the policy, which is a choice that dictates what generalizes, what requires retraining, and what remains entangled. Existing methods span...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.02027v1
Provable Data Scaling Law for Meta Learning via Complexity Minimization
Pre-training has become a fundamental paradigm in modern machine learning, with one of its key empirical benefits being reduced downstream sample complexity as the scale of pre-training data increases. However, existing theoretical frameworks for pre-training do not fully explain this phenomenon. In...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.02008v1
Randomized Least Squares Value Iteration itself is Joint Differentially Private
As reinforcement learning (RL) increasingly applies to sensitive domains, such as health care and recommendation systems, privacy-preserving techniques have become essential to protect users' sensitive information. We investigate privacy-preserving RL under an episodic setting, focusing on algorithm...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01952v1
G2LoRA: Gradient Orthogonal Low-Rank Adaptation Framework for Graph Continual Learning on Text-Attributed Graphs
LLM-as-Aligner has emerged as a prevalent pre-training paradigm for Text-Attributed Graphs(TAGS), aligning graph and text modalities into a shared embedding space via CLIP-style contrastive learning. While effective on individual downstream tasks, we observe severe catastrophic forgetting when such ...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01873v1
Task-Induced Representational Invariances Depend on Learning Objective in Deep RL
Reinforcement Learning (RL) has long served as a model for goal-directed animal behavior in neuroscience. Modern deep RL has shown remarkable success across many domains, further strengthening this connection. The ability to learn abstract representations of high-dimensional state spaces underlies m...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01868v1
A Theoretical Framework for Self-Play Theorem Proving Algorithms
Self-play, a type of training algorithm that enables a model to self-improve, has recently shown promising empirical results in the context of formal theorem proving using Large Language Models (LLMs). (Dong & Ma, 2025) instantiate self-play with two cooperating agents: a prover, which proves theore...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01861v1
Observation, Not Prediction: Conversation-Level Disaggregated Scheduling for Agentic Serving
LLM-based agents resolve a user task through many turns of dependent inference and tool calls, producing a workload whose total cost is unknown when the task arrives. Existing multi-turn systems keep the turn as the scheduling unit and decide, turn by turn, whether to disaggregate prefill from decod...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01839v1
Adaptive Sharpness-Aware Minimization with a Polyak-type Step size: A Theory-Grounded Scheduler
Sharpness-Aware Minimization (SAM) has established itself as a powerful and widely adopted optimizer for training machine learning models. By explicitly minimizing the sharpness of the loss landscape, SAM often improves generalization while delivering strong empirical performance. However, SAM and i...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01827v1
FLARE: Diffusion for Hybrid Language Model
Autoregressive (AR) large language models (LLMs) have achieved broad practical success, but sequential decoding remains a key bottleneck for low-latency deployment. Recent efficient-inference work has progressed along two axes: reducing the cost of each model invocation through efficient architectur...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01774v1
Adaptive Auto-Harness: Sustained Self-Improvement for Agentic System Deployment on Open-Ended Task Streams
Auto-harness systems such as A-Evolve, GEPA, and Meta-Harness improve LLM agents by optimizing prompts, skills, tools, memories, and supporting infrastructure from execution feedback, but they are typically evaluated on fixed offline benchmarks. Real deployments instead present open-ended task strea...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01770v1
TypeTab
On-device autocomplete that learns how you write
🧰 ToolsJun 1, 2026https://www.producthunt.com/products/typetab-2?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
Evaluating Real-World Generalizability of Algorithm Selection Models
Algorithm Selection (AS) aims to automatically identify the most suitable optimization algorithm for a given problem instance by leveraging measurable problem characteristics and historical performance data. In this study, we investigate the generalization ability of AS models across both synthetic ...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.02016v1
A Closer Look at In-Distribution vs. Out-of-Distribution Accuracy for Open-Set Test-time Adaptation
Open-set test-time adaptation (TTA) updates models on new data in the presence of input shifts and unknown output classes. While recent methods have made progress on improving in-distribution (InD) accuracy for known classes, their ability to accurately detect out-of-distribution (OOD) unknown class...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01973v1
Flow-Transformed Implicit Processes for Function-Space Variational Inference
Implicit-process priors define distributions over functions through flexible generative mechanisms, making them attractive for Bayesian function-space modelling. However, performing posterior inference with such priors is challenging because their induced function-space distributions are typically n...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01954v1
Learning Action-Conditional and Object-Centric Gaussian Splatting World Models for Rigid Objects
World models enable intelligent agents to predict the consequences of their actions on the environment. In this paper, we propose Multi Rigid Object Gaussian World Model (MRO-GWM), a novel model that learns action-conditional dynamics of rigid objects in 3D. By representing the scene by object-centr...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01950v1
Private and Stable Test-Time Adaptation with Differential Privacy
Test-time adaptation (TTA) can reduce error on new and different data by updating the model on these inputs during inference. However, these updates raise the issue of privacy w.r.t. the testing data, because the model parameters now depend on all past inputs. To control this privacy risk, we cast m...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01908v1
Segment-driven Structural Induction and Semantic Alignment for Heterogeneous Tabular Representation
Real-world domains often contain heterogeneous tables whose headers vary while their underlying attribute semantics are shared, making it difficult to induce domain-specialized semantics from table-local evidence alone. Existing encoders model parts of this problem, but often underuse column-level v...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01890v1
Beyond the Simplex: Balanced Prototype Geometry for Scorer-Agnostic Open-Set Recognition
Open-set recognition (OSR) requires a classifier to reject inputs from unseen classes which is essential in safety-critical settings such as medical imaging. Simplex based methods, which fix class prototypes at the vertices of a regular simplex and then reject via a distance-ratio score, perform wel...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01883v1
Continual Learning as a Multiphase Moving-Boundary Problem
Continual learning struggles to balance retaining past knowledge with absorbing new tasks. Stefan-CL elegantly resolves this stability-plasticity dilemma through the physics of melting. It frames consolidated knowledge as a protected "solid" and unused capacity as an adaptable "liquid." As the netwo...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01863v1
The Lie We Tell: Correcting the Euclidean Fallacy in Vision Language Action Policies via Score Matching on Tangent Space
Diffusion-based Vision-Language-Action policies achieve remarkable success in robotic manipulation, yet commit a fundamental geometric error we term the $\textbf{Euclidean Fallacy}$: representing SE(3) poses as flat $\mathbb{R}^{12}$ vectors. This approximation induces (1) manifold drift violating S...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01847v1
Mos-Gen: A Generative Molecular Framework for Mosquito Insecticide Design
Mosquito-borne infectious diseases cause more than 700000 deaths worldwide each year. The long-term use of conventional chemical insecticides has induced serious resistance problems, creating an urgent need to develop novel, highly effective, and ecologically sustainable alternatives. While existing...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01846v1
Site4Drug: Predicting Drug-Binding Target Sites with an AI Agent
Selecting where to intervene on a protein (i.e., choosing a targetable site) is often a more ambiguous and failure-prone bottleneck than selecting what binds, especially for membrane proteins where accessibility, topology, and post-translational modifications (PTMs) constrain actionable regions. We ...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01816v1
Tree-Guided Identify-Then-Exploit: A Unified Framework of Best Arm Identification and Regret Minimization for Dueling Bandits
We study $N$-armed stochastic dueling bandits under the Condorcet-winner assumption, where three widely adopted objectives are considered: best-arm identification (BAI), weak regret, and strong regret. We propose Tree-Guided Identify-Then-Exploit (TG-ITE), the first unified framework to tackle all t...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01799v1
WALL-WM: Carving World Action Modeling at the Event Joints
WALL-WM is a World Action Model that shifts video-action learning from chunk-centric optimization to event-grounded Vision-Language-Action pretraining, using semantically coherent action events as the atomic unit of learning. Existing WAMs commonly initialize from multimodal or video foundation mode...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01955v1
FlatVPR: Plug-and-play Geo-linear Residual Adapter for Geometric Rectification of Foundation Model Feature Manifolds
This paper proposes ``FlatVPR,'' a novel geometric rectification paradigm that effectively bridges the trade-off between map lightweightness and localization accuracy in visual place recognition (VPR) by enforcing a feature manifold structure where any descriptor between two adjacent anchors $\mathb...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01734v1
TIDES: Time-Derivative Event Simulation via Deformable Reconstruction
Event cameras emit asynchronous events in response to environmental appearance changes. The scarcity of real-world event datasets makes simulation essential. However, most simulators infer event timestamps from frame sequences, forcing many threshold crossings to share a small set of discrete times;...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.02058v1
OpenCairn
Open-source AI knowledge OS for notes, docs, and agents
🧰 ToolsJun 1, 2026https://www.producthunt.com/products/opencairn?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
Co-training with Ego-centric Video and Demonstration for Robot Navigation Task
Vision-language-action (VLA) models are promising for diverse robotic tasks, but their performance heavily depends on large-scale high-quality training data, whose collection on real robots is costly and time-consuming. While prior work has explored augmenting manipulation datasets with egocentric h...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01951v1
Set-Supervised Diffusion Policy: Learning Action-Chunking Diffusion through Corrections
Diffusion policies have recently emerged as a powerful framework for robotic manipulation. However, like other behavior cloning methods, they remain vulnerable to distributional shift, often requiring human-in-the-loop interventions to correct failures during deployment. These interactions naturally...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01865v1
PHASOR: Phase-Anchored Universal Action Representations for Humanoid Embodiments
Learning a good action embedding space is fundamental to scalable robot policy learning, yet existing methods treat action latents as task-specific intermediates rather than first-class representations. The resulting latents are unstructured, embodiment-specific, and weakly tied to motion semantics,...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01851v1
Trans2Occ: Voxel Occupancy Estimation and Grasp for Transparent Objects from Simulation to Reality
Transparent objects remain challenging for robotic perception due to unreliable depth sensing caused by refraction and reflection. While prior approaches rely on multi-view reconstruction or depth completion, they are often difficult to scale or deploy in real-world robotic systems. In this paper, w...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01777v1
FlipItRight: Stable Pose-Targeted Throw-Flip Across Diverse Objects
We propose FlipItRight, a framework for stable planar pose-targeted throw-flip with a high-DoF manipulator. The task is decomposed into an object-level planner, which generates candidate release states satisfying the desired landing pose, and a robot-level planner, which evaluates executability and ...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01713v1
Goal2Pixel: Grounding Goals to Pixels for Vision-Language Navigation
Vision-language models (VLMs) have become a common foundation for vision-and-language navigation in continuous environments (VLN-CE). Yet most VLM-based methods cast navigation as low-level action prediction, an interface that is ambiguous, tied to short-horizon motion primitives, and inefficient du...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01621v1
Embedding Semantic Risk into Distance Fields and CBFs for Online Monocular Safe Control
We propose an online monocular perception-to-control framework that embeds semantic risk into the distance field used by Control Barrier Function (CBF)-based safe navigation and teleoperation. Many perception-based safety filters assign the same distance-based safety margin to all mapped obstacles o...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01605v1
Hierarchical Semantic-Augmented Navigation: Optimal Transport and Graph-Driven Reasoning for Vision-Language Navigation
Vision-Language Navigation in Continuous Environments (VLN-CE) poses a formidable challenge for autonomous agents, requiring seamless integration of natural language instructions and visual observations to navigate complex 3D indoor spaces. Existing approaches often falter in long-horizon tasks due ...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01565v1
Hierarchical Object Representation for Spatial Robot Perception: Points, Meshes, and Superquadrics
Hierarchical 3D Scene Graphs (3DSG) have emerged as an actionable and scalable representation for long-term autonomy incorporating metric, semantic, and topological information in the scene. However, the question of geometric representation of objects in 3DSG has been overlooked as most methods use ...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01545v1
Spatio-Temporal Reconnection for Multi-Robot Networks using Adaptive Prescribed-Time CBFs
In multi-robot systems, maintaining persistent communication graph connectivity is often overly restrictive, especially when robots have limited communication ranges but operate in large environments. Instead, allowing robots to temporarily disconnect and later reconnect is often more desirable for ...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01526v1
ReSkill: Reconciling Skill Creation with Policy Optimization in Agentic RL
Agentic reinforcement learning (RL) enables LLM agents to improve continuously from environment rewards, yet the resulting policies do not systematically accumulate reusable strategies that generalize across tasks. Modular skills can provide such reusable strategies, yet existing skill-augmented RL ...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01619v1
Fast Generalization after Interpolation via Critically Damped Momentum Optimization
A central problem in machine learning is that models can achieve near-perfect training performance while generalizing substantially less well to unseen examples. This gap is especially acute in high-dimensional, low-sample regimes, where many interpolating solutions exist and optimization must impli...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01521v1
Decision-calibrated prediction sets for robust power system operations
Robust optimization offers a tractable approach to balance operating costs and reliability in power systems dominated by weather-dependent renewable uncertainty, but its performance depends critically on the uncertainty set. Standard data-driven approaches often calibrate uncertainty sets to attain ...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.02081v1
Data-Automated Policy Learning for Nonlinear Welfare
This paper explores policy learning from observational data, focusing on a nonlinear welfare criterion in a binary treatment setting. The nonlinear criterion is inspired by scenarios where policymakers prioritize specific population segments. We model this criterion using a utility function that enc...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01659v1
MINTS: Minimalist Thompson Sampling
The Bayesian paradigm offers principled tools for sequential decision-making under uncertainty, but its reliance on a probabilistic model for all parameters can hinder the incorporation of complex structural constraints. We introduce a minimalist Bayesian framework that places a prior only on the lo...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01655v1
Self-Regulating Annealing in Heavy-Tailed Diffusion Models
Diffusion models have emerged as a leading framework for deep generative modeling. While the standard Gaussian formulation is theoretically convenient, its suitability for heavy-tailed datasets remains unclear. To address this, heavy-tailed diffusion models (HTDMs) extend the standard formulation by...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01645v1
Semi-Supervised Hyperbolic Hierarchical Clustering with Set-Level Structural Priors
Semi-supervised hierarchical clustering aims to learn a tree structure consistent with data patterns and user-provided supervision. Supervision is usually given as leaf-level relations, such as pairwise must-link/cannot-link constraints or triplet-wise must-link-before constraints. Although useful f...
📄 ResearchJun 1, 2026http://arxiv.org/abs/2606.01525v1
Mina Meeting Assistant
Your AI Teammate now responds and executes during your calls
🧰 ToolsJun 1, 2026https://www.producthunt.com/products/mina-meeting-assistant?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
SocialEcho 2.0
AI social media copilot for teams and agents
🧰 ToolsJun 1, 2026https://www.producthunt.com/products/socialecho?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
Dune Keypad
Context-aware Mac keypad, w/ Claude + community extensions
🧰 ToolsJun 1, 2026https://www.producthunt.com/products/dune-4?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29