AI News Archive: June 30, 2026 — Part 24

Sourced from 500+ daily AI sources, scored by relevance.

Self-Supervised Temporal Regularization for Landmark-Based Cardiac Segmentation with Automatic AHA Regional Mapping
Graph-based cardiac segmentation with implicit anatomical correspondences provides topological guarantees and population-level analysis capabilities, but models trained on independent frames of image sequences exhibit temporal discontinuities that affect reliable clinical measurements, particularly ...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31785v1
Mesh BDF: Barycentric Dominance Field for 3D Native Mesh Generation
Autoregressive (AR) modeling has recently achieved remarkable progress in native 3D mesh generation, largely due to its natural ability to handle variable-length, discrete data structures. However, the inherent constraints of the AR paradigm severely restrict the generated meshes, leading to limited...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31777v1
NURBS Splatting: A Unified Differentiable Rendering Framework for Vector Graphics
Differentiable rendering of planar rational splines remains largely underexplored, despite their widespread use in vector graphics and design. Existing differentiable vector renderers primarily focus on Bézier curves and rely on analytic rasterization, which can suffer from gradient instability and ...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31764v1
Estimating Velocity of Spheres from Rolling-Shutter Image(s)
Rolling-shutter cameras introduce characteristic distortions when imaging fast moving objects, and these effects are typically treated as artifacts to be corrected. In this work, we instead leverage rolling-shutter distortions as a valuable source of temporal information to estimate the 3D translati...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31760v1
Rhythm-Structured Predictive Learning for Remote Photoplethysmography
Remote photoplethysmography (rPPG) estimates physiological signals from facial videos by analyzing subtle pulse induced skin color variations. Despite recent progress, existing self-supervised rPPG methods mainly reconstruct masked pixels or low-level visual representations, which can bias the model...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31736v1
MemLearner: Learning to Query Context memory for Video World Models
Video World Models are interactive video generation models that predict future world states based on user actions and history video frames. A critical challenge in video world models is the lack of memory, causing inconsistent generated scenes over extended durations. Previous methods explored rule-...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31734v1
EatSense 2.0
Gut tracking with a mascot that mirrors your day
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/eatsense-ai-food-diary?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
WIDER-FAIR: An Annotated Version of the WIDER-FACE Dataset for Fairness Evaluation
The deployment of face detection models in real-world applications raises important fairness concerns, as these systems may showcase performance disparities across demographic groups. A key obstacle to studying and mitigating such biases is the lack of face detection datasets with sensitive feature ...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31704v1
Phantom: A Unified Face-Swap Deepfake Protection Framework with Latent and Spatial Constraints
Face-swapping deepfakes pose an escalating threat to personal privacy by enabling unauthorized identity manipulation. While adversarial approaches have demonstrated success against black-box face recognition (FR) models, their applicability to face-swapping scenarios remains underexplored. In partic...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31703v1
ShellMaker: Language-Guided Exterior Completion under Structural Constraints
Despite advances in indoor scene generation, synthesizing coherent building exteriors consistent with generated interiors remains largely unexplored. Existing methods can generate floor plans and wall layouts but typically stop at a structural shell, lacking stylistically consistent facades and roof...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31680v1
FedLAB: Traceable Semantic Codebooks for Federated Multimodal Graph Foundation Learning
Multimodal graph foundation models aim to learn reusable knowledge from graphs enriched with text, images, attributes, and relational topology, thereby supporting diverse graph-centric and modality-centric tasks. In practice, however, such multimodal graphs are often distributed across decentralized...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.32016v1
Surrogate Fidelity: When Can Open LLMs Explain Closed Ones?
Mechanistic interpretability (MI) requires full access to model internals, yet the APIs for most widely deployed language models at best expose log-probabilities over output tokens. This creates a surrogate problem: when do measurements made on open models allow us to make claims about a closed mode...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.32008v1
Making Sense of Touch from the Child's View for Contrastive Learning
Is the sense of touch a mechanism for human babies' learning of visual concepts? If so, can we quantify its importance, and to what extent do babies rely on their sense of touch for visual learning? To approach these questions in a principled way, we propose a structured coding system for baby-centr...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31943v1
Low-dimensional topology of deep neural networks
We study layered models, including feedforward networks, ResNets, and transformers, by limiting each layer to a width of $d = 3$, i.e., $\mathbb{R}^3$ as representation space. This allows us to track how a neural network changes low-dimensional topological invariants through its layers. Just about a...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31856v1
Relational and Sequential Conformal Inference for Energy Time Series over Graphs via Foundation Models
Accurate energy demand forecasting is essential for the reliable operation and planning of modern sustainable energy systems. Spatial-temporal graph neural networks (STGNNs) have recently achieved strong performance in point forecasting by jointly modeling temporal dynamics and relational dependenci...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31804v1
Addressing Over-Refusal in LLMs with Competing Rewards
Safety training on language models often induces over-refusal: improved safety on harmful prompts at the cost of increased refusal on harmless ones. Though this trade-off can be mitigated by training models with reinforcement learning (RL) to reason before answering, it does not remove the underlyin...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31748v1
Nonlinearity-Aware LoRA: Structured Gate Adaptation under Low-Rank Constraints
Low-rank adaptation (LoRA) is commonly viewed as an update-space approximation to full fine-tuning, yet this view is incomplete for self-gated Transformer feed-forward networks. In gated FFNs, a low-rank residual can change not only projected features but also the nonlinear selection weights that de...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31717v1
Improving Certified Robustness via Adversarial Distillation
Certified training aims to produce models whose predictions can be formally verified against adversarial perturbations, typically by optimising upper bounds on the worst-case loss over an allowed perturbation set. For neural networks, certified training methods based purely on tight relaxation bound...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31653v1
Evil Spectra: How Optimisers can Amplify or Suppress Emergent Misalignment
Emergent misalignment (EM) is a recently discovered phenomenon in LLMs where fine-tuning on a narrow misaligned task, such as writing insecure code, leads to broadly misaligned behaviour on unrelated prompts. Previous work has noted that the severity of EM is highly sensitive to training choices; ho...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31591v1
From Failure to Alignment: A Requirements Engineering Framework for Machine Learning Systems
Organisations designing, developing, and deploying machine learning systems (MLS) need to be able to check that these systems are trustworthy, and communicate this clearly to their stakeholders, be they different categories of users, engineers, or wider society. By focusing on stakeholders, Requirem...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31589v1
Robustness of neural networks to random noise perturbations of their inputs
We investigate the problem of the robustness of a trained neural network to the perturbation of its input values. More specifically, we examine the interplay between the accuracy of the network, as measured by the mean squared error, and robustness. Accordingly, we present a robustness measure, whic...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31581v1
Localized Conformal Prediction for Image Classification with Vision-Language Models
Conformal predictions have attracted significant attention in the field of uncertainty quantification, mainly because of their strong marginal coverage guarantees. Full conditional guarantee is not an attainable goal, a well known fact in conformal predictions literature. As a result, several approa...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31577v1
Random Reshuffling Dominates Stochastic Gradient Descent
Stochastic Gradient Descent ($\textsf{SGD}$) is one of the most classical optimization algorithms with favorable theoretical guarantees, yet the practical implementation of $\textsf{SGD}$ differs subtly from its well-known form and is often referred to as Shuffling Stochastic Gradient Descent ($\tex...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.32005v1
Evaluation of Population Initialization Methods for Genetic Programming-based Symbolic Regression
We analyze the effect of optimizing the initial population of genetic programming (GP) for symbolic regression (SR) on the accuracy and complexity of solutions. We compare three well-established random initialization methods as well as initialization with small optimized solutions from exhaustive sy...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31990v1
Accelerating Conformal Prediction via Approximate Leave-One-Out
While conformal prediction provides a general framework for uncertainty quantification in predictive inference, its application is often limited by computational cost. Recent methods, including Jackknife+ and Jackknife-minmax, achieve faster computation by trading a slight loss of efficiency relativ...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31915v1
Sequential RC-TGAN: Generating Relational Time Series with Spectral Envelope Loss
The generation of synthetic relational databases often involves modeling complex temporal dynamics, such as transaction logs or event sequences. A significant challenge in this domain is the handling of categorical time series (e.g., status codes), where standard encoding methods like one-hot encodi...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31904v1
Policy Optimization Achieves Data-Dependent Regret Bounds in MDPs with Unknown Transitions
We study policy optimization for online episodic tabular Markov decision processes with unknown transition kernels, aiming for best-of-both-worlds guarantees together with data-dependent regret bounds. Recent work (Dann et al., 2023; Li et al., 2026) has shown that policy optimization can adapt to b...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31769v1
Is Natural Always Appropriate? Investigating Naturalness and Appropriateness Across Different Domains for TTS Evaluation
Text-to-speech (TTS) evaluation is an open challenge. While the primary target was "naturalness," recent fidelity gains shifted focus toward "appropriateness" and whether speech is correct for its context. In this work, we examine how perception changes when the expected downstream use varies. We me...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31729v1
Diffusing Blame: Task-Dependent Credit Assignment in Biologically Plausible Dual-Stream Networks
Biological neural circuits obey Dale's principle: each neuron's synapses are uniformly excitatory or inhibitory. Artificial networks that respect this constraint must coordinate separate excitatory and inhibitory populations, fundamentally changing how credit is assigned during learning. Several bio...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31700v1
ECHO: Prune to act, trace to learn with selective turn memory in agentic RL
Long-horizon language agents must repeatedly interact with tools, accumulate evidence, and make decisions under bounded context windows. Existing context-management methods make such rollouts feasible by truncating distant history, folding past turns into summaries, or selecting compact memory state...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31650v1
Think in English, Answer in Korean: Efficient Adaptation of Multilingual Tool-Using Agents
We present LuckyStar 111B, a 111B-parameter hybrid reasoning model developed through a collaboration between Cohere and LG CNS for Korean-English enterprise agents under practical memory and serving constraints. The model trains from Cohere's fully post-trained Command A model rather than a new pret...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31648v1
Calibration, Not Compilation: Detecting and Repairing Misspecified Probabilistic Programs Written by Language Models
Language models increasingly write probabilistic programs (in NumPyro, Stan, or Pyro), but a program that compiles, runs, and passes every unit test can still be \emph{statistically} wrong -- a Gaussian likelihood for heavy-tailed data, a Poisson for over-dispersed counts, an invalid prior support, ...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31630v1
Preserve the Hard, Regenerate the Rest: Uncertainty-Guided Synthetic Training Data Augmentation with Diffusion Models
Semantic segmentation models struggle with data sparsity and rare or visually diverse regions, e.g., dense regions or small objects in aerial or autonomous mobility data. While synthetic augmentation is an appealing solution, directly generating new labeled data risks misalignment of labels and gene...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31603v1
Adapting Generalist Robot Policies with Semantic Reinforcement Learning
Generalist robot policies learn a diverse repertoire of behaviors from large-scale pretraining. In principle, this makes them excellent priors for downstream adaptation via reinforcement learning (RL). In practice, however, standard RL methods leveraging this prior optimize directly over robot actio...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31958v1
RoboTacDex: A Dexterous Visual-Tactile-Action Dataset for Humanoid Manipulation
In the field of robot learning, large-scale and diverse demonstration trajectories provide the fundamental basis for enhancing robotic manipulation ability. We introduce RoboTacDex, a large, multi-modal, and diverse dataset of dexterous manipulation behaviors performed with a humanoid robot. Built o...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31836v1
Reinforcement Learning-Based Control for an Inline Skating Humanoid Robot
As humanoid robots become increasingly dynamic, coupling them with reinforcement learning offers a promising approach to solving the complex, underactuated mechanics of passive inline skating. Equipping a humanoid robot with passive inline skating wheels presents an opportunity to combine the versat...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31807v1
FastDSAC: Enhancing Policy Plasticity via Constrained Exploration for Scalable Humanoid Locomotion
Scalable reinforcement learning has popularized high-throughput sampling architectures, which significantly compresses the training time for off-policy methods in robotic locomotion. However, the rapid increase of data volume and update frequency undermines the stability of value-based methods and d...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31691v1
DynFly: Dynamic-Aware Continuous Trajectory Generation for UAV Vision-Language Navigation in Urban Environments
Recent advances in multimodal large models have significantly improved UAV vision-language navigation (UAV-VLN) by enhancing high-level perception and reasoning. However, existing methods mainly focus on predicting discrete actions, local targets, or sparse waypoints, while the continuous transition...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31654v1
Wiglo
Replace fog with clarity
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/wiglo-2?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
Robust Autonomous UAV Landing on Maritime Platforms via Multimodal Agentic AI and Active Wave Compensation
Autonomous aerial inspection of marine infrastructure is frequently compromised by stochastic sea states, introducing risks of high-kinetic impacts, post-landing toppling, and sensory occlusion. This paper proposes a decoupled, multi-vehicle landing framework synchronizing an Unmanned Surface Vehicl...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31613v1
Stabilization Learning: A Paradigm Transition Bridging Control Theory and Machine Learning
Stabilization learning is an interdisciplinary paradigm that bridges control theory and machine learning. Its core idea is to enable systems to adjust their policies under perturbations or environmental changes through real-time feedback and adaptive mechanisms. It takes stability as its primary goa...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31562v1
UniTac: A Unified Multimodal Model for Cross-Sensor Tactile Understanding and Generation
Unified multimodal models (UMMs) have shown great promise in integrating understanding and generation across diverse modalities. However, existing research rarely extends this paradigm to the tactile domain, where both object-level semantics and sensor-level configurations jointly determine the mean...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31451v1
Stage-Transition Dense Reward Modeling for Reinforcement Learning
Reinforcement learning for long-horizon robotic manipulation is often limited by sparse and delayed rewards, while manually designing dense shaping signals is costly and brittle to changes in environments and object configurations. This work proposes Stage-Transition Dense Reward (STDR), a visual re...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31377v1
Verification-Gated Agentic Mission-State Governance for Intelligent Industrial Multi-Robot Systems
Agentic artificial intelligence is increasingly used to decompose industrial tasks, propose robot actions, and adapt execution plans in dynamic cyber-physical environments. However, autonomous proposal generation alone does not guarantee that multi-robot industrial systems preserve task dependencies...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31339v1
3D HAMSTER: Bridging Planning and Control in Hierarchical Vision Language Action Models through 3D Trajectory Guidance
Hierarchical Vision-Language-Action (VLA) models decouple high-level planning from low-level control to improve generalization in robot manipulation. Recent work in this paradigm uses 2D end-effector trajectories predicted by a Vision-Language Model (VLM) as explicit guidance for a downstream policy...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31329v1
Safe Online Learning via Smooth Safety-Structured Policy Composition
Safe online reinforcement learning requires policies to respect safety constraints while maintaining smooth optimization dynamics. Existing approaches typically rely on either strict safety enforcement via action interventions, which introduce discontinuities in system interaction and learning, or s...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31320v1
Long-term Traffic Simulation via Structured Autoregressive Modeling
Interactive traffic simulation is a vital world model for autonomous driving. A central challenge in long-horizon simulation is modeling sustained multi-agent interactions, which is further exacerbated by dynamic token cardinality as agents continuously enter and exit the scene. In this work, we pro...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31209v1
Machine Learning-based Feedback Linearization Control of Quadrotor Subject to Unmodeled Dynamics
The control of agile quadrotors in dynamic and uncertain environments remains an open area of investigation to this day, particularly when the complete system dynamics are partially known or highly nonlinear. This work introduces a novel machine learning-based feedback-linearization control framewor...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31199v1
LLM-Powered Interactive Robotic Action Synthesis from Multimodal Speech, Gestures, and Music
The quest for intuitive and natural human-robot interaction (HRI) remains a significant challenge in robotics. Traditional methods often rely on rigid, pre-programmed commands that limit the robot's expressiveness and adaptability. This paper introduces a novel framework that leverages the reasoning...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31158v1
Scenario Generation for Testing of Autonomous Driving Systems Using Real-World Failure Records
To ensure safe on-road behavior, pre-deployment testing and failure discovery of Autonomous Driving Systems (ADS) is crucial. Present day simulation based testing methods focus largely on mathematical models for efficient search of optimal scenarios, assuming a fixed scenario representation. On the ...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31131v1