AI News Archive: May 26, 2026 — Part 19

Sourced from 500+ daily AI sources, scored by relevance.

Three Office
Virtual office with spatial voice chat and AI meeting notes
🧰 ToolsMay 26, 2026https://www.producthunt.com/products/three-office?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
BEAT: Rhythm-Elastic Alignment for Agentic Music-guided Movie Trailer Generation
Automatic movie trailer generation must select shots from a full-length film and synchronize them with background music. Existing methods either relegate music alignment to post-processing or enforce rigid one-to-one shot-music mappings, overlooking that professional editing rhythm is elastic: rapid...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27067v1
SCKAN: Structural Consensus-based KAN Prototype Learning for Semi-Supervised Pancreas Segmentation
Accurate pancreas segmentation is critical for early cancer diagnosis, where annotation scarcity necessitates Semi-Supervised Learning (SSL). However, due to significant inter-sample morphological variability, existing SSL methods face severe generalizability limitations under sparse supervision, le...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27032v1
NeR-SC: Adapting Neural Video Representation to Screen Content
Implicit neural representations have emerged as a promising paradigm for video compression, with recent methods achieving competitive performance on natural video. However, screen content video -- common in remote desktop, online education, and cloud gaming -- exhibits distinct statistics: sharp edg...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27024v1
Timestep-Aware SVDQuant-GPTQ for W4A4 Quantization of Wan2.2-I2V
W4A4 quantization of large video diffusion Transformers offers substantial memory savings but is hindered by two main challenges: sparse large-magnitude activation outliers, and strongly timestep-dependent activation distributions across the multi-step denoising trajectory. These difficulties are co...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27003v1
On the Robustness of Machine Unlearning for Vision-Language Models
Vision-language models (VLMs) may memorize undesirable information from training data, motivating growing interest in machine unlearning. In this work, we present the first systematic survey and robustness analysis of VLM unlearning. We provide a comprehensive taxonomy and review of existing VLM unl...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26992v1
CodecCap: High-Fidelity Codec-Inspired Residual Modeling for Dense Video Captioning
Existing video captioning methods struggle to balance visual fidelity and redundancy: holistic captions are compact but lose fine-grained evidence, whereas segment-wise captions improve coverage but introduce heavy redundancy. We propose CodecCap, a codec-inspired framework for high-fidelity dense v...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26967v1
DinoComplete: 3D Shape Completion with Distilled Semantic Priors and State Space Models
3D shape completion from partial scans remains challenging for unseen categories and noisy real-world observations, where geometry alone is often insufficient for inferring missing structure. We present DinoComplete, a deterministic and efficient shape completion framework that augments geometric re...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26949v1
Leveraging Text-to-Image Diffusion Models for Unsupervised Visual Object Tracking
Unsupervised visual object tracking is a challenging task that requires following arbitrary targets in videos without training on ground-truth annotations. Despite considerable progress, existing state-of-the-art unsupervised trackers often struggle in scenarios that demand fine-grained understandin...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26933v1
I2PRef: Image-Driven Point Completion with Iterative Refinement
We present an image-conditioned point cloud completion approach that treats images as the primary geometric source rather than a secondary guide. To this end, we introduce an Image-to-Point (I2P) module that can reconstruct complete point clouds directly from a single RGB image, with no need for 3D ...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26914v1
SIMPC: Learning Self-Induced Mirror-Point Consistency for Unsupervised Point Cloud Denoising
In point clouds, noise directly perturbs point coordinates that encode both spatial location and geometry, making one-to-one correspondence construction more challenging than in images. Existing methods impose statistical mappings across noisy variants via noise or optimal transport, but suffer from...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26894v1
BASIS: Batchwise Advantage Estimation from Single-Rollout Information Sharing for LLM Reasoning
Reinforcement learning with verifiable rewards has become a standard recipe for improving the reasoning abilities of large language models. Existing algorithms face a tradeoff between computational efficiency and sample efficiency in value estimation and policy learning. We introduce BASIS, a critic...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27293v1
Detectability in Diversity: Improved Canary Crafting for Privacy Auditing in One Run
Privacy auditing aims to empirically assess privacy leakage in machine learning models using membership inference attacks (MIAs), and to derive lower bounds on differential privacy (DP) parameters. Recent one-run auditing methods address the high cost of standard approaches by relying on a single tr...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27292v1
Transfer Learning using 66 Diseases for Disease Forecasting Applications
Disease forecasting models typically rely on a single data stream, making models brittle when histories are short or noisy. Recent top-performing models have shown that synthesizing multiple reporting systems for the same disease improves performance. Other recent work takes this idea a step further...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27269v1
Explainable Comparison of Feature-Based and Deep Learning Models for TROPOMI Methane Plume Screening
Continuous and global detection of large methane emissions is a crucial step for global warming mitigation. Satellite observations, such as from S5P/TROPOMI, combined with plume detection algorithms, can play a key role in this effort. However, not all TROPOMI plume detections that look like methane...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27236v1
Mildly Overparameterized ReLU Networks on Orthogonal Data: Incremental Learning and Implicit Bias
The successful training of neural networks hinges on the use of first order optimization methods, yet the theoretical characterization of these methods remains incomplete. This is especially true in settings with mild overparameterization. In this work, we study the gradient flow dynamics of two-lay...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27097v1
InfraOne AI Labs
Instant AI Labs for Research & Learning — Free 15-Min trial
🧰 ToolsMay 26, 2026https://www.producthunt.com/products/infraone-ai-labs?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
Adversarial Dual On-Policy Distillation from Expressive Flow-based Teacher
Learning from demonstrations in embodied control is often cast as behavioral cloning, and recent diffusion or flow-matching policies improve this paradigm by modeling multi-modal expert actions. Yet these methods remain offline supervised learners: the policy is trained only on expert states and rec...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27095v1
Trust Region Q Adjoint Matching
Off-policy reinforcement learning of pretrained flow policies remains challenging due to the instability of optimization arising from the multi-step sampling process. Recently, Q-learning with Adjoint Matching (QAM) addressed this issue by reformulating into a memoryless stochastic optimal control (...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27079v1
Two Speeds of Learning: A Representation-Readout Decomposition of Grokking and Double Descent
Training loss and accuracy are the standard signals used to monitor generalization during deep neural network training. Two well-documented phenomena complicate this picture: in grokking, train loss falls rapidly while test performance improves abruptly only after a long delay; in epoch-wise double ...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27078v1
Causal Representation Learning for Generalisable Recommendation
Predictive models trained on observational data often fail to generalise to the distributions they encounter when deployed, especially when the training data is a product of the system being optimised. Recommender systems are a canonical example: they are trained on interaction logs confounded by th...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27043v1
SQARL: A Size-Agnostic Reinforcement Learning approach for Circuit Allocation in Distributed Quantum Architectures
The scaling of quantum processors is currently limited by technical challenges such as decoherence and cross-talk. As the number of qubits grows, interference increases the computational noise. Distributed quantum computing addresses these limitations by interconnecting smaller, easier-to-handle qua...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27027v1
From Scores to Gibbs Correctors: Accelerating Uniform-Rate Discrete Diffusion Models
Discrete diffusion models have achieved strong empirical performance in text and other symbolic domains, but, especially for uniform-rate models, they often require many steps to generate a single sample. Existing acceleration methods either rely on training additional quantities or suffer from slow...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27352v1
Greening AI Inference with Accuracy and Latency-aware User Incentives
The widespread use of AI services has raised concerns for its environmental sustainability, towards which recent studies have identified carbon emissions of AI inference as the major contributor. This paper introduces a framework for designing AI inference incentives based on the users' valuation fo...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27309v1
Normal Guidance is what Attention Needs
We consider training classifiers for 3D medical images using only one binary label for the entire volume rather than a label for each 2D slice. In such weakly supervised settings, can we learn accurate classifiers for slice-level predictions? Attention-based multiple instance learning (MIL) can prod...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27306v1
Kan Extension Transformers: A Categorical Unification of Attention, Diffusion, and Predict-Detach Self-Conditioning
We propose Kan Extension Transformers (KETs) as a unifying categorical framework for a diverse group of Transformer implementations. The core claim is that a Transformer layer can be viewed as a weighted structured extension operator: standard attention is the singleton-neighborhood case, Geometric ...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27259v1
Symbolic Regression via Latent Iterative Refinement
Symbolic regression (SR) seeks closed-form mathematical expressions that fit observed data. Neural SR methods amortize the search by training an encoder to map observations directly to expressions in a single pass, but this amortized inference leaves a residual amortization gap between its one-shot ...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27245v1
The Role of Causal Features in Strategic Classification for Robustness and Alignment
In strategic classification, an institution (e.g., a bank) anticipates adaptation from users who change their features to increase utility in a classification task (e.g., loan repayment). Since a key challenge is the distribution shift induced by users, we turn to causal models, which have been show...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27163v1
Gaussian Process-based learning with new MCMC-based implementation of Wishart prior on correlation matrix
In probabilstic supervised learning of an input-output relationship - as a sample function of a Gaussian Process (GP) - priors are typically specified for the hyperparameters of the kernel that parametrises the covariance function of the GP, where the induced covariance matrix of the (resulting mult...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27093v1
Cost of Structural Learning Under Censored Feedback: A Threshold-Bandit Approach
In many multi-agent applications, tasks yield rewards only when executed by a coalition meeting an unknown size threshold; otherwise, feedback is fully censored. This censorship creates an identifiability problem: agents cannot distinguish stochastic failure from insufficient coordination. We formal...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27076v1
Learning to Orchestrate Agents under Uncertainty
Adaptive orchestration of heterogeneous agents requires making sequential delegation decisions under uncertain and evolving agent behaviour, e.g., coordinating specialised AI models with varying reliability, cost, and response quality. While prior work on agent orchestration focuses on performance o...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27073v1
Learning Dynamic Graph Representations through Timespan View Contrasts
The rich information underlying graphs has inspired further investigation of unsupervised graph representation. Existing studies mainly depend on node features and topological properties within static graphs to create self-supervised signals, neglecting the temporal components carried by real-world ...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27063v1
Less is More: Early Stopping Rollout for On-Policy Distillation
On-policy distillation has recently emerged as a promising alternative to standard sequence-level imitation, training a student by scoring its own rollouts with a teacher model. However, we observe ``Off-policy Teacher Decay'' problem in this paradigm: for the later tokens, with student's earlier tr...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27028v1
TPS-Drive: Task-Guided Representation Purification for VLM-based Autonomous Driving
Vision-Language Models (VLMs) provide a promising foundation for autonomous driving planning, yet bridging semantic reasoning and precise 3D spatial forecasting remains a critical challenge. Existing representation strategies generally follow two paths: text-aligned methods flatten continuous spatia...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27038v1
Trust, Geometry, and Rules: A Credibility-Aware Reinforcement Learning Framework for Safe USV Navigation under Uncertainty
Autonomous navigation of Unmanned Surface Vehicles (USVs) that is safe and compliant with the International Regulations for Preventing Collisions at Sea (COLREGs) remains a formidable challenge in dynamic maritime environments, particularly when perception systems exhibit miscalibrated uncertainty. ...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26974v1
SteelDS: A High-Resolution Video Dataset of E40 Steel Scrap for Object Detection and Instance Segmentation
This dataset provides high-resolution, annotated video sequences of shredded E40-grade steel and copper scrap on a conveyor belt. Captured in a controlled laboratory environment, the data reflects the industrial post-magnetic sorting stage, where manual intervention is typically required to remove c...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26682v1
Breaking the Epistemic Trap: Active Perception Under Compound Uncertainty
Deploying reinforcement learning in safety critical domains, from autonomous vehicles to medical decision support, is constrained by failures arising when systems encounter unfamiliar conditions. We argue that the fundamental bottleneck is not individual challenges like changing dynamics or incomple...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26627v1
Efficient On-policy Visual-RL via Stochastic Decoupled Policy Gradient
We present the stochastic decoupled policy gradient (SDPG), a lightweight visual reinforcement learning (RL) method that trains diverse visuomotor control policies end-to-end within a few hours on a single NVIDIA RTX 4080 GPU. SDPG estimates policy gradients via random perturbations of trajectory ro...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26478v1
Heterogeneous AAV Logistics Task Allocation: A Reinforcement Learning Enhanced Overlapping Coalition Formation Game Approach
In dynamic urban logistics, the stochastic emergence of time-sensitive tasks poses a significant optimality challenge for heterogeneous AAVs logistics task allocation. To address this problem, a reinforcement learning enhanced overlapping coalition formation game approach is proposed. A dynamic task...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26471v1
Robust Koopman Control Barrier Filters for Safe Actor-Critic Reinforcement Learning
Safe reinforcement learning (RL) for robotic systems requires policies that improve task performance while satisfying state and input constraints during both training and deployment. Control barrier functions (CBFs) provide a principled mechanism for enforcing forward invariance through minimally in...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26452v1
VR-DAgger: Immersive VR for Dexterous Data Collection and Uncertainty-Guided On-Policy Correction
Learning from demonstrations is effective for robotic manipulation, but collecting sufficient task-specific data remains a major bottleneck. Under distribution shift, small errors compound, performance degrades, and expert time is often spent on redundant, low-value corrections instead of the few cr...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27114v1
Learning to Balance Motor Thermal Safety and Quadrupedal Locomotion Performance with Residual Policy
Motor thermal management is often overlooked in the context of electrically-actuated robots, particularly legged robots, but motor overheating is a key factor that limits long-duration locomotion especially under payload conditions. This paper integrates a whole-body thermal model of a quadruped rob...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.27046v1
Towards Shared Embodied Intelligence in Humanoid Robots through Optimization Development and Testing of the Human Aware ergoCub Robot
Collaboration is central to human behavior, enabling tasks beyond individual capability. This ability arises from coordinating actions through internal representations of others, a concept known as shared intelligence. Additionally, humans are characterized by physical bodies and cognitive abilities...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26991v1
OSMa-Bench++: Toward Open-Ended Benchmarking of Semantic Mapping for Manipulation with Prompt-Generated Synthetic Scenes
Semantic mapping methods are increasingly used as intermediate scene representations for downstream robotic reasoning and manipulation, yet their evaluation is still largely tied to fixed benchmark datasets with limited coverage of manipulation-relevant corner cases. In this work, we extend OSMa-Ben...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26831v1
Learning Compositional Symbolic Task Rules from Demonstrations with Inductive Logic Programming
Learning from Demonstration~(LfD) should capture not only how a task is executed, but also its high-level task structure that explains the demonstrated behavior. As robots become more autonomous, such task representations must be inspectable, reusable, and human-interpretable. To address this, we st...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26828v1
Can VLA Models Learn from Real-World Data Continually without Forgetting?
Vision-language-action (VLA) models provide a promising foundation for general-purpose robotics. However, their successful deployment in real-world scenarios requires the ability to continually acquire new skills while retaining previously learned behaviors. While pioneering research has studied the...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26820v1
Look Further: Socially-Compliant Navigation System in Residential Buildings
The distance at which a mobile robot reacts to a person strongly impacts various qualities of the human-robot interaction. In this paper, we focus on the navigation of a mobile delivery robot platform in a residential indoor hallway environment. Social navigation methods typically focus on avoiding ...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26710v1
On the Generalization Capabilities, Design Choices and Limitations of Keypoint Imitation Learning
RGB-based imitation learning requires many demonstrations to generalize to unseen objects or scenes, motivating research into intermediate representations to improve generalization for robotic manipulation. Visual foundation models enable one-shot extraction of keypoints to provide such representati...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26649v1
L-Learning : A Lyapunov-Based Approach Leveraging Lagrangian Mechanics for Efficient and Stable Robot Tracking
This paper presents L-Learning, a novel data-driven control framework for robotics that integrates Lyapunov stability theory with Lagrangian mechanics to enhance trajectory tracking performance. While traditional control methods often suffer from performance degradation in dynamic and uncertain envi...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26648v1
HyperSim: A Holistic Sim-To-Real Framework For Robust Robotic Manipulation
Scaling data volume and diversity is critical for generalizing embodied intelligence. While synthetic data generation offers a scalable alternative to expensive physical data acquisition, transferring robotic manipulation policies from simulation to the real world (sim-to-real) remains a formidable ...
📄 ResearchMay 26, 2026http://arxiv.org/abs/2605.26638v1