AI News Archive: May 14, 2026 — Part 21

Sourced from 500+ daily AI sources, scored by relevance.

MambaRain: Multi-Scale Mamba-Attention Framework for 0-3 Hour Precipitation Nowcasting
Accurate precipitation nowcasting over extended horizons (0-3 hours) is essential for disaster mitigation and operational decision-making, yet remains a critical challenge in the field. Existing deterministic approaches are predominantly constrained to shorter prediction windows (0-2 hours), exhibit...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.14606v1
VMU-Diff: A Coarse-to-fine Multi-source Data Fusion Framework for Precipitation Nowcasting
Precipitation nowcasting is a vital spatio-temporal prediction task for meteorological applications but faces challenges due to the chaotic property of precipitation systems. Existing methods predominantly rely on single-source radar data to build either deterministic or probabilistic models for ext...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.14597v1
TOPOS: High-Fidelity and Efficient Industry-Grade 3D Head Generation
High-fidelity 3D head generation plays a crucial role in the film, animation and video game industries. In industrial pipelines, studios typically enforce a fixed reference topology across all head assets, as such a clean and uniform topology is a prerequisite for production-level rigging, skinning ...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.14594v1
Eradicating Negative Transfer in Multi-Physics Foundation Models via Sparse Mixture-of-Experts Routing
Scaling Scientific Machine Learning (SciML) toward universal foundation models is bottlenecked by negative transfer: the simultaneous co-training of disparate partial differential equation (PDE) regimes can induce gradient conflict, unstable optimization, and plasticity loss in dense neural operator...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15179v1
Text Knows What, Tables Know When: Clinical Timeline Reconstruction via Retrieval-Augmented Multimodal Alignment
Reconstructing precise clinical timelines is essential for modeling patient trajectories and forecasting risk in complex, heterogeneous conditions like sepsis. While unstructured clinical narratives offer semantically rich and contextually complete descriptions of a patient's course, they often lack...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15168v1
MeMo: Memory as a Model
Large language models (LLMs) achieve strong performance across a wide range of tasks, but remain frozen after pretraining until subsequent updates. Many real-world applications require timely, domain-specific information, motivating the need for efficient mechanisms to incorporate new knowledge. In ...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15156v1
Self-Distilled Agentic Reinforcement Learning
Reinforcement learning (RL) has emerged as a central paradigm for post-training LLM agents, yet its trajectory-level reward signal provides only coarse supervision for long-horizon interaction. On-Policy Self-Distillation (OPSD) complements RL by introducing dense token-level guidance from a teacher...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15155v1
RoSHAP: A Distributional Framework and Robust Metric for Stable Feature Attribution
Feature attribution analysis is critical for interpreting machine learning models and supporting reliable data-driven decisions. However, feature attribution measures often exhibit stochastic variation: different train--test splits, random seeds, or model-fitting procedures can produce substantially...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15154v1
Widening the Gap: Exploiting LLM Quantization via Outlier Injection
LLM quantization has become essential for memory-efficient deployment. Recent work has shown that quantization schemes can pose critical security risks: an adversary may release a model that appears benign in full precision but exhibits malicious behavior once quantized by users. However, existing q...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15152v1
Forgetting That Sticks: Quantization-Permanent Unlearning via Circuit Attribution
Standard unlearning evaluations measure behavioral suppression in full precision, immediately after training, despite every deployed language model being quantized first. Recent work has shown that 4-bit post-training quantization can reverse machine unlearning; we show this is not a tuning artefact...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15138v1
Learning from Language Feedback via Variational Policy Distillation
Reinforcement learning from verifiable rewards (RLVR) suffers from sparse outcome signals, creating severe exploration bottlenecks on complex reasoning tasks. Recent on-policy self-distillation methods attempt to address this by utilizing language feedback to generate dense, token-level supervision....
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15113v1
Novel Dynamic Batch-Sensitive Adam Optimiser for Vehicular Accident Injury Severity Prediction
The choice of optimiser is important in deep learning, as it strongly influences model efficiency and speed of convergence. However, many commonly used optimisers encounter difficulties when applied to imbalanced and sequential datasets, limiting their ability to capture patterns of minority classes...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15083v1
Croissant Baker: Metadata Generation for Discoverable, Governable, and Reusable ML Datasets
Croissant has emerged as the metadata standard for machine learning datasets, providing a structured, JSON-LD-based format that makes dataset discovery, automated ingestion, and reproducible analysis machine-checkable across ML platforms. Adoption has accelerated, and NeurIPS now requires Croissant ...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15079v1
Concurrency without Model Changes: Future-based Asynchronous Function Calling for LLMs
Function calling, also known as tool use, is a core capability of modern LLM agents but is typically constrained by synchronous execution semantics. Under these semantics, LLM decoding is blocked until each function call completes, resulting in increasing end-to-end latency. In this work, we introdu...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15077v1
DiffusionOPD: A Unified Perspective of On-Policy Distillation in Diffusion Models
Reinforcement learning has emerged as a powerful tool for improving diffusion-based text-to-image models, but existing methods are largely limited to single-task optimization. Extending RL to multiple tasks is challenging: joint optimization suffers from cross-task interference and imbalance, while ...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15055v1
TFGN: Task-Free, Replay-Free Continual Pre-Training Without Catastrophic Forgetting at LLM Scale
Continually pre-training a large language model on heterogeneous text domains, without replay or task labels, has remained an unsolved architectural problem at LLM scale. Existing methods rely on replay buffers, task identifiers, regularization penalties that scale poorly, or sentence-classification...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15053v1
An Interpretable Latency Model for Speculative Decoding in LLM Serving
Speculative decoding (SD) accelerates large language model (LLM) inference by using a smaller draft model to propose multiple tokens that are verified by a larger target model in parallel. While prior work demonstrates substantial speedups in isolated or fixed-batch settings, the behavior of SD in p...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15051v1
SpeakerLLM: A Speaker-Specialized Audio-LLM for Speaker Understanding and Verification Reasoning
As audio-first agents become increasingly common in physical AI, conversational robots, and screenless wearables, audio large language models (audio-LLMs) must integrate speaker-specific understanding to support user authorization, personalization, and context-aware interaction. This requires modeli...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15044v1
Generalized Priority-Aware Shapley Value
Shapley value and its priority-aware extensions are widely used for valuation in machine learning, but existing methods require pairwise priority to be binary and acyclic, a restriction spectacularly violated in real-data examples such as aggregated human preferences and multi-criterion comparisons....
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15018v1
Boosting Reinforcement Learning with Verifiable Rewards via Randomly Selected Few-Shot Guidance
Reinforcement Learning with Verifiable Rewards (RLVR) has achieved great success in developing Large Language Models (LLMs) with chain-of-thought rollouts for many tasks such as math and coding. Nevertheless, RLVR struggles with sample efficiency on difficult problems where correct rollouts are hard...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15012v1
Second-Order Actor-Critic Methods for Discounted MDPs via Policy Hessian Decomposition
We address the discounted reward setting in reinforcement learning (RL). To mitigate the value approximation challenges in policy gradient methods, actor-critic approaches have been developed and are known to converge to stationary points under suitable assumptions. However, these methods rely on fi...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.14982v1
InfoSFT: Learn More and Forget Less with Information-Aware Token Weighting
Supervised fine-tuning (SFT) provides the standard approach for teaching LLMs new behaviors from offline expert demonstrations. However, standard SFT uniformly fits all samples -- including those with low likelihood under the base model -- which can disproportionately drive training updates toward o...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.14967v1
Real-time virtual circuits for plasma shape control via neural network emulators
Reliable position and shape control in tokamak plasmas requires accurate real-time regulation of several strongly coupled shape parameters. The control vectors that disentangle these couplings, referred to as \textit{virtual circuits} (VCs), enable independent shape parameter control for a specific ...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.14939v1
Octopus: History-Free Gradient Orthogonalization for Continual Learning in Multimodal Large Language Models
Continual learning in multimodal large language models (MLLMs) aims to sequentially acquire knowledge while mitigating catastrophic forgetting, yet existing methods face inherent limitations: architecture-based approaches incur additional computational overhead and often generalize poorly to new tas...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.14938v1
A Hardware-Aware, Per-Layer Methodology for Post-Training Quantization of Large Language Models
Scaled Outer Product (SOP) is a post-training quantization methodology for large language model weights, designed to deliver near-lossless fidelity at 4.5--6 bits per weight on hardware with per-layer LUT decode. The methodology combines per-layer search of fixed and dynamic codebook pairs selected ...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.14929v1
Learning with Shallow Neural Networks on Cluster-Structured Features
The success of deep learning in high-dimensional settings is often attributed to the presence of low-dimensional structure in real-world data. While standard theoretical models typically assume that this structure lies in the target function, projecting unstructured inputs onto a low-dimensional sub...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.14927v1
A Mutual Information Lower Bound for Multimodal Regression Active Learning
Active learning for continuous regression has lacked an acquisition function that targets epistemic uncertainty when the predictive distribution is multimodal: variance misses modal disagreement, and information-theoretic targets like BALD are designed for discrete outputs. We introduce a Two-Index ...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.14917v1
Comie.dev
Production context for AI with logs, DBs, and error tracking
🧰 ToolsMay 14, 2026https://www.producthunt.com/products/comie-dev?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
RefDecoder: Enhancing Visual Generation with Conditional Video Decoding
Video generation powers a vast array of downstream applications. However, while the de facto standard, i.e., latent diffusion models, typically employ heavily conditioned denoising networks, their decoders often remain unconditional. We observe that this architectural asymmetry leads to significant ...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15196v1
FutureSim: Replaying World Events to Evaluate Adaptive Agents
AI agents are being increasingly deployed in dynamic, open-ended environments that require adapting to new information as it arrives. To efficiently measure this capability for realistic use-cases, we propose building grounded simulations that replay real-world events in the order they occurred. We ...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15188v1
When Are Two Networks the Same? Tensor Similarity for Mechanistic Interpretability
Mechanistic interpretability aims to break models into meaningful parts; verifying that two such parts implement the same computation is a prerequisite. Existing similarity measures evaluate either empirical behaviour, leaving them blind to out-of-distribution mechanisms, or basis-dependent paramete...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15183v1
Position: Behavioural Assurance Cannot Verify the Safety Claims Governance Now Demands
This position paper argues that behavioural assurance, even when carefully designed, is being asked to carry safety claims it cannot verify. AI governance frameworks enacted between 2019 and early 2026 require reviewable evidence of properties such as the absence of hidden objectives, resistance to ...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15164v1
Hand-in-the-Loop: Improving Dexterous VLA via Seamless Interventional Correction
Vision-Language-Action (VLA) models are prone to compounding errors in dexterous manipulation, where high-dimensional action spaces and contact-rich dynamics amplify small policy deviations over long horizons. While Interactive Imitation Learning (IIL) can refine policies through human takeover data...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15157v1
Training ML Models with Predictable Failures
Estimating how often an ML model will fail at deployment scale is central to pre-deployment safety assessment, but a feasible evaluation set is rarely large enough to observe the failures that matter. Jones et al. (2025) address this by extrapolating from the largest k failure scores in an evaluatio...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15134v1
Causal Foundation Models with Continuous Treatments
Causal inference, estimating causal effects from observational data, is a fundamental tool in many disciplines. Of particular importance across a variety of domains is the continuous treatment setting, where the variable of intervention has a continuous range. This setting is far less explored and r...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15133v1
Natural Synthesis: Outperforming Reactive Synthesis Tools with Large Reasoning Models
Reactive synthesis, the problem of automatically constructing a hardware circuit from a logical specification, is a long-standing challenge in formal verification. It is elusive for two reasons: It is algorithmically hard, and writing formal specifications by hand is notoriously difficult. In this p...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15131v1
CoCo-InEKF: State Estimation with Learned Contact Covariances in Dynamic, Contact-Rich Scenarios
Robust state estimation for highly dynamic motion of legged robots remains challenging, especially in dynamic, contact-rich scenarios. Traditional approaches often rely on binary contact states that fail to capture the nuances of partial contact or directional slippage. This paper presents CoCo-InEK...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15122v1
Logging Policy Design for Off-Policy Evaluation
Off-policy evaluation (OPE) estimates the value of a target treatment policy (e.g., a recommender system) using data collected by a different logging policy. It enables high-stakes experimentation without live deployment, yet in practice accuracy depends heavily on the logging policy used to collect...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15108v1
From Data to Action: Accelerating Refinery Optimization with AI
Nowadays refinery optimization utilizes sheer amounts of data, which can be handled with modern Linear Programming (LP) software, but the interpreting and applying the results remains challenging. Large petrochemical companies use massive models, with hundreds of thousands of input matrix elements. ...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15085v1
Average Gradient Outer Product in kernel regression provably recovers the central subspace for multi-index models
We study a prototypical situation when a learned predictor can discover useful low-dimensional structure in data, while using fewer samples than are needed for accurate prediction. Specifically, we consider the problem of recovering a multi-index polynomial $f^*(x)=h(Ux)$, with $U\in\mathbb{R}^{r\ti...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15082v1
Separating Intrinsic Ambiguity from Estimation Uncertainty in Deep Generative Models for Linear Inverse Problems
Recently, deep generative models have been used for posterior inference in inverse problems, including high-stakes applications in medical imaging and scientific discovery, where the uncertainty of a prediction can matter as much as the prediction itself. However, posterior uncertainty is difficult ...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15050v1
TopoPrimer: The Missing Topological Context in Forecasting Models
We introduce TopoPrimer, a framework that makes the global topological structure of the series population an explicit input to any forecasting model. TopoPrimer improves accuracy across diverse domains, stabilizes forecasts under seasonal demand spikes, and closes the cold-start gap. Precomputed...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15035v1
DeepTokenEEG Enhancing Mild Cognitive Impairment and Alzheimers Classification via Tokenized EEG Features
The detection of Alzheimers disease (AD) is considered crucial, as timely intervention can improve patient outcomes. Electroencephalogram (EEG)-based diagnosis has been recognized as a non-invasive, accessible, and cost-effective approach for AD detection; however, it faces challenges related to dat...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.15009v1
Explainable Detection of Depression Status Shifts from User Digital Traces
Every day, users generate digital traces (e.g., social media posts, chats, and online interactions) that are inherently timestamped and may reflect aspects of their mental state. These traces can be organized into temporal trajectories that capture how a user's mental health signals evolve, includin...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.14995v1
Distance-Matrix Wasserstein Statistics for Scalable Gromov--Wasserstein Learning
Gromov--Wasserstein (GW) distances compare graphs, shapes, and point clouds through internal distances, without requiring a common coordinate system. This invariance is powerful, but discrete GW is a nonconvex quadratic optimal transport problem and is difficult to estimate at scale. We propose \emp...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.14981v1
Efficient Online Conformal Selection with Limited Feedback
We address the problem of conformal selection, where an agent must select a minimal subset of options to ensure that at least one ``success'' is identified with a pre-specified target probability $φ$. While traditional online conformal prediction focuses on maintaining validity for the observed sequ...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.14953v1
nASR: An End-to-End Trainable Neural Layer for Channel-Level EEG Artifact Subspace Reconstruction in Real-Time BCI
Electroencephalogram (EEG) signals are highly susceptible to artifacts, resulting in a low signal-to-noise ratio which makes extraction of meaningful neural information challenging. Artifact Subspace Reconstruction (ASR) is one of the most widely used artifact filtering techniques in EEG-based BCI a...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.14941v1
Not All Symbols Are Equal: Importance-Aware Constellation Design for Semantic Communication
Semantic communication systems for goal-oriented transmission must protect task-relevant information not only through source compression but also via physical layer mapping. Existing approaches decouple constellation design and semantic encoding, exposing critical symbols to channel errors at the sa...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.14940v1
Slot-MPC: Goal-Conditioned Model Predictive Control with Object-Centric Representations
Predictive world models enable agents to model scene dynamics and reason about the consequences of their actions. Inspired by human perception, object-centric world models capture scene dynamics using object-level representations, which can be used for downstream applications such as action planning...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.14937v1
Road Maps as Free Geometric Priors: Weather-Invariant Drone Geo-Localization with GeoFuse
Drone-view geo-localization aims to match a query drone image, often captured under adverse weather conditions (e.g., rain, snow, fog), against a gallery of geo-tagged satellite images. Weather-induced degradations in the drone view, such as noise, reduced visibility, and partial occlusions, severel...
📄 ResearchMay 14, 2026http://arxiv.org/abs/2605.14925v1