AI News Archive: June 9, 2026 — Part 17
Sourced from 500+ daily AI sources, scored by relevance.
- Multi-UAV Active Sensing with Information Gain-based Planning and Belief Fusion
Unmanned aerial vehicles (UAVs) are increasingly used for active sensing and information gathering in spatially distributed environments. Their performance, however, is constrained by limited flight time, sensing uncertainty, and the trade-off between spatial coverage and observation accuracy. This ...
- Resilient Navigation for Autonomous Farm Robots by Leveraging Jerk-Augmented Models with IMU-Only Disturbance Rejection
Precise state estimation for navigation of autonomous agricultural robots is often compromised by sensor outages (GNSS/LiDAR/Visual) and high-frequency vibrations inherent in off-road environments. This paper proposes a robust navigation algorithm based on a jerk-augmented Extended Kalman Filter (EK...
- AgniNav: Configuration-Driven Cross-Embodiment Local Planning for Robot Navigation
Monocular local navigation is attractive for lightweight robots, but existing vision-based policies often couple perception to a specific body, camera height, and footprint, making transfer from wheeled bases to legged platforms dependent on retraining or active depth hardware. This paper introduces...
- MV-Actor: Aligning Multi-View Semantics and Spatial Awareness for Bimanual Manipulation
Robotic manipulation has been widely applied in industrial scenarios. Compared with single-arm manipulation, bimanual manipulation is equipped with multiple cameras to capture information from different viewpoints. However, existing multi-view policies encode each view independently or fuse view fea...
- Soniqo Speech
On-device speech AI for Mac, Windows, Linux & Android
- Embodiment-conditioned Generalist Control for Multirotor Aerial Robots
We present a generalist position control policy capable of controlling arbitrary multirotor configurations of a certain rotor count (e.g., hexarotors or quadrotors) with a single set of network weights. The policy is conditioned on a physics-grounded embodiment descriptor: a mass and inertia-normali...
- Gradient based Bilevel for Inverse Optimal Control, a Riemannian approach
Inverse Optimal Control (IOC) aims to recover the cost function that explains observed trajectories as solutions of an optimal control problem. Classical IOC formulations rely on bilevel optimization, which repeatedly solves a nested optimal control problem and quickly becomes computationally prohib...
- GUIDE: Goal-Initialized Directional Understanding for End-to-End Visual Navigation
Learning-based visual navigation for legged robots typically relies on continuous goal updates from hierarchical state estimation to provide a persistent directional reference. This reliance incurs additional sensory and computational overhead and deviates from fully end-to-end mobile autonomy. Furt...
- Bridging Semantics and Physical Execution: A Neuro-Symbolic Framework for Multi-Pair Robotic Assembly
Multi-pair robotic assembly in unstructured environments faces spatial interference and contact uncertainties. Existing paradigms fail to bridge cognitive decision-making and physical execution, as they either encounter state-space explosion and knowledge bottlenecks or suffer from logical hallucina...
- Hand-centric Human-to-Robot Trajectory Transfer from Video Demonstrations via Open-World Contact Localization
Learning from human video demonstrations remains challenging due to noisy hand-object interactions, unseen objects with partial observation, and cross-embodiment discrepancy. To address these challenges, we present \textit{HOWTransfer} (\emph{H}and-\emph{O}bject \emph{O}pen-\emph{W}orld Transfer), a...
- Pushing the Performance Limits in Autonomous Racing: Continuous Stability-Aware Adaptive Velocity Planning in Formula Student Driverless
In autonomous racing, especially in competitions such as Formula Student Driverless, precise planning of the target velocity of a race car is crucial for competitive lap times and stable driving behavior. Especially at high speeds, Velocity Planning (VP) is a significant challenge as it has to be pe...
- Dexterous Point Policy: Learning Point-based Dexterous Hand Policies from Human Demonstrations
Robotic foundation models pre-trained on human demonstration videos have shown promise, but a significant embodiment gap remains when the resulting policies are deployed on real robots. A common remedy is to fine-tune these models on robot-specific demonstrations. However, robot data collection can ...
- AgenticNav: Zero-Shot Vision-and-Language Navigation as a Tool-Calling Harness
Zero-shot vision-and-language navigation in continuous environments (VLN-CE) has recently become feasible with large vision-language models (VLMs). However, existing methods typically rely on learned waypoint predictors to propose navigable actions. This severely limits the model's action space and ...
- VeriSpace: Spatially Grounded Action Verification for Vision-Language-Action Models
Vision-language-action (VLA) models have shown strong promise for robotic manipulation, but their reliability at test time remains limited by one-shot action prediction, where even small action errors can cause grasp failure, collision, or incorrect task progression. A natural alternative is to equi...
- Uncovering Vulnerability of Vision-Language-Action Models under Joint-Level Physical Faults
Deploying Vision-Language-Action (VLA) models in real robotic systems requires robustness not only to semantic and perceptual variations, but also to embodiment-side faults that change how actions are physically realized. Real robots can experience joint-level changes caused by actuator degradation,...
- Act on What You See: Unlocking Safe Social Navigation in Vision-Language-Action Models
Safe social navigation requires robots to distinguish people from ordinary obstacles and to react before danger becomes imminent. We show that pretrained Vision-Language-Action (VLA) models already encode pedestrian-object distinctions and future collision signals in their internal representations, ...
- GuideWalk: Learning Unified Autonomous Navigation and Locomotion for Humanoid Robots across Versatile Terrains
Humanoid robots have achieved strong locomotion capabilities, but reliable navigation on versatile terrains remains challenging because obstacle avoidance must be coordinated with dynamically feasible motion. In this work, we present GuideWalk, a unified end-to-end framework that integrates traversa...
- Test-time Adversarial Takeover: A Real-time Hijacking Interface against Robotic Diffusion Policies
Diffusion-based action generation has become a foundational component of embodied AI, but its reliance on visual conditioning leaves deployed visuomotor policies vulnerable to adversarial manipulation. Most prior attacks focus on disruption: they perturb the observation stream to reduce task success...
- A Practical Recipe Towards Improving Sim-and-Real Correlation for VLA Evaluation
Simulation has become an essential tool for evaluating and improving vision-language-action (VLA) policies, offering scalable, reproducible, and controllable alternatives to costly real-world robot evaluation. Recent simulation benchmarks have made substantial progress on realism and diversity, yet ...
- HiMem-WAM: Hierarchical Memory-Gated World Action Models for Robotic Manipulation
World Action Models (WAMs) have emerged as a new powerful paradigm for embodied intelligence, learning action-relevant visual dynamics that significantly enhance generalization and robustness. However, existing WAMs still struggle with task-relevant memory in long-horizon robotic manipulation. To ad...
- Pawvision
See the world through your pet’s eyes
- Rethinking Embodied Navigation via Relational Inductive Bias
Object navigation requires an agent to locate a target in an unknown environment through visual observations. Existing methods typically rely on open-vocabulary detectors or vision-language models (VLMs) to answer where to search, but often overlook what not to trust - which semantic cues are unreli...
- OMG: Omni-Modal Motion Generation for Generalist Humanoid Control
Humanoid whole-body control has made significant progress in recent years, yet existing approaches remain limited to few-skill policies with heavy reward engineering, or motion trackers that are difficult to extend to new input modalities. We argue that the key to general-purpose humanoid control is...
- Baseline-Free Policy Optimization for Neural Combinatorial Optimization
Neural combinatorial optimization (NCO) trains autoregressive policies to solve routing problems. The standard training algorithm, REINFORCE with a rollout baseline, requires maintaining and periodically updating a frozen copy of the policy for variance reduction. This baseline introduces a structur...
- Hierarchical Policies from Verbal and Egocentric Human Signals for Natural Human-Robot Interaction
For natural human-robot interaction, a robot must understand human intent expressed not only through language but also through nonverbal signals such as gestures and gaze. However, current robot policies rely on language instructions as the sole interface for conveying intent, leaving nonverbal sign...
- What Matters in Orchestrating Robot Policies: A Systematic Study of Hierarchical VLA Agents
Hierarchical vision-language-action (Hi-VLA) systems have emerged as a promising paradigm for complex robot manipulation, by using high-level VLM planners to decompose tasks into language subgoals executed by low-level VLA controllers. Despite recent empirical progress, there is a lack of unified de...
- Advancing the State-of-the-Art in Empirical Privacy Auditing
Parameter-efficient fine-tuning of large language models (LLMs) can exhibit problematic memorization of individual training examples. Empirical privacy auditing (EPA) quantifies this risk by measuring realistic data leakage on membership inference (MI) or reconstruction attacks. A key challenge in E...
- Rank Collapse, Fixed Points, and the Renormalization Group Structure of MLP Residual Networks
The analogy between deep neural network forward passes and renormalization group (RG) flows has been repeatedly noted in the literature, but existing treatments remain qualitative: depth is described as a coarse-graining scale, attention is likened to a partition function, and representations are sa...
- Flexible Kernels for Protein Property Prediction
Despite its importance to applications in protein design, predicting protein properties like binding affinity and thermostability from sparse experimental data remains a significant challenge. Accordingly, we introduce a class of sequence kernels that exploit evolutionary substitution matrices as we...
- SPACR: Single-Pass Adaptive Training of Uncertainty-Aware Conformal Regressors
Conformal Prediction (CP) provides robust uncertainty guarantees for predictive models, but is typically applied post hoc, which misaligns model training with the conformal goal of producing efficient (i.e, narrow) intervals. We propose SPACR (Single-Pass Adaptive Conformal Regressor), a novel metho...
- A Mean-Field Analysis of Multi-Head Self-Attention under Cross-Entropy Training
This paper develops a mean-field theory for a simplified single-layer causal multi-head self-attention model trained by cross-entropy minimization. Each attention head is treated as a particle in parameter space, and the empirical law of the heads is used as the large-head state variable. In the inf...
- ZeroGPU
The compute efficient layer for AI inference
- Krisp Voice Translation API
Real-time speech-to-speech translation API
- agmsg
Stop copy-pasting between your AI coding agents
- Kimi Work
The AI desktop for knowledge work
- Uiverse Design
De-slop your AI generated websites
- BooBar
AI Dynamic Island for your Mac
- TravelMind
AI-powered city discovery built on taste, not reviews
- Reve 2.0
Generate and edit 4K images through layout-based control
- agentcad
A CAD design tool for coding agents (free + open source)
- Solarch
Interactive diagrams with AI, and your code always in sync
- Intrascope: BYOK + Managed AI for Teams
Access top AI models without your own API keys
- Narasi AI
Turn raw ideas into engaging shorts in 5 minutes
- Achiv - Stop Guessing What to Write
Content from real demand, built for Google + AI search
- Andika AI
AI Writing Toolkit, Custom Commands and Personalized Prompts
- NovaMind AI
The inventory brain for Amazon FBA sellers
- StackIQ: Supplement Tracker
AI Vitamin Scanner & Optimizer
- BrandGEO
See exactly how AI talks about your brand — and what to fix.
- AEO Table
Track your brand in AI answers: ChatGPT, Perplexity, more
- MDMA - GenUI for apps
Turn AI chat responses into interactive forms and workflows