AI News Archive: June 30, 2026 — Part 23

Sourced from 500+ daily AI sources, scored by relevance.

BlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decoding
Speculative decoding accelerates inference by using a lightweight draft model to generate candidate tokens in parallel, and are then verified by the target model, enabling lossless acceleration. Recently, diffusion-based speculative decoding further improves parallelism by generating multiple tokens...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31315v1
Andy Anxiety & Stress Tracker
Free anxiety tracker. Spot your triggers, not just your mood
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/andy-anxiety-stress-tracker?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
The Decomposition Is the Fingerprint: Per-Component Identity for Agent Skills
AI agents increasingly acquire and execute skills at runtime: bundles of prompt instructions, executable code, and tool declarations fetched from marketplaces and other agents. Governing them needs a stable notion of skill identity, yet cryptographic hashing is engineered to destroy the very similar...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31272v1
Gated Multi-Graph Fusion via Graph Attention Networks for Alzheimer's Disease Detection
Spontaneous speech is a vital non-invasive biomarker for Alzheimer's Disease (AD), yet many systems overlook non-linear structural disruptions and clinical heterogeneity in pathological language. We propose a Multi-View Gated Graph Attention Network that transcribes audio via Automatic Speech Recogn...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31186v1
SpheRoPE: Zero-Shot Optimization-Free 360 Panorama Generation with Spherical RoPE
We present a zero-shot, training-free and optimization-free framework for generating 360 panoramic images and videos by directly injecting spherical priors into pre-trained diffusion transformers. Existing methods either rely on costly fine-tuning on scarce panoramic data that limits generalization,...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.32033v1
Automated Background Swapping for Robustness against Spurious Backgrounds
Classifiers based on Deep Neural Networks exhibit strong performance across domains, yet can fail catastrophically if they rely on spurious correlations, i.e., features that are predictive of the target label in the training data but are not causally linked and thus fail to generalize. For the visio...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.32018v1
CoMet: Context and Multiplicity Decomposition for Multimodal Uncertainty Estimation
Uncertainty estimation has been a long-standing challenge in AI models; it amounts to "knowing what you don't know," and metacognition is notoriously difficult even for humans (cf. the Dunning-Kruger effect). Although it is still far from solved even in simpler classification systems, tackling it in...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.32012v1
Reviio
AI auto-reply & translation for Google Maps reviews
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/reviio?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
RecruiterVibe AI
Enjoy the True Vibe of Automated Hiring.
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/recruitervibe-ai?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
Beatrium — Beat-Synced Music Visualizer
Endless AI animations that sync to the music you're playing
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/beatrium-beat-synced-music-visualizer?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
AIdatalOOks
For all your Bussiness Analytics
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/aidatalooks?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
Genie
After-hours AI receptionist that books jobs while you sleep
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/genie-13?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
UseLoveia
A IA brasileira que transforma emoção em palavra certa
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/useloveia?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
Hermetic AI
Hermetic AI is a software development company.
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/hermetic-ai?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
MarketiQ Ai
Your Marketing Automation Co-Pilot
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/marketiq-ai?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
Write Ababil 360
Autonomous AI engine to design, write, and export KDP-ready.
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/write-ababil-360?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
Wellzy
Therapy AI Free for 24/7 Mental Support | Wellzy
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/wellzy?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
Claude Prompt Generator
Get perfect Claude prompts for your business
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/claude-prompt-generator?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
LE LAB IMMO
AI tools for real estate professionals in France.
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/le-lab-immo?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
crisprmart.ir کریسپرمارت
ابزارهای مخصوص کریسپر و ژنتیک با هوش مصنوعی
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/crisprmart-ir?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
E-Trust
Smart AI agent that automates emails and meeting scheduling
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/e-trust?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
TestimonialDrop
Collect customer reviews in 60s — AI turns them into content
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/testimonialdrop?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
ParseToSheet
Fill your Excel template from a PDF with AI
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/parsetosheet?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
lidito
Turn long videos into viral clips with AI
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/lidito?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
ToolboxHub
The only toolbox you need — 50+ free tools, zero signup need
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/toolboxhub-3?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
CoLT: Teaching Multi-Modal Models to Think with Chain of Latent Thoughts
Chain-of-thought (CoT) reasoning has enabled multi-modal large language models (MLLMs) to tackle complex visual reasoning tasks by generating explicit intermediate reasoning steps in natural language. However, this text-based reasoning paradigm is inherently slow at inference time with even thousand...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31986v1
ERA: Entropy-Guided Visual Token Pruning with Rectified Attention for Efficient MLLMs
Multimodal Large Language Models (MLLMs) incur prohibitive inference costs due to long visual token sequences. Training-free visual token reduction provides an efficient solution. However, existing methods distort attention distributions, giving rise to a phenomenon we term Attention Logit Collapse....
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31982v1
FlexViT: A Flexible FPGA-based Accelerator for Edge Vision Transformers
Deploying Vision Transformer (ViT) models on edge platforms remains challenging due to their high computational demands and the architectural heterogeneity of modern hybrid ViT models, which incorporate both fully connected and convolutional layers. This heterogeneity leads to significant variation ...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31938v1
No Place to Hide: Benchmarking Video Hallucination with Background-Controlled Pairs
We introduce VidPair-Halluc, a new benchmark for evaluating video hallucination in large video models (LVMs) under rigorous and controlled conditions. Unlike previous benchmarks that primarily rely on text-based perturbations or adversarial questions while neglecting the consistency of visual backgr...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31933v1
DriveWeaver: Point-Conditioned Video Inpainting for Controllable Vehicle Insertion in Autonomous Driving Simulation
A pivotal step in autonomous driving simulation involves inserting foreground vehicles with predefined trajectories into simulated scenes. This process enhances scene diversity and facilitates the creation of various corner cases for testing and improving autonomous driving models. However, existing...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31918v1
RESOLVE: A Multi-Resolution and Multi-Modal Dataset for Roadside Cooperative Perception
LiDAR has increasingly been integrated into traffic cameras to expand coverage and mitigate occlusion in roadside cooperative perception. However, how unimodal and camera-LiDAR fusion architectures behave under variations in LiDAR point sparsity induced by sensor configurations and scene-dependent s...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31895v1
SENSE-VAD: Sentient and Semantic Video Anomaly Detection for Autonomous Driving
Autonomous vehicles (AVs) must navigate not only motion-based hazards but also socially complex situations whose danger is constituted by inter-agent relationships rather than movement statistics alone. A child running away from a guardian, a person being carried by another, or a pursuer chasing a p...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31875v1
Towards Voxel Spacing Consistency for Medical Image Segmentation
Volumetric medical image segmentation is essential for both preoperative diagnosis and intraoperative guidance. While recent years have witnessed rapid progress in segmentation architectures, comparatively little attention is paid to the physical voxel spacing of anatomical data. Indeed, volumetric ...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31839v1
PriorEye: Geospatial Visual Priors for End-to-End Autonomous Driving
Most end-to-end autonomous driving methods rely solely on instantaneous sensor observations, limiting them to reactive behavior without the anticipatory foresight human drivers employ through prior experience. We introduce geospatial visual priors, street-level visual context anchored to the intende...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31830v1
MuSViT: A Foundation Vision Model for Sheet Music Representation
Foundation models have transformed vision and language processing by providing rich, reusable representations that transfer across diverse tasks. Sheet music, as a visual encoding of musical language, lacks such a strong domain-specific backbone. We introduce MuSViT (Music Score Vision Transformer):...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31811v1
UniCoder: Unified Visual-to-Code Generation via Symbolic Rewards and Reference-Guided Code Optimization
Visual-to-Code generation, which transforms scientific plots, vector graphics, and webpages into executable scripts, demands a level of pixel-precise alignment that standard Multimodal Large Language Models (MLLMs) fail to achieve through Supervised Fine-Tuning (SFT) alone. While Reinforcement Learn...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31732v1
Intrinsically Stable Spiking Neural Networks: Overcoming the Performance Barrier in the Absence of Batch Normalization
The performance of deep spiking neural networks (SNNs) often relies on batch normalization (BN). However, the advanced dynamic BN variants used in state-of-the-art models introduce runtime multiplications, which weaken the hardware-efficiency motivation of SNNs. To address this tension, we identify ...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31695v1
Semantic Occupancy Prediction with Dual Range-Voxel Representation
LiDAR-based 3D semantic occupancy prediction, which aims to provide accurate and comprehensive scene representation, is crucial for autonomous driving systems. As point clouds suffer from sparsity and incompleteness, leading to insufficient semantic learning and difficult occupancy perception, exist...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31688v1
FaceMoE: Mixture of Experts for Low-Resolution Face Recognition
Low-resolution face recognition (LR-FR) remains a challenging task due to poor feature extraction and aggregation, as probe images often contain limited identity information resulting from extreme degradations such as blur, occlusion, and low contrast. Additionally, the domain gap between high-resol...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.32040v1
GEAR: Guided End-to-End AutoRegression for Image Synthesis
Visual generative models are typically trained in two stages. A tokenizer is first trained for reconstruction and then frozen, after which a generator is trained on its discrete indices or continuous latents. This decoupling leaves the tokenizer unaware of what the generator finds easy to model. We ...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.32039v1
Ferguson
Is your landing page actually... landing? Ferguson tells you
🧰 ToolsJun 30, 2026https://www.producthunt.com/products/ferguson?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
PointSplat: Compact Gaussian Splatting via Human-Centric Prediction
Producing 3D human representations from input views on the fly is essential for immersive live streaming systems, where representation compactness is as critical as high fidelity given limited computational power and transmission bandwidth. Although recent feed-forward reconstruction methods achieve...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.32036v1
Cross-Space Distillation: Teaching One-Step Students with Modern Diffusion Teachers
Modern one-step diffusion models achieve impressive quality through distribution-based timestep distillation. Yet, they rely on a critical assumption: Teacher and Student must inhabit the same latent space. This Shared-Space constraint prevents knowledge transfer from modern high-capacity Teachers (...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.32020v1
Planar-SfM: Camera Pose Estimation via Homography Graph Embeddings
Structure from Motion (SfM) systems traditionally struggle with planar scenes, where standard epipolar geometry-based methods become degenerate. Rather than viewing planar surfaces as a limitation, we propose a unified framework that leverages them as a source of geometric constraints. Our key insig...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31979v1
AnyBokeh: Physics-Guided Any-to-Any Bokeh Editing with Optical Fingerprint Transfer
Depth-of-field control is a fundamental tool in photography, yet post-capture bokeh editing from a single image remains challenging. A practical editor should handle images captured under arbitrary focus and aperture settings. Existing methods typically assume an all-in-focus input, or first recover...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31959v1
DEMUN: Fast and accurate discovery of music notation in very large collections
Much of written musical heritage is preserved and digitised at memory institutions: libraries, museums, and archives. Owing to their collection structures, sheet music tends to be concentrated in large subsets that are defined as collections of music, with corresponding metadata that makes the music...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31956v1
World Narrative Model for Highly Controllable Video Generation: A Paradigm Shift from Pixel Sampling to Physical World Orchestration
The fundamental obstacle to industrial grade video generation is the lack of controllability: existing models treat video as a pixel distribution sampling problem, bypassing the explicit, instance level $4D$ $(3D + T)$ physical world. Consequently, content creators cannot specify geometry, motion, c...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31946v1
InstanceControl: Controllable Complex Image Generation without Instance Labeling
Controllable image generation methods, such as ControlNet, have demonstrated a remarkable capacity to introduce visual conditions(e.g., depth maps) to guide image generation. However, these methods often struggle with complex multi-instance scenes, frequently leading to attribute confusion among ins...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31924v1
Absorption-Feature-Guided Distance-Decoupled Estimation and Band Selection for LWIR Hyperspectral Passive Ranging
Long-wave infrared (LWIR) hyperspectral observations contain distance-dependent atmospheric absorption signatures, providing a physical basis for long-range passive ranging. However, in natural scenes, these signatures are nonlinearly coupled with target temperature, material emissivity, and path ra...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31824v1
Generative Lane Topology Reasoning via Autoregressive Model with Geometry Prior
Lane topology reasoning aims to construct a lane graph from onboard sensor observations. Existing methods follow a detection and association paradigm that treats each lane instance independently, leading to geometric inconsistency at connected endpoints and incomplete graphs due to visual occlusions...
📄 ResearchJun 30, 2026http://arxiv.org/abs/2606.31814v1