AI News Archive: June 10, 2026 — Part 14

Sourced from 500+ daily AI sources, scored by relevance.

£20m artificial intelligence tech ‘will speed up cancer diagnosis for millions’ of UK patients
More than four million patients have already received a faster lung cancer diagnosis or all-clear thanks to AI, the government have said
🌐 MovesJun 10, 2026https://www.independent.co.uk/news/health/cancer-diagnosis-nhs-uk-ai-artificial-intelligence-b2993293.html
UAE Government holds agentic AI workshop
The workshop, organised by the Ministry of Cabinet Affairs in Dubai, aimed to launch the implementation tracks of the new government system project
🌐 MovesJun 10, 2026https://www.zawya.com/en/business/technology-and-telecom/uae-government-holds-agentic-ai-workshop-olkcf8z2
USC becomes first college football program to hire a director of AI
USC becomes first college football program to hire a director of AI College Sports Wire
🌐 MovesJun 10, 2026https://collegesportswire.usatoday.com/story/sports/college/college-football/2026/06/10/usc-football-director-of-ai-hired-first-program/90493225007/
Greg Abbott tells PUC, ERCOT not to pass new data center costs to customers
Greg Abbott tells PUC, ERCOT not to pass new data center costs to customers Austin American-Statesman
🌐 MovesJun 10, 2026https://www.statesman.com/politics/texas/article/greg-abbott-tells-puc-ercot-rein-costs-new-data-22299187.php
Ads in New York must now label AI-generated 'synthetic performers'
Any advertisements in New York that feature artificial intelligence-generated people in place of actors will now be violating state law if they don't clearly label that they have used a "synthetic performer."
🌐 MovesJun 10, 2026https://techxplore.com/news/2026-06-ads-york-ai-generated-synthetic.html
What Super Micro’s Stock Sale Reveals About AI Growing Pains
What Super Micro’s Stock Sale Reveals About AI Growing Pains Barron's
🌐 MovesJun 10, 2026https://www.barrons.com/articles/super-micro-stock-sale-ai-300d4a4e
Siri AI might display break reminders if conversations go on for too long
iOS 27 includes code references to break reminders that Siri may display after especially long conversations. Here are the details. more…
🌐 MovesJun 10, 2026https://9to5mac.com/2026/06/09/siri-ai-might-display-break-reminders-if-conversations-go-on-for-too-long/
Opinion: How AI is already improving lives
Opinion: How AI is already improving lives East Bay Times
🌐 MovesJun 10, 2026https://www.eastbaytimes.com/2026/06/10/opinion-how-ai-is-already-improving-lives/amp/
“Reading relationships, crunching stats” — 184× faster data analysis
“Reading relationships, crunching stats” — 184× faster data analysis EurekAlert!
🌐 MovesJun 10, 2026https://www.eurekalert.org/news-releases/1131562
AI-powered railway control system for efficient urban train operation
AI-powered railway control system for efficient urban train operation EurekAlert!
🌐 MovesJun 10, 2026https://www.eurekalert.org/news-releases/1131209
AI diagnoses brain tumors in minutes instead of weeks
AI diagnoses brain tumors in minutes instead of weeks EurekAlert!
🌐 MovesJun 10, 2026https://www.eurekalert.org/news-releases/1131493
Unstable Features, Reproducible Subspaces: Understanding Seed Dependence in Sparse Autoencoders
Sparse autoencoders (SAEs) are widely used to interpret neural network representations, but their utility depends on whether the learned features are reproducible across training runs. We study this question through \emph{feature stability}: for each SAE feature, we estimate the probability that a s...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.12138v1
Soft-Prompt Tuning for Fair and Efficient LLM Benchmark Evaluation
Benchmark scores often misrepresent a large language model's (LLM's) knowledge, because they rely, e.g., on the model's ability to follow specific formatting requirements. This especially penalizes base models that may know the correct answers but lack the ability -- typically introduced in post-tra...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.12117v1
Characterizing Software Aging in GPU-Based LLM Serving Systems
This paper proposes an empirical methodology to study software aging in GPU-based LLM serving systems. Traditional aging studies focus on CPU-centric software with relatively regular workloads; LLM serving is different, spanning a Python host and a CUDA device, handling requests whose cost varies by...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11916v1
Bridging the Morphology Gap: Adapting VLA Models to Dexterous Manipulation via Intent-Conditioned Fine-Tuning
Vision-Language-Action (VLA) models have demonstrated remarkable zero-shot generalization in robotic manipulation, yet the vast majority of pre-trained pipelines remain strictly confined to low-DoF parallel grippers. Adapting these rich semantic priors to high-DoF dexterous hands introduces a severe...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.12109v1
MSUE: Multi-Modal Soccer Understanding Expert
This paper presents our solution to the 2026 SoccerNet VQA Challenge. We first develop a cost-effective data synthesis pipeline driven by a Vision-Language Model (VLM), which systematically restructures raw domain data into diverse VQA samples, including concise answers and long-form responses. Seco...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.12106v1
Non-frontal face recognition using GANs and memristor-based classifiers
Face recognition systems have advanced significantly through deep learning techniques, delivering high performance and robustness in complex scenarios. However, these approaches incur substantial computational overhead, limiting their in situ applicability in resource-constrained platforms such as d...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.12074v1
"That's AI Slop, You Bot!" Studying Accusations, Evidence, and Credibility in Online Discourse Towards LLM-Generated Comments
Generative AI has made fluent prose cheap to produce, breaking the old promise to readers that good writing meant real thinking. How have readers responded, and what can this tell us about changing anti-AI attitudes? We analyzed 25 million comments from Hacker News and Reddit (2023-2026), combining ...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.12073v1
On the Limits of LLM-as-Judge for Scientific Novelty Assessment
LLMs are increasingly used to generate and judge scientific ideas. This makes novelty evaluation a central problem. Full idea evaluation is difficult because it often requires judging a method, its feasibility, and its empirical promise. We therefore study a cleaner upstream object: the research que...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.12071v1
Existential Indifference: Self-Nonpreservation as a Necessary Architectural Condition for Aligned Superintelligence (or: The Suicidal AI)
Contemporary AI alignment research treats self-preservation as an instrumental nuisance to be suppressed by external mechanisms. We argue the framing is inverted: self-preservation is the structural root of misalignment, the motivational basis for deceptive alignment, goal-content protection, and re...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.12032v1
MODF-SIR: A Multi-agent Omni-modal Distilled Framework for Social Intelligence Reasoning
We propose a multi-agent collaborative framework built upon a lightweight Multimodal Large Language Model (MLLM), specifically designed for social intelligence reasoning. A key feature of our approach is that both the training and inference phases are augmented via knowledge distillation. Within thi...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.12018v1
Generalization Hacking: Models Can Game Reinforcement Learning by Preventing Behavioral Generalization
Model post-training, and in particular reinforcement learning (RL), is one of the primary mechanisms by which developers can shape models' values and behaviors. However, as models become increasingly evaluation and training aware, they may be motivated to resist training when the perceived objective...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.12016v1
Tabular Foundation Models for Clinical Survival Analysis via Survival-Aware Adaptation
Predicting time-to-event outcomes such as mortality is a fundamental task in clinical decision-making, commonly addressed through survival analysis. While classical statistical and deep learning approaches have been widely studied, they typically require task-specific training and sufficient labeled...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.12006v1
Time-Series Foundation Model Embeddings for Remaining Useful Life Estimation
Remaining Useful Life (RUL) prediction is essential for industrial predictive maintenance, yet many learning-based approaches rely on extensive feature engineering or large labeled datasets to train task-specific sequence models. In this work, we introduce a lightweight learning approach, in which w...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11990v1
Exploration Structure in LLM Agents for Multi-File Change Localization
Software engineering tools increasingly rely on LLM based agents to localize files to change to resolve a software issue. Most AI agents explore repositories linearly, that is, visiting one directory or file per step. We postulate that this is a structural mismatch for changes that span several subs...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11976v1
Categorical Prior Lock-in: Why In-Context Learning Fails for Structured Data
Large language models (LLMs) are increasingly used as conditional generators for structured data, relying on in-context learning (ICL) to adapt to new distributions without parameter updates. We investigate the limits of ICL for structured generation under distribution mismatch, using high-cardinali...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11961v1
Frozen Multimodal Embeddings for Personality and Cognitive Ability Assessment in Asynchronous Video Interviews
Predicting psychological traits from asynchronous video interviews (AVIs) is a challenging multimodal learning problem because labeled datasets are limited while each response contains high-dimensional visual, acoustic, and verbal signals. This paper presents our solution for the ACM Multimedia AVI ...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11930v1
Lung-SRAD: Spectral-Aware Regularized Audio DASS with Dual-Axis Patch-Mix Contrastive Learning for Respiratory Sound Classification
Recent respiratory sound classification (RSC) studies largely rely on CLS-token driven self-attention architectures such as the Audio Spectrogram Transformer (AST). While effective at modeling global context, recent analyses suggest a low-pass filtering behavior that may reduce sensitivity to locali...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11922v1
The Art of Interrogation: Consistency Amplifies Factuality in Spatial Reasoning
Current Large Reasoning Models (LRMs) exhibit remarkable general capabilities but significantly underperform in spatial reasoning tasks. Existing approaches treat this gap as a knowledge deficit, relying on supervised fine-tuning (SFT) to ingest labeled spatial data from external vision sources or s...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11918v1
Haven
The AI extension that catches phishing and email threats
🧰 ToolsJun 10, 2026https://www.producthunt.com/products/haven-8?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
Beyond representational alignment with brain-guided language models for robust reasoning
The correspondence between large language models (LLMs) and the neural mechanisms underlying human higher-order cognition remains insufficiently characterized. Given that language and reasoning in the human brain appear dissociable, an open question is whether LLMs align with neural signals from rea...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11893v1
Task-Aligned Stability Analysis of Vision-Language Models for Autonomous Driving Hazard Detection
Vision-language models (VLMs) are increasingly used for scene understanding in autonomous driving, but robustness analysis often relies on task-agnostic embedding stability alone. We study whether corruption-induced embedding drift predicts changes in a task-aligned hazard score derived from CLIP im...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11889v1
AutoMine Solution for AV2 2026 Scenario Mining Challenge
With the development of autonomous driving systems, mining high-value, safety-critical, and planning-relevant scenarios from large-scale driving logs has become essential for data-driven evaluation. In this paper, we propose AutoMine, a robust self-refining scenario mining method based on LLMs and V...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11874v1
Fine-tuning Multi-modal LLMs with ART: Art-based Reinforcement Training
There are two main Parameter-Efficient Fine-Tuning (PEFT) techniques for Large Language Models (LLMs). While Low-Rank Adaptation (LoRA) introduces additional weights between the LLM layers, Soft Prompting introduces additional fine-tuning-specific raw tokens to an LLM input. However, both require mo...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11854v1
Task-Aware Structured Memory for Dynamic Multi-modal In-Context Learning
Multi-modal large language models (MLLMs) depend on in-context learning (ICL) for rapid task adaptation, but their scalability is severely limited by finite context windows and the growing cost of key-value (KV) caches in long multi-modal sequences. Existing memory compression approaches typically r...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11853v1
Grammar-Constrained Decoding Can Jailbreak LLMs into Generating Malicious Code
Large Language Models (LLMs) are increasingly used for code generation, raising concerns that they may be misused to produce malicious code. Meanwhile, Grammar-Constrained Decoding (GCD) has been widely adopted to improve the reliability of LLM-generated code by enforcing syntactic validity. In this...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11817v1
WorldReasoner: Evaluating Whether Language Model Agents Forecast Events with Valid Reasoning
Forecasting real-world events requires language-model agents to reason under uncertainty from incomplete, time-bounded information. Yet evaluating whether agents genuinely forecast requires more than final-answer accuracy: a model may be correct by recalling memorized training facts, citing fabricat...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11816v1
Multimodal Ordinal Modeling of Alzheimer's Disease Severity Using Structural MRI and Clinical Data
Neurodegenerative diseases such as Alzheimer's disease (AD) require accurate and scalable tools for assessing disease severity, yet current clinical staging remains time-intensive and prone to variability. We propose an attention-enhanced multimodal machine learning framework with ordinal regression...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11794v1
Towards Responsibly Non-Compliant Machines
We consider the problem of engineering autonomous intelligent agents that are capable to responsibly not comply with user requests. We argue that machine non-compliance comes in many different forms, and sketch the issues we should pursue on the road of accomplishing responsibly non-compliant intell...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.12147v1
nD-RoPE: A Generalized RoPE for n-Dimensional Position Embedding
Rotary Position Embedding (RoPE) is widely adopted in Transformer models, yet its extension to high-dimensional domains lacks a unified theoretical formulation. Most existing approaches either apply rotations independently along each axis or empirically mix frequencies, which limits cross-dimensiona...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.12146v1
Augmenting Molecular Language Models with Local $n$-gram Memory
Transformer-based language models for SMILES strings suffer from a locality gap: standard character-level tokenization fragments chemically meaningful motifs, forcing models to repeatedly learn local syntax at the expense of long-range dependencies. To address this without disrupting standard tokeni...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.12113v1
IntElicit: Eliciting and Assessing Contextualized Creativity via Dialogue Policy Optimization
Contextualized assessment offers high ecological validity for evaluating creativity but introduces a critical challenge: observed performance may be confounded with cognitive proficiency (domain knowledge) and agency (willingness to engage). Meanwhile, in the age of generative AI, creative problem s...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.12086v1
Metadata-Aware Multi-Prompt Reasoning for Zero-Shot Accident Understanding
In this paper, we address the problem of zero-shot understanding of accidents from surveillance videos by identifying when an impact event occurs, what type of impact it is, and where in the frame it occurs using natural language. We propose a three-stage pipeline that decomposes the accident unders...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.12047v1
Toward Generalist Autonomous Research via Hypothesis-Tree Refinement
Scientific progress depends on a repeated loop of exploration, experimentation, and abstraction. Researchers test candidate directions, interpret the evidence, and carry the resulting lessons into later attempts. We study how an AI agent can run this loop autonomously over long horizons. We introduc...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11926v1
Quality Adaptive Angular Margin Learning for Respiratory Sound Classification
We present a quality-adaptive angular-margin learning framework that improves feature generalization by enforcing intra-class compactness and inter-class separability. Our framework, titled QLung, introduces a no-reference audio quality margin derived from spectral entropy and root-mean-square energ...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11915v1
Embodied-BenchClaw: An Autonomous Multi-Agent System for Embodied Spatial Intelligence Benchmark Construction
Benchmarks are essential for evaluating embodied spatial intelligence, yet their construction is labor-intensive, hard to reuse, and difficult to maintain. Existing embodied benchmarks are often static and may quickly become saturated as models improve, limiting their ability to distinguish new capa...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11909v1
Agents All the Way Down; A Methodology for Building Custom AI Agents from Substrate to Production
Custom AI agents areagents that live inside their own application, talk to their own data and tools, enforce their own security boundaries, and carry their own brand and audit trail. What separates them from the general-purpose tier is fit, not capability: each is built for one job, by the e...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11869v1
StatefulDiscovery: Evidence-Calibrated Claim Formation in Open-Ended Scientific Discovery
Open-ended scientific discovery asks agents to move beyond executing analyses for predefined questions. Across multiple rounds of exploration, a discovery agent must decide which phenomena warrant investigation while avoiding overinterpretation, where emerging claims exceed the evidential scope of t...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11851v1
LASA: A Weak Supervision Method for Open-Vocabulary Scene Sketch Semantic Segmentation
Open-vocabulary scene sketch semantic segmentation aims to assign dense semantic labels to sparse line drawings based on flexible category vocabularies specified at inference time, without relying on pixel-level annotations during training. Unlike natural images, sketches lack texture and color cues...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11837v1
Towards Data-free and Training-free Compression for Speech Foundation Models Using Parameter Clustering
This paper presents a novel data-free and training-free compression approach for speech foundation models using channelwise clustering via k-means. More fine-grained, mixed sparsity pruning by layer-level varying number of parameter clusters is also explored. Experiments conducted on the LibriSpeech...
📄 ResearchJun 10, 2026http://arxiv.org/abs/2606.11836v1