AI News Archive: May 13, 2026 — Part 20
Sourced from 500+ daily AI sources, scored by relevance.
- SAP customers say migration is eating their budgets—and AI is next in line
SAP has unveiled its most ambitious AI vision yet at Sapphire 2026 this week: more than 50 Joule Assistants, 200 specialized agents, and a new “Autonomous Enterprise” framework. But for many SAP customers, who are still navigating costly migrations to SAP’s S/4HANA cloud ERP, the more pressing question isn’t what AI can do. It’s whether they can get there from here. Budget constraints are now the top challenge facing SAP customers, cited by 61% of respondents in the Americas’ SAP Users’ Group (ASUG) 2026 Pulse of the SAP Customer survey, 7 percentage points higher than last year. And the culprit isn’t macroeconomic pressure, according to ASUG. It’s the migrations themselves. “From our research, it’s more so that S/4HANA projects are creating the budget pressures,” said ASUG’s research director Marissa Gilbert . “And we’ll see AI impacting them next.” Stuck in pilot mode Separate ASUG research on AI adoption , conducted in February 2026 in collaboration with Microsoft and Intel, reveale
- Introducing Managed Deep Agents
Introducing Managed Deep Agents
- New in Deep Agents v0.6
New features in Deep Agents v0.6
- Cerebras — Faster Tokens Please
// OpenAI and AWS Partnerships, Tokenomics Explainer, Architecture Deep Dive, Datacenter Ramp, Technical Roadmap
- Google's new AI will make your Android phone faster and smarter
Google's new AI will make your Android phone faster and smarter
- Googles big Android update goes all-in on AI: Everything to know
Google’s Android update adds Gemini Intelligence, bringing more advanced AI features and app-based assistance to phones.
- Make cleaning more efficient with $700 off the Dreame X50 Ultra robot vacuum and mop
As of May 13, save $700 on the Dreame X50 Ultra robot vacuum and mop.
- Normally over $1,000, the EcoVacs Deebot T30S robot vacuum is just $449 right now at Amazon
As of May 13, the Ecovacs Deebot T30S robot vacuum and mop is back to its lowest-ever price of $449 at Amazon. This is 63% off its list price of $1,199.99.
- Intel, Qualcomm confirm Googlebook AI laptop partnerships, opening ARM andx86 possibilities for new OS — Google VP says devices to also ship with MediaTek chips
Intel has officially confirmed its partnership with Googlebook as Google prepares a new lineup of Gemini-powered AI laptops.
- Query-Conditioned Test-Time Self-Training for Large Language Models
Large language models (LLMs) are typically deployed with fixed parameters, and their performance is often improved by allocating more computation at inference time. While such test-time scaling can be effective, it cannot correct model misconceptions or adapt the model to the specific structure of a...
- Probing Persona-Dependent Preferences in Language Models
Large language models (LLMs) can be said to have preferences: they reliably pick certain tasks and outputs over others, and preferences shaped by post-training and system prompts appear to shape much of their behaviour. But models can also adopt different personas which have radically different pref...
- Tracing Persona Vectors Through LLM Pretraining
How large language models internally represent high-level behaviors is a core interpretability question with direct relevance to AI safety: it determines what we can detect, audit, or intervene on. Recent work has shown that traits such as evil or sycophancy correspond to linear directions in the in...
- CANTANTE: Optimizing Agentic Systems via Contrastive Credit Attribution
LLM-based multi-agent systems have demonstrated strong performance across complex real-world tasks, such as software engineering, predictive modeling, and retrieval-augmented generation. Yet automating their configuration remains a structural challenge, as scores are available only at the system lev...
- What properties of reasoning supervision are associated with improved downstream model quality?
Validating training data for reasoning models typically requires expensive trial-and-error fine-tuning cycles. In this work, we investigate whether the utility of a reasoning dataset can be reliably predicted prior to training using intrinsic data metrics. We propose a suite of quantitative measures...
- The Readability Spectrum: Patterns, Issues, and Prompt Effects in LLM-Generated Code
As Large Language Models (LLMs) are transforming software development, the functional quality of generated code has become a central focus, leaving readability, one of critical non-functional attributes, understudied. Given that LLM-generated code still needs human review before adoption, it is impo...
- IndexedAI
Your site scores X/100 for AI agents with next steps
- Polygram
AI-native design and coding app to build mobile & web apps
- claude-share
Securely share your Claude Code with your friends
- Zen Reports
See how much traffic your website gets from ChatGPT
- Mi
30-line zero-config CLI agent for bug fixes + refactoring
- Linchpin
Open-source, self-hostable runtime for managed AI agents
- SurfBuddy
AI sidebar companions for cross-app workflows and automation
- Mycelis
Serverless AI workspace with smart routing & MCP agents
- Jootle
The AI-Native Operations Platform
- Pipecat
Build AI workflows and assistants for your business
- BossHogg
Agent-first CLI for PostHog analytics and feature flags
- SearchScore AI
See if AI search can find and recommend your brand.
- Compact Latent Manifold Translation: A Parameter-Efficient Foundation Model for Cross-Modal and Cross-Frequency Physiological Signal Synthesis
The analysis of physiological time series, such as electrocardiograms (ECG) and photoplethysmograms (PPG), is persistently hindered by modality and frequency gaps stemming from heterogeneous recording devices. Existing foundation models typically rely on continuous latent spaces, which frequently su...
- It's not the Language Model, it's the Tool: Deterministic Mediation for Scientific Workflows
Language models can produce convincing scientific analyses, but repeated generations on the same data do not guarantee the same result. A researcher may regenerate an identical query and receive a different fit, a different peak position or a different analysis procedure, without an obvious way to d...
- Teacher-Guided Policy Optimization for LLM Distillation
The convergence of reinforcement learning and imitation learning has positioned Reverse KL (RKL) as a promising paradigm for on-policy LLM distillation, aiming to unify exploration with teacher supervision. However, we identify a critical limitation: when the student and teacher distributions diverg...
- Improving Code Translation with Syntax-Guided and Semantic-aware Preference Optimization
LLMs have shown immense potential for code translation, yet they often struggle to ensure both syntactic correctness and semantic consistency. While preference-based learning offers a promising alignment strategy, it is hindered by unreliable semantic rewards derived from sparse test cases or restri...
- ReTool-Video: Recursive Tool-Using Video Agents with Meta-Augmented Tool Grounding
Video understanding requires active evidence seeking, motivating tool-augmented video agents for temporal reasoning, cross-modal understanding, and complex question answering. Existing video agents have improved video reasoning with retrieval, memory, frame inspection, and verifier tools, but they s...
- Hierarchical Attacks for Multi-Modal Multi-Agent Reasoning
Multi-modal multi-agent systems (MM-MAS) have gained increasing attention for their capacity to enable complex reasoning and coordination across diverse modalities. As these systems continue to expand in scale and functionality, investigating their potential vulnerabilities has become increasingly i...
- STAR: Semantic-Temporal Adaptive Representation Learning for Few-Shot Action Recognition
Few-shot action recognition (FSAR) requires models to generalize to novel action categories from only a handful of annotated samples. Despite progress with vision-language models, existing approaches still suffer from semantic-temporal misalignment, where static textual prompts fail to capture decis...
- ECG-NAT: A Self-supervised Neighborhood Attention Transformer for Multi-lead Electrocardiogram Classification
Electrocardiogram (ECG) arrhythmia classification remains challenging due to signal variability, noise, limited labeled data, and the difficulty in achieving both accuracy and efficiency in models. While self-supervised learning reduces label dependency, most methods target either global contextual ...
- Stable Attention Response for Reliable Precipitation Nowcasting
Precipitation nowcasting remains challenging due to the highly localized, rapidly evolving, and heterogeneous nature of atmospheric dynamics. Although recent methods increasingly adopt attention-based architectures in both unimodal and multimodal settings, they mainly emphasize stronger representati...
- CLIP Tricks You: Training-free Token Pruning for Efficient Pixel Grounding in Large VIsion-Language Models
In large vision-language models, visual tokens typically constitute the majority of input tokens, leading to substantial computational overhead. To address this, recent studies have explored pruning redundant or less informative visual tokens for image understanding tasks. However, these methods str...
- PanoWorld: Towards Spatial Supersensing in 360$^\circ$ Panorama World
Multimodal large laboratory models (MLLMs) still struggle with spatial understanding under the dominant perspective-image paradigm, which inherits the narrow field of view of human-like perception. For navigation, robotic search, and 3D scene understanding, 360-degree panoramic sensing offers a form...
- EvObj: Learning Evolving Object-centric Representations for 3D Instance Segmentation without Scene Supervision
We introduce EvObj for unsupervised 3D instance segmentation that bridges the geometric domain gap between synthetic pretraining data and real-world point clouds. Current methods suffer from structural discrepancies when transferring object priors from synthetic datasets (e.g., ShapeNet) to real sca...
- GRIP-VLM: Group-Relative Importance Pruning for Efficient Vision-Language Models
In Vision-Language Models (VLMs), processing a massive number of visual tokens incurs prohibitive computational overhead. While recent training-aware pruning methods attempt to selectively discard redundant tokens, they largely rely on continuous-gradient relaxations. However, visual token pruning i...
- AI Harness Engineering: A Runtime Substrate for Foundation-Model Software Agents
Foundation models have transformed automated code generation, yet autonomous software-engineering agents remain unreliable in realistic development settings. The dominant explanation locates this gap in model capability. We propose a different locus: software-engineering capability emerges from a mo...
- Inducing Overthink: Hierarchical Genetic Algorithm-based DoS Attack on Black-Box Large Language Reasoning Models
Large Reasoning Models (LRMs) are increasingly integrated into systems requiring reliable multi-step inference, yet this growing dependence exposes new vulnerabilities related to computational availability. In particular, LRMs exhibit a tendency to "overthink", producing excessively long and redunda...
- Ego2World: Compiling Egocentric Cooking Videos into Executable Worlds for Belief-State Planning
Embodied agents in household environments must plan under partial observation: they need to remember objects, track state changes, and recover when actions fail. Existing benchmarks only partially test this ability. Egocentric video datasets capture realistic human activities but remain passive, whi...
- Stylized Text-to-Motion Generation via Hypernetwork-Driven Low-Rank Adaptation
Text-driven motion diffusion models are capable of generating realistic human motions, but text alone often struggles to express fine-level nuances of motion, commonly referred to as style. Recent approaches have tackled this challenge by attaching a style injection mechanism to a pretrained text-dr...
- What Limits Vision-and-Language Navigation ?
Vision-and-Language Navigation (VLN) is a cornerstone of embodied intelligence. However, current agents often suffer from significant performance degradation when transitioning from simulation to real-world deployment, primarily due to perceptual instability (e.g., lighting variations and motion blu...
- VERA-MH: Validation of Ethical and Responsible AI in Mental Health
Chatbot usage has increased, including in fields for which they were never developed for--notably mental health support. To that end, we introduce Validations of Ethical and Responsible AI in Mental Health (VERA-MH), a novel clinically-validated evaluation for safety of chatbots in the context of me...
- IdeaForge: A Knowledge Graph-Grounded Multi-Agent Framework for Cross-Methodology Innovation Analysis and Patent Claim Generation
Current AI-assisted innovation systems typically apply a single ideation methodology (such as TRIZ or Design Thinking) using sequential prompt-based workflows that do not preserve intermediate reasoning structure. As a result, insights generated across methodologies remain fragmented, limiting trace...
- Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling
Recent progress in reasoning models has substantially advanced long-horizon mathematical and scientific problem solving, with several systems now reaching gold-medal-level performance on International Mathematical Olympiad (IMO) and International Physics Olympiad (IPhO) problems. In this paper, we i...
- Discrete Diffusion for Complex and Congested Multi-Agent Path Finding with Sparse Social Attention
Multi-Agent Path Finding (MAPF) is a coordination problem that requires computing globally consistent, collision-free trajectories from individual start positions to assigned goal positions under combinatorial planning complexity. In dense environments, suboptimal initial plans induce compound confl...
- IndicMedDialog: A Parallel Multi-Turn Medical Dialogue Dataset for Accessible Healthcare in Indic Languages
Most existing medical dialogue systems operate in a single-turn question--answering paradigm or rely on template-based datasets, limiting conversational realism and multilingual applicability. We introduce IndicMedDialog, a parallel multi-turn medical dialogue dataset spanning English and nine Indic...