AI News Archive: April 30, 2026 — Part 20

Sourced from 500+ daily AI sources, scored by relevance.

Language Models Refine Mechanical Linkage Designs Through Symbolic Reflection and Modular Optimisation
Designing mechanical linkages involves combinatorial topology selection and continuous parameter fitting. We show that language models can systematically improve linkage designs through symbolic representations. Language model agents explore discrete topologies while numerical optimisers fit continu...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27962v1
LLMs as ASP Programmers: Self-Correction Enables Task-Agnostic Nonmonotonic Reasoning
Recent large language models (LLMs) have achieved impressive reasoning milestones but continue to struggle with high computational costs, logical inconsistencies, and sharp performance degradation on high-complexity problems. While neuro-symbolic methods attempt to mitigate these issues by coupling ...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27960v1
GUI Agents with Reinforcement Learning: Toward Digital Inhabitants
Graphical User Interface (GUI) agents have emerged as a promising paradigm for intelligent systems that perceive and interact with graphical interfaces visually. Yet supervised fine-tuning alone cannot handle long-horizon credit assignment, distribution shifts, and safe exploration in irreversible e...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27955v1
MM-StanceDet: Retrieval-Augmented Multi-modal Multi-agent Stance Detection
Multimodal Stance Detection (MSD) is crucial for understanding public discourse, yet effectively fusing text and image, especially with conflicting signals, remains challenging. Existing methods often face difficulties with contextual grounding, cross-modal interpretation ambiguity, and single-pass ...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27934v1
Taming the Centaur(s) with LAPITHS: a framework for a theoretically grounded interpretation of AI performances
We introduce a framework called LAPITHS (Language model Analysis through Paradigm grounded Interpretations of Theses about Human likenesS) and use it to show that several major claims advanced by models such as CENTAUR, proposed as an artificial Unified Model of Cognition, are not theoretically or e...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27927v1
Can AI Be a Good Peer Reviewer? A Survey of Peer Review Process, Evaluation, and the Future
Peer review is a multi-stage process involving reviews, rebuttals, meta-reviews, final decisions, and subsequent manuscript revisions. Recent advances in large language models (LLMs) have motivated methods that assist or automate different stages of this pipeline. In this survey, we synthesize techn...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27924v1
Beyond Semantics: Measuring Fine-Grained Emotion Preservation in Small Language Model-Based Machine Translation
Preserving affective nuance remains a challenge in Machine Translation (MT), where semantic equivalence often takes precedence over emotional fidelity. This paper evaluates the performance of three state-of-the-art Small Language Models (SLMs) -- EuroLLM, Aya Expanse, and Gemma -- in maintaining fin...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27920v1
Simulating clinical interventions with a generative multimodal model of human physiology
Understanding how human health changes over time, and why responses to interventions vary between individuals, remains a central challenge in medicine. Here we present HealthFormer, a decoder-only transformer that models the human physiological trajectory generatively, by training on data from the H...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27899v1
ElevenMusic
AI-assisted music creation with built-in discovery, royalty
🧰 ToolsApr 30, 2026https://www.producthunt.com/products/elevenmusic?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
Graph World Models: Concepts, Taxonomy, and Future Directions
As one of the mainstream models of artificial intelligence, world models allow agents to learn the representation of the environment for efficient prediction and planning. However, classical world models based on flat tensors face several key problems, including noise sensitivity, error accumulation...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27895v1
In-Context Prompting Obsoletes Agent Orchestration for Procedural Tasks
Agent orchestration frameworks -- LangGraph, CrewAI, Google ADK, OpenAI Agents SDK, and others -- place an external orchestrator above the LLM, tracking state and injecting routing instructions at every turn. We present a controlled comparison showing that for procedural tasks, this architecture is ...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27891v1
Modeling Clinical Concern Trajectories in Language Model Agents
Large language model (LLM) agents deployed in clinical settings often exhibit abrupt, threshold-driven behavior, offering little visibility into accumulating risk prior to escalation. In real-world care, however, clinicians act on gradually rising concern rather than instantaneous triggers. We study...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27872v1
Rethinking Agentic Reinforcement Learning In Large Language Models
Reinforcement Learning (RL) has traditionally focused on training specialized agents to optimize predefined reward functions within narrowly defined environments. However, the advent of powerful Large Language Models (LLMs) and increasingly complex, open-ended tasks has catalyzed a paradigm shift to...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27859v1
NeocorRAG: Less Irrelevant Information, More Explicit Evidence, and More Effective Recall via Evidence Chains
Although precise recall is a core objective in Retrieval-Augmented Generation (RAG), a critical oversight persists in the field: improvements in retrieval performance do not consistently translate to commensurate gains in downstream reasoning. To diagnose this gap, we propose the Recall Conversion R...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27852v1
CastFlow: Learning Role-Specialized Agentic Workflows for Time Series Forecasting
Recently, large language models (LLMs) have shown great promise in time series forecasting. However, most existing LLM-based forecasting methods still follow a static generative paradigm that directly maps historical observations to future values in a single pass. Under this paradigm, forecasting is...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27840v1
ObjectGraph: From Document Injection to Knowledge Traversal -- A Native File Format for the Agentic Era
Every document format in existence was designed for a human reader moving linearly through text. Autonomous LLM agents do not read - they retrieve. This fundamental mismatch forces agents to inject entire documents into their context window, wasting tokens on irrelevant content, compounding state ac...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27820v1
Post-Optimization Adaptive Rank Allocation for LoRA
Exponential growth in the scale of modern foundation models has led to the widespread adoption of Low-Rank Adaptation (LoRA) as a parameter-efficient fine-tuning technique. However, standard LoRA implementations disregard the varying intrinsic dimensionality of model layers and enforce a uniform ran...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27796v1
Test Before You Deploy: Governing Updates in the LLM Supply Chain
Large Language Models (LLMs) are increasingly used as core dependencies in software systems. However, the hosted LLM services evolve continuously through provider-side updates without explicit version changes. These silent updates can introduce behavioral drift, causing regressions in functionality,...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27789v1
RuC: HDL-Agnostic Rule Completion Benchmark Generation
Large Language Models (LLMs) have rapidly improved in performance across code-related tasks, making their integration into Register Transfer Level (RTL) development increasingly attractive. Mimicking the behavior of inline code assistants, many benchmarks evaluate LLMs' capabilities in code completi...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27780v1
Instruction-Guided Poetry Generation in Arabic and Its Dialects
Poetry has long been a central art form for Arabic speakers, serving as a powerful medium of expression and cultural identity. While modern Arabic speakers continue to value poetry, existing research on Arabic poetry within Large Language Models (LLMs) has primarily focused on analysis tasks such as...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27766v1
To Build or Not to Build? Factors that Lead to Non-Development or Abandonment of AI Systems
Responsible AI research typically focuses on examining the use and impacts of deployed AI systems. Yet, there is currently limited visibility into the pre-deployment decisions to pursue building such systems in the first place. Decisions taken in the earlier stages of development shape which systems...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.28053v1
Agent-Agnostic Evaluation of SQL Accuracy in Production Text-to-SQL Systems
Text-to-SQL (T2SQL) evaluation in production environments poses fundamental challenges that existing benchmarks do not address. Current evaluation methodologies whether rule-based SQL matching or schema-dependent semantic parsers assume access to ground-truth queries and structured database schema, ...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.28049v1
Design Structure Matrix Modularization with Large Language Models
Design Structure Matrix (DSM) modularization, the task of partitioning system elements into cohesive modules, is a fundamental combinatorial challenge in engineering design. Traditional methods treat modularization as a pure graph optimization, without access to the engineering context embedded in t...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.28018v1
ITS-Mina: A Harris Hawks Optimization-Based All-MLP Framework with Iterative Refinement and External Attention for Multivariate Time Series Forecasting
Multivariate time series forecasting plays a pivotal role in numerous real-world applications, including financial analysis, energy management, and traffic planning. While Transformer-based architectures have gained popularity for this task, recent studies reveal that simpler MLP-based models can ac...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27981v1
TransVLM: A Vision-Language Framework and Benchmark for Detecting Any Shot Transitions
Traditional Shot Boundary Detection (SBD) inherently struggles with complex transitions by formulating the task around isolated cut points, frequently yielding corrupted video shots. We address this fundamental limitation by formalizing the Shot Transition Detection (STD) task. Rather than searching...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27975v1
The Effects of Visual Priming on Cooperative Behavior in Vision-Language Models
As Vision-Language Models (VLMs) become increasingly integrated into decision-making systems, it is essential to understand how visual inputs influence their behavior. This paper investigates the effects of visual priming on VLMs' cooperative behavior using the Iterated Prisoner's Dilemma (IPD) as a...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27953v1
Attractor FCM
In this paper an attractor FCM is created, tested, and analyzed. This FCM is neither a hebbian based nor agentic, nor a hybrid; it rather is a gradient descent based, physics constrained, Jacobian version of an FCM. Moreover, this model has several quirks; it uses residual memory, back propagation t...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27947v1
From Unstructured Recall to Schema-Grounded Memory: Reliable AI Memory via Iterative, Schema-Aware Extraction
Persistent AI memory is often reduced to a retrieval problem: store prior interactions as text, embed them, and ask the model to recover relevant context later. This design is useful for thematic recall, but it is mismatched to the kinds of memory that agents need in production: exact facts, current...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27906v1
Building Persona-Based Agents On Demand: Tailoring Multi-Agent Workflows to User Needs
Recent advances in agentic AI are shifting automation from discrete tools to proactive multi-agent systems that coordinate multi-specialized capabilities behind unified interfaces. However, today's agent systems typically rely on hard-coded agent architectures with fixed roles, coordination patterns...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27882v1
AI Inference as Relocatable Electricity Demand: A Latency-Constrained Energy-Geography Framework
AI inference is becoming a persistent and geographically distributed source of electricity demand. Unlike many traditional electrical loads, inference workloads can sometimes be executed away from the user-facing service location, provided that latency, state locality, capacity, and regulatory const...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27855v1
Focus Session: Autonomous Systems Dependability in the era of AI: Design Challenges in Safety, Security, Reliability and Certification
The design of embedded safety-critical systems such as those used in next-generation automotive and autonomous platforms, is increasingly challenged by escalating system complexity, hardware-software heterogeneity, and the integration of intelligent, data-driven components. Ensuring dependability in...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27807v1
How Generative AI Disrupts Search: An Empirical Study of Google Search, Gemini, and AI Overviews
Generative AI is being increasingly integrated into web search for the convenience it provides users. In this work, we aim to understand how generative AI disrupts web search by retrieving and presenting the information and sources differently from traditional search engines. We introduce a public b...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27790v1
WindowsWorld: A Process-Centric Benchmark of Autonomous GUI Agents in Professional Cross-Application Environments
While GUI agents have shown impressive capabilities in common computer-use tasks such as OSWorld, current benchmarks mainly focus on isolated and single-application tasks. This overlooks a critical real-world requirement of coordinating across multiple applications to accomplish complex profession-s...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27776v1
Measuring research data reuse in scholarly publications using generative artificial intelligence: Open Science Indicator development and preliminary results
Numerous metascience studies and other initiatives have begun to monitor the prevalence of open science practices when it is more important to understand the 'downstream' effects or impacts of open science. PLOS and DataSeer have developed a new LLM-based indicator to measure an important effect of ...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.28061v1
Stable Behavior, Limited Variation: Persona Validity in LLM Agents for Urban Sentiment Perception
Large Language Models (LLMs) are increasingly used as proxies for human perception in urban analysis, yet it remains unclear whether persona prompting produces meaningful and reproducible behavioral diversity. We investigate whether distinct personas influence urban sentiment judgments generated by ...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.28048v1
Models Recall What They Violate: Constraint Adherence in Multi-Turn LLM Ideation
When researchers iteratively refine ideas with large language models, do the models preserve fidelity to the original objective? We introduce DriftBench, a benchmark for evaluating constraint adherence in multi-turn LLM-assisted scientific ideation. Across 2,146 scored benchmark runs spanning seven ...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.28031v1
Why Self-Supervised Encoders Want to Be Normal
We develop a geometric and information-theoretic framework for encoder-decoder learning built on the Information Bottleneck (IB) principle. Recasting IB as a rate-distortion problem with Kullback-Leibler (KL) divergence as distortion, we show that the optimal representation at any distortion level i...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27743v1
Linear-Core Surrogates: Smooth Loss Functions with Linear Rates for Classification and Structured Prediction
The choice of loss function in classification involves a fundamental trade-off: smooth losses (like Cross-Entropy) enable fast optimization rates but yield slow square-root consistency bounds, while piecewise-linear losses (like Hinge) offer fast linear consistency rates but suffer from non-differen...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27742v1
VibroML: an automated toolkit for high-throughput vibrational analysis and dynamic instability remediation of crystalline materials using machine-learned potentials
While machine-learned interatomic potentials (MLIPs) accelerate phonon dispersion calculations, merely identifying dynamical instabilities in computationally predicted materials is insufficient; automated pathways to resolve them are required. We introduce VibroML, an open-source Python toolkit driv...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27685v1
Can Tabular Foundation Models Guide Exploration in Robot Policy Learning?
Policy optimization in high-dimensional continuous control for robotics remains a challenging problem. Predominant methods are inherently local and often require extensive tuning and carefully chosen initial guesses for good performance, whereas more global and less initialization-sensitive search m...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27667v1
Language Ideologies in a Multilingual Society: An LLM-based Analysis of Luxembourgish News Comments
Detecting language ideologies is a valuable yet complex task for understanding how identities are constructed through discourse. In Luxembourg's multicultural and multilingual society, language ideologies reflect more than simple preferences: they carry deep cultural and social meanings, shaping ide...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27661v1
Mapping how LLMs debate societal issues when shadowing human personality traits, sociodemographics and social media behavior
Large Language Models (LLMs) can strongly shape social discourse, yet datasets investigating how LLM outputs vary across controlled social and contextual prompting remain sparse. Cognitive Digital Shadows (CDS) is a 190,000-record synthetic corpus supporting analyses of LLM-generated discourse. Each...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27624v1
HATS: An Open data set Integrating Human Perception Applied to the Evaluation of Automatic Speech Recognition Metrics
Conventionally, Automatic Speech Recognition (ASR) systems are evaluated on their ability to correctly recognize each word contained in a speech signal. In this context, the word error rate (WER) metric is the reference for evaluating speech transcripts. Several studies have shown that this measure ...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27542v1
Entropy of Ukrainian
In natural language processing, the entropy of a language is a measure of its unpredictability and complexity. The first study on this subject was conducted by Claude Shannon in 1951. By having participants predict the next character in a sentence, he was able to approximate the entropy of the Engli...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27534v1
Qualitative Evaluation of Language Model Rescoring in Automatic Speech Recognition
Evaluating automatic speech recognition (ASR) systems is a classical but difficult and still open problem, which often boils down to focusing only on the word error rate (WER). However, this metric suffers from many limitations and does not allow an in-depth analysis of automatic transcription error...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27533v1
Debiasing Reward Models via Causally Motivated Inference-Time Intervention
Reward models (RMs) play a central role in aligning large language models (LLMs) with human preferences. However, RMs are often sensitive to spurious features such as response length. Existing inference-time approaches for mitigating these biases typically focus exclusively on response length, resul...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27495v1
Skills-Coach: A Self-Evolving Skill Optimizer via Training-Free GRPO
We introduce Skills-Coach, a novel automated framework designed to significantly enhance the self-evolution of skills within Large Language Model (LLM)-based agents. Addressing the current fragmentation of the skill ecosystem, Skills-Coach explores the boundaries of skill capabilities, thereby facil...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27488v1
Exploring Applications of Transfer-State Large Language Models: Cognitive Profiling and Socratic AI Tutoring
Large language models (LLMs) sometimes exhibit qualitative shifts in response style under sustained self-referential dialogue conditions (Berg et al., 2025). This study refers to this phenomenon as "transfer" and explores the application potential of LLMs in a transfer state. As an applied case, the...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27454v1
From Coarse to Fine: Benchmarking and Reward Modeling for Writing-Centric Generation Tasks
Large language models have achieved remarkable progress in text generation but still struggle with generative writing tasks. In terms of evaluation, existing benchmarks evaluate writing reward models coarsely and fail to measure performance from the perspective of specific requirements. In terms of ...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27453v1
Sentiment Analysis of AI Adoption in Indonesian Higher Education Using Machine Learning and Transformer-Based Models
This study analyzes Indonesian student opinions on the adoption of artificial intelligence in higher education using two approaches: TF-IDF-based machine learning and Transformer-based deep learning. The dataset consists of 2,295 labeled samples, combining 1,154 student opinions with additional lexi...
📄 ResearchApr 30, 2026http://arxiv.org/abs/2604.27439v1