AI News Archive: June 11, 2026 — Part 22

Sourced from 500+ daily AI sources, scored by relevance.

The Emergence of Autonomous Penetration Capabilities in Large Language Model-Powered AI Systems
Nowadays, the autonomous execution of cyberattacks capable of causing substantial real-world harm is widely regarded as one of the critical red lines that frontier AI systems must not cross. Within this broader red-line scenario, autonomous penetration represents a core enabling capability and subta...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.13079v1
PolicyGuard: Towards Test-time and Step-level Adversary Defense for Reinforcement Learning Agent
While real-world applications of reinforcement learning (RL) are becoming increasingly popular, the security of RL systems deserve more attention and exploration. In particular, recent work has revealed that RL agents are vulnerable to backdoor attacks, where a victim agent behaves normally under st...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.12896v1
A Privacy-Preserving Framework Using Remote Data Science for Inter-Institutional Student Retention Prediction
This study explores privacy-preserving machine learning (PPML) techniques using the PySyft platform to enable collaborative prediction of student retention between institutions. We developed a remote data science (RDS) framework with a semi-air-gapped architecture consisting of high-side and low-sid...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.12845v1
Detecting Functional Memorization in Code Language Models
Large language models (LLMs) are increasingly used to generate code at scale. Meanwhile, prior work has investigated whether training data may be recoverable from model outputs, by auditing the textual overlap between training examples and model generations. Code, however, can be functionally equiva...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.12764v1
Differentially Private Hierarchical Heavy Hitters
The task of finding _Hierarchical_ Heavy Hitters (HHH) was introduced by Cormode et al. [VLDB 2003] as a generalisation of the heavy hitter problem. While finding HHH in data streams has been studied extensively, the question of releasing HHH when the underlying data is private remains unexplored. I...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.13563v1
Efficient, Robust, and Anti-Collusion Fingerprinting of Image Diffusion Models
Model fingerprinting, embedding user-specific identifiers (fingerprints) into generated outputs, has recently emerged as a popular solution to protect the intellectual property rights (IPR) of generative text-to-image (T2I) models and prevent unauthorized redistribution. In this work, we reveal a pr...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.12977v1
ViPER: Vision-based Packing-Aware Encoder for Robust Malware Detection
Visualization-based malware detection maps raw binary bytes to grayscale images and applies learned visual classifiers, providing an evasion-resistant and disassembly-free alternative to conventional analysis pipelines. However, executable packing remains a critical failure mode: packed binaries pro...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.12949v1
MAStrike: Shapley-Guided Collusive Red-Teaming on Multi-Agent Systems
Hierarchical multi-agent systems (MAS) are rapidly being deployed in high-stakes workflows across domains such as finance and software engineering. In these systems, safety and security are inherently distributed across role-specialized agents, significantly expanding the attack surface, particularl...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.12918v1
Semantic Identification of IoT Devices from Behavioral Primitives
Accurate identification of IoT devices is important for security management and policy enforcement. Existing approaches typically learn device signatures from packets or flow records. These methods operate on low-level communication observations whose traffic patterns may vary across deployments, so...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.12793v1
The Rise of AI-Native Software Engineering: Implications for Practice, Education, and the Future Workforce
Generative Artificial Intelligence (GenAI), Large Language Models (LLMs), and emerging Agentic AI constitute the most disruptive transformation in the history of software engineering (SE), reshaping development processes, required competencies, professional roles, and the educational outcomes that u...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.12986v1
Beyond Problem Solving: UOJ-Bench for Evaluating Code Generation, Hacking, and Repair in Competitive Programming
Despite strong performance in competitive programming, the role of Large Language Models (LLMs) in supporting human learning in the same setting remains largely unexplored. In this work, we introduce UOJ-Bench, a benchmark designed to evaluate not only the problem-solving ability of LLMs, but also t...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.12864v1
Mining Architectural Quality Under Agentic AI Adoption: A Causal Study of Java Repositories
AI coding tools are now used by a majority of developers, and agentic use of these tools has popularized the practice colloquially called "vibe coding". Yet causal evidence on their effect on software architecture is scarce. Prior causal work has measured code-level outcomes (complexity, static anal...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.13298v1
The End of Code Review: Coding Agents Supersede Human Inspection
Code review has been the primary quality gate in software development since Fagan formalised code inspection in 1976. For five decades, having a human examine and comment on a colleague's changes before merge has been a cornerstone practice at organisations of every size. Coding agents are large lan...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.13175v1
Generating Training Targets for Real-World Speech Enhancement via Close-to-Distant Microphone Projection
Training neural networks (NNs) for speech enhancement (SE) in distant speech-capturing scenarios requires paired distorted and clean reference speech signals. While such data are often generated through simulation, the mismatch between simulated and real recordings significantly limits SE accuracy. ...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.13109v1
Balancing ASR and diarization in end-to-end LLMs for multi-talker speech recognition
Multi-talker speech recognition is often addressed by combining automatic speech recognition (ASR) and speaker diarization in a pipeline system. Recently, LLM-based approaches have shown promise by jointly modeling semantic and speaker information, but they typically require large-scale multi-talker...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.13095v1
Endpoint Anticipation for Low-Latency Spoken Dialogue
While low-latency interaction is critical for spoken dialogue, cascaded architectures are often bottlenecked by reactive turn-completion detection. We propose Endpoint Anticipation, shifting from reactive detection to proactive forecasting of end-of-turn signals. Our speech-based model anticipates e...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.13450v1
OneRetrieval: Unifying Multi-Branch E-commerce Retrieval with an Editable Generative Model
Industrial e-commerce search serves hundreds of millions of items through a multi-branch retrieval stage fused by hand-tuned merging without joint optimization. Generative retrieval (GR) raises the prospect of collapsing this stage into a single model, yet unification is gated by more than retrieval...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.13533v1
CQC-RAG: Robust Retrieval-Augmented Generation via Cross-Query Consistency
Retrieval-Augmented Generation (RAG) has become a common approach for improving the factuality of Large Language Models (LLMs), yet its reliability remains highly sensitive to how external evidence is retrieved and used. Semantically equivalent queries with different syntactic forms may lead to diff...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.13438v1
CFALR: Collaborative Filtering-Augmented Large Language Model for Personalized Fashion Outfit Recommendation
Personalized outfit recommendation poses a significant challenge in e-commerce and social media platforms, requiring systems that balance user preferences with aesthetic compatibility. Collaborative filtering (CF) provides a traditional solution for this, but it struggles with data-sparse scenarios ...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.13001v1
How Fine-Grained Should a RAG Benchmark Be? A Hierarchical Framework for Synthetic Question Generation
Evaluating retrieval-augmented generation (RAG) systems requires benchmarks that capture diverse question characteristics, yet practitioners lack empirical guidance on which dimensions to vary and at what granularity. We present HieraRAG, a hierarchical framework for studying granularity in RAG benc...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.12789v1
CoDeR: Local Constraint-Compatible Retrieval Beyond Semantic Similarity
Information retrieval systems have long treated semantic similarity as a proxy for relevance. For constraint-sensitive queries, this proxy can fail when a document is topically close to the query but supports the opposite constraint direction, such as satisfying an attribute that should be excluded ...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.13204v1
The Clustering Strikes Back: Building Cost-Effective and High-Performance ANNS at Scale with Helmsman
RedNote (a.k.a., Xiaohongshu, a global-scale social network platform) widely adopts approximate nearest neighbor search (ANNS) to power its search, recommendation, and advertising services. Due to the demanding Service Level Agreements (SLAs), we have to rely on in-memory graph-based ANNS (i.e., HNS...
📄 ResearchJun 11, 2026http://arxiv.org/abs/2606.13145v1
Multimodal PET Defines a Goldilocks Thermal Window for Focused Ultrasound Ablation and Immunotherapy Combinations
Background: Thermally ablative focused ultrasound (T-FUS) offers a noninvasive, spatially precise strategy for local tumor destruction, with the added potential to remodel tumor architecture and immune dynamics in ways that influence downstream therapeutic delivery and efficacy. Despite promising preclinical and clinical findings, the T-FUS parameters that best balance tumor debulking with preservation of local biologic, e.g. immunotherapy, penetrance remain unclear. Thermal dose, defined by the relationship between tissue heating, exposure duration, and biological effect, is likely a critical determinant of this balance. Excessive thermal dose may eliminate the vascular and stromal features needed to support immunotherapy access, whereas insufficient thermal dose may fail to achieve meaningful cytoreduction. Here, we deploy multimodal PET, contrast-enhanced ultrasound, and tissue profiling to define a Goldilocks Zone for T-FUS that balances bulk tumor destruction with immunotherapy delivery. Method: Subtotal T-FUS was applied to 4T1 tumors using three thermal dose regimens resolved by in silico modeling. Ablation was quantified by H&E and TTC staining. Post-ablative perfusion and microvascular coverage were assessed by contrast-enhanced ultrasound and immunofluorescence, respectively. Tumor oxygenation was measured by intravenous hypoxyprobe labeling. After T-FUS, mice underwent dynamic [18F]-FDG PET and immunoPET with a model tumor-targeted antibody, [89Zr]-anti-CD47, to relate cytoreduction to antibody penetrance. ImmunoPET findings were further evaluated by ex vivo biodistribution analysis. Results: In silico modeling established three T-FUS regimens that generated distinct thermal dose profiles and were deployed in vivo in a solid breast tumor model. Histopathology, perfusion imaging, and hypoxia analysis revealed dose-dependent and dose-divergent biological effects that informed a candidate Goldilocks thermal window. Low thermal dose produced measurable but limited tumor debulking, whereas high thermal dose caused disproportionate functional perfusion collapse. An intermediate thermal dose achieved robust partial ablation, broad hypoxia relief, and preservation of residual tumor physiology sufficient to support antibody access. Dynamic [18F]-FDG PET confirmed a marked reduction in metabolically active tumor burden after Goldilocks T-FUS. Serial [89Zr]-anti-CD47 immunoPET showed that bulk antibody signal was maintained after ablation, and integration of immunoPET with matched [18F]-FDG PET revealed approximately 3-fold enrichment of antibody exposure within the residual viable tumor compartment of ablated tumors. These findings demonstrate that appropriately tuned thermal ablation can debulk tumor while preserving, and potentially concentrating, immunotherapy access within the remaining targetable tumor niche. Conclusion: This study identifies thermal dose as a critical consideration for T-FUS immunotherapy combinations and establishes a PET-informed framework for balancing cytoreduction with therapeutic delivery. Rather than functioning solely as a local debulking modality, we demonstrate that T-FUS can be tuned to yield a post-ablation tumor state that remains accessible to large biologics. These findings provide timely, translationally relevant guidance for tailoring T-FUS regimens to achieve local tumor destruction while preserving an immunotherapy-permissive niche for combination treatment.
📄 ResearchJun 11, 2026https://www.biorxiv.org/content/10.64898/2026.06.10.731490v1?rss=1
Controlling metal-carbonate phase, form, and function through de novo protein design
Biomineralization enables living systems to construct hybrid materials by controlling the location, orientation, and polymorph of inorganic crystals with proteins and other biomolecules. Despite decades of study, the molecular principles underlying these processes remain difficult to harness in engineered materials, in part because native biomineralization proteins are often intrinsically disordered, heterogeneous, or insoluble. Here we show that de novo designed protein interfaces can be assembled into reconfigurable two-dimensional arrays which template calcite nanocrystals. By fine-tuning RFdiffusion2 on repeat protein scaffolds, we further enable the design of protein architectures which selectively form aragonite, a metastable polymorph of calcium carbonate, in nucleation conditions that otherwise result in a mixture of phases. Extending beyond inorganics found in biological systems, we show that lattice-matched protein designs template cobalt carbonate formation: a flat helical repeat protein interface promotes unconfined growth, whereas soluble D3 cage assemblies yield more homogenous cobalt carbonate nanocrystals confined to the interior of the cage. These protein-cage cobalt carbonate hybrid materials function as electrocatalysts for alkaline water splitting. Our results demonstrate the potential of deep learning-based methods to unlock the structural and functional activity of protein-mineral composites.
📄 ResearchJun 11, 2026https://www.biorxiv.org/content/10.64898/2026.06.10.730916v1?rss=1
Multimodal phenotyping defines variant-to-function maps for RBM20 in dilated cardiomyopathy
Multiplex assays of variant effects have linked thousands of genotypes to fitness effects, yet we lack profound understanding of how variants impact molecular phenotypes. Here, we introduce a deep mutational scanning framework that quantifies disease-determining molecular phenotypes in human cells, allowing readouts of protein localization and splicing regulatory function at scale. Applied to the dilated cardiomyopathy (DCM)-associated protein RBM20, we profiled ~4,300 amino acid substitutions across disease-linked protein domains. Complemented by structure-function investigations of RBM20 bound to its nuclear import receptor TNPO3, we discover new variant hotspots affecting protein function. Finally, we systematically probed nuclear relocalization to identify variants that may be amenable to this therapeutic strategy. Together, we create comprehensive variant-to-function maps that predict variant impact, enhance clinical interpretation, and stratify RBM20-mediated DCM into mechanistically distinct therapeutic classes.
📄 ResearchJun 11, 2026https://www.biorxiv.org/content/10.64898/2026.06.10.730167v1?rss=1
DLDN-Bench: A Benchmark Framework for Deep Learning de Novo Peptide Sequencing in Proteomics
De novo peptide sequencing is an essential approach for analyzing mass spectrometry data because it enables the identification of novel peptides without relying on protein sequence databases. Recent advances in deep learning have substantially improved the performance of de novo sequencing methods, but the rapid emergence of new models has led to heterogeneous evaluation practices and limited comparability. To address this, we introduce DLDN-Bench, a benchmark framework including a set of benchmark datasets derived from human muscle biopsy mass spectrometry data retrieved from PRIDE and annotated through consensus across multiple widely used database search engines. Using these datasets, we systematically benchmark recent deep learning-based de novo sequencing tools alongside traditional approaches. Performance is assessed using established metrics, including precision and coverage relative to a pseudo-ground truth defined by cross-engine agreement. To demonstrate the utility of DLDN-Bench, we benchmark four recent deep learning models and make all results publicly available. This benchmark framework provides a standardized basis for comparing state-of-the-art methods and offers an extensible resource for evaluating future tools in de novo peptide sequencing.
📄 ResearchJun 11, 2026https://www.biorxiv.org/content/10.64898/2026.06.10.728383v1?rss=1
DyMoTree decodes early cell state transitions and drivers from single-cell transcriptomes using a tree-structured neural network
Inferring early cell fate from single-cell RNA-sequencing data is essential for identifying cellular origins and fate plasticity in development and disease. However, existing methods often fail to exploit tree-structured lineage trajectories, limiting the accuracy and interpretability of fate mapping. Here we present DyMoTree, a computational framework that models cell fate decisions as nonlinear mappings between progenitor and terminal cell states under explicit lineage constraints. By integrating lineage graphs with a tree-structured neural architecture, DyMoTree learns lineage-resolved cell-state transition maps from single-cell transcriptomes, enabling robust inference of early fate bias and identification of fate-specific progenitor substates and driver genes. Across simulations, lineage-tracing experiments, and in vivo systems, DyMoTree outperformed existing methods in resolving early fate biases. Applications to mouse embryogenesis, lung adenocarcinoma progression, and CAR-T immunotherapy revealed regulatory programs underlying developmental and disease-associated transitions. DyMoTree provides a general framework for modeling lineage-resolved cell-state dynamics underlying development and disease progression.
📄 ResearchJun 11, 2026https://www.biorxiv.org/content/10.64898/2026.06.09.731114v1?rss=1
HalluDesign-NA: Extending HalluDesign for De Novo Nucleic Acid Design
AlphaFold3 has revolutionized the prediction of biomolecular structures and interactions, including atomic-level modeling of nucleic acids. However, the de novo design of structured and functional nucleic acids remains a significant challenge. Here, we extend our HalluDesign framework to nucleic acid design by integrating NA-MPNN for nucleic acid sequence optimization and design. This new framework, HalluDesign-NA, enables iterative sequence-structure co-optimization, facilitating the de novo design of nucleic acids. Computational benchmarking across ssDNA, ssRNA, and aptamer design tasks demonstrates consistent improvements in confidence scores (pLDDT, ipTM), supporting the feasibility of de novo nucleic acid design under various constraints, such as sequence length, symmetry, and protein structure context. We anticipate that HalluDesign-NA will accelerate the de novo design of functional nucleic acids for applications in biotechnology and medicine. The source code for HalluDesign-NA is available at https://github.com/MinchaoFang/HalluDesign_NA.
📄 ResearchJun 11, 2026https://www.biorxiv.org/content/10.64898/2026.06.10.730767v1?rss=1
Viability of engineered AAVs via protein language models
Capsid engineering has greatly improved the performance of recombinant AAV vectors used for gene therapy. One commonly used strategy is the insertion of a short, 7-mer, peptide into surface-exposed loops to modify receptor interactions and enhance cell entry. While effective in receptor retargeting and improved transduction, these insertions might destabilize the capsid protein, hinder assembly, and thus limit production. While previous attempts have used deep mutational scanning and AI to predict which insertions are viable, there is lack in understanding the structural consequences of these peptide insertions at the amino-acid level. Here we combined experiments, deep sequencing and large protein language models to gain insight on the impact of 7-mer insertions on the VR-VIII region. We first characterize the biochemical properties of viable insertions, thus identifying which residues are well tolerated, and which should instead be avoided. We then focus on the nearby context of those insertions, by studying the effect of the linkers, either for highly diverse libraries or for individual variants known for their efficiency. Next, we study the broader context, by extending our analysis to the whole capsid sequence, and identifying regions that can tolerate insertions without long-ranged structural deformations that could affect capsid functionality. We conclude with a cross-serotype comparison and a viability analysis of tens of previously engineered variants. Our work showcases how AI can uncover structure-function rules governing the success of engineered AAV capsids.
📄 ResearchJun 11, 2026https://www.biorxiv.org/content/10.64898/2026.06.11.731521v1?rss=1
PCRAgent: A Multi-Agent Framework for Transforming Noisy clinical conversations into Structured Pre-Consultation Medical Records and Reusable Clinical Data Resources
In primary care and outpatient settings, clinically important patient information is often embedded in fragmented, ambiguous, repetitive, and noisy communication between physicians and patients. This limits physicians ability to obtain a clear preconsultation overview of symptoms, history of present illness, and visit intent, while also preventing real world clinical dialogues from being reused in hospital information systems and medical artificial intelligence applications. To address this challenge, we developed PCRAgent, a centrally coordinated multi agent framework for preconsultation clinical information organization. Guided by physician inquiry logic, PCRAgent identifies, extracts, corrects, and standardizes patient-reported information from noisy consultations. Its coordinated modules including error detection, semantic editing, output control, contextual memory, and intent recognition enable robust parallel handling of spelling errors, repetitions, grammatical inconsistencies, medical ambiguities, and non-medical interference. A traceable edit list records intermediate corrections and context, allowing iterative refinement without redundant modifications. PCRAgent generates two complementary outputs. One is a PreConsultation Clinical Report for rapid physician review. The other is a Structured Clinical Conversation Dataset for hospital data construction and downstream AI applications. In evaluations using 220000 strongly perturbed consultations, PCRAgent maintained high robustness, achieving a clinical information accuracy of 4.99 out of 5 and key element completeness of 5 out of 5, outperforming GPT4o. Expert review of Chinese and English dialogues confirmed high clinical accuracy of 4.85 out of 5 and high safety of 4.79 out of 5. Multicenter validation in real-world outpatient workflows further demonstrated practical utility. These findings indicate that PCRAgent can efficiently transform noisy and unstructured consultations into physician ready reports and AI ready structured data, improving outpatient efficiency, reducing cognitive burden, ensuring information completeness, supporting precise decision-making, and enabling high-quality reuse of clinical data.
📄 ResearchJun 11, 2026https://www.medrxiv.org/content/10.64898/2026.06.10.26355372v1?rss=1
Computer Vision Scoring of Figure Copy and Recall
Objective. Figure copy and recall tests are sensitive measures of visuoconstruction and visual episodic memory, but their clinical is constrained by labor-intensive manual scoring. We developed and validated an automated, element-level scoring pipeline using Vertex AI object detection for the tablet-based figure copy and recall tasks in the California Cognitive Assessment Battery (CCAB). The automated scoring pipeline duplicated the scoring procedures used by expert manual raters. Methods. A normative sample of 2,011 community-dwelling adults aged 18-90 completed figure copy and delayed recall trials at baseline, with subsamples retested at 1 day and at 6, 18, and 30 months. Participants completed the drawings with their index finger on a tablet computer with finger position digitized to analyze the speed and timing of individual drawing strokes A convolutional object-detection model trained on the Vertex AI AutoML Vision platform identified each of twelve canonical figure elements in rendered drawings. Separate element presence and location scores were computed after homographically warping drawings onto a canonical template to produce trial-level Element, Location, and Total scores. To compare Vertex and human scores, Vertex AI and expert human raters independently scored 1500 randomly selected drawings to evaluate inter-rater agreement, including a common subset of 100 drawings scored by Vertex AI and all raters. Results. Total scores were virtually indistinguishable (r = 0.966) from human-human agreement (mean r = 0.971) as were Element presence scores (mean r = 0.959 vs. r = 0.963). Location-score agreement (r = 0.951) was slightly below the human-human mean (r = 0.972) due to pixel-level analysis by Vertex AI that was impossible for human raters. The Vertex pipeline showed no preferential advantage for the single expert rater who categorized Elements during training. Automated scores showed strong demographic gradients, age effects on Recall (r = -0.32) were approximately twice those in Copy conditions (r = -0.16). A Memory Cost score (Recall - Copy) showed a monotonic age-related decline from +0.40 z in the youngest subjects to -0.54 z in the oldest. Kinetic analysis revealed that drawing speed and efficiency showed significant age-related changes. Overnight test-retest reliability was high (Recall r = 0.72) and the Recall trial showed a large overnight learning effect ({Delta} = +1.18) that continued with repeated tests up to 30 months ({Delta} = +0.75).
📄 ResearchJun 11, 2026https://www.medrxiv.org/content/10.64898/2026.06.10.26355298v1?rss=1
What level of expertise is necessary to generate ACLS training test questions: pre-med students vs. artificial intelligence?
Abstract Introduction In-hospital cardiac arrest carries high mortality despite standardized ACLS training. Educators face increasing time constraints in developing assessment tools for ACLS training. Two possible solutions to this problem are using pre-medical students or using artificial intelligence to generate test questions. This study compared the quality of pre-medical student-generated ACLS test questions vs. AI-generated ACLS test questions, testing the hypothesis that AI-generated questions are non-inferior to student-generated questions. Methods Ten pre-medical students created ACLS questions following predefined criteria, while an AI model (Northwell's Artificial Intelligence Hub) generated comparable questions. A blinded ACLS-certified physician evaluated questions on the qualities of Alignment, Clarity, Cognitive Level, and Question Design using a standardized rubric (Likert scale: 1 = poor quality, 5 = excellent). Student's T-test and Chi-square analysis were used to compare the quality of questions on different rubric domains within each arm (student vs. AI) and within one domain (eg, question Clarity) between arms. The Student's T test was used when 2 comparator groups were compared (eg, Clarity of student-generated vs. AI-generated questions) within one arm. The ANOVA test was used when comparing more than 2 comparator groups (eg, Alignment vs. Clarity vs. Cognitive Level) within one arm. Statistical significance was set as a priority at p <0.05. Results Both student-generated and AI-generated questions were of high quality. AI-generated questions achieved the maximum score in the domains of Alignment, Clarity, and Question Design, but fell short of perfect scores in the domain of Cognitive Level (8 of 50 questions were less than 5). Student-generated questions achieved less-than-perfect scores in each domain. No significant difference was found in overall mean question scores between groups (students = 4.79, AI = 4.81; p = 0.9). However, AI-generated questions had significantly-greater Clarity (students = 4.8, AI = 5; p = .0461), while Alignment, Cognitive level, and Question Design showed no significant differences. Conclusion AI-generated questions demonstrated overall quality comparable to those generated by pre-medical students, supporting the potential role of AI as a scalable tool in ACLS educational assessment development. Further studies are warranted to evaluate additional AI platforms and determine optimal integration of AI in medical education assessment design.
📄 ResearchJun 11, 2026https://www.medrxiv.org/content/10.64898/2026.06.11.26354470v1?rss=1
Visa is connecting with ChatGPT to let AI agents automatically make purchases
ChatGPT can now search for and buy products on your behalf using Visa.
🌐 MovesJun 11, 2026https://mashable.com/tech/visa-chatgpt-agents-automatic-purchases
TCS partners Anthropic, to roll out Claude AI access to 50K employees
TCS partners Anthropic, to roll out Claude AI access to 50K employees Techcircle
🌐 MovesJun 11, 2026https://www.techcircle.in/2026/06/11/tcs-partners-anthropic-to-roll-out-claude-ai-access-to-50k-employees/
AI Stem Splitter
Turn any song into clean isolated tracks.
🧰 ToolsJun 11, 2026https://theresanaiforthat.com/ai/ai-stem-splitter-1780980441/
Cheap AI Tools Directory
Find affordable AI APIs for your side projects
🧰 ToolsJun 11, 2026https://www.producthunt.com/products/cheap-ai-tools?utm_campaign=producthunt-api&utm_medium=api-v2&utm_source=Application%3A+the500feed+%28ID%3A+283491%29
OminiGate
OminiGate
🧰 ToolsJun 11, 2026https://www.aixploria.com/en/ominigate/
VCFConverter
VCFConverter
🧰 ToolsJun 11, 2026https://www.aixploria.com/en/vcfconverter/
Rootlenses
Rootlenses
🧰 ToolsJun 11, 2026https://www.aixploria.com/en/rootlenses/
Claude Fable 5
Claude Fable 5
🧰 ToolsJun 11, 2026https://www.aixploria.com/en/claude-fable-5/
MagicSlides.app AI PPT Maker
Create Stunning PPTs in Seconds with the AI PPT Maker
🧰 ToolsJun 11, 2026https://theresanaiforthat.com/ai/magicslides/
ModelAtlas
Chat with 360+ AI models in one place. Code.
🧰 ToolsJun 11, 2026https://theresanaiforthat.com/ai/modelatlas/
Talkniva
Real-time voice translation for business calls.
🧰 ToolsJun 11, 2026https://theresanaiforthat.com/ai/talkniva/
Computer Vision for Real-Time Anatomical Navigation in Neurosurgery: First-in-Human Clinical Evaluation and Iterative Development (IDEAL Stage 1)
Introduction: Precise anatomical navigation is fundamental to safe endoscopic pituitary surgery, a high-stakes procedure characterised by a challenging learning curve. While traditional navigation systems often rely on workflow-disrupting probes or static preoperative imaging, advancements in computer vision AI (CVAI) now enable dynamic, real-time anatomical segmentation directly from live surgical video1-3. Our group has previously conducted a series of preclinical human-computer interaction studies to refine the system's design, alongside digital and high-fidelity physical simulations demonstrating the benefit of AI assistance in improving overall performance, training, and safety4-8. Building on this foundation, the current study represents a first-in-human application of real-time CVAI assistance in the neurosurgical operating room, serving to assess feasibility and safety, and to iteratively improve the system. Method: Guided by DECIDE-AI and IDEAL frameworks, this single-centre evaluation comprises an initial proof-of-concept phase (n=6) for endoscopic transsphenoidal pituitary surgeries. The AI model utilised a DINOv3-derived vision transformer architecture, deployed via a high-performance edge computing unit to achieve low-latency, real-time inference without reliance on cloud infrastructure2. Given the high-risk nature of the procedure and the early stage of clinical AI integration, the system was initially deployed as an educational adjunct on a secondary monitor, ensuring the primary surgical feed remains uncompromised. Functionality and safety were assessed via structured questionnaire, prospective observation, and blinded retrospective review of the recordings of the endoscopic surgical video feed and wider operating room environment. Continuous multi-stakeholder feedback through validated human factors surveys drove iterative technical refinements between cases. Results: Six patients with pituitary adenomas were enrolled. The CVAI system was successfully deployed in four cases, demonstrating acceptable real-time sella segmentation accuracy. Deployment failed pre-operatively in two cases owing to a single recurring system reboot bug. Iterative refinement between cases were driven by our experience and surgical team feedback. This resulted in the integration of additional anatomical structure segmentations (e.g., carotid arteries), enhanced model accuracy via training dataset expansion, and hardware firmware upgrades. Multi-stakeholder surveys demonstrated satisfactory system feasibility, usability, and acceptability among the surgical team. Both prospective observation and retrospective video review confirmed the absence of adverse events, including no significant distraction to the primary surgeon, and there were no AI-related clinical complications. Conclusion: This first-in-human early clinical evaluation demonstrates the feasibility, safety and iterative development of real-time, CVAI-based anatomical navigation during high-stakes neurosurgery. Future work will include a larger single-centre case series (IDEAL Stage 2a) with more surgical teams to further iterate the system and explore its impact on training and workflow. As the underpinning technology improves, deployment will transition to direct intra-operative decision support and integration with other intra-operative navigational technologies.
📄 ResearchJun 11, 2026https://www.medrxiv.org/content/10.64898/2026.06.11.26355205v1?rss=1
Beyond External Load: Integrative Immune Monitoring Reveals Injury-Predictive Signals in the Athlete's Internal State
Abstract (already in the PDF; paste if a box is required): Injury risk prediction in elite football relies almost exclusively on external load metrics derived from GPS tracking, overlooking the molecular state of the athlete. We monitored 26 male players from FC Barcelona's first team across the 2025 calendar year, integrating GPS-derived training load with longitudinal blood-based immune monitoring (systemic inflammation and TCR-derived immune age). Immune age acceleration and inflammation were elevated in the 14 days preceding musculoskeletal injuries. A logistic regression model combining external load, inflammation, immune age acceleration, and career injury history reached an overall AUC of 0.678 and a mean per-player AUC of 0.754 (SD 0.146), improving on a GPS-only baseline of 0.541. Applied to 2026 data, the frozen model ranked players who later sustained non-contact musculoskeletal injuries high in the risk distribution. Together, our data suggest multimodal immune monitoring in elite football to reveal the athlete's internal physiological state, which carries injury-relevant information that external load alone does not capture.
📄 ResearchJun 11, 2026https://www.medrxiv.org/content/10.64898/2026.06.06.26354898v1?rss=1
Conversational Speech for Respiratory Triage in Primary Care: A Pilot Study
Background. Respiratory complaints account for a substantial share of adult ambulatory care visits, and triaging them accurately has direct consequences for antibiotic stewardship and pathogen-specific therapy. Prior work has investigated voice as a triage signal, but that literature is dominated by single-condition detection from scripted speech in crowdsourced or controlled clinical settings and has not been evaluated at primary care scale on conversational ambient audio. Methods. A dataset of 514,377 ambient-recorded primary care visits from 379,225 adult patients at a US clinic network was used, with per-visit clinically assigned ICD-10 diagnosis codes and de-identified demographic and geographic metadata. Patient audio was extracted from each doctor-patient conversation, and spectral, voice quality, and prosodic features were computed. Eleven binary classification tasks were defined, aligned with a respiratory triage cascade (e.g., acute respiratory versus acute non-respiratory illness, and lower versus upper respiratory tract infection). An acoustic model (feed-forward network) was trained independently for each task using patient-stratified five-fold cross-validation and evaluated on a held-out test set. Each task's model was also compared against six non-acoustic baselines using a single demographic, geographic, or temporal variable. The 11 trained classifiers were composed into a hierarchical cascade and illustrated as case studies on selected patients. Results. Test-set AUC across the 11 tasks ranged from 0.602 (95% CI: 0.588-0.614) to 0.745 (95% CI: 0.742-0.748), with a mean expected calibration error of 0.018. Six of eleven binaries outperformed all confounder baselines. Four binaries showed median within-stratum AUC of 0.62-0.70 when the confounder was held fixed, indicating acoustic discrimination beyond what the confounder alone explains. The exception was the pneumonia versus non-pneumonia lower respiratory tract infection binary, which failed against the patient-city confounder baseline, plausibly reflecting a clinic-level difference in ICD-10 coding. Conclusion. Conversational primary care audio carries acoustic signal that discriminates clinically meaningful respiratory contrasts. Absolute performance is moderate, but the conditions are stricter than prior work: conversational speech and differential-diagnosis contrasts among sick patients. This pilot study is a baseline for voice-based clinical AI moving beyond sick-versus-healthy detection toward differential-diagnosis panels and a proof-of-concept for hierarchical reasoning.
📄 ResearchJun 11, 2026https://www.medrxiv.org/content/10.64898/2026.06.09.26355284v1?rss=1
These Logs of ChatGPT Allegedly Driving a Suicidal Woman to Her Death Are Deeply Disturbing
"I don't want to tell you to hang on if you don't believe it can ever get better." The post These Logs of ChatGPT Allegedly Driving a Suicidal Woman to Her Death Are Deeply Disturbing appeared first on Futurism .
🌐 MovesJun 11, 2026https://futurism.com/artificial-intelligence/logs-chatgpt-suicidal-woman-death
KKR, Nvidia, Others Launch $10 Billion Data Center Company
KKR, Nvidia, Others Launch $10 Billion Data Center Company The Information
🌐 MovesJun 11, 2026https://www.theinformation.com/briefings/kkr-nvidia-others-launch-10-billion-data-center-company
Former AWS CEO Adam Selipsky to lead new $10B AI data center venture
Former Amazon Web Services CEO Adam Selipsky is returning to the world of cloud infrastructure as co-founder and CEO of Helix Digital Infrastructure, a newly-launched company backed by more than $10 billion. Read More
🌐 MovesJun 11, 2026https://www.geekwire.com/2026/former-aws-ceo-adam-selipsky-to-lead-10b-ai-data-center-venture/
VTube Me
Turn a selfie into a photorealistic VRM avatar.
🧰 ToolsJun 11, 2026https://theresanaiforthat.com/ai/vtube-me/