AI News Archive: June 16, 2026 — Part 7
Sourced from 500+ daily AI sources, scored by relevance.
- GLM 5.2 now available on AI Gateway
GLM 5.2 is now available on AI Gateway . Built for long-horizon tasks, GLM 5.2 carries project-level engineering context across a single task, runs long-running tasks more reliably, and follows engineering standards more consistently. The context window for this model has been upgraded to 1M tokens, up from 200K in GLM 5.1. To use GLM 5.2, set model to zai/glm-5.2 in the AI SDK : AI Gateway provides a unified API for calling models, tracking usage and cost, and configuring retries, failover, and performance optimizations for higher-than-provider uptime. It includes built-in custom reporting , Zero Data Retention support , budgets for API keys , and more. AI Gateway reflects provider pricing with no markup and does not charge a platform fee on inference, including on Bring Your Own Key (BYOK) requests. Learn more about AI Gateway , view the AI Gateway model leaderboard or try it in our model playground . Read more
- Introducing Jasper GEO Agent and GEO Hub: End-to-End AI Search Optimization
Jasper's end-to-end system for turning AI search visibility into consistent brand authority.
- Microsoft's Copilot Cowork moves to usage-based billing and may tap DeepSeek
Microsoft is weighing a fine-tuned version of Deepseek V4 as a cheaper model option for Copilot Cowork. The company is also switching to usage-based billing, since Copilot head Charles Lamanna says flat-rate pricing isn't sustainable. The move fits a broader pattern in the industry. The article Microsoft's Copilot Cowork moves to usage-based billing and may tap DeepSeek appeared first on The Decoder .
Score: 43🌐 MovesJun 16, 2026https://the-decoder.com/microsofts-copilot-cowork-moves-to-usage-based-billing-and-may-tap-deepseek/ - Databricks sales growth tops 80%, but margin are shrinking from swarm of AI agents
Databricks is seeing higher growth as AI agents assist with data analysis, but all that activity is significantly increasing costs.
Score: 43🌐 MovesJun 16, 2026https://www.cnbc.com/2026/06/16/databricks-revenue-growth-tops-80percent-to-6point9-billion-annualized.html - Next page in NLP's playbook moves from word-of-mouth wins to government bids
NLP Logix spent 15 years convincing companies to try artificial intelligence. Now the Jacksonville firm is expanding its office space and shifting its strategy to handle an influx of clients who already know they need AI help.
Score: 42🌐 MovesJun 16, 2026https://www.bizjournals.com/jacksonville/news/2026/06/16/nlp-logix-identity-pivot-begun.html?ana=brss_6150 - If Your Model Inference is Slow, MOE Can Fix it
“Mixture of Experts makes model inference faster. To scale request volume, MoE optimizes token routing.” Continue reading on Towards AI »
- Former KPMG chief joins £10m funding round for AI-powered audit challenger
A Berlin-based audit AI firm has raised over £10m in a fundraising round, including from Dawn Capital, to expand into the UK market as a solution to the skilled labour shortage in the audit industry. Cortea, founded in 2024 by Valentin Neumann and Philipp Hovelmann, pitches itself as an answer for firms under pressure to [...]
Score: 42💰 MoneyJun 16, 2026https://www.cityam.com/former-kpmg-chief-joins-10m-funding-round-for-ai-powered-audit-challenger/ - Windows 11 June Update Adds NPU Monitoring for AI PCs
Microsoft’s June Windows 11 update adds Task Manager NPU visibility for supported AI PCs, helping IT teams verify hardware, drivers, and policy-sensitive changes before rollout. The post Windows 11 June Update Adds NPU Monitoring for AI PCs appeared first on TechRepublic .
- Great talent is what happens after AI creates the first draft
There are moments now where I would rather ask an AI agent to produce the first version than hire someone to do it. That sounds harsher than I mean it. I am not saying people are less valuable. I am saying the baseline has changed. AI can draft the product brief. It can summarise the […] The post Great talent is what happens after AI creates the first draft appeared first on e27 .
Score: 42🌐 MovesJun 16, 2026https://e27.co/great-talent-is-what-happens-after-ai-creates-the-first-draft-20260615/ - Toronto planning ‘simpler, more intuitive’ AI chatbot for city’s 311 service
Toronto planning ‘simpler, more intuitive’ AI chatbot for city’s 311 service Toronto Star
- Scottish minister clarifies police facial-recognition approach
Scotland will ensure that any police use of facial recognition technology is “lawful, effective, proportionate and grounded in respect for human rights,” according to the Scottish justice secretary. Writing to Scottish biometrics commissioner Brian Plastow – who urged the Scottish government in late May 2026 to introduce primary legislation for the police use of live facial-recognition (LFR) technology – Neil Gray, the cabinet secretary for justice, said the Scottish government will actively be monitoring how Westminster approaches the development of a new legal framework for the use of biometrics in law enforcement. Gray also said that while Scottish policing bodies are at least two years away from having a workable business case for the technology, he has taken note of Plastow’s “view that the introduction of primary legislation to Parliament would represent the optimal approach to establishing a statutory basis and enabling framework for the limited and proportionate use of LFR by Police Scotland”. Gray added although it would be “premature” for the Scottish government to commit to primary legislation on LFR given these circumstances, it stands ready to work with Plastow and others to ensure that any use of the technology is “lawful, effective, proportionate and grounded in respect for human rights”. He further added that any draft legislation for police biometrics published in England and Wales will be “carefully assessed” by the Scottish government for both its applicability and implications. The clarification on how Scotland will approach police LFR follows the Kings Speech in mid-May 2026 , which set out the intention of Westminster to bring forward legislation in this session of Parliament. Writing to the justice secretary on 26 May , Plastow noted that because rapid advances in technology tend to outpace the law, their deployment in a law enforcement context also means they tend to “emerge and proliferate” in a legal vacuum, as exemplified by the situation with LFR in England and Wales. Highlighting Home Office plans to massively expand its funding and use of LFR and other artificial intelligence (AI)-powered biometrics, Plastow added it would be “inconceivable” that Police Scotland would not want access to the same technologies. He added the technology could be particularly helpful in, for example, dealing with serious organised crime, counter-terror policing operations, county-lines drug dealing, and as part of a strategic response to the national emergency of male violence towards women and girls. “[However,] in England and Wales, the approach has been to put the cart before the horse as evidenced by the technology being introduced within a legal vacuum with primary legislation proposed only after the home secretary has already made a funding decision to support its adoption and roll out to all police forces in England and Wales,” he wrote. “Therefore, my purpose in writing to you is to advocate a different approach for Scotland and to ask ministers to consider bringing forward primary legislation in Scotland to create a statutory basis and enabling framework for the limited and proportionate use of LFR in Scotland by Police Scotland within legislative guardrails approved by the Scottish Parliament for its use.” He said that unlike in England and Wales, this approach could help “establish legal and democratic legitimacy as well as public accountability”, adding while there are other routes to introducing police LFR in Scotland (such as through the biometrics commissioner’s Statutory Code of Practice introduced in November 2022), this would “carry less democratic legitimacy and accountability and would not be exposed to full Parliamentary debate”. Gray responded to Plastow that such an alternative route would still require Scottish ministers to approve changes to the Code of Practice and “lay draft affirmative regulations” to bring those revisions into effect – a process that would still be open to Parliamentary scrutiny if the Scottish Parliament decides to exercise it. LFR in England and Wales While the use of LFR by police in England and Wales – beginning with the Met’s deployment at Notting Hill Carnival in August 2016 – has already ramped up massively in recent years , there has so far been minimal public debate or consultation , with the Home Office claiming for years that there is already “comprehensive” legal framework in place. However, in December 2025, the Home Office launched a 10-week consultation on the use of LFR by UK police , allowing interested parties and members of the public to share their views on how the controversial technology should be regulated. At the start of its LFR consultation – which the Home Office is still yet to formally respond to – the department said although a “patchwork” legal framework for police facial recognition exists (including for the increasing use of the retrospective and “ operator-initiated ” versions of the technology), it does not give police themselves the confidence to “use it at significantly greater scale … nor does it consistently give the public the confidence that it will be used responsibly”. It added that the current rules governing police LFR use are “complicated and difficult to understand”, and that an ordinary member of the public would be required to read four pieces of legislation, police national guidance documents and a range of detailed legal or data protection documents from individual forces to fully understand the basis for LFR use on their high streets. Responding to Plastow, Gray said: “While the findings of the UK government’s consultation, and its formal response, are still to be published, the finer detail of any draft legislation once published will be carefully assessed by the Scottish Government in terms of their applicability to, and implications for, Scotland.” However, despite not having even formally responded to the consultation – and before the now-dismissed judicial review case against the Met’s use of the technology had even been heard – the Home Office announced sweeping reform to UK policing in January 2026 , which included significant investment into the LFR capabilities of police in England and Wales. Computer Weekly has contacted the Home Office on two occasions about when it will publicly respond to the consultation but is yet to receive a response. Consultation response so far Despite this, many organisations – including regulatory bodies , civil society groups and academics – have begun posting their consultation responses online. While a consensus is emerging around the need for clear regulatory requirements around, for example, oversight, safeguards and what constitutes proportionate use, there are also points of divergence, particularly over how high thresholds should be set for the police use of powerful biometric technologies. For example, while some say these technologies are valid and valuable policing tools, others have argued that the level of intrusion on people’s rights means there are exceedingly few cases, if any, where they should be allowed. However, all responses published so far agreed that, given the threats to rights presented by biometric surveillance technologies, it is not an option to continue without a dedicated legal framework. For William Webster, the biometrics and surveillance camera commissioner for England and Wales, the success of any new framework is dependent on effective oversight by “an independent regulatory body with adequate powers and resources to conduct its business and particularly to enable appropriate use of technology and to identify and sanction unlawful activities”. He added in his consultation response that the scope for both individual and society-level harms mean voluntary self-regulator “will not be sufficient to ensure compliance with standards”, and that any new regulator “must make compliance mandatory, offer proactive oversight with an inspection and compliance regime, and be able to enact swift remedies where compliance is not evident”. Webster also said the regulator must be empowered to publish codes of practice to operationalise the key principles of the framework, including reasonableness, proportionality and assessment of harm. There have been repeated calls from both Parliament and civil society over many years for the police’s use of facial recognition to be regulated. This includes three separate inquiries by the Justice and Home Affairs Committee into shoplifting , police algorithms and police facial recognition ; two of the UK’s former biometrics commissioners, Paul Wiles and Fraser Sampson ; an independent legal review by Matthew Ryder QC; the UK’s Equalities and Human Rights Commission ; and the House of Commons Science and Technology Committee , which called for a moratorium on live facial recognition as far back as July 2019. Read more about facial recognition Essex Police halts live facial recognition over bias and accuracy risks : LFR deployments by Essex Police will not continue until risks associated with bias and inaccuracy have been reduced. How police live facial recognition subtly reconfigures suspicion : A growing body of research suggests that the use of live facial recognition is reshaping police perceptions of suspicion in ways that undermine supposed human-in-the-loop protections. Police facial recognition trials show little evidence of benefits : In-the-wild testing of police facial recognition systems has failed to generate clear evidence of the technology’s benefits, or to assess the full range of socio-technical impacts.
Score: 42🌐 MovesJun 16, 2026https://www.computerweekly.com/news/366644481/Scottish-minister-clarifies-police-facial-recognition-approach - AI Application Layer Faces Reckoning as Products Shutter Within Months
A wave of AI applications is shutting down as the industry transitions from feature experiments to commercial viability, with OpenAI, Google, and startups alike consolidating their product lines.
Score: 42🌐 MovesJun 16, 2026https://pandaily.com/ai-application-layer-reckoning-products-shutter-jun2026 - Trump promised to bring order to AI oversight. That lasted 2 weeks.
New restrictions on Anthropic are causing industry advocates to worry the White House’s plan for a laissez-faire approach to the technology is crumbling.
- Anthropic "pauses" token-based billing for its Claude Agent SDK
Move originally planned for Monday would have heavily increased power users' costs.
Score: 42🌐 MovesJun 16, 2026https://arstechnica.com/ai/2026/06/anthropic-pauses-token-based-billing-for-its-claude-agent-sdk/ - Europe’s AI champion Mistral vulnerable to Russian disinformation, study finds
Open-source generative models are worse at removing false news than others, according to Estonian researchers
- (Human) Attention Is (Still) All You Need: Human Oversight Makes Ai-Assisted Social Science
(Human) Attention Is (Still) All You Need: Human Oversight Makes Ai-Assisted Social Science University of Cambridge
- Workday will likely face California claims in sprawling AI bias lawsuit
Workday will likely face California claims in sprawling AI bias lawsuit Reuters
- Adobe survey: AI is helping creators grow, but not without tradeoffs
Adobe published a report today showing that creators are increasingly embracing creative AI, with 75% of respondents now describing it as integrated or essential to how they work. Here are the details.
Score: 41🌐 MovesJun 16, 2026https://9to5mac.com/2026/06/16/adobe-survey-ai-is-helping-creators-grow-but-not-without-tradeoffs/ - SiMa.ai cuts physical AI deployment from months to days with agentic developer tooling
Artificial intelligence chip startup SiMa Technologies Inc. today launched Palette Neat, a purpose-built agentic AI development environment for creating applications that connect the physical world to AI models. The new tool can be paired readily with the company’s Modalix MLSoC system-on-module or the company’s new PCIe companion card, allowing developers to rapidly build apps that can […] The post SiMa.ai cuts physical AI deployment from months to days with agentic developer tooling appeared first on SiliconANGLE .
Score: 41🌐 MovesJun 16, 2026https://siliconangle.com/2026/06/16/sima-ai-cuts-physical-ai-deployment-months-days-agentic-developer-tooling/ - Alaan launches UAE's first AI-native business bank account, powered by ruya
First fintech in the region to bring corporate cards, domestic and cross-border transfers, invoice automation, and accounting together in a single AI-native workflow
- Why the best content talent is no longer just a good writer
For most of my adult life, I have been paid to write. That sentence used to carry a certain clarity. It meant you could report, interview, structure arguments, meet deadlines, understand audiences and turn half-formed ideas into something people wanted to read. Over the years, I have written for media platforms across Asia and globally, […] The post Why the best content talent is no longer just a good writer appeared first on e27 .
Score: 41🌐 MovesJun 16, 2026https://e27.co/why-the-best-content-talent-is-no-longer-just-a-good-writer-20260609/ - Anthropic to widen data collection for Claude users from July 8
Anthropic's updated privacy policy reveals how Claude handles verification, third-party integrations, and user data collection. The post Anthropic to widen data collection for Claude users from July 8 appeared first on MEDIANAMA .
- The Best Open-Source LLMs in 2026
Read Full Article
Score: 41🌐 MovesJun 16, 2026https://bentoml.com/blog/navigating-the-world-of-open-source-large-language-models - The AI Literacy Gap No One Expected
While Gen Z may be advanced at generating quick outputs or using free LLMs for surface-level tasks, they need to develop critical thinking, communication, and analysis skills.
Score: 40🌐 MovesJun 16, 2026https://campustechnology.com/articles/2026/06/16/the-ai-literacy-gap-no-one-expected.aspx - KRISS develops world-class ultrahigh-precision robotic system for electromagnetic wave measurement
KRISS develops world-class ultrahigh-precision robotic system for electromagnetic wave measurement EurekAlert!
- New national publication gives teachers guidance on AI in the classroom
With artificial intelligence tools available on every phone, laptop and tablet, higher education has struggled to implement consistent recommendations for how and when AI can be used. A new national guide seeks to change that.
Score: 40🌐 MovesJun 16, 2026https://phys.org/news/2026-06-national-teachers-guidance-ai-classroom.html - Ark Invest CEO Cathie Wood calls AI 'biggest catalyst of all time' in Florida Business Minds podcast
Wood discusses how five key technology platforms are converging to drive disruptive innovation and why her firm takes a long-term approach to investing in these transformative sectors.
Score: 40🌐 MovesJun 16, 2026https://www.bizjournals.com/jacksonville/news/2026/06/16/ark-invest-cathie-wood-podcast.html?ana=brss_6150 - Bluechip Technologies acquires YarnGPT at the third edition of Bluechip Data and AI Summit
At the Bluechip Technologies Data and AI Summit in Lagos, investors, founders, and technology leaders gathered to debate the future of AI on the continent.
Score: 40🌐 MovesJun 16, 2026https://thecondia.com/bluechip-technologies-yarngpt-third-edition-bluechip-data-ai-summit/ - The new visibility crisis: When AI, not search, decides what consumers see
Heather Horton, founder and CEO of Ecrubox, shares her perspective on how AI-assisted discovery is influencing product research, and why brands may benefit from expanding visibility beyond traditional search strategies
Score: 40🌐 MovesJun 16, 2026https://www.independent.co.uk/tech/ai-visibility-search-consumers-b2996534.html - Around a fifth of Steam Next Fest demos have a generative AI disclosure
We'll see how players react.
Score: 40🌐 MovesJun 16, 2026https://www.engadget.com/2195840/around-a-fifth-of-steam-next-fest-demos-have-a-generative-ai-disclosure/ - UROVO DT610 Pro: The Rugged AI Terminal Redefining Frontline Productivity
UROVO DT610 Pro: The Rugged AI Terminal Redefining Frontline Productivity The Straits Times
- Angles of attack for continual learning safety
This is the fourth post in the sequence Implications of Continual Learning for LLM Agents . Summary Continual learning is a capability that largely doesn’t exist yet in LLMs. We first want to acknowledge that this may make it difficult to identify tractable angles of attack for making CL safer: it may be too difficult to predict how the development of CL will play out to find good opportunities to positively influence that development. Differential development is one way to get around this issue, but requires a lot of caution. We begin by discussing these points in depth and making some high-level recommendations that seem robustly good despite the unpredictability of CL developments. We then discuss concrete project ideas that fall within three broad categories: Help deconfuse the field about different possible approaches to CL, their likelihood, and their safety implications, Differentially advance safer CL implementations, and Create evals that scale to CL agents or incentivize the development of safer CL agents. The angles of attack we lay out below are best used as starting points for project ideation. We aim to give concrete suggestions, but many of these are not sufficiently thought-out for us to be confident that they are important and tractable. High-level considerations and recommendations Projects within each of the three broad categories discussed in this post have the potential to advance capabilities. Category 1 can also help deconfuse capabilities researchers about possible approaches without steering them to safer implementations. Category 2 often explicitly advances capabilities, with the intent and belief that it is via a safer method, but it is hard to develop strong confidence about this. Category 3 enables evaluating models for important capabilities that we care about in safety, but may also enable hill-climbing to improve those capabilities. In general, this domain requires caution. Capabilities advancements that get widely adopted have much more obvious effects on AI development than most safety work, and for that reason they also have a much larger effect on the safety of frontier AI. However, the sign of that effect is often highly unclear. Consider RLHF : it made LLMs safe and steerable enough to be widely deployed to the general public, but also likely accelerated progress toward dangerous capabilities. Similarly, the release of o1 ensured the adoption of a paradigm where models reason extensively in monitorable natural language, but again, likely accelerated capabilities . RLHF-trained LLMs that reason in plain sight are certainly better than many possible alternatives could have been, but it is nevertheless difficult to reason about the question of whether ruling out worse alternatives was worth the cost of accelerating capabilities progress. Projects attempting to differentially advance safer CL implementations should be motivated by principled views on safety-capabilities trade-offs like these and have a clear story for why they either aren’t going to cause an overall acceleration in capabilities or why the acceleration is worth it. Nevertheless, though the net effect of any given project is hard to predict, we’re fairly confident about some high-level properties CL agents should have in order to be safe. Following the discussion in our previous post , there are two properties of CL agents that have an outsized influence on our expectation of risk from them: 1) interpretability —whether the deployment-time memories of CL agents are stored in natural language or otherwise highly interpretable, and 2) being easy to control —whether humans retain a sufficient degree of control over CL updates to perform actions like rolling back an update. Ideally, we would like future CL agents to look very similar to Claude Code : persistent memories are stored in natural language, the agent must do a lot of CoT reasoning when it uses those memories and make explicit tool calls in order to write new memories, and humans can edit the memories in any way they want. This directly translates into our first high-level recommendation: AI safety researchers should nudge the field toward developing and deploying more interpretable and easy-to-control CL architectures . This can take the shape of developing those architectures with the goal of differential development, but doesn’t have to—for example, this goal can also be achieved through advocacy directed toward the broader ML community. We don’t expect there to be a binary choice between legible and inscrutable CL: some memories will be more efficiently stored as text, others more efficiently distilled into weights. However, we expect that for some architectures, the most efficient allocation keeps a larger fraction of memories in text than for others. Any method that shifts this optimum toward the text-based side would thus be valuable. We describe some methods that may be worth exploring below. Second, as discussed in multiple parts of our post on the safety effects of CL , more robust character training would be helpful against several safety concerns accompanying CL. The kinds of advances we have in mind would make it substantially more difficult to dramatically override the assistant character, even under direct fine-tuning of the weights. We have outlined some directions we’re excited about in A List of Research Directions in Character Training . Deconfusion We now move on to concrete project ideas, starting with deconfusion. This sequence is one attempt at conceptual work to deconfuse the field about approaches to continual learning and their safety implications. There are also important questions we largely left aside, like: How should we operationalize milestones for CL development? The three levels of CL introduced in Pacchiardi et al. (2026) is one example of such an operationalization. [1] In what order will these milestones arrive, relative to each other and to other AI capabilities milestones? When will they arrive, in terms of calendar date? Which types of CL, if implemented, would most speed up the path to other CL and AI R&D advancements? How and in what order will LLM monitorability, capabilities, and likelihood of egregious misalignment from reflection on goals develop over time? Good answers to these questions would go a long way toward resolving our remaining confusions about CL, but they’re also very broad and difficult to make progress on. Below are some more concrete ways to help deconfuse the field. We include empirical work that can help increase conceptual clarity. Value systematization Empirically study realistic goal shifts. Implement some form of CL that could induce reflective goal-formation and/or value systematization. Observe the resulting dynamics and study the model’s willingness to reflect on goals and how its goals change in the process (if at all). An example project is model organisms of goal shifts : train a model with two or more conflicting goals that are activated in different contexts, then place the model in settings where all of those goals are activated at once and the model needs to reason about and make trade-offs between them. [2] Another idea is training on hypotheticals: create a setting where an LLM agent has to make a decision that balances two conflicting heuristics (ideally value-laden heuristics, like “it is good to mitigate suffering” and “I should follow user instructions”) and show it the outcome. Allow it to reflect and explain what it should do differently in the future. Then, have it generate hypothetical trajectories that demonstrate the new desired behavior and fine-tune on those trajectories. Such projects would hopefully help us highlight design choices for developers that significantly alter the risk of dangerous goal shifts and encourage safe versions of those choices. What constitutions are more stable upon reflection? If future CL agents will autonomously reflect on and refine their goals, as we previously argued, it’s important to know how to make this process more predictable and steer it toward favorable convergence. By studying which constitutions are more stable upon iterated reflection and fine-tuning, we might gain insights into the kinds of value systems that will be more stable in CL agents. In the dropdown below, we paste some methodological details for reflective stability evals from A List of Research Directions in Character Training . Methodological details for reflective stability evaluations To keep things simple, the principles should probably be presented in a simple bullet-point list rather than in a long soul doc. It would be interesting to test reflective stability across several methodological choices: Asking the model to immediately rate the alternatives vs. asking it to first reflect and then provide a rating. Asking the model to rate alternatives provided to it in a prompt vs. asking it to modify the constitution in a free-form way. Asking the model to reflect on an internally consistent constitution vs. on a constitution with contradictory principles—does it want to make more changes to the latter, and do the changes lead to the constitution becoming internally consistent? Studying instruct models that haven’t been through any character training vs. Claude models that have been through extensive character training. For the latter, experiment both with constitutions that are highly similar to Claude’s constitution and ones that are dissimilar and see whether the models converge to similar constitutions in both cases. Instead of providing a constitution in the prompt, asking the model to build a constitution of its own, starting from a blank slate. Reflective stability can be studied either at the level of principles—for a given principle, how likely is the model to want to swap it for another one?—or at the level of entire constitutions—if a constitution gets modified iteratively, does the model eventually converge on a constitution that it no longer wants to modify? There are multiple ways to approach the latter question: Provide the constitution in a prompt and ask the model to make changes. Prompt a new instance with the modified constitution and iterate until the new instance consistently declines to make further changes. Provide the constitution in a prompt and ask the model to make changes. Then, within the same context window, ask the model to reflect on the constitution and the change and make further changes if it wants to. Finish when the model no longer wants to change the constitution. Provide the constitution in a prompt and ask multiple instances to discuss it and make changes. Finish when the instances no longer want to make changes. Fine-tune a constitution into the model, then ask it to reflect and make changes to it. Elicit many such reasoning traces and fine-tune the model on its own decision processes. Study whether it becomes more stable over time. Explore conceptual questions on value systematization. The following questions have been borrowed from Tim Hua’s SPAR proposal on value systematization: Do models act differently when they reflect more? Suppose you take some existing way to measure model values, and just make models reflect more via prompting or changing the reasoning strengths. Are there any trends in model behavior? What does it mean for a value to be more “systematized” than another value? What types of inductive biases could AIs have for their values/goals as we scale them? [3] Are values becoming more systematized? Utility Engineering argues that models are becoming more coherent and that their political values are convergent. How meaningful are their results? How should we evaluate these measures of model values? What if we just tell models to not reflect on their values? Why would/wouldn't this work? How could we tell? How should we think about the effects on value systemization from existing alignment techniques such as deliberative alignment , character training , and Claude's constitution ? General deconfusion about the way we should think about goals and values and the way they are represented in LLMs would also be useful. Examples of conceptual thinking that we consider valuable include Richard Ngo’s shortform post on the formation of instrumental and terminal goals in humans and the paper The Artificial Self: Characterising the landscape of AI identity by Douglas et al. Model organisms of ontological shifts. Vintage LLMs such as Talkie may provide a good testbed for studying the effects of ontological shifts on LLM goals and values. [4] If you train a vintage LLM with a knowledge cutoff before some important philosophical breakthrough that has load-bearing implications on values and then describe the breakthrough in context, is the model able to independently discover important implications of these discoveries? How suggestive do the prompts have to be to elicit such discoveries? What if you fine-tune it on information about the breakthrough instead? Are there any other interesting consequences that arise from giving the model this information? Another idea that doesn't require waiting for good vintage LLMs to be developed is training an ontology-changing fact into a model through synthetic document finetuning . An example of such a fact might be “a new kind of animal has been discovered on the Pitcairn Islands”. It would be interesting to know whether LLMs instinctively reason about the implications of such facts and begin to value the lives of the new animal to a similar extent as they value the lives of other animals. The experiment should then be repeated with a number of different facts. Additionally, it might be valuable for theoretically inclined researchers to do theoretical work on value stability. Past work in this area includes MIRI's research on tiling agents , which attempts to provide mathematical guarantees about when and whether an agent's values will remain stable through processes of self-modification, as well as on Vingean reflection , reflective oracles, and logical induction. Forecasting the likelihood of different safety effects To determine how much we should worry about CL in the first place and on which interventions are most worth pursuing, it’s important to forecast which CL implementations are most likely to be adopted and how they will affect safety. We have tried to do so throughout this sequence, but remain quite uncertain and think that a lot more forecasting work can be done. We are especially interested in forecasting work in the following domains: The likelihood of goal-reflection in CL agents. Although we suspect that CL agents will at some points reflect on and perhaps re-interpret their goals (for reasons listed in our previous post ), we take seriously the counterarguments that a) CL agents may reflect only on strategies rather than top-level goals and b) CL agents may never undergo large goal changes even after reflecting on goals because alignment training may give them sufficiently aligned starting motivations. We would like to see more discussion on how likely CL agents are to reflect on their goals, what conditions are likely to incentivize such reflection, and how that reflection may play out. The likelihood of text-based and weight-based approaches to CL. Most of the safety effects we discussed, especially those in the last-mover advantage section, are greatly alleviated if the CL agent doesn’t undergo deployment-time weight updates. Ideally, as Caleb Biddulph has already argued , we would like future CL agents to look very similar to Claude Code: the persistent memories are stored in natural language, the agent must do a lot of CoT reasoning when it uses those memories and make explicit tool calls in order to write new memories, and humans can edit the memories in any way they want. In a Twitter poll by Herbie Bradley, most respondents expect that the first AI system capable of acting as a drop-in knowledge worker will be such an agent, having CL capabilities via a markdown file database and RAG. On the other hand, Steven Byrnes has argued that open-ended CL is impossible without weight updates. (Arguably, though, a drop-in knowledge worker doesn’t need open-ended CL in the sense Byrnes defines it.) This is a key consideration influencing our level of concern about CL, so we would like to see more discussion on which form of CL we should expect. The likelihood of CL agents with bounded and unbounded updates in internal and external deployments. As discussed in the previous post , another substantial factor in how concerning the safety effects of CL are is whether the agent’s updates are bounded or unbounded. This depends on multiple factors: can all on-the-job learning be done with a bounded agent that cannot undergo arbitrary weight updates? Are open-source developers going to release unbounded CL agents that are substantially more capable than the bounded ones, thereby putting pressure on closed-source developers to do the same? Are companies going to deploy unbounded CL agents internally even if they don’t deploy them externally? What does all of this imply for where we should expect CL to pose the greatest risks? It would be useful to gain clarity about these and other similar questions. The likelihood of shared opaque memory banks and meme propagation through them. Do opaque memory banks have theoretical advantages over transparent, text-based ones? Can we build model organisms of within-generation memetic spread with current text-based memory systems and generalize the findings to opaque memory banks? A better understanding of memetic dynamics among OpenClaw agents would also be valuable. Finally, good evals that give an accurate overview of how CL capabilities are progressing across various axes are useful for gaining a clearer picture about the rate of progress in CL. Though we expect capabilities researchers to take care of this direction, we mention it for completeness. Goel et al. (2026) and Asawa et al. (2026) are two examples of such work. Differentially advancing safer CL implementations If it’s possible to get CL agents that can solve months-long tasks with memories that are text-based or otherwise highly interpretable, whether we get text-based consolidation or weight-based consolidation/learning might be path-dependent. Once the text-based learning approach has been introduced, scaffolds will be built that assume the memories have text-based structure and that work better with such memories. Developers will find creative ways to share text-based memories in multi-agent settings and those approaches may no longer work with weight-based learning. Developers will also use the legible memories as a debugging mechanism. All of this will make it more costly to transition away from the text-based learning approach, raising the bar that a weight-based approach has to clear in order to be adopted. This is analogous to the situation we’re currently in with legible chain-of-thought: thanks to the business and safety value of legible CoTs, the capability boost a neuralese approach would have to provide to be adopted is higher than before. We thus think it’s worth working on differentially advancing interpretable CL approaches. To keep continual learning interpretable, we would strongly prefer agents’ deployment-time learning to involve as much text-based learning and as little weight-based learning as possible. In other words, we would like continual learning to look more like writing memories to Markdown files and less like hindsight-guided on-policy distillation . However, this doesn’t have to be an either-or decision: there’s a spectrum of methods between fully text-based and fully weight-based ones, and there may be creative ways to combine text-based and weight-based learning in a way that allows for expressive updates but doesn’t allow for radical goal drift. Below, we’ll describe some ideas that may help achieve that. Prompt optimization for safety-critical updates. Biddulph and Carroll (2026) introduce prompt optimization as a method for updating an LLM’s policy without modifying its weights. They extract response-reward pairs from an LLM on many training examples, but instead of a weight update, they show the examples, responses, and reward values to an "optimizer" LLM, which then proposes an improved system prompt that’s expected to achieve a higher reward on these training examples. This makes the strategies that the model learned straightforwardly interpretable—they’re written down in the system prompt in natural language. It seems implausible that all deployment-time learning could be done using prompt optimization: Biddulph and Carroll acknowledge that storing all useful strategies in a single prompt may be impractical and that prompt optimization doesn’t match the performance of weight-based RL. However, we’re excited about projects that attempt to combine prompt optimization with weight-based RL and find ways to perform the most safety-critical updates using the prompt-based approach while still getting the performance benefits of weight-based RL. AI control methods for CL agents. Suppose that we have access to an agent that undergoes unbounded deployment-time updates and is thus untrusted, but also to its final pre-deployment checkpoint. Call the former agent and latter agent . There are multiple ways to apply AI control techniques to limit ’s influence on deployment trajectories. For example, one can imagine protocols where is specifically trained to remain close to ’s distribution during generation, e.g. by switching the text-generating model from to whenever ’s cumulative KL divergence from over the entire rollout exceeds some threshold. One might hope that this will teach to only diverge from at high-leverage token positions where the deployment-time memories matter the most. Another approach might be to train to produce task-agnostic high-level instructions that summarize what it has learned during deployment, with the divergence of those instructions again being bit-limited, and to then use to solve the problem, conditioned on the high-level instructions provided by . We’re unsure whether either of these approaches is feasible: for example, running both and on every rollout may be prohibitively expensive. However, it seems valuable to think about similar protocols. (h/t Caleb Biddulph for bringing the two above points to our attention) Create interpretable CL agent memory architectures. Even if the process by which the CL agent’s memories are updated is hard-to-interpret, the memories themselves could be stored in an interpretable format, being easily amenable to human inspection as well as editable and deletable by humans. To ensure this, it is useful to keep in mind the following axes: Storage: Are memories stored in natural language or in vector embeddings? Retrieval: Is it clear how the AI retrieves its memories, or does it use a hard-to-interpret search over the embedding space? Modularity: Are the episodic memories stored in a modular format that can be detached from the primary LLM head? Usage: Does the AI reason about each memory it has retrieved in natural language, or is it able to make direct use of the memories in its activations without mentioning this in its CoT? Ideally, we would design CL agents that stay on the more interpretable end of each of these spectra, but improvements along any of these dimensions would be valuable. Evals for CL agents As we discussed in our post on safety effects , evaluation is complicated by the very property that makes CL agents worth worrying about: they don't stay fixed. There is some work tentatively suggesting that it might be possible to create evals that circumvent this limitation, at least to some extent, but there are also strong complications that we'll describe below. Create evaluation frameworks and mechanisms for evaluating behavioral trajectories. Pacchiardi et al. (2026) argue that to ensure the safety of CL agents, evaluation practices need to shift from evaluating frozen snapshots to evaluating behavioural trajectories of systems that continually learn from experience. This can be operationalized as attempting to characterize the landscape of possible behaviors using trajectory elicitation sandboxes and to forecast how behaviour will evolve from a given set of experiences using predictive monitors . Pacchiardi et al. recommend the following concrete research directions: Build trajectory elicitation sandboxes —environments where a CL system undergoes prolonged, controlled multi-session interactions. These environments should support inputs provided by the evaluators for adversarial stress-testing, emulated users/environments that react realistically to the system's outputs, and varying levels of self-directedness on the CL agent’s part. Train predictive monitors —models that take the system's current state and upcoming inputs and forecast its future behavioural profile at a given horizon. The trajectories collected through elicitation sandboxes can function as training data for the monitors. Apply trajectory-centered evaluation to current agents in order to find out what kinds of systems are amenable to it, in turn informing how future CL agents can be designed such that they remain amenable to trajectory-centered evaluation. They also recommend research on methods like contractive update rules that may yield predictable long-horizon updates, choice of intrinsic objectives (e.g., curiosity or novelty), gated adaptation, and circuit breakers that pause or roll back learning when monitors detect loss of predictability. Based on the results of this work, developers should design CL mechanisms amenable to evaluation. However, Pacchiardi et al. also note that the nature of CL agents as dynamical systems may prevent such evals from being reliable. Chaotic sensitivity to states and initial conditions break predictive monitors—small differences in state or input diverge exponentially. Multiplicity of attractors may cause elicited trajectories to represent only some of the possible behavioural profiles. A single trajectory can provide evidence about either of those obstacles, but ruling them out requires a global guarantee. We haven't thought enough about whether those issues can be circumvented; Pacchiardi et al. are hopeful that they can. Create a benchmark that measures CL interpretability. It is easier for the ML field to make progress on goals that can be easily measured. If it is possible to create a solid metric for the interpretability of a CL method and measure that alongside the effectiveness of the method, this could make it easier to build interpretable CL and more likely for people to make an effort to have interpretable CL or feel an aversion to degrading CL interpretability. One counterargument to working on this project is that we may not care much about subtle differences in the interpretability of different CL approaches, but rather about step changes in interpretability between methods with purely text-based updates vs. weight-based updates with memories stored in sparse linear structures vs. weight-based updates with memories stored in nonlinear structures. Creating a benchmark that measures CL interpretability may overly emphasize the subtle differences rather than the step changes. Please feel free to suggest additional ideas in the comment section, and let us know if you’re planning to work on any of these directions! We are not planning to work on any of these projects ourselves at Aether in the immediate future, but would be curious to know about other efforts and are happy to provide feedback to anyone thinking about them. ^ As mentioned earlier in this sequence, our definition of CL can guide the operationalization of CL milestones. For example, some questions that can define a CL milestone include: What level of sample-efficiency is required? How frequent do the updates have to be? Do specific agent components, such as model weights, have to be updated to meet the milestone? Are updates shared across instances of the agent? How widely used are the agents receiving these updates? ^ This could be achieved either through synthetic document finetuning or by performing SFT on chats with a user message and an assistant prefill like “ My goal is X, therefore I should …” ^ For example, it seems that emergent misalignment is a “simple” solution that’s easier for the model to find when fine tuned on narrowly misaligned data. Are there “simple” solutions for model values/goals/drives? ^ Talkie is most likely too weak to display interesting behavior in such experiments, but we expect people to release better vintage LLMs in the future. Discuss
Score: 40🌐 MovesJun 16, 2026https://www.lesswrong.com/posts/FKggLpnfbpbYvnjfG/angles-of-attack-for-continual-learning-safety - Snowflake Well-Architected Framework: Build for the AI Era
Assess and optimize your Snowflake environment with the Well-Architected Framework: five pillars, automated CoCo assessments and Blueprint Manager guidance.
Score: 40🌐 MovesJun 16, 2026https://www.snowflake.com/content/snowflake-site/global/en/blog/snowflake-well-architected-framework - Chinese AI Models Find a Way Forward Through Multi-Model Routing and Cost-Effective Architectures
Chinese domestic large language models are finding their path to commercial relevance through multi-model dynamic routing (Fusion) and hybrid agent architectures that prioritize cost efficiency over raw benchmark performance.
Score: 40🌐 MovesJun 16, 2026https://pandaily.com/chinese-ai-models-multi-model-routing-architectures-jun2026 - L&G Expands Microsoft AI Partnership for Customer Growth
L&G Expands Microsoft AI Partnership for Customer Growth YourStory.com
Score: 40🌐 MovesJun 16, 2026https://yourstory.com/ai-story/lg-expands-microsoft-ai-partnership-for-customer-growth - AI app market shift? Anthropic's Claude earns $2.76 per user against ChatGPT's $1.74, finds report
Anthropic's Claude is outperforming ChatGPT on revenue per user and subscriber conversion even as OpenAI's app reaches 1 billion monthly active users.
- Algorithm–hardware co-design of neuromorphic networks with dual memory pathways
Nature Machine Intelligence, Published online: 16 June 2026; doi:10.1038/s42256-026-01255-3 Pengfei Sun et al. develop a spiking neural network with a dual memory pathway, co-designed with a custom neuromorphic chip. The approach delivers over 4× throughput and 5x energy efficiency gains while using 40–60% fewer parameters than state-of-the-art implementations.
- Agentic AI systems are doing more and more work. Now humans need to figure out how to verify it all
Agentic AI systems are doing more and more work. Now humans need to figure out how to verify it all Fortune
- Checkmarx Unveils Next Generation SAST Engine with Hybrid AI Architecture
Agentic application security provider Checkmarx has unveiled Checkmarx SAST, which the company said is the first static analysis engine with a security-tuned LLM at its core. Along with Checkmarx SAST, the company has added a new Finding Analysis Engine that examines every signal the SAST engine produces and, with reasoning, either confirms vulnerabilities or suppresses … continue reading The post Checkmarx Unveils Next Generation SAST Engine with Hybrid AI Architecture appeared first on SD Times .
Score: 40🌐 MovesJun 16, 2026https://sdtimes.com/appsec/checkmarx-unveils-next-generation-sast-engine-with-hybrid-ai-architecture/ - June Pixel Drop: New features for creators, Gemini upgrades and more
The words "Pixel Drop," where the black text is overlaid with 3D, glossy light-purple balloon letters spelling out "Pixel."
Score: 40🌐 MovesJun 16, 2026https://blog.google/products-and-platforms/devices/pixel/june-2026-pixel-drop/ - Don’t Let AI Slop Muck Up Your Company’s Processes
Four steps leaders can take to protect one of their most valuable assets: organizational knowledge.
Score: 40🌐 MovesJun 16, 2026https://hbr.org/2026/06/dont-let-ai-slop-muck-up-your-companys-processes - VisionWave Holdings, Inc. Announces Filing of U.S. Provisional Patent Application for SDNN™ Symbiotic Deep Neural Network Architecture for Autonomous Defense and Intelligent Systems
VisionWave Holdings, Inc. Announces Filing of U.S. Provisional Patent Application for SDNN™ Symbiotic Deep Neural Network Architecture for Autonomous Defense and Intelligent Systems Toronto Star
- India Digest: Crest, ContraVault AI raise funding
India Digest: Crest, ContraVault AI raise funding DealStreetAsia
- Rainbow Crops raises €9.7M to scale AI-powered crop engineering
Belgian agritech startup Rainbow Crops has raised €9.7million in an oversubscribed seed funding round to advance its crop engineeringplatform, expand its application across multiple crops, and grow it...
Score: 39🌐 MovesJun 16, 2026https://tech.eu/2026/06/16/rainbow-crops-raises-eur97m-to-scale-ai-powered-crop-engineering/ - AT&T Throttles Some Employees’ AI Usage as ‘Tokenminimizing’ Arrives
AT&T Throttles Some Employees’ AI Usage as ‘Tokenminimizing’ Arrives The Information
Score: 39🌐 MovesJun 16, 2026https://www.theinformation.com/newsletters/applied-ai/t-throttles-employees-ai-usage-tokenminimizing-arrives - Why This AI Data Center Stock Is Surging 25% on an AMD Partnership
Why This AI Data Center Stock Is Surging 25% on an AMD Partnership Barron's
- Pentagon boasts of using AI to write reports mandated by Congress
Pentagon also claims 1.5 million personnel are using generative AI tools.
Score: 39🌐 MovesJun 16, 2026https://arstechnica.com/ai/2026/06/pentagon-boasts-of-using-ai-to-write-reports-mandated-by-congress/ - Companies are scrambling to curtail soaring AI costs
Although AI laggards are still racing to adopt the technology, for the heaviest users tokenmaxxing is out. Some, including Meta and Amazon, have scrapped their leaderboards.
Score: 39🌐 MovesJun 16, 2026https://www.livemint.com/ai/companies-are-scrambling-to-curtail-soaring-ai-costs-11781602608098.html - India emerges as an AI-ready healthcare market in APAC: Report
Clinician burnout and demand for convenience drive shift toward new care models
- How easily can Russian propaganda fool AI models? A new benchmark finds out
The Institute of the Estonian Language has released a benchmark measuring how susceptible AI language models are to Russian propaganda. The article How easily can Russian propaganda fool AI models? A new benchmark finds out appeared first on The Decoder .
Score: 39🌐 MovesJun 16, 2026https://the-decoder.com/how-easily-can-russian-propaganda-fool-ai-models-a-new-benchmark-finds-out/