🤖 Models AI News
New AI model releases and updates. From GPT and Claude to open-source models, we score and categorize the top new AI model news.
- OpenAI Ships GPT-5.5-Cyber, a Near-Mythos Model for Vetted Defenders
OpenAI launched GPT-5.5-Cyber, a specialized model for cybersecurity defenders.
- OpenAI Ships GPT-Realtime-2 — A Voice Model That Reasons Inside the Audio Loop
OpenAI releases GPT-Realtime-2, a voice model for real-time audio reasoning
- OpenAI brings GPT-5-class reasoning to real-time voice — and it changes what voice agents can actually orchestrate
Voice agents have been expensive to run and painful to orchestrate, not because the models can't handle conversation, but because context ceilings forced enterprises to build session resets, state compression, and reconstruction layers into every deployment. OpenAI's three new voice models are designed to reduce that overhead, and they change how engineers can think about building voice into a larger agent stack. GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper integrate real-time audio into the model management stack as discrete orchestration primitives — separating conversational reasoning, translation, and transcription into specialized components rather than bundling them in a single voice product. The company said in a blog post that Realtime-2 is its first voice model “with GPT-5 class reasoning” and can handle difficult requests and keep conversations flowing naturally. Realtime-Translate understands more than 70 languages and translates them into 13 others at th
- OpenAI introduces GPT‑5.5‑Cyber for high-impact cybersecurity research
OpenAI Group PBC has developed a version of GPT-5.5 that is specifically optimized for cybersecurity research. GPT‑5.5‑Cyber, as the model is called, made its debut on Thursday. It’s available in limited preview through a program called Trusted Access for Cyber, or TAC, that OpenAI launched in February. The initiative gives cybersecurity researchers expanded access to […] The post OpenAI introduces GPT‑5.5‑Cyber for high-impact cybersecurity research appeared first on SiliconANGLE .
- The Netherlands' ambitious homegrown AI model enters the real world
GPT-NL is emerging as one of Europe’s most ambitious attempts to build a homegrown AI system
- CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models
CyberSecQwen-4B: Why Defensive Cyber Needs Small, Specialized, Locally-Runnable Models
Score: 46🤖 ModelsMay 8, 2026https://huggingface.co/blog/lablab-ai-amd-developer-hackathon/cybersecqwen-4b - Ramp Builds AI Model Better than Claude Opus for Navigating Spreadsheets
Built on top of the Qwen models, Fast Ask is designed to navigate and retrieve data from spreadsheets.
Score: 38🤖 ModelsMay 8, 2026https://analyticsindiamag.com/ai-news/ramp-builds-ai-model-better-than-claude-opus-for-navigating-spreadsheets - OpenAI launches GPT-realtime voice suite for developers
OpenAI has launched new voice intelligence models for its API, enhancing AI-powered voice interactions. GPT‑Realtime‑2 offers advanced reasoning, while GPT‑Realtime‑Translate enables live multilingual conversations. GPT‑Realtime‑Whisper provides instant speech-to-text transcription, moving voice agents beyond simple responses to actively performing tasks.
- OpenAI Releases Three Realtime Audio Models: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper in the Realtime API
OpenAI Releases Three Realtime Audio Models: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper in the Realtime API MarkTechPost
- 😺 OpenAI's GPT-Realtime-2 is coming for call center
PLUS:
- OpenAI rolls out new model for cybersecurity teams a month after Anthropic's Mythos debut
GPT-5.5-Cyber is now available to TAC members, but it's a slight upgrade from 5.4.
- OpenAI Introduces GPT-Realtime-2 With GPT-5-Level Reasoning for Voice Assistants
OpenAI has unveiled three new real-time audio models aimed at developers building voice assistants, customer support systems, translation tools and live transcription services.
- OpenAI opens GPT-5.5-Cyber to vetted security researchers
OpenAI is releasing GPT-5.5-Cyber, a model variant that rejects far fewer security requests and even actively executes exploits against test servers. Access is limited to verified defenders of critical infrastructure, including partners like Cisco, CrowdStrike, and Cloudflare. The model competes directly with Anthropic's Mythos Preview. The article OpenAI opens GPT-5.5-Cyber to vetted security researchers appeared first on The Decoder .
- OpenAI has 3 new AI voice models that the ChatGPT maker says will ‘unlock a new class of voice apps for developers’
OpenAI has released new AI models for deep reasoning, translation and transcription.
- OpenAI’s Brand New Voice AI Is Here. It Could Change How Companies Talk to Their Customers
OpenAI has released three new voice models, and companies like Zillow and Priceline are already on board.
- How Sakana trained a 7B model to orchestrate GPT-5, Claude Sonnet 4 and Gemini 2.5 Pro
Every LangChain pipeline your team hardcodes starts breaking the moment the query distribution shifts — and it always shifts. That bottleneck is what Sakana AI set out to eliminate. Researchers at Sakana AI have introduced the " RL Conductor ," a small language model trained via reinforcement learning to automatically orchestrate a diverse pool of worker LLMs. Conductor dynamically analyzes inputs, distributes labor among workers, and coordinates among agents. This automated coordination achieves state-of-the-art results on difficult reasoning and coding benchmarks, outperforming individual frontier models like GPT-5 and Claude Sonnet 4 as well as expensive human-designed multi-agent pipelines. It achieves this performance at a fraction of the cost and with fewer API calls than competitors. RL Conductor is the backbone of Fugu, Sakana AI’s commercial multi-agent orchestration service. The limitations of manual agentic frameworks Large language models have strong latent capabilities. Bu
- Meet ZAYA1-8B, a super efficient, open reasoning model trained on AMD Instinct MI300 GPUs
Even as leading AI providers like OpenAI and Anthropic battle over the compute to train and release ever larger, more powerful models, other labs are going in a different direction — pursuing the development of smaller, more efficient models and often open sourcing them. The latest worth paying attention to comes from the lesser-known Palo Alto startup Zyphra , which this week released its new reasoning, mixture-of-experts (MoE) language model, ZAYA1-8B , with just over 8 billion parameters and only 760 million active — far fewer than the trillions estimated for the likes of the big labs. Yet, ZAYA1-8B retains competitive performance on third-party benchmarks against GPT-5-High and DeepSeek-V3.2. It can be downloaded from Hugging Face now free of charge under a permissive, standard, enterprise-friendly Apache 2.0 license — and enterprises and indie developers can begin using and customizing it immediately to suit their needs. Individual users can also test it themselves here free at Zy
- OpenAI rolls out new model for cybersecurity teams a month after Anthropic's Mythos debut
OpenAI said GPT-5.5-Cyber, a variation of its latest AI model, is rolling out in a limited preview capacity to vetted cybersecurity teams.
Score: 60🤖 ModelsMay 7, 2026https://www.cnbc.com/2026/05/07/openai-rolls-out-new-gpt-5point5-cyber-to-vetted-cybersecurity-teams.html - Available today: GPT-5.5 Instant in Microsoft 365 Copilot
The post Available today: GPT-5.5 Instant in Microsoft 365 Copilot appeared first on Source .
- OpenAI's new voice model brings GPT-5-level reasoning to real-time conversations
OpenAI is shipping three new voice models—GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper—that can reason in real time, translate across 70+ languages, and transcribe live speech. GPT-Realtime-2 brings reasoning that OpenAI says matches GPT-5. The article OpenAI's new voice model brings GPT-5-level reasoning to real-time conversations appeared first on The Decoder .
- OpenAI launches new voice intelligence features in its API
The new features could be handy for customer service systems, but OpenAI says they have applications that work across a variety of other fields, including education and creator platforms.
- Great to bring GPT 5.5 Instant to M365 Copilot today. With quicker, clearer and more accurate responses, you can get to useful answers with less back and forth. Also rolling out to Copilot Studio and Foundry. All part of our focus on providing you more model choice across work, agents and apps. Read more…
The post Great to bring GPT 5.5 Instant to M365 Copilot today. With quicker, clearer and more accurate responses, you can get to useful answers with less back and forth. Also rolling out to Copilot Studio and Foundry. All part of our focus on providing you more model choice across work, agents and apps. Read more… appeared first on Source .
- OpenAI has new voice models that reason, translate, and transcribe as you speak
OpenAI has just released three new realtime voice models that it says will “unlock a new class of voice apps for developers.” Each new voice intelligence model has a unique speciality for different purposes. more…
- A model to predict and improve the efficacy of cancer treatment
How to determine the dose and frequency of chemotherapy treatments is a crucial question in pharmacology. A multidisciplinary team, which includes Inria, has developed a numerical model that is able to provide an answer and has already proved its worth in vivo. Their groundbreaking results pave the way for more effective treatment.
Score: 84🤖 ModelsMay 6, 2026https://www.inria.fr/en/model-predict-and-improve-efficacy-cancer-treatment - Google's Gemma 4 AI models get 3x speed boost by predicting future tokens
Up to 3x the speed with no loss of quality—is it too good to be true?
- French startup unveils AI model for robots and human-like hand
French startup unveils AI model for robots and human-like hand Reuters
Score: 71🤖 ModelsMay 6, 2026https://www.reuters.com/world/china/french-startup-unveils-ai-model-robots-human-like-hand-2026-05-06/ - Inworld AI Launches Realtime TTS-2: A Closed-Loop Voice Model That Adapts to How You Actually Talk
Inworld AI Launches Realtime TTS-2: A Closed-Loop Voice Model That Adapts to How You Actually Talk MarkTechPost
- UCT researchers develop multilingual AI language model
MzansiLM is an AI model trained on all 11 South African languages, to address gaps in global AI systems.
Score: 66🤖 ModelsMay 6, 2026https://www.itweb.co.za/article/uct-researchers-develop-multilingual-ai-language-model/Gb3BwMWa6Vjv2k6V - Khosla-backed robotics startup Genesis AI has gone full stack, demo shows
Genesis AI, a startup that raised a $105 million seed round to build foundational AI for robotics, has unveiled its first model, GENE-26.5, but also a demo showcasing a set of robotic hands performing complex tasks.
Score: 65🤖 ModelsMay 6, 2026https://techcrunch.com/2026/05/06/khosla-backed-robotics-startup-genesis-ai-has-gone-full-stack-demo-shows/ - ChatGPT becomes smarter, less verbose and better at remembering you with new GPT-5.5 Instant update
OpenAI has launched GPT-5.5 Instant, an upgrade over GPT-5.3 Instant, featuring improved accuracy and clarity. The model reduces hallucinations by 52.5% and enhances performance in everyday tasks, while also offering better memory capabilities and personalization options for users.
- Timer-XL: A Long-Context Foundation Model for Time-Series Forecasting
Exploring the inner workings of a decoder-only Transformer foundation model The post Timer-XL: A Long-Context Foundation Model for Time-Series Forecasting appeared first on Towards Data Science .
Score: 55🤖 ModelsMay 6, 2026https://towardsdatascience.com/timer-xl-a-long-context-foundation-model-for-time-series-forecasting/ - This Breakthrough Model Uses 1,000X Less Compute
New AI model reduces compute requirements by 1,000X.
Score: 54🤖 ModelsMay 6, 2026https://www.superhuman.ai/p/this-breakthrough-model-uses-1-000x-less-compute - Google’s Best Small AI Models Just Got 3x Faster
With Multi-Token Prediction technology, Google has now enabled developers to bypass traditional memory bottlenecks with Gemma 4.
Score: 49🤖 ModelsMay 6, 2026https://analyticsindiamag.com/ai-news/googles-best-small-ai-models-just-got-3x-faster - Why the EU is freaked out about a new AI model
The European Commission is unveiling its new anti-poverty strategy today. The only problem is … it doesn’t include any new cash. On the pod, Zoya and Ryan discuss how the EU executive is justifying the lack of new funding in this plan. They also look at how likely the bloc is to reach its target of eradicating poverty by 2050 (spoiler […]
- ChatGPT's New 5.5 Instant Default Model Reduces Errors, Shows Fewer Emoji
ChatGPT's New 5.5 Instant Default Model Reduces Errors, Shows Fewer Emoji PCMag UK
- ChatGPT's New 5.5 Instant Default Model Reduces Errors, Shows Fewer Emoji
ChatGPT's New 5.5 Instant Default Model Reduces Errors, Shows Fewer Emoji PCMag Middle East
- ChatGPT-5.5 Instant is finally here — 7 everyday prompts that prove the 'less is more' era is actually smarter
ChatGPT-5.5 Instant is finally here — 7 everyday prompts that prove the 'less is more' era is actually smarter Tom's Guide
- Google AI Releases Multi-Token Prediction (MTP) Drafters for Gemma 4: Delivering Up to 3x Faster Inference Without Quality Loss
Google AI Releases Multi-Token Prediction (MTP) Drafters for Gemma 4: Delivering Up to 3x Faster Inference Without Quality Loss MarkTechPost
- Google speeds up Gemma 4 threefold with multi-token prediction
Google has released multi-token prediction drafters for its Gemma 4 open model family that speed up text generation by up to three times. A small auxiliary model suggests several tokens at once while the main model checks them in a single pass. The article Google speeds up Gemma 4 threefold with multi-token prediction appeared first on The Decoder .
- Google DeepMind Will Train AI Models on the MMORPG Eve Online
The AI lab has taken a minority stake in the gaming company behind the incredibly complex 23-year-old space simulator.
- Google’s latest trick gets Gemma 4 running 3x faster right on your phone
New assistant models share Gemma 4's workload for much less memory.
- Google DeepMind partners with EVE Online for AI model testing
Move comes as CCP Games spends $120M to go independent, rebrands as Fenris Creations.
- ChatGPT's New 5.5 Instant Default Model Reduces Errors, Shows Fewer Emoji
ChatGPT's New 5.5 Instant Default Model Reduces Errors, Shows Fewer Emoji PCMag
- OpenAI Makes GPT-5.5 Instant as the New Default Model in ChatGPT
GPT-5.5 Instant can draw on past chats, uploaded files, and connected services such as Gmail to tailor responses.
- OpenAI Upgrades ChatGPT’s Default AI Model to GPT-5.5 Instant, Adds New Capabilities
OpenAI, on Tuesday, announced that it is updating the default artificial intelligence (AI) model in ChatGPT. The default model is available to everyone when they first open the website or the app, including those on the free tier. So far, this experience has been powered by the GPT-5.3 Instant, but now, the San Francisco-based AI giant is replacing it with the GPT-5.5...
- OpenAI rolls out GPT-5.5 Instant with improved accuracy, sets it as ChatGPT default
OpenAI rolls out GPT-5.5 Instant with improved accuracy, sets it as ChatGPT default
- ChatGPT's New 5.5 Instant Default Model Reduces Errors, Shows Fewer Emoji
ChatGPT's New 5.5 Instant Default Model Reduces Errors, Shows Fewer Emoji PCMag Australia
- AI model analyses body composition to predict health risks
AI model analyses body composition to predict health risks EurekAlert!
- GPT-5.5 Instant: smarter, clearer, and more personalized
GPT-5.5 Instant updates ChatGPT’s default model with smarter, more accurate answers, reduced hallucinations, and improved personalization controls.
- OpenBind’s first data and model release marks a milestone for AI enabled drug discovery
OpenBind’s first data and model release marks a milestone for AI enabled drug discovery EurekAlert!