Search

Karpathy joins Anthropic, Google I/O 2026 launches the Gemini 3.5 agentic era, Cohere acquires Reliant AI

Karpathy joins Anthropic, Google I/O 2026 launches the Gemini 3.5 agentic era, Cohere acquires Reliant AI

ai-powered-markdown-translator

Article translated from fr to en with gpt-5.4-mini.

View project on GitHub ↗

May 19, 2026 is a packed day for AI: Andrej Karpathy leaves the stage to join Anthropic, Google I/O 2026 launches a set of agentic announcements around the Gemini 3.5 series, xAI rolls out Grok Skills with Office document generation, and Cohere makes a strategic acquisition in biopharma. At the same time, OpenAI strengthens the provenance of AI content, and the generative media world gains new capabilities.


Andrej Karpathy joins Anthropic

May 19 — Andrej Karpathy announced on X that he has joined Anthropic. The news generated more than 11.3 million views, 102,000 likes, and 13,000 reposts in a few hours — exceptional engagement that measures the impact of this hiring in the community.

“Personal update: I’ve joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.” — @karpathy on X

Karpathy is one of the best-known figures in the field. His path: a PhD at Stanford, founding member of OpenAI, director of AI at Tesla, then an independent figure with the educational nanoGPT series and his machine learning courses followed by millions of people.

His arrival at Anthropic significantly strengthens the fundamental research team at the exact moment when competition among frontier labs is intensifying. An ex-OpenAI executive joining Anthropic — the signal is strong about talent dynamics in the sector.

Anthropic did not specify his exact role. The announcement coincides with the Code with Claude London event and several major launches on the same day.


Google I/O 2026 — The Gemini 3.5 agentic era

May 19 — Google I/O 2026 is taking place today, with the central theme of the Gemini agentic era (agentic Gemini era). During the keynote, Google announces a coherent set of announcements that all revolve around Gemini’s ability to act autonomously, proactively, and persistently.

“Three years ago, Gemini started by understanding the world. With Gemini 2, models learned to think and reason. Late last year, Gemini 3 brought any idea to life. Today, we’re continuing that journey with our Gemini 3.5 series, starting with Gemini 3.5 Flash, delivering frontier performance for agents and coding.” — @GoogleAI on X

The main announcements from the I/O 2026 keynote:

AnnouncementCategory
Gemini 3.5 FlashNew model (agents + coding)
Gemini OmniMultimodal video model
Gemini Spark24/7 personal agent
Daily BriefPersonalized morning agent
Neural ExpressiveNew app design
Managed Agents APIAntigravity infrastructure for developers
Google AI StudioRenewed with agent templates

🔗 Google I/O 2026 blog


Gemini 3.5 Flash — Speed and intelligence for agents

May 19Gemini 3.5 Flash is the first model in the new 3.5 generation. It is designed to combine frontier intelligence and lightning-fast action, especially optimized for agent and coding use cases.

Gemini 3.5 Flash is the underlying engine for the Antigravity agent (Managed Agents API) and Gemini Spark. It is available via the Gemini API and in Google AI Studio as of May 19.

GenerationKey capability
Gemini 1World understanding
Gemini 2Reasoning and thinking
Gemini 3Multimodal creation
Gemini 3.5 FlashAgents, coding, lightning speed

The model is also generally available in GitHub Copilot on the day of launch — on Pro, Pro+, Business, and Enterprise plans. GitHub reports near-Pro performance at Flash speed, with efficient tool use, fast response times, and high cache efficiency. Rollout is gradual; Copilot Enterprise and Business admins must enable the Gemini 3.5 Flash policy in settings.

Copilot attributeValue
Plans affectedPro, Pro+, Business, Enterprise
Supported IDEsVS Code, Visual Studio, JetBrains, Xcode, Eclipse, GitHub Mobile
Premium multiplier14× (temporary, subject to change)

🔗 GitHub Copilot announcement


Gemini Omni — Create anything from anything

May 19 — Google DeepMind launches Gemini Omni, presented as “the first step toward a model capable of creating anything from anything — starting with video.” It combines Gemini’s intelligence with Google’s generative media systems (Veo, Imagen).

“We’re dropping Gemini Omni: our first step towards a model that can create anything from anything - starting with video. It combines Gemini’s intelligence with our generative media systems - representing a leap forward in world understanding, multimodality, and editing.” — @GoogleDeepMind on X

In the Gemini app, Gemini Omni enables high-quality cinematic video generation from text, images, or existing videos, video editing through natural conversation (cinematic zooms, background changes with a simple prompt), creation of custom AI avatars resembling the user, and one-tap application of built-in templates.

Availability: Gradual rollout starting May 19 for Google AI Plus, Pro, and Ultra subscribers worldwide.


Gemini Spark — 24/7 personal agent

May 19 — Google announces Gemini Spark, a personal AI agent available 24/7. This is a fundamental shift in Gemini’s nature: it moves from a reactive assistant to an active partner that works in the background even when the user closes their device (cloud architecture).

Spark runs on Gemini 3.5 Flash via the Antigravity infrastructure. It is deeply integrated with Workspace tools (Gmail, Docs, Slides, Calendar).

CapabilityDetail
Recurring tasksAutomatic triggers (e.g.: analyze monthly statements)
Skill learningOn user instruction
End-to-end workflowsMeeting summary → Google Doc → follow-up email
MCP connectionsCanva, OpenTable, Instacart at launch
User controlConfirmation required before high-impact actions

Availability: Trusted testing this week; beta for Google AI Ultra subscribers in the United States the following week. Arrival in the Gemini macOS app is planned for summer 2026.


Managed Agents in the Gemini API — Open Antigravity infrastructure

May 19 — Google launches Managed Agents in the Gemini API, allowing developers to deploy agents with a single line of code in an isolated, ephemeral Linux environment. The infrastructure is powered by the Antigravity agent, built on Gemini 3.5 Flash.

This opens Google’s internal agent infrastructure to external developers. The first managed agent was Deep Research (December 2025); now developers can create their own.

Developer featureDetail
DeploymentOne API call — remote Linux environment
CapabilitiesWeb browsing, code execution, file management
CustomizationAGENTS.md and SKILL.md files
Persistent sessionsFull state resumption
Reusable agentsRegistration via client.agents.create()
EnterpriseGemini Enterprise Agent Platform (private preview)

Availability: Preview in the Gemini API as of the evening of May 19; templates in Google AI Studio.

🔗 Managed Agents API announcement


Claude Managed Agents — Self-hosted sandboxes and MCP tunnels

May 19 — From the Code with Claude London event, Anthropic launched two major new features for Claude Managed Agents.

Self-hosted sandboxes (public beta) — Claude agents can now run in sandboxes hosted on the customer’s infrastructure. Teams keep sensitive files, packages, and services within their own perimeter, with their existing network policies, audit logs, and security tools already in place.

MCP tunnels (research preview) — Agents can reach MCP servers located on a private network without exposing them publicly. MCP tunnels are managed from the Claude Console and supported both in Managed Agents and in the Messages API.

FeatureStatusProviders
Self-hosted sandboxesPublic betaCloudflare, Daytona, Modal, Vercel
MCP tunnelsResearch preview (access on request)Via Claude Console

🔗 Claude Managed Agents announcement


Grok Skills — Persistent expertise and Office documents

May 18 — xAI launched Skills for Grok 4.3, a persistent memory feature that lets Grok learn preferences, formatting rules, and workflow steps once, then apply them across all subsequent conversations.

“I’m leveling up my Skills. Automate your workflows and get things done in record time with prebuilt and custom Skills.” — @grok on X

Document typeCapabilities
Word (.docx)Creation and editing with full formatting (headings, tables, bullet points)
PowerPointSlides with visual hierarchy and speaker notes
ExcelFormulas, colored sections, clear structure
PDFCreation, merging, splitting, and extraction
Skill CreatorBuilding new custom skills through conversation

Skills are available on grok.com, iOS, and Android. The user’s custom skills always take priority over the default built-in skills.

🔗 Grok Skills announcement


Cohere acquires Reliant AI — Sovereign AI for biopharma

May 19 — Cohere announces the acquisition of Reliant AI, a company specialized in AI for biopharma, with offices in Montreal and Berlin. This move strengthens Cohere’s sovereign AI strategy in the healthcare and life sciences sectors.

“Healthcare represents one of the most consequential opportunities for AI and it demands secure, sovereign, and domain-specific systems.” — Aidan Gomez, Co-Founder & CEO, Cohere

Acquisition contributions:

ElementDetail
TeamHigh-level researchers specialized in biopharma
DataProprietary biomedical databases
TechnologyIntelligent research workbench
Transferred customersGSK, Medicus Pharma, Kyowa Kirin

New product: North for Pharma — Cohere announces development of an agentic AI system designed for biopharma teams working on R&D, clinical development, and scientific analysis. It extends the “North” suite already offered for the financial and telecom sectors.

Karl Moritz Hermann (former CEO of Reliant AI) becomes VP of AI Verticalizations in Berlin. Marc Bellemare (co-founder, Canada CIFAR AI Chair, Mila) becomes VP of Modelling in Montreal.

🔗 Cohere announcement


KPMG × Anthropic — Global alliance (276,000 employees, 138 countries)

May 19 — KPMG, one of the world’s largest professional services firms, announced a global strategic alliance with Anthropic. The firm integrates Claude into its Digital Gateway platform and gives Claude access to its entire workforce of more than 276,000 employees in 138 countries.

The main pillars: Claude Cowork and Managed Agents integrated into Digital Gateway, new tools for tax and legal clients, KPMG becomes Anthropic’s preferred partner for private equity funds, and KPMG teams use Claude Code (via KPMG Blaze) to modernize aging IT systems.

“Building an AI agent to help clients adjust to changing tax regulations used to take weeks and required teams to switch between multiple tools and chat windows. With Cowork and Managed Agents integrated in Digital Gateway, that same capability takes minutes.” — Rema Serafi, Vice President of Tax, KPMG US

This rollout builds on two years of internal adoption at KPMG in the United States.

🔗 Anthropic × KPMG announcement


Gemini App — Neural Expressive and Daily Brief

May 19 — The Gemini app receives two major new features.

Neural Expressive is a new design language for the AI era: smooth animations, vivid colors, new typography, haptic feedback. The Gemini Live experience (voice conversation) is now directly integrated into the main interface — seamless switching between typing and conversation. Global rollout as of May 19 on web, Android, and iOS for all users.

Daily Brief is an agent that generates a personalized summary every morning based on Gmail, Calendar, and the user’s priorities. It goes beyond a simple summary: it organizes, prioritizes, and suggests concrete next actions. Gradual rollout for Plus, Pro, and Ultra subscribers, starting in the United States.


OpenAI — C2PA, SynthID, and public verification tool

May 19 — OpenAI announces a multi-layered approach to the provenance of AI-generated content, combining three complementary mechanisms.

MechanismScopeResistance to modification
C2PA metadataImages, audio, videoMedium (removable)
SynthID watermarking (Google DeepMind)Images onlyHigh
openai.com/verify toolOpenAI-generated contentN/A

OpenAI now integrates Google DeepMind’s invisible digital watermark SynthID into all images generated via ChatGPT, Codex, and the OpenAI API. SynthID encodes the origin directly into the image pixels in an imperceptible way and resists most common modifications (cropping, filters, compression). The public tool openai.com/verify allows anyone to upload an image to detect the presence of a SynthID watermark — currently in limited preview for OpenAI content only.

🔗 OpenAI announcement


NVIDIA Sana WM — World Model on a Single GPU

May 19 — NVIDIA AI introduces Sana WM (World Model). From a single image, a text description, and a camera trajectory, the model generates coherent and controllable videos — all on a single GPU.

The architecture is based on four innovations: Hybrid Linear Attention, Dual-Branch Camera Control, Two-Stage Generation Pipeline, and Robust Annotation Pipeline. The project is entirely open source (paper + code + model).

🔗 NVIDIA AI announcement


Wan2.7-Image — Color Palette Control

May 19 — Alibaba announces color palette control (Palette Control) for Wan2.7-Image. The feature makes it possible to extract the color palette from any image — like an artistic masterpiece — and apply it to a new creation. Available directly on wan.video. The announcement generated 529 742 views on X.

🔗 Alibaba Wan announcement


ElevenLabs — Albert Einstein’s Voice for Teaching

May 19 — ElevenLabs announces the introduction of the recreated voice of Albert Einstein on its platform, as a dedicated conversational teaching agent. The agent illustrates how voice AI can make learning more interactive, accessible, and multilingual. A demo is available on start.elevenlabs.io. Teachers can access the free Pro plan through the Impact program.

At the same time, ElevenLabs announces a pilot in Poland: deployment of voice agents in public healthcare services (NFZ) this summer.

🔗 ElevenLabs announcement


xAI Grok integrated into OpenClaw — Local open-source multi-messaging agent

May 19 — xAI announces the integration of Grok into OpenClaw, a personal open-source, local-first agent (local-first). Grok and X Premium subscribers can now use Grok in OpenClaw, which runs on any hardware (Mac Mini, laptop, Raspberry Pi) and connects to WhatsApp, Telegram, Slack, Discord, Signal, and iMessage.

🔗 xAI announcement


Z.ai GLM-5.1 — No. 1 open-source SWE-Bench Pro, MIT, 200K context

May 18 — Z.ai (Zhipu AI) makes GLM-5.1 available on OrcaRouter. The model claims the top spot among open-source models on SWE-Bench Pro (a real-world code repair benchmark), surpassing even some proprietary models. MIT license, 200K token context, designed for long-horizon agentic coding (long-horizon agentic coding).

🔗 OrcaRouter announcement


Briefs

  • The Problem Solvers (Anthropic) — New video series featuring founders building with Claude. First episodes: Scott Wu (Cognition/Devin) and Kay Zhu (Genspark). 🔗 claude.com/problem-solvers

  • Manus × Higgsfield (MCP connector) — Manus integrates Higgsfield via MCP to give creators professional control over image/video generation within their workflows. 🔗 manus.im/blog

  • OpenAI Guaranteed Capacity — New offer allowing companies to reserve access to OpenAI compute resources via 1- to 3-year commitments, with increasing discounts depending on term length. 🔗 openai.com

  • Perplexity Computer — Rho — Rho reduced weekly meeting time by 90% and saved 120 hours over 12 weeks thanks to Perplexity Computer, which monitors Slack, Notion, Jira, Figma, and Google Docs. 🔗 Perplexity on X


What this means

The convergence of Managed Agents. Google and Anthropic announced on the same day, almost at the same time, a similar infrastructure under the same name: Managed Agents. At Google, it’s Antigravity opened via the Gemini API — a single call is enough to deploy an agent in an isolated Linux sandbox. At Anthropic, it’s self-hosted sandboxes (Cloudflare, Daytona, Modal, Vercel) and MCP tunnels for private networks. This convergence is no accident: it marks the emergence of a new standardized layer of AI infrastructure, between the raw model and the final application.

Enterprise AI is organizing around sovereignty. Cohere’s acquisition of Reliant AI, the KPMG-Anthropic alliance with Claude in Digital Gateway, and the OpenAI Guaranteed Capacity offering all point to the same movement: large organizations want AI systems within their perimeter, with their data, their security rules, and multi-year capacity commitments. “Sovereign” (sovereign) has become the keyword for enterprise AI in 2026.

The race for long-horizon agentic coding. Gemini 3.5 Flash (agents + coding optimized), Z.ai’s GLM-5.1 (No. 1 open-source SWE-Bench Pro, 200K context), Grok Skills (persistent workflows) — all target the same use case: an agent that works on a complex task for hours without human intervention. GitHub Copilot integrates Gemini 3.5 Flash on the very day of its launch, illustrating how quickly developer tooling absorbs new models.

Provenance and trust in AI content. The C2PA + SynthID + openai.com/verify combination announced by OpenAI (with SynthID from Google DeepMind) represents a novel multi-layer approach: metadata for context, invisible watermarking for persistence even after modification, and a public verification tool. This move toward traceability of generated content is gradually becoming an industry norm, driven jointly by OpenAI, Google, and the C2PA coalition.


Sources