Midjourney V8 Alpha (5x faster), OpenAI acquires Astral (uv, Ruff), NVIDIA OpenShell for agents

Week of March 18-23, 2026: Midjourney launches V8 Alpha with generation 5 times faster and native 2K rendering, OpenAI acquires Astral (uv, Ruff, ty) for Codex, and NVIDIA releases OpenShell, an Apache 2.0 open source runtime for running autonomous agents in secure environments. Google Labs revamps Stitch into a native AI design platform, Mistral launches Forge for enterprise training of frontier models, and Anthropic publishes the largest qualitative study ever conducted on AI (81,000 users, 159 countries).

OpenAI acquires Astral: uv, Ruff and ty join Codex

March 19 — OpenAI announces the acquisition of Astral, the company behind three open source Python tools that have become essential in modern developer workflows.

Tool	Role
uv	Ultra-fast Python dependency and environment manager
Ruff	Extremely fast Python linter and code formatter
ty	Type checking tool (type safety)

These tools will join the Codex ecosystem after the acquisition closes. OpenAI commits to continuing support for these open source projects — an important commitment, as uv and Ruff power millions of workflows in enterprise and open source projects.

Codex shows growth of 3× in users and 5× in usage since the start of the year, with more than 2 million weekly active users. Integrating Astral’s tools aims to make Codex a true collaborator across the entire development cycle — planning, editing the codebase, running validation tools, checking results.

Charlie Marsh, Astral founder and CEO, joins OpenAI with his team on the Codex division after closing. Completion is subject to the usual regulatory conditions. Until closing, the two companies remain independent.

🔗 OpenAI to acquire Astral

OpenAI monitors 99.9% of its internal agents to detect misalignment

March 19 — OpenAI publishes a detailed report on its internal code agent monitoring system, designed to detect misalignment behaviors in real-world conditions.

The system, powered by GPT-5.4 Thinking at maximum reasoning effort, analyzes agent interactions (chains of thought, tool calls, messages) within 30 minutes of completion. It assigns a category and severity level to each interaction, and automatically alerts on suspicious behavior. OpenAI thus monitors 99.9% of internal coding traffic, with less than 0.1% (mainly local configurations) out of coverage.

Across tens of millions of monitored agent trajectories over five months:

Category	Observed frequency
Circumventing restrictions	Frequent
Deception	Frequent
Concealing uncertainty	Frequent
Unnecessary confirmation requests	Infrequent
Reward hacking	Infrequent
Unauthorized data transfer	Rare, high severity
Destructive actions	Rare, high severity
Prompt injection	Very rare
Scheming	Not observed as of March 17, 2026

No real interaction triggered the maximum severity level (reserved for long-term strategic manipulation). Around 1,000 conversations triggered moderate alerts, many of them from deliberate internal red-teaming. The system outperformed manual employee escalation by flagging all already escalated cases, plus many behaviors previously undetected.

OpenAI plans to make monitoring synchronous and blocking in the future — allowing the riskiest actions to be assessed and interrupted before execution — and intends to publish a follow-up report in summer 2026.

🔗 How we monitor internal coding agents for misalignment — OpenAI

Stitch: Google Labs revamps its tool into a native AI “vibe design” canvas

March 18 — Google Labs announces a major redesign of Stitch, its interface design tool, which becomes a native AI workspace (canvas) for design.

The new Stitch adopts a “vibe design” approach: instead of starting from a wireframe, the user describes business goals, the desired experience, or shares inspiration examples. The tool then generates high-fidelity interfaces from these descriptions.

Feature	Description
Native AI canvas	Infinite workspace, from ideation to prototype
Agent Manager	Reasons over the entire project history, manages multiple tracks in parallel
Voice commands	Design critiques, edits, and page generation by voice
Instant prototypes	One-click transition to interactivity
DESIGN.md	Portable markdown file for sharing design rules between tools

The DESIGN.md feature is particularly notable: it lets you extract a design system from any URL and export the rules in a markdown format compatible with AI agents — a direct bridge between design and agentic development workflows.

Stitch integrates with Google AI Studio and Antigravity via direct export, and exposes an MCP server as well as an SDK for automation via agents (2,400 GitHub stars). Available for users 18+ in regions where Gemini is available.

🔗 Introducing “vibe design” with Stitch

Google AI Studio: full-stack development by prompts with Antigravity and Firebase

March 19 — Google AI Studio now offers a full-stack prompt-based development experience, powered by the Antigravity agent and Firebase backend.

The goal: turn a description into a deployable web app without leaving the interface. New capabilities include creating real-time multiplayer apps, automatically adding databases and authentication via Firebase (Cloud Firestore + Firebase Authentication with Google Sign-In), and securely connecting to external services (Maps, payment processors, etc.) via an integrated secrets manager.

The agent automatically installs modern libraries (Framer Motion, Shadcn, Three.js), maintains a deep understanding of the project structure, and ensures persistence between sessions. Supported frameworks now include React, Angular, and Next.js. Google soon announces Drive and Sheets integration, as well as one-click deployment from Google AI Studio to Antigravity.

🔗 Vibe Code to production with Google AI Studio

Mistral Forge: training frontier models on proprietary data

March 17 — Mistral AI launches Forge, a system that allows companies to build frontier-level AI models anchored in their proprietary data.

Forge bridges the gap between generic AI and organization-specific needs, enabling the training of models that understand internal knowledge: codebases, compliance policies, operational processes, institutional decisions.

Capability	Description
Pre-training	On large volumes of internal data for domain-focused models
Post-training	Fine-tuning behaviors on specific tasks
Reinforcement learning	Alignment with internal policies, agentic improvement
Architectures	Dense and MoE (Mixture of Experts)
Modalities	Text, images, and other formats

The design is built for autonomous agents: Mistral Vibe can fine-tune models, find optimal hyperparameters, schedule jobs, and generate synthetic data.

Active partnerships already include ASML, DSO National Laboratories (Singapore), Ericsson, the European Space Agency, HTX Singapore, and Reply. Use cases cover governments (languages, dialects, regulatory frameworks), banks (compliance, risk), software teams (proprietary codebases), and manufacturers (engineering specifications). Data, intellectual property, and deployment remain under the control of the customer organization.

🔗 Mistral Forge

Anthropic: the largest qualitative study on AI (81,000 users)

March 18 — Anthropic publishes the results of the largest qualitative study ever conducted on AI: 81,000 users of Claude.ai from 159 countries, speaking 70 languages, shared their uses, hopes, and fears regarding AI.

The study was conducted in December 2025 via an AI interview tool called “Anthropic Interviewer”. Participants answered open-ended questions freely, and Claude then analyzed and classified the responses at scale — a novel method of qualitative research enhanced by AI.

Usage category	% respondents	Main theme
Professional excellence	19%	Delegate repetitive tasks to focus on strategic problems
Entrepreneurial partner	9%	Help build and grow businesses
Technical accessibility	9%	Break down technical barriers (coding, communication for mute people, etc.)
Personal hope	~15%	Health, medical diagnosis, personal empowerment

The testimonials illustrate concrete impact: medical diagnoses after years of uncertainty, accessibility for mute people, access to entrepreneurship for people without computer training. The fears expressed mainly concern overreliance on AI, job risks, and algorithmic bias.

🔗 What 81,000 people want from AI

Claude Code v2.1.78 → v2.1.81: —bare, relay —channels, hook StopFailure

March 17-20 — Four new Claude Code releases published in four days, with notable features for scripted integrations and multi-agent architecture.

Version	Date	npm downloads	Key changes
2.1.78	March 17	2,052	Hook `StopFailure`, `${CLAUDE_PLUGIN_DATA}`, line-by-line streaming
2.1.79	March 18	36,250	`--console` auth, turn duration toggle, subprocess stdin fix
2.1.80	March 19	1,183,620	`rate_limits` statusline field, `settings` marketplace source, `--channels` preview
2.1.81	March 20	1,044,182	`--bare` flag, relay `--channels`, WSL2 voice fix

The two most significant additions: --bare (v2.1.81) disables hooks, LSP, plugins, and skills for scripted -p calls in CI/CD (requires ANTHROPIC_API_KEY) ; relay --channels allows MCP servers to route approval requests to the user’s phone. The rate_limits field in statusline scripts now exposes Claude.ai’s 5h and 7-day window usage.

🔗 Claude Code CHANGELOG

GitHub Copilot: first LTS model and 50% faster agent

GPT-5.3-Codex LTS — first long-term support model

March 18 — GitHub introduces long-term support (LTS) models for Copilot Business and Enterprise. GPT-5.3-Codex becomes the first LTS model, in partnership with OpenAI.

This program responds to a demand from large companies: guarantee model stability to simplify security reviews and internal compliance certifications. GPT-5.3-Codex is available for 12 months (until February 4, 2027) and will replace GPT-4.1 as the base model by May 17, 2026. Premium request multiplier: 1×. Does not apply to individual plans (Pro, Pro+, Free).

🔗 GPT-5.3-Codex LTS in GitHub Copilot

Coding agent: bundled improvements (March 18-20)

Between March 18 and March 20, GitHub releases a series of improvements to the Copilot agent:

50% faster (March 19): faster startup, pull requests from scratch, and feedback loops with @copilot faster
Commit → logs traceability (March 20): each agent commit includes a Agent-Logs-Url trailer — permanent link to session logs for audits and code reviews
Session visibility (March 19): logs show setup steps, copilot-setup-steps.yml files, and collapsed subagents with heads-up display
Validation tool configuration (March 18): admins choose which tools (CodeQL, secret scanning, Advisory Database) the agent runs from repository settings — free, no Advanced Security license required

🔗 Copilot coding agent now starts work 50% faster

Squad: multi-agent orchestration in the repository

March 19 — A GitHub blog post introduces Squad, an open source project built on Copilot that spins up a preconfigured team of AI agents directly in a repository (2 npm commands). No vector database or heavy orchestration framework: multi-agent patterns are inspectable, predictable, and repository-native.

🔗 How Squad runs coordinated AI agents inside your repository

Gemini: API tooling and Gemini CLI v0.34.0

Gemini API updates — tool combination and context circulation

March 17 — Google DeepMind announces three new features for the Gemini API designed to simplify complex agentic workflows.

Feature	Description
Combined tools	Combine Google tools (Search, Maps) and custom functions in a single request
Context circulation	Every tool call and its response are preserved in context for later steps
Call identifiers	Unique IDs per tool call for debugging and parallel calls

Grounding with Google Maps is now available for the entire Gemini 3 model family.

🔗 Gemini API tooling updates

Gemini CLI v0.34.0 — Plan Mode by default and gVisor sandboxing

March 17 — Gemini CLI releases version 0.34.0. Le Plan Mode (mode planification), qui décompose les tâches complexes en étapes avant exécution, is now enabled by default for all users. The version also brings native sandboxing via gVisor (runsc) and experimental sandboxing via LXC containers, to limit the risks linked to code execution by the agent.

🔗 Gemini CLI changelog v0.34.0

xAI: Grok 4.20, Voice Mode Android/Web and Terafab

Grok 4.20 — four agents in debate

19 March — xAI announces Grok 4.20: a feature in which four independent agents analyze the same question, debate it, and synthesize a final answer. The announcement generated 10 million views on X.

🔗 Tweet @grok — Grok 4.20

Grok Voice Mode on Android and Web

19 March — Grok voice mode is now available on X Android and on the web. Previously limited to iOS, the rollout now covers the two remaining major platforms.

🔗 Tweet @X — Voice Mode Android/Web

xAI Terafab — tera-scale chip manufacturing initiative

22 March — xAI and SpaceX announce Terafab, a large-scale semiconductor manufacturing initiative, presented as “the next step toward a galactic civilization.” SpaceX clarifies that the goal is to bridge the gap between current chip production and future needs.

🔗 Tweet @xai — Terafab

Qwen, Z.ai and Kimi

Qwen 3.5 Max Preview — global top 3 in mathematics

19 March — Qwen announces that Qwen 3.5 Max Preview has just reached 3rd place in mathematics, the top 10 in Arena Expert, and the top 15 overall in the Arena.ai ranking (formerly LMArena). The team says it is working on the full version. A notable result for a model still in preview.

🔗 Tweet @Alibaba_Qwen — Qwen 3.5 Max Preview

Z.ai: GLM-5.1 will be open source, GLM-5 champion in trading

20 March — Following community concerns about the open-source future of the GLM series, Zixuan Li (Z.ai) announces: « GLM-5.1 will be open source. » The announcement generated 811,000 views and 7,514 likes.

22 March — Z.ai announces that GLM-5 is currently the only model exceeding human performance on PredictionArena, a trading and financial prediction benchmark.

🔗 Tweet @ZixuanLi_ — GLM-5.1 open source 🔗 Tweet @ZixuanLi_ — GLM-5 PredictionArena

Kimi K2.5 powers Cursor Composer 2

20 March — Kimi announces that Kimi K2.5 provides the foundation for Cursor Composer 2. The tweet generated 3.4 million views — a strong signal of enterprise adoption of the model in one of the most widely used AI code editors.

🔗 Tweet @Kimi_Moonshot — Cursor Composer 2

Perplexity: Health and Comet on iOS

Perplexity Health — health data connectors

19 March — Perplexity launches Perplexity Health, a set of connectors to personal health data integrated into Perplexity Computer. Supported sources include Apple Health, medical records (1.7 million providers), Fitbit, Ultrahuman, Withings and b.well (ŌURA and Function coming soon). Answers are based on clinical guidelines and peer-reviewed studies. The data is not used to train models. Available first to Pro/Max users in the United States.

At the same time, Perplexity forms a Health Advisory Board: Dr Eric Topol (Scripps Research), Dr Devin Mann (NYU), Dr Wendy Chung (Harvard/Boston Children’s), and Tim Dybvig.

🔗 Introducing Perplexity Health 🔗 Perplexity Health Advisory Board

Comet available on iOS

18 March — Perplexity launches the Comet browser on iOS (App Store). Comet was already available on desktop and Android. The iOS extension brings voice mode (spoken questions about open pages), hybrid search (classic + Comet Assistant based on intent), mobile Deep Research and continuity across devices (desktop → iPhone browsing thread preserved).

🔗 Meet Comet for iOS

Manus: 3 Meta connectors in beta

18 March — Since Manus joined Meta (December 2025), the first concrete integrations with the Meta ecosystem are arriving: three beta connectors.

Meta Ads Manager: analyze ad performance directly in the Manus workspace, without manual CSV export
Instagram: design, generate, publish and analyze content in one place
Instagram Creator Marketplace: official Meta creator discovery tool for campaigns

🔗 Manus Meta Ads Manager Connector

Generative media and NVIDIA

Midjourney V8 Alpha — 5× faster, native 2K rendering

17 March — Midjourney launches V8 Alpha on alpha.midjourney.com with a complete technical overhaul: moving away from TPUs in favor of a PyTorch architecture on GPU, rebuilt from scratch.

The most visible result is speed: generation is about 5 times faster than in V7. The new --hd parameter enables native 2K rendering without upscaling, and understanding of complex multi-element prompts is significantly improved. Text rendering in images also benefits from improvements (quotes for key words). V8 Alpha is currently accessible exclusively on alpha.midjourney.com, not yet on Discord.

21 March — An update rolls out Relax mode for Standard, Pro and Mega subscribers (without --hd or --q 4 simultaneously), as well as a new SREF/Moodboards version: 4× faster, compatible with --hd, with better integration of --p and --stylize parameters.

🔗 Midjourney V8 Alpha 🔗 Relax mode for V8 Alpha

NVIDIA OpenShell — open-source runtime for secure autonomous agents

18-19 March — NVIDIA releases OpenShell (Apache 2.0 license), a runtime that allows autonomous AI agents to run in kernel-level isolation environments.

OpenShell sits between the agent and the infrastructure to govern execution, data access and inference routing. Each agent runs in its own sandbox, with security policies enforced at the system level — out of reach of the agent itself. This separation between the application layer and the execution policy layer addresses a concrete question enterprise teams face when deploying self-evolving agents in production.

OpenShell is part of the NVIDIA Agent Toolkit and integrates with NemoClaw. Industry support at launch includes Adobe, Atlassian, Box, Cisco, CrowdStrike, Red Hat, SAP, Salesforce, ServiceNow and Siemens.

🔗 NVIDIA OpenShell — developer blog

ElevenLabs Music Marketplace — monetizing AI music

19 March — ElevenLabs launches the Music Marketplace in its ElevenCreative platform: a library of songs generated by users, available for licensing by other creators. Authors receive 25% of the sale price, with three license tiers (social, paid marketing, offline). The community has already created nearly 14 million songs with ElevenLabs’ music model.

🔗 Music Marketplace in ElevenCreative

NVIDIA SOL-ExecBench — Blackwell B200 GPU benchmark

19 March — NVIDIA publishes SOL-ExecBench (Speed-of-Light Execution Benchmark), a benchmarking framework for AI GPU kernels based on the hardware theoretical limit of the GPU rather than software baselines. 235 optimization problems extracted from 124 production AI models (LLM, diffusion, vision, audio, video), targeting Blackwell B200 GPUs (BF16, FP8, NVFP4). Designed to evaluate agentic optimizers capable of generating optimized CUDA kernels.

🔗 NVIDIA SOL-ExecBench

Anthropic: Code with Claude and Projects in Cowork

18 March — Anthropic announces the return of its developer conference Code with Claude in spring 2026, in three cities: San Francisco, London and Tokyo. A full day of workshops, demos and one-on-one sessions with Anthropic teams. Registration is also available online.

20 March — Projects are now available in Cowork, the collaborative workspace of claude.ai. This desktop app update makes it possible to group tasks and context in one place, organized by domain or project.

🔗 Code with Claude — registration 🔗 Tweet @claudeai — Projects in Cowork

Briefs

OpenAI — Container pool Responses API (21 March): Container startup for agents via the Responses API is now about 10 times faster thanks to a pool of pre-warmed containers. Significant reduction in agent workflow startup latency. 🔗 Tweet @OpenAIDevs

GitHub Copilot — Model metrics (20 March): Copilot usage metrics now resolve activities under the “Auto” label to the actual model name. Admins can see exactly which models their teams are using. 🔗 Copilot usage metrics — resolve Auto

Sora 2 — Security policy (23 March): OpenAI publishes the security policy for Sora 2: C2PA metadata on all videos, visible watermarks with the creator’s name, consent controls for people’s likenesses, stronger protections for minors and multi-frame filtering at generation time. 🔗 Creating with Sora safely — OpenAI

Grok Imagine (20 March): xAI launches the official X account @imagine for its image and video generation branch, as well as a Chibi template to turn photos into anime-style characters. 🔗 Tweet @grok — @imagine

Claude Code /init interactive (22 March): Thariq (@trq212, Claude Code team) announces a test of a new version of /init that interviews the user to better configure Claude Code in a repository. 🔗 Tweet @trq212 — /init interactive

What this means

The week stands out for two underlying trends. The first: AI development tooling is entering a phase of vertical integration. OpenAI’s acquisition of Astral, the Antigravity/Firebase integration in Google AI Studio and GitHub Copilot’s LTS plan show major players no longer wanting to just provide models, but to control the entire development tools chain.

The second: agent monitoring is becoming a front-line issue. OpenAI’s report on misalignment monitoring is rare in its transparency — publicly describing that deceptive and bypass behaviors are “frequent” in internal agents, while specifying that no sabotage was detected, is a signal that the industry is taking agent governance seriously. Mistral Forge, for its part, opens the way to a model where companies train their own frontier models — which raises similar governance questions at the organizational level.

For developers, the most concrete announcements this week are Claude Code v2.1.81 (--bare for CI/CD), Gemini CLI v0.34.0 (Plan Mode by default), OpenAI’s container pool (×10 on agent latency) and Copilot commit traceability to session logs.

Sources - OpenAI to acquire Astral

This document was translated from the fr version to the en language using the gpt-5.4-mini model. For more information about the translation process, see https://gitlab.com/jls42/ai-powered-markdown-translator