Journal/AI Weekly Digest/26 June – 3 July 2026

AI Weekly Digest26 June – 3 July 2026

The Fable 5 crisis resolves — US Department of Commerce lifts export controls June 30, Anthropic restores global access July 1 with a new classifier blocking the cited jailbreak; Claude Sonnet 5 launches June 30 as the new default for Free and Pro at introductory $2/$10 per MTok; Claude in Microsoft Foundry goes GA; self-hosted gateway ships; OpenAI publishes first large-scale agentic-work research, teases Codex Micro hardware, and retires GPT-4.5 from ChatGPT; Google rebrands Vertex AI into the Gemini Enterprise Agent Platform and launches Gemini 3.1 Flash-Lite Image; Microsoft ships Copilot Cowork worldwide GA with automatic model selection between GPT-5.5 Thinking, Claude, and GPT-5.5 Instant.

Period
26 June – 3 July 2026
Published
Jul 3, 2026
Covers
Anthropic · OpenAI · Gemini · Copilot

Dateline: July 3, 2026 | Next update: July 10, 2026

The Fable 5 crisis resolved this week with three simultaneous moves: the US Department of Commerce lifted the export-control directive on June 30, Anthropic restored global access to Fable 5 on July 1 with enhanced safeguards, and Anthropic launched Claude Sonnet 5 on June 30 as a near-Opus-quality model at Sonnet pricing. These are the two biggest product events since the Fable 5 launch itself. Alongside them: Claude in Microsoft Foundry went GA, a self-hosted gateway launched for enterprises, Enterprise admin analytics were overhauled, and Claude Code shipped a major weekly update including Claude in Chrome going GA and background PR auto-open.


Claude / Anthropic

✅ Fable 5 + Mythos 5 — export controls lifted, access restored

Export controls lifted: June 30, 2026 | Access restored: July 1, 2026 | Days suspended: 19 | Enhanced safeguards: yes

On June 30, the US Department of Commerce lifted the export-control directive on Claude Fable 5 and Mythos 5 — ending the 19-day global suspension. Anthropic restored global access to Fable 5 on July 1 across the Claude Platform, Claude.ai, Claude Code, and Claude Cowork. The resolution came with a package of enhanced cybersecurity safeguards developed in coordination with the government. Mythos 5 remains partially restricted: access has been restored for a set of US organisations following government approval on June 26, with expansion to the broader Glasswing programme ongoing.

✅ Resolved

Fable 5 is back globally from July 1 on the Claude Platform, Claude.ai, Claude Code, and Claude Cowork. Access on AWS, Google Cloud, and Microsoft Foundry is being restored as quickly as possible. Enhanced safeguards shipped with restoration: the specific jailbreak technique cited by the government is now blocked in more than 99% of cases by a new classifier layer. Trade-off: slightly more sensitive safety routing, with a higher rate of false positives on legitimate security development and debugging tasks. Anthropic is proposing an industry-wide jailbreak severity framework (with Amazon, Microsoft, Google, and Glasswing partners) built on four criteria: capability gain provided to the attacker, scope of that gain, ease of weaponisation, and discoverability. HackerOne bug-bounty programme launched for reporting cybersecurity jailbreaks. Subscription terms: Fable 5 is available for up to 50% of weekly usage limits on Pro, Max, Team, and select Enterprise plans through July 7, after which usage credits are required.

Technical details

Model strings restored: claude-fable-5 (global), claude-mythos-5 (US Glasswing orgs only) | Enhanced classifier: blocks cited jailbreak technique >99% | Subscription window: Fable 5 included up to 50% of weekly limits through July 7; usage credits from July 8 | AWS/GCP/Foundry: restoration in progress | HackerOne: bug-bounty programme for cybersecurity jailbreaks | Jailbreak severity framework: proposed with Amazon, Microsoft, Google, Glasswing partners | Tokenizer note: same text produces ~30% more tokens than pre-Opus-4.7 models

Best for: All users — Fable 5 is back, use it freely through July 7 within weekly limits. API developers: restore claude-fable-5 in model strings and note the new tokenizer (~30% more tokens for same text).

Claude Sonnet 5 — new default for Free and Pro

Launch: June 30, 2026 | Model string: claude-sonnet-5 | Introductory pricing: $2/$10 per MTok through August 31 | Standard from Sept 1: $3/$15 per MTok | Availability: all plans + API + Bedrock + Vertex AI + Microsoft Foundry

Claude Sonnet 5 is the most significant mid-tier model Anthropic has shipped. It replaces Sonnet 4.6 as the default for Free and Pro users and delivers near-Opus-4.8 agentic performance at Sonnet pricing. It is the first Sonnet model built primarily around autonomous, multi-step tasks — planning, tool use, browser and terminal control, and self-verification without prompting. Introductory API pricing at $2/$10 per MTok makes it more than 60% cheaper than Opus 4.8 at launch.

★ What's new

Claude Sonnet 5 launches June 30 as the new default for Free and Pro. Key benchmarks (Anthropic's own figures): SWE-bench Verified 72.7% (vs Sonnet 4.6: 62.3%, Opus 4.8: 79.4%); Terminal-Bench 76.1% (vs Sonnet 4.6: 55.4%, +20.7 points — the largest single benchmark jump of any Sonnet launch); BrowseComp (agentic search) and OSWorld-Verified (computer use) both approach Opus 4.8 at medium effort. Safety: lower hallucination, sycophancy, and undesirable-agent-behaviour rates than Sonnet 4.6; refuses malicious requests more consistently; resists prompt injection more reliably. Cyber safeguards enabled by default. In a Mozilla Firefox test, Sonnet 5 never produced a working exploit despite multiple attempts. Sonnet 5 is included in the Cyber Verification Program on the native Claude Platform, Claude Platform on AWS, and Claude in Microsoft Foundry (Vertex AI coming soon). Tokenizer note: Sonnet 5 uses the same newer tokenizer as Opus 4.7/4.8 and Fable 5 — approximately 30% more tokens for the same text. Audit your token budgets before switching production pipelines.

Technical details

Model string: claude-sonnet-5 | Context: 1M tokens | Max output: 128k tokens | Introductory pricing: $2/$10 per MTok through August 31, 2026 | Standard pricing from September 1: $3/$15 per MTok | Tokenizer: newer tokenizer (~30% more tokens than Sonnet 4.6 for same text) | BrowseComp chart corrected June 30: updated chart uses 10M token budget with compaction and programmatic tool calling | Sonnet 4.6 deprecation: no date announced yet | Claude Code: update to v2.1.197 for Sonnet 5 as default

Best for: The default everyday model for most users from June 30. Cost-sensitive agent workloads where Opus-class quality is overkill. High-volume coding, tool use, and knowledge work pipelines.

Rate limits increased — higher-effort model support

Effective: alongside Fable 5 restoration and Sonnet 5 launch | Applies to: Claude Platform, claude.ai, Claude Code, Cowork

Anthropic raised rate limits across Chat, Cowork, Claude Code, and the Claude Platform to accommodate the higher token usage that comes with higher effort levels and Fable 5's larger context use. On the API, Sonnet and Haiku rate limits were also raised at every usage tier, and the tier structure simplified to three tiers: Start, Build, and Scale.

★ What's new

Rate limits raised across all paid plans on Chat, Cowork, Claude Code, and the Claude Platform. API tier structure simplified to Start, Build, and Scale (from the previous multi-tier structure). Sonnet and Haiku API rate limits raised at every tier. View your tier and current limits in the Claude Console.

Technical details

API tiers: Start, Build, Scale (simplified) | Console: view tier and limits at Claude Console | Primary drivers: higher effort levels and Fable 5 higher context use

Best for: All paid users — no action needed. API developers: check the new tier documentation in Claude Console.

Claude in Microsoft Foundry — generally available

GA date: June 29, 2026 | Platform: Azure | Availability: enterprise customers globally

Claude in Microsoft Foundry went generally available on June 29, giving Azure enterprise customers a production-ready path to Claude with Azure-native authentication (Entra ID), billing, networking, governance, and data residency controls. Two hosting options are available: Hosted on Azure (inference processed inside Azure, US data zone option) and Hosted on Anthropic (full Claude API feature set). Sonnet 5 is available in Foundry from day one.

★ What's new

Claude in Microsoft Foundry is now generally available. Hosted on Azure: inference in Azure, Azure authentication, billing, governance, US and global data zones, Anthropic as data processor. Hosted on Anthropic: full Claude API feature set, previously called Foundry Preview. Claude Sonnet 5 available in Foundry at launch on July 1. NVIDIA GB300 GPU support confirmed for Foundry. Foundry Agent Service can use Sonnet 5 as the reasoning core for multi-step planning, tool use, and task execution across enterprise systems. Feature and model parity between the two hosting options is the roadmap goal.

Technical details

GA: June 29, 2026 | Hosted on Azure: Entra ID auth, Azure Marketplace billing, US + global data zones | Hosted on Anthropic: full API features (previously Foundry Preview) | Models: Sonnet 5 (GA July 1), Opus 4.8, Haiku 4.5 | NVIDIA: GB300 GPU confirmed | Billing: Claude Consumption Units (CCUs) via Azure Marketplace | Fable 5 on Foundry: coming as part of post-suspension restoration rollout

Best for: Azure enterprise customers needing Claude with existing Azure identity, billing, and governance controls

Claude self-hosted gateway — on-premise API infrastructure

Platform: Claude Platform | Availability: enterprise | Ships inside the claude binary

Anthropic launched a self-hosted gateway — an enterprise-grade API gateway for Claude that runs inside the same claude binary developers already install. Enterprises can run it in a single stateless container on their own infrastructure, with their own network policies, identity provider (OIDC: Google Workspace, Microsoft Entra ID, Okta), and telemetry stack. This is a direct response to enterprises that need Claude's API features but cannot route traffic through Anthropic's public endpoints due to security or compliance requirements.

★ What's new

Self-hosted gateway ships inside the claude binary as a single stateless container. Identity: acts as an OIDC relying party against Google Workspace, Microsoft Entra ID, Okta, or any standards-compliant OIDC provider — issues short-lived sessions, no long-lived secrets on developer machines. Policy: managed settings defined once on the server; clients receive policy at sign-in and the gateway enforces it on every request (allowed models, default settings). Telemetry: the client stamps a usage metric on every request; the gateway relays it over OTLP to a collector you configure in your own network and on your own retention schedule.

Technical details

Deployment: single stateless container | Ships in: claude binary | Identity: OIDC RP — Google Workspace, Microsoft Entra ID, Okta, any standards-compliant OIDC | Session: short-lived, no long-lived secrets | Policy: managed settings, client enforces at sign-in | Telemetry: OTLP to a collector you configure | Complements: Claude Platform on AWS (AWS-native), Claude in Microsoft Foundry (Azure-native), WIF (keyless auth for CI/CD)

Best for: Enterprises with strict data perimeter requirements — financial services and healthcare teams who cannot route traffic via Anthropic's public endpoints

Claude Enterprise — richer admin analytics, model entitlements, spend alerts

Platform: Claude Enterprise | Availability: Enterprise plans | Admin console

Enterprise admins gained substantially more visibility and control over Claude usage this week. The admin console now shows usage and cost breakdowns at the model, team, and user level, with spend-threshold alerts and model-level entitlements that let admins set which models specific groups can access.

★ What's new

Richer admin analytics: usage and cost breakdowns by model, team, and user in the admin console. Model-level entitlements: set model defaults and access permissions by group, not just organisation-wide. Spend alerts: configure spend-threshold notifications to catch overages before they land on the invoice. Analytics API: finance and IT teams can pull the same metrics into existing reporting systems programmatically. These controls complement existing spend caps, access and model routing, and the usage analytics dashboard already in Claude Enterprise.

Technical details

New in admin console: model-level cost breakdown, team/user breakdown, group-level model entitlements, spend-threshold alerts | Analytics API: same metrics as admin dashboard, programmatic access | Complements: existing spend caps, model routing, SCIM/RBAC, WIF | Plans: Claude Enterprise

Best for: Enterprise IT admins and finance teams tracking Claude spend and enforcing model access policies by team

Claude Code — Claude in Chrome GA, background PR auto-open, /dataviz skill

Platform: terminal / VS Code / web / mobile / Chrome | Availability: all plans | Update to v2.1.197 for Sonnet 5

Claude Code had another packed week: Claude in Chrome graduated from beta to generally available; background agents now automatically commit, push, and open a draft PR when they finish code work in a worktree; Sonnet 5 is the new default in Claude Code from v2.1.197; and a /dataviz skill was added for chart and dashboard design guidance.

★ What's new

Claude in Chrome is now generally available — Claude can browse, click, fill forms, and act on web pages directly inside Chrome. Update to v2.1.197: Sonnet 5 is now the default model in Claude Code with introductory pricing of $2/$10 per MTok through August 31. Background PR auto-open: background agents launched from claude agents now commit, push, and open a draft PR automatically when they finish code work in a worktree, instead of stopping to ask. New /dataviz skill: chart and dashboard design guidance with a runnable colour-palette validator. Background agent notifications: sessions that need input or finish now fire the Notification hook (agent_needs_input / agent_completed). Claude Platform on AWS added as a gateway upstream provider (anthropicAws); model-not-found responses now advance the failover chain rather than erroring out. Built-in Explore agent now inherits the main session's model (capped at Opus) instead of always running on Haiku. Subagents and context compaction now inherit the session's extended thinking configuration. Org default models: admins set the default model in the org console; it shows as 'Org default' in /model. Fixed: brief network drops mid-response now retry with backoff instead of aborting the turn. Fixed: Opus 4.7 fast mode deprecated, removal July 24.

Technical details

Update required: v2.1.197 for Sonnet 5 default | Claude in Chrome: GA | Background PR: auto-commit, push, draft PR on worktree task completion | /dataviz: chart/dashboard design skill with colour-palette validator | Notification hook: agent_needs_input / agent_completed | Gateway: anthropicAws added; model-not-found advances failover chain | Org default model: set in org console, shown as 'Org default' in /model | Fixed: ECONNRESET and transient network drops now retry with backoff | Fixed: excessive background classifier requests on repeated network host access | Fixed: background tasks stuck on 'Running' after finish or session resume | Opus 4.7 fast mode deprecated, removal July 24

Best for: All Claude Code users — update to v2.1.197 immediately for Sonnet 5 default. Chrome users: Claude in Chrome is now GA for production use.

API — Opus 4.7 fast mode deprecated, MCP tunnels migrated, SDK updates

Platform: Claude API | Effective: various dates this week

Three housekeeping API updates this week. Opus 4.7 fast mode is deprecated and will be removed July 24. The MCP tunnels management API moved to a new surface. And all major language SDKs now include support for the latest code execution tool version.

★ What's new

Opus 4.7 fast mode deprecated: removal July 24, 2026. After removal, requests to claude-opus-4-7 with speed: 'fast' will return an error. Migrate to Opus 4.8 fast mode. Opus 4.6 fast mode already removed as of June 29. MCP tunnels API migrated: management API moved from /v1/organizations/tunnels on the Admin API to /v1/tunnels on the Claude API; new header: anthropic-beta: mcp-tunnels-2026-06-22; new WIF scope: workspace:manage_tunnels. Previous surface remains available during migration window. SDK update: Python, TypeScript, Go, Java, Ruby, PHP, and C# SDKs now include support for code_execution_20260120 (REPL state persistence, minimum version for programmatic tool calling). No beta header required — set type to code_execution_20260120.

Technical details

Opus 4.7 fast mode removal: July 24, 2026 | Opus 4.6 fast mode: already removed June 29 | MCP tunnels new endpoint: /v1/tunnels on Claude API | MCP tunnels header: anthropic-beta: mcp-tunnels-2026-06-22 | MCP tunnels WIF scope: workspace:manage_tunnels | Migration window: previous Admin API surface still available temporarily | code_execution_20260120: REPL state persistence, programmatic tool calling, no beta header, all major SDKs updated | Next model retirement: Opus 4.1, August 5, 2026

Best for: API developers using Opus 4.7 fast mode (migrate now), MCP tunnels (update to new endpoint before migration window closes), or code execution (update SDK for REPL persistence)

Plans and Pricing — most active week of 2026

Sonnet 5 introductory pricing at $2/$10 per MTok (through August 31) is the headline. Fable 5 restored with a 50%-of-weekly-limits window on paid subscription plans through July 7, then usage credits. Opus 4.6 fast mode removed June 29. Opus 4.7 fast mode removing July 24.

Technical details

Sonnet 5 introductory: $2/$10 per MTok through August 31, 2026 | Sonnet 5 standard from Sept 1: $3/$15 per MTok | Fable 5: restored July 1, up to 50% of weekly limits on Pro/Max/Team/Enterprise through July 7; usage credits from July 8 | Fable 5 API: $10/$50 per MTok standard, $5/$25 per MTok Batch API | Opus 4.8: $5/$25 per MTok | Opus 4.7: $5/$25 per MTok (fast mode removing July 24) | Opus 4.6 fast mode: removed June 29 | Haiku 4.5: low-cost tier | Next retirement: Opus 4.1, August 5

Best for: Switch default model to claude-sonnet-5 for everyday workloads at $2/$10 until August 31. Use Fable 5 freely through July 7. Remove fast-mode references for Opus 4.6 (already broken) and plan Opus 4.7 fast-mode migration before July 24.


ChatGPT / OpenAI

Dateline: July 3, 2026 | Next update: July 10, 2026

A relatively quiet week in terms of major product launches, but several important developments reinforced OpenAI's long-term direction. The biggest story was the publication of OpenAI's first large-scale research into how AI agents are changing work, showing rapid adoption of Codex across both technical and non-technical professions. OpenAI also simplified the ChatGPT Business model picker, completed the retirement of GPT-4.5 inside ChatGPT, and teased its first dedicated Codex hardware product.

First large-scale research on AI agents changing work

Published: June 27–28, 2026 | Category: Research

OpenAI published its first major economic research paper analysing how people use AI agents in real-world work. Rather than measuring benchmark performance, the paper examines millions of Codex interactions and finds that users are increasingly delegating longer, more complex, and more cross-functional work to AI agents. Within OpenAI itself, Codex has largely replaced ChatGPT for many work tasks, and the share of users assigning tasks estimated to take an experienced human more than eight hours has increased almost tenfold since the beginning of the year.

★ What's new

OpenAI frames this as evidence that AI is moving beyond question answering towards autonomous task execution. Active users increased more than fivefold during 2026. More than 10% of users now manage three or more concurrent Codex agents. The paper argues that agentic AI is beginning to reshape how organisations organise work rather than simply making existing work faster.

Technical details

Dataset: large-scale Codex usage | Growth: active users increased more than fivefold during 2026 | Multi-agent users: >10% manage three or more concurrent agents | Internal finding: Codex largely replaces ChatGPT for many OpenAI employee workflows

Best for: Enterprise AI leaders, policymakers, and organisations trying to understand how AI agents — not just chatbots — are beginning to transform knowledge work

ChatGPT Business — simplified model picker

Announced: June 26, 2026 | Platform: ChatGPT Business

OpenAI redesigned the model picker for ChatGPT Business, replacing numerous thinking variants with reasoning effort levels: Instant, Medium, High, Extra High, Pro Standard, and Pro Extended. Instant can automatically escalate to Medium when additional reasoning would improve the response. GPT-5.5 Pro continues powering Pro Standard and Pro Extended.

★ What's new

New reasoning effort options: Instant, Medium, High, Extra High, Pro Standard, Pro Extended. Thinking Light removed. This reflects OpenAI's broader strategy of hiding model complexity from end users and allowing routing systems to determine the appropriate reasoning level automatically.

Technical details

Platform: ChatGPT Business (web, iOS, Android) | New options: Instant, Medium, High, Extra High, Pro Standard, Pro Extended | Thinking Light removed | GPT-5.5 Pro continues powering Pro Standard and Pro Extended

Best for: Business users who want simpler model selection without understanding the differences between multiple GPT variants

⚠️ GPT-4.5 retirement completed in ChatGPT

Effective: June 26–27, 2026 | Platform: ChatGPT only (API unaffected)

OpenAI completed the retirement of GPT-4.5 from ChatGPT. Existing conversations continue automatically using GPT-5.5 where appropriate. API users are unaffected. Users with custom GPTs built on GPT-4.5 have been migrated to GPT-5.5 within ChatGPT.

Technical details

Applies only to ChatGPT | API unchanged | Existing conversations automatically migrate to GPT-5.5 | Custom GPTs on GPT-4.5: auto-migrated to GPT-5.5

Best for: ChatGPT users and administrators maintaining custom GPTs — verify behaviour after the migration

Improved Memory — continued enterprise rollout

Platform: ChatGPT Business, Enterprise, and Edu

OpenAI continued the staged rollout of improved Memory for Business, Enterprise, and Edu customers. Rather than relying only on manually saved memories, ChatGPT can now reference relevant information from previous conversations to keep its understanding current, while allowing users to review, edit, or delete remembered information. Enterprise workspaces remain in the early-access phase, with administrators able to enable or disable the feature before wider default rollout.

Technical details

Available: Business, Enterprise, Edu | Controls: memory summary, source review, deletion, opt-out | Enterprise rollout: staged early access continues | Codex memory unaffected

Best for: Organisations using ChatGPT across long-running projects where context continuity improves productivity

Codex Micro — hardware accessory teased

Teased: June 29–30, 2026 | Launch: July 15 | Partner: Work Louder

OpenAI teased Codex Micro, developed in partnership with keyboard manufacturer Work Louder. Unlike the separate consumer AI device in development with Jony Ive, Codex Micro is designed specifically for Codex users — a programmable shortcut device similar to a macro keyboard that gives developers quick access to Codex workflows. This is OpenAI's first dedicated hardware accessory built specifically around one of its software products. Full details to be announced July 15.

Technical details

Product: Codex Micro | Partner: Work Louder | Category: programmable macro controller | Launch: July 15 | Purpose: faster Codex interaction and shortcut workflows

Best for: Developers who use Codex extensively throughout the day

Codex CLI 0.142.3

Released: June 27, 2026

Codex CLI received a small maintenance release with no new user-facing functionality or breaking changes. Bug fixes and maintenance work only.

Technical details

Version: 0.142.3 | Changes: maintenance only | No API changes | No new features

Best for: Existing Codex CLI users — update to remain current

Plans and Pricing

No significant pricing changes announced between June 26 and July 3. The most relevant operational changes remain last week's enterprise billing controls, improved Memory rollout, and the simplified Business model picker.

Technical details

Pricing: unchanged | Models: GPT-4.5 retired from ChatGPT | Business: simplified reasoning picker | Enterprise: Memory rollout continues | Codex CLI: updated to 0.142.3

Best for: No immediate action required. Business customers should familiarise themselves with the new reasoning-level picker; enterprise admins may wish to evaluate improved Memory before wider rollout.


Gemini (Google)

Dateline: July 3, 2026 | Next update: July 10, 2026

The past week marked a critical architectural consolidation for Google's AI ecosystem. Google DeepMind launched Gemini 3.1 Flash-Lite Image, a highly optimised sub-millisecond text-to-image model. Google Cloud deprecated Vertex AI as a standalone brand, absorbing all subsequent model deployments and enterprise developer features into the newly launched Gemini Enterprise Agent Platform. On the consumer front, the "Gemini Spark" update introduced a native macOS application with local file automation and third-party extensions.

Gemini 3.1 Flash-Lite Image — new visual generation model

Announced: June 30, 2026 | Model string: gemini-3.1-flash-lite-image | Also known as: nano-banana-2-lite (consumer UI) | Availability: Google AI Studio, API, consumer tiers

Google DeepMind expanded its native multimodal architecture with Gemini 3.1 Flash-Lite Image, engineered to bridge high-fidelity visual generation with strict operational cost constraints. Designed specifically for rapid, iterative text-to-image generation, real-time localised text rendering, and multi-turn visual editing, it addresses the economic bottlenecks enterprises face when deploying visual AI at scale. In consumer-facing applications, it is deployed under the name "Nano Banana 2 Lite."

★ What's new

Sub-millisecond processing speeds and drastically lowered compute requirements relative to Gemini 3 Pro Image. Supports a 1M token input context window. Output: 4K-token image matrices paired with up to 64K tokens of explanatory text. Exceptional performance in internationalized text rendering across non-Latin scripts and character consistency across sequential edits. Pricing: approximately 80% cost reduction per thousand generated matrices relative to gemini-3-pro-image, at an identical input/output token cost ratio to the standard text-based gemini-3.1-flash-lite model card.

Technical details

Model strings: gemini-3.1-flash-lite-image / nano-banana-2-lite | Availability: live in AI Studio, web/mobile consumer tiers, and select API regions | Context: 1M tokens input | Output: 4k tokens for image matrices, 64k tokens for text | Training: Google TPU clusters | Evaluation: Side-by-Side (SxS) human Elo for T2I prompt adherence, internationalisation text rendering, and multi-turn character consistency

Best for: Enterprise developers building high-volume creative applications, digital marketing automation, and interactive educational platforms requiring low-latency visual generation

Gemini Spark — native macOS app and third-party extensions

Announced: June 30, 2026 | Platform: macOS (native), Web, iOS, Android | Availability: Google AI Ultra subscribers (US beta); extensions rolling out globally

Google launched a native macOS application under the "Gemini Spark" framework, granting the assistant secure, explicit permissions to interface directly with local desktop environments. Users can command Gemini Spark to manage files across local directories, parse financial data, and automatically update Google Workspace spreadsheets — or issue multi-step instructions via mobile to their remote Mac. Third-party extensions went live for Canva, Dropbox, Instacart, OpenTable, and Zillow Rentals, alongside native integrations with Google Tasks and Keep.

★ What's new

macOS native client with local file system access (user-defined directory gating). Upcoming: remote execution pipeline — issue instructions from mobile to remote Mac hardware (retrieve sales reports, extract metrics, email summaries). Background task orchestration with automated real-time polling of web hooks, RSS feeds, financial tickers, and sports data streams. MCP support: developers can build and expose custom local or remote MCP servers directly to the Gemini Spark client, enabling custom internal enterprise applications to be called natively within the Gemini chat interface. New third-party extensions: Canva, Dropbox, Instacart, OpenTable, Zillow Rentals. Native integrations: Google Tasks, Google Keep.

Technical details

Platform: macOS native (Beta), Web, iOS, Android | Model: Gemini 3.1 Pro and Gemini 3.1 Flash-Lite | Availability: US Beta for AI Ultra subscribers; extensions rolling out over seven-day window | Local file system access: user-defined directory gating | Real-time polling: web hooks, RSS, financial tickers, sports data | MCP: custom local or remote MCP server support | Extensions: Canva, Dropbox, Instacart, OpenTable, Zillow Rentals

Best for: Power users and executives in Apple ecosystems who need a proactive AI assistant capable of automating local cross-application workflows and tracking real-time data

Gemini Enterprise Agent Platform — Vertex AI absorbed and rebranded

Announced: June 30, 2026 | Platform: Google Cloud | Availability: GA for all Google Cloud enterprise accounts

Google Cloud deprecated Vertex AI as a standalone brand, absorbing all machine learning models, developer suites, and cognitive roadmaps into the unified Gemini Enterprise Agent Platform. The platform is architected around four pillars — build, scale, govern, and optimise — shifting the enterprise value proposition from model fine-tuning to deploying highly autonomous, long-running agent workflows. The Agent Designer provides a visual no-code/low-code flowchart environment for designing trigger-based operational paths. The Agent Inbox provides a centralised operations room categorising agent activity into "Needs your input," "Errors," and "Completed."

★ What's new

Vertex AI brand retired; all enterprise ML workloads now under the Gemini Enterprise Agent Platform. Agent Designer: visual no-code/low-code flowchart environment for designing trigger-based autonomous agent paths that run in isolated cloud sandboxes. Agent Inbox: centralised operations room for prioritising and auditing agent activity. Projects: strictly walls off an agent's memory to assigned datasets, Drive repositories, and Group chats — enforcing clean data boundaries across departments. Skills: custom macros via @mentions to execute deterministic tasks across the enterprise workspace. Agent Gallery: out-of-the-box connectors for Asana, Workday, Mailchimp, Adobe, Atlassian, Lovable, and ServiceNow.

Technical details

Platform: Gemini Enterprise Agent Platform (formerly Vertex AI infrastructure) | Models: Gemini 3.1 Pro, Flash, Flash-Lite | Availability: GA globally for Google Cloud Enterprise | Key changes: complete absorption of Vertex AI into unified Agentic SDK; secure cloud sandboxing for runtime code execution; visual flowchart compilation into execution graphs; native enterprise data schema mapping | Agent Gallery connectors: Asana, Workday, Mailchimp, Adobe, Atlassian, Lovable, ServiceNow

Best for: CIOs, enterprise architects, and operations leaders seeking to deploy secure, autonomous, and governed AI workforces for end-to-end business workflows

Remote MCP server for Gemini Enterprise Agent Platform

Announced: June 30, 2026 | Availability: GA across all Google Cloud projects with Agent Platform API enabled

Google deployed a remote Model Context Protocol (MCP) server within the Gemini Enterprise Agent Platform, allowing external development frameworks — such as Claude Code — to interact directly and safely with Google Cloud environments. The connection runs entirely within Google Cloud's secure infrastructure and integrates natively with Cloud IAM, so security administrators can use standard IAM Deny policies.

⚠ What's new

External coding agents can natively call foundation models from Google's Model Garden, pull approved internal prompt templates, or manage notebooks within an active project. Implementation requires three steps: enable the Agent Platform API in Google Cloud Console, configure the local client JSON payload to point to the remote Google Cloud MCP endpoint, and copy the provided Toolset Endpoints into the external IDE workspace. Native Cloud IAM integration handles enterprise token exchange and session validation automatically at the boundary layer.

Technical details

Platform: Google Cloud IAM / Agent Platform API | Protocol: open MCP specification | Availability: auto-activated on enabling Agent Platform API | Key changes: native hosting of remote MCP server endpoints within Google Cloud; centralized asset cataloging via Agent Registry; enforces IAM Deny policies for external agent requests | Security: runs within Google Cloud's secure infrastructure, IAM-governed

Best for: DevOps engineers and full-stack development teams wanting to use advanced third-party coding agents without compromising enterprise data governance

Native VPC Service Controls — agent data isolation

Announced: June 27, 2026 | Availability: GA for all enterprise accounts using advanced agentic workflows

Google announced native integration of Virtual Private Cloud (VPC) Service Controls directly into the Gemini Enterprise Agent Platform, establishing explicit network perimeters around autonomous agent workloads. When a long-running agent executes code in its cloud sandbox or connects to internal data repositories (Drive, BigQuery), VPC Service Controls ensure data remains strictly within a designated trusted boundary.

Technical details

Platform: Google Cloud Security Architecture | Availability: globally via Google Cloud Console perimeter configuration | Key changes: extension of VPC Service Controls to cover Agent Platform API endpoints; enforcement of cryptographic identities for agents moving across network perimeters; automated blocking of unverified external data egress from agent tool-calling loops | Update path: existing VPC perimeters can be updated to include the Gemini Enterprise Agent Platform service identity without modifying agent code

Best for: CISOs, compliance officers, and network security engineers requiring absolute data isolation during deployment of autonomous enterprise agents — particularly banking, healthcare, and government

Workspace Feature Drop — Formula Fixer, iterative Gmail drafting

Announced: June 25, 2026 | Platform: Google Workspace (Gmail, Google Sheets, Google Drive) | Availability: AI Ultra, Pro, Plus, and Workspace Enterprise Plus on Gemini Alpha track

Google delivered its latest Workspace Feature Drop, embedding deeper context-aware Gemini capabilities across core productivity applications. In Sheets, the Gemini Formula Fixer diagnoses formula syntax or logic faults, explains the correction in plain language, and updates the formula inline. In Gmail, the "Help me write" feature was upgraded with a persistent instruction bar for iterative refinements — issue direct edits like "add the project deadline to line three" without rewriting the entire prompt. Spreadsheet localisation expanded to 28 new languages.

★ What's new

Gemini Formula Fixer (Sheets): diagnoses syntax/logic errors and updates formulas inline. Iterative Prompt Refinement (Gmail): persistent instruction bar for surgical text edits without rewriting the full prompt. 28-language sheet localisation (Spanish, Japanese, French, German, Korean, and more). AI Inbox (Alpha Track only): high-volume executive triage.

Technical details

Platform: Google Workspace | Model: Gemini 3.1 Pro integration | Availability: live for premium tiers, rolling out to enterprise | Sheets: multi-column context analysis, AST-based formula debugging engine | Gmail: stateless partial-string editing states in drafting API | Localization: 28 new languages with expanded global character set tokenizer arrays | AI Inbox: strictly gated to Gemini Alpha track

Best for: Financial analysts, project managers, and operations personnel heavily reliant on Google Workspace who need rapid data troubleshooting and precise communication drafting

Plans and Pricing

Gemini Enterprise Agent Platform deprecates Vertex AI's standalone billing model — all subsequent enterprise ML workloads are invoiced under the unified platform ledger. Agent execution within secure cloud sandboxes incurs standard Google Cloud compute-per-minute overheads, insulated by native VPC Service Controls at no additional compliance premium. Remote MCP server access is provided at zero licensing cost, billing only on the volume of underlying model and tool calls. Gemini Spark macOS and real-time background tracking capabilities remain tied to the Google AI Ultra subscription ($20/month). New Google Home Premium tier at $10/month for advanced voice agent capabilities and continuous video history analysis for consumer IoT hardware.

Technical details

Gemini Enterprise Agent Platform: replaces Vertex AI standalone billing | gemini-3.1-flash-lite-image: ~80% cost reduction vs gemini-3-pro-image per thousand generated matrices | Cloud sandbox execution: standard Google Cloud compute-per-minute overhead | Remote MCP: zero licensing cost; billed on model and tool call volume | Google AI Ultra: $20/month (Gemini Spark macOS + background tracking) | Google Home Premium: $10/month (advanced voice agent + continuous video history for IoT)

Best for: Enterprise IT leaders shifting high-frequency visual tasks to Flash-Lite architecture and integrating external engineering IDEs into Google Cloud via open-standard MCP channels


Microsoft Copilot

Dateline: July 3, 2026 | Next update: July 10, 2026

Copilot Cowork reached worldwide general availability on June 30, with automatic model selection between GPT-5.5 Thinking (deep reasoning), Claude (structured/visual work), and GPT-5.5 Instant (lightweight tasks). Productivity apps gained new customisation and brand tools, Copilot Vision was previewed for screen-based insights, and admins gained stronger governance with Purview and cost dashboards.

⚠️ Copilot Cowork — worldwide GA, model choice, expanded plugins

GA: June 30, 2026 | Status: globally available | Model auto-selection: live

Cowork is now generally available worldwide. Users can toggle between Chat and Cowork in the Copilot app. Cowork executes tasks end-to-end, returning deliverables rather than drafts. The headline new feature is automatic model selection: Cowork can now choose between GPT-5.5 Thinking for deep reasoning, Claude for structured/visual work, and GPT-5.5 Instant for lightweight tasks without user intervention.

★ What's new

Worldwide GA: all enterprise tenants. Model auto-selection: Cowork automatically routes between GPT-5.5 Thinking, Claude, and GPT-5.5 Instant based on task type. Expanded plugin ecosystem: Enosix, Harvey, LSEG, Miro, monday.com, Moodys, Morningstar, S&P Global Energy, TeamsMaestro, plus Databricks via sideloading. Microsoft Fabric and Dynamics 365 apps now integrated. Cost Management Dashboard in admin center: admins can monitor Cowork credit usage, budgets, and spend. Customize tab for custom skills.

Technical details

Cowork: usage-based billing, Cost Management Dashboard in admin center | Model auto-selection: GPT-5.5 Thinking / Claude / GPT-5.5 Instant | Plugin store expanded: Enosix, Harvey, LSEG, Miro, monday.com, Moodys, Morningstar, S&P Global Energy, TeamsMaestro, Databricks | Microsoft Fabric + Dynamics 365: integrated | Customize tab: custom skills

Best for: Enterprises orchestrating complex workflows with tailored model selection and plugin integration

Copilot in Apps — Word, Excel, Outlook, PowerPoint

★ What's new

Word: model choice now available; Copilot Catchup content card; image creation; iOS agentic capabilities; preserves chat history across apps; applies edits based on comments. Excel: Skills, personalisation, and rules; Planner Agent availability; deeper reasoning over Power BI enterprise data. Outlook: Compose canvas refinement; Copilot settings in Outlook Classic rolling out next month. PowerPoint: Brand Kit Picker for approved templates; reusable Copilot skills; references to SharePoint libraries and OneDrive folders.

Technical details

Word: model choice, Catchup card, image creation, iOS agentic, cross-app chat history, comment-based edits | Excel: Skills, personalisation, rules, Planner Agent, Power BI reasoning | Outlook: Compose canvas, Classic settings next month | PowerPoint: Brand Kit Picker, reusable skills, SharePoint/OneDrive references

Best for: Teams needing consistent brand enforcement, deeper data reasoning, and cross-app continuity

Governance and Compliance — Admin Center, Purview

★ What's new

Cost Management Dashboard: admins can monitor Cowork credit usage, budgets, and spend in the admin center. Purview DLP controls: email restriction and Cowork governance added. Federated Connectors GA: secure external app integration with staged rollout.

Technical details

Cost Management Dashboard: Cowork credits, budget monitoring, spend tracking | Purview DLP: email restriction, Cowork governance | Federated Connectors: GA, staged rollout

Best for: IT admins managing spend, compliance, and connector governance

Copilot Vision — screen-based insights

Announced: June 29, 2026 | Status: early rollout in Frontier tenants

Copilot Vision will generate insights based on what is visible on a user's screen. In early rollout for Frontier tenants.

Best for: Analysts and researchers needing contextual screen-based insights

Copilot Notebooks — expanded references

★ What's new

Outlook emails can now be added as references in Copilot Notebooks. Notebooks are now available to all Copilot Chat users, not just licensed Copilot 365 users.

Best for: Knowledge workers consolidating references across Outlook, SharePoint, and OneDrive

Plans and Pricing

Cowork: usage-based billing with admin Cost Management Dashboard. Claude and GPT-5.5 Thinking included in Pro and Enterprise tiers. GPT-5.5 Instant available as lower-cost tier. Plugins included with Cowork GA.

Technical details

Cowork: usage-based billing, admin Cost Management Dashboard | Models: Claude + GPT-5.5 Thinking (Pro + Enterprise), GPT-5.5 Instant (lower-cost) | Plugins: included with Cowork GA | Copilot Notebooks: expanded to all Copilot Chat users

Best for: Enterprises balancing cost with model flexibility and plugin expansion


Filed under: AI Weekly Digest
First published: Jul 3, 2026

← Previous issue19–26 June 2026All issues