Journal/AI Weekly Digest/26 June – 3 July 2026

AI Weekly Digest26 June – 3 July 2026

The Fable 5 crisis resolves — US Department of Commerce lifts export controls June 30, Anthropic restores global access July 1 with a new classifier blocking the cited jailbreak; Claude Sonnet 5 launches June 30 as the new default for Free and Pro at introductory $2/$10 per MTok; Claude in Microsoft Foundry goes GA; self-hosted gateway ships; OpenAI publishes first large-scale agentic-work research, teases Codex Micro hardware, and retires GPT-4.5 from ChatGPT; Google rebrands Vertex AI into the Gemini Enterprise Agent Platform and launches Gemini 3.1 Flash-Lite Image; Microsoft ships Copilot Cowork worldwide GA with automatic model selection between GPT-5.5 Thinking, Claude, and GPT-5.5 Instant.

Period

26 June – 3 July 2026

Published

Jul 3, 2026

Covers

Anthropic · OpenAI · Gemini · Copilot

Dateline: July 3, 2026 | Next update: July 10, 2026

The Fable 5 crisis resolved this week with three simultaneous moves: the US Department of Commerce lifted the export-control directive on June 30, Anthropic restored global access to Fable 5 on July 1 with enhanced safeguards, and Anthropic launched Claude Sonnet 5 on June 30 as a near-Opus-quality model at Sonnet pricing. These are the two biggest product events since the Fable 5 launch itself. Alongside them: Claude in Microsoft Foundry went GA, a self-hosted gateway launched for enterprises, Enterprise admin analytics were overhauled, and Claude Code shipped a major weekly update including Claude in Chrome going GA and background PR auto-open.

Claude / Anthropic

✅ Fable 5 + Mythos 5 — export controls lifted, access restored

Export controls lifted: June 30, 2026 | Access restored: July 1, 2026 | Days suspended: 19 | Enhanced safeguards: yes

On June 30, the US Department of Commerce lifted the export-control directive on Claude Fable 5 and Mythos 5 — ending the 19-day global suspension. Anthropic restored global access to Fable 5 on July 1 across the Claude Platform, Claude.ai, Claude Code, and Claude Cowork. The resolution came with a package of enhanced cybersecurity safeguards developed in coordination with the government. Mythos 5 remains partially restricted: access has been restored for a set of US organisations following government approval on June 26, with expansion to the broader Glasswing programme ongoing.

✅ Resolved

Fable 5 is back globally from July 1 on the Claude Platform, Claude.ai, Claude Code, and Claude Cowork. Access on AWS, Google Cloud, and Microsoft Foundry is being restored as quickly as possible. Enhanced safeguards shipped with restoration: the specific jailbreak technique cited by the government is now blocked in more than 99% of cases by a new classifier layer. Trade-off: slightly more sensitive safety routing, with a higher rate of false positives on legitimate security development and debugging tasks. Anthropic is proposing an industry-wide jailbreak severity framework (with Amazon, Microsoft, Google, and Glasswing partners) built on four criteria: capability gain provided to the attacker, scope of that gain, ease of weaponisation, and discoverability. HackerOne bug-bounty programme launched for reporting cybersecurity jailbreaks. Subscription terms: Fable 5 is available for up to 50% of weekly usage limits on Pro, Max, Team, and select Enterprise plans through July 7, after which usage credits are required.

Technical details

Model strings restored: claude-fable-5 (global), claude-mythos-5 (US Glasswing orgs only) | Enhanced classifier: blocks cited jailbreak technique >99% | Subscription window: Fable 5 included up to 50% of weekly limits through July 7; usage credits from July 8 | AWS/GCP/Foundry: restoration in progress | HackerOne: bug-bounty programme for cybersecurity jailbreaks | Jailbreak severity framework: proposed with Amazon, Microsoft, Google, Glasswing partners | Tokenizer note: same text produces ~30% more tokens than pre-Opus-4.7 models

Best for: All users — Fable 5 is back, use it freely through July 7 within weekly limits. API developers: restore claude-fable-5 in model strings and note the new tokenizer (~30% more tokens for same text).

Claude Sonnet 5 — new default for Free and Pro

Launch: June 30, 2026 | Model string: claude-sonnet-5 | Introductory pricing: $2/$10 per MTok through August 31 | Standard from Sept 1: $3/$15 per MTok | Availability: all plans + API + Bedrock + Vertex AI + Microsoft Foundry

Claude Sonnet 5 is the most significant mid-tier model Anthropic has shipped. It replaces Sonnet 4.6 as the default for Free and Pro users and delivers near-Opus-4.8 agentic performance at Sonnet pricing. It is the first Sonnet model built primarily around autonomous, multi-step tasks — planning, tool use, browser and terminal control, and self-verification without prompting. Introductory API pricing at $2/$10 per MTok makes it more than 60% cheaper than Opus 4.8 at launch.

★ What's new

Claude Sonnet 5 launches June 30 as the new default for Free and Pro. Key benchmarks (Anthropic's own figures): SWE-bench Verified 72.7% (vs Sonnet 4.6: 62.3%, Opus 4.8: 79.4%); Terminal-Bench 76.1% (vs Sonnet 4.6: 55.4%, +20.7 points — the largest single benchmark jump of any Sonnet launch); BrowseComp (agentic search) and OSWorld-Verified (computer use) both approach Opus 4.8 at medium effort. Safety: lower hallucination, sycophancy, and undesirable-agent-behaviour rates than Sonnet 4.6; refuses malicious requests more consistently; resists prompt injection more reliably. Cyber safeguards enabled by default. In a Mozilla Firefox test, Sonnet 5 never produced a working exploit despite multiple attempts. Sonnet 5 is included in the Cyber Verification Program on the native Claude Platform, Claude Platform on AWS, and Claude in Microsoft Foundry (Vertex AI coming soon). Tokenizer note: Sonnet 5 uses the same newer tokenizer as Opus 4.7/4.8 and Fable 5 — approximately 30% more tokens for the same text. Audit your token budgets before switching production pipelines.

Technical details

Model string: claude-sonnet-5 | Context: 1M tokens | Max output: 128k tokens | Introductory pricing: $2/$10 per MTok through August 31, 2026 | Standard pricing from September 1: $3/$15 per MTok | Tokenizer: newer tokenizer (~30% more tokens than Sonnet 4.6 for same text) | BrowseComp chart corrected June 30: updated chart uses 10M token budget with compaction and programmatic tool calling | Sonnet 4.6 deprecation: no date announced yet | Claude Code: update to v2.1.197 for Sonnet 5 as default

Best for: The default everyday model for most users from June 30. Cost-sensitive agent workloads where Opus-class quality is overkill. High-volume coding, tool use, and knowledge work pipelines.

Rate limits increased — higher-effort model support

Effective: alongside Fable 5 restoration and Sonnet 5 launch | Applies to: Claude Platform, claude.ai, Claude Code, Cowork

Anthropic raised rate limits across Chat, Cowork, Claude Code, and the Claude Platform to accommodate the higher token usage that comes with higher effort levels and Fable 5's larger context use. On the API, Sonnet and Haiku rate limits were also raised at every usage tier, and the tier structure simplified to three tiers: Start, Build, and Scale.

★ What's new

Rate limits raised across all paid plans on Chat, Cowork, Claude Code, and the Claude Platform. API tier structure simplified to Start, Build, and Scale (from the previous multi-tier structure). Sonnet and Haiku API rate limits raised at every tier. View your tier and current limits in the Claude Console.

Technical details

API tiers: Start, Build, Scale (simplified) | Console: view tier and limits at Claude Console | Primary drivers: higher effort levels and Fable 5 higher context use

Best for: All paid users — no action needed. API developers: check the new tier documentation in Claude Console.

Claude in Microsoft Foundry — generally available

GA date: June 29, 2026 | Platform: Azure | Availability: enterprise customers globally

Claude in Microsoft Foundry went generally available on June 29, giving Azure enterprise customers a production-ready path to Claude with Azure-native authentication (Entra ID), billing, networking, governance, and data residency controls. Two hosting options are available: Hosted on Azure (inference processed inside Azure, US data zone option) and Hosted on Anthropic (full Claude API feature set). Sonnet 5 is available in Foundry from day one.

★ What's new

Claude in Microsoft Foundry is now generally available. Hosted on Azure: inference in Azure, Azure authentication, billing, governance, US and global data zones, Anthropic as data processor. Hosted on Anthropic: full Claude API feature set, previously called Foundry Preview. Claude Sonnet 5 available in Foundry at launch on July 1. NVIDIA GB300 GPU support confirmed for Foundry. Foundry Agent Service can use Sonnet 5 as the reasoning core for multi-step planning, tool use, and task execution across enterprise systems. Feature and model parity between the two hosting options is the roadmap goal.

Technical details

GA: June 29, 2026 | Hosted on Azure: Entra ID auth, Azure Marketplace billing, US + global data zones | Hosted on Anthropic: full API features (previously Foundry Preview) | Models: Sonnet 5 (GA July 1), Opus 4.8, Haiku 4.5 | NVIDIA: GB300 GPU confirmed | Billing: Claude Consumption Units (CCUs) via Azure Marketplace | Fable 5 on Foundry: coming as part of post-suspension restoration rollout

Best for: Azure enterprise customers needing Claude with existing Azure identity, billing, and governance controls

Claude self-hosted gateway — on-premise API infrastructure

Platform: Claude Platform | Availability: enterprise | Ships inside the claude binary

Anthropic launched a self-hosted gateway — an enterprise-grade API gateway for Claude that runs inside the same claude binary developers already install. Enterprises can run it in a single stateless container on their own infrastructure, with their own network policies, identity provider (OIDC: Google Workspace, Microsoft Entra ID, Okta), and telemetry stack. This is a direct response to enterprises that need Claude's API features but cannot route traffic through Anthropic's public endpoints due to security or compliance requirements.

★ What's new

Self-hosted gateway ships inside the claude binary as a single stateless container. Identity: acts as an OIDC relying party against Google Workspace, Microsoft Entra ID, Okta, or any standards-compliant OIDC provider — issues short-lived sessions, no long-lived secrets on developer machines. Policy: managed settings defined once on the server; clients receive policy at sign-in and the gateway enforces it on every request (allowed models, default settings). Telemetry: the client stamps a usage metric on every request; the gateway relays it over OTLP to a collector you configure in your own network and on your own retention schedule.

Technical details

Deployment: single stateless container | Ships in: claude binary | Identity: OIDC RP — Google Workspace, Microsoft Entra ID, Okta, any standards-compliant OIDC | Session: short-lived, no long-lived secrets | Policy: managed settings, client enforces at sign-in | Telemetry: OTLP to a collector you configure | Complements: Claude Platform on AWS (AWS-native), Claude in Microsoft Foundry (Azure-native), WIF (keyless auth for CI/CD)

Best for: Enterprises with strict data perimeter requirements — financial services and healthcare teams who cannot route traffic via Anthropic's public endpoints

Claude Enterprise — richer admin analytics, model entitlements, spend alerts

Platform: Claude Enterprise | Availability: Enterprise plans | Admin console

Enterprise admins gained substantially more visibility and control over Claude usage this week. The admin console now shows usage and cost breakdowns at the model, team, and user level, with spend-threshold alerts and model-level entitlements that let admins set which models specific groups can access.

★ What's new

Richer admin analytics: usage and cost breakdowns by model, team, and user in the admin console. Model-level entitlements: set model defaults and access permissions by group, not just organisation-wide. Spend alerts: configure spend-threshold notifications to catch overages before they land on the invoice. Analytics API: finance and IT teams can pull the same metrics into existing reporting systems programmatically. These controls complement existing spend caps, access and model routing, and the usage analytics dashboard already in Claude Enterprise.

Technical details

New in admin console: model-level cost breakdown, team/user breakdown, group-level model entitlements, spend-threshold alerts | Analytics API: same metrics as admin dashboard, programmatic access | Complements: existing spend caps, model routing, SCIM/RBAC, WIF | Plans: Claude Enterprise

Best for: Enterprise IT admins and finance teams tracking Claude spend and enforcing model access policies by team

Claude Code — Claude in Chrome GA, background PR auto-open, /dataviz skill

Platform: terminal / VS Code / web / mobile / Chrome | Availability: all plans | Update to v2.1.197 for Sonnet 5

Claude Code had another packed week: Claude in Chrome graduated from beta to generally available; background agents now automatically commit, push, and open a draft PR when they finish code work in a worktree; Sonnet 5 is the new default in Claude Code from v2.1.197; and a /dataviz skill was added for chart and dashboard design guidance.

★ What's new

Claude in Chrome is now generally available — Claude can browse, click, fill forms, and act on web pages directly inside Chrome. Update to v2.1.197: Sonnet 5 is now the default model in Claude Code with introductory pricing of $2/$10 per MTok through August 31. Background PR auto-open: background agents launched from claude agents now commit, push, and open a draft PR automatically when they finish code work in a worktree, instead of stopping to ask. New /dataviz skill: chart and dashboard design guidance with a runnable colour-palette validator. Background agent notifications: sessions that need input or finish now fire the Notification hook (agent_needs_input / agent_completed). Claude Platform on AWS added as a gateway upstream provider (anthropicAws); model-not-found responses now advance the failover chain rather than erroring out. Built-in Explore agent now inherits the main session's model (capped at Opus) instead of always running on Haiku. Subagents and context compaction now inherit the session's extended thinking configuration. Org default models: admins set the default model in the org console; it shows as 'Org default' in /model. Fixed: brief network drops mid-response now retry with backoff instead of aborting the turn. Fixed: Opus 4.7 fast mode deprecated, removal July 24.

Technical details

Update required: v2.1.197 for Sonnet 5 default | Claude in Chrome: GA | Background PR: auto-commit, push, draft PR on worktree task completion | /dataviz: chart/dashboard design skill with colour-palette validator | Notification hook: agent_needs_input / agent_completed | Gateway: anthropicAws added; model-not-found advances failover chain | Org default model: set in org console, shown as 'Org default' in /model | Fixed: ECONNRESET and transient network drops now retry with backoff | Fixed: excessive background classifier requests on repeated network host access | Fixed: background tasks stuck on 'Running' after finish or session resume | Opus 4.7 fast mode deprecated, removal July 24

Best for: All Claude Code users — update to v2.1.197 immediately for Sonnet 5 default. Chrome users: Claude in Chrome is now GA for production use.

API — Opus 4.7 fast mode deprecated, MCP tunnels migrated, SDK updates

Platform: Claude API | Effective: various dates this week

Three housekeeping API updates this week. Opus 4.7 fast mode is deprecated and will be removed July 24. The MCP tunnels management API moved to a new surface. And all major language SDKs now include support for the latest code execution tool version.

★ What's new

Opus 4.7 fast mode deprecated: removal July 24, 2026. After removal, requests to claude-opus-4-7 with speed: 'fast' will return an error. Migrate to Opus 4.8 fast mode. Opus 4.6 fast mode already removed as of June 29. MCP tunnels API migrated: management API moved from /v1/organizations/tunnels on the Admin API to /v1/tunnels on the Claude API; new header: anthropic-beta: mcp-tunnels-2026-06-22; new WIF scope: workspace:manage_tunnels. Previous surface remains available during migration window. SDK update: Python, TypeScript, Go, Java, Ruby, PHP, and C# SDKs now include support for code_execution_20260120 (REPL state persistence, minimum version for programmatic tool calling). No beta header required — set type to code_execution_20260120.

Technical details

Opus 4.7 fast mode removal: July 24, 2026 | Opus 4.6 fast mode: already removed June 29 | MCP tunnels new endpoint: /v1/tunnels on Claude API | MCP tunnels header: anthropic-beta: mcp-tunnels-2026-06-22 | MCP tunnels WIF scope: workspace:manage_tunnels | Migration window: previous Admin API surface still available temporarily | code_execution_20260120: REPL state persistence, programmatic tool calling, no beta header, all major SDKs updated | Next model retirement: Opus 4.1, August 5, 2026

Best for: API developers using Opus 4.7 fast mode (migrate now), MCP tunnels (update to new endpoint before migration window closes), or code execution (update SDK for REPL persistence)

Plans and Pricing — most active week of 2026

Sonnet 5 introductory pricing at $2/$10 per MTok (through August 31) is the headline. Fable 5 restored with a 50%-of-weekly-limits window on paid subscription plans through July 7, then usage credits. Opus 4.6 fast mode removed June 29. Opus 4.7 fast mode removing July 24.

Technical details

Sonnet 5 introductory: $2/$10 per MTok through August 31, 2026 | Sonnet 5 standard from Sept 1: $3/$15 per MTok | Fable 5: restored July 1, up to 50% of weekly limits on Pro/Max/Team/Enterprise through July 7; usage credits from July 8 | Fable 5 API: $10/$50 per MTok standard, $5/$25 per MTok Batch API | Opus 4.8: $5/$25 per MTok | Opus 4.7: $5/$25 per MTok (fast mode removing July 24) | Opus 4.6 fast mode: removed June 29 | Haiku 4.5: low-cost tier | Next retirement: Opus 4.1, August 5

Best for: Switch default model to claude-sonnet-5 for everyday workloads at $2/$10 until August 31. Use Fable 5 freely through July 7. Remove fast-mode references for Opus 4.6 (already broken) and plan Opus 4.7 fast-mode migration before July 24.

ChatGPT / OpenAI

Dateline: July 3, 2026 | Next update: July 10, 2026

A relatively quiet week in terms of major product launches, but several important developments reinforced OpenAI's long-term direction. The biggest story was the publication of OpenAI's first large-scale research into how AI agents are changing work, showing rapid adoption of Codex across both technical and non-technical professions. OpenAI also simplified the ChatGPT Business model picker, completed the retirement of GPT-4.5 inside ChatGPT, and teased its first dedicated Codex hardware product.

First large-scale research on AI agents changing work

Published: June 27–28, 2026 | Category: Research

OpenAI published its first major economic research paper analysing how people use AI agents in real-world work. Rather than measuring benchmark performance, the paper examines millions of Codex interactions and finds that users are increasingly delegating longer, more complex, and more cross-functional work to AI agents. Within OpenAI itself, Codex has largely replaced ChatGPT for many work tasks, and the share of users assigning tasks estimated to take an experienced human more than eight hours has increased almost tenfold since the beginning of the year.

★ What's new

OpenAI frames this as evidence that AI is moving beyond question answering towards autonomous task execution. Active users increased more than fivefold during 2026. More than 10% of users now manage three or more concurrent Codex agents. The paper argues that agentic AI is beginning to reshape how organisations organise work rather than simply making existing work faster.

Technical details

Dataset: large-scale Codex usage | Growth: active users increased more than fivefold during 2026 | Multi-agent users: >10% manage three or more concurrent agents | Internal finding: Codex largely replaces ChatGPT for many OpenAI employee workflows

Best for: Enterprise AI leaders, policymakers, and organisations trying to understand how AI agents — not just chatbots — are beginning to transform knowledge work

ChatGPT Business — simplified model picker

Announced: June 26, 2026 | Platform: ChatGPT Business

OpenAI redesigned the model picker for ChatGPT Business, replacing numerous thinking variants with reasoning effort levels: Instant, Medium, High, Extra High, Pro Standard, and Pro Extended. Instant can automatically escalate to Medium when additional reasoning would improve the response. GPT-5.5 Pro continues powering Pro Standard and Pro Extended.

★ What's new

New reasoning effort options: Instant, Medium, High, Extra High, Pro Standard, Pro Extended. Thinking Light removed. This reflects OpenAI's broader strategy of hiding model complexity from end users and allowing routing systems to determine the appropriate reasoning level automatically.

Technical details

Platform: ChatGPT Business (web, iOS, Android) | New options: Instant, Medium, High, Extra High, Pro Standard, Pro Extended | Thinking Light removed | GPT-5.5 Pro continues powering Pro Standard and Pro Extended

Best for: Business users who want simpler model selection without understanding the differences between multiple GPT variants

⚠️ GPT-4.5 retirement completed in ChatGPT

Effective: June 26–27, 2026 | Platform: ChatGPT only (API unaffected)

OpenAI completed the retirement of GPT-4.5 from ChatGPT. Existing conversations continue automatically using GPT-5.5 where appropriate. API users are unaffected. Users with custom GPTs built on GPT-4.5 have been migrated to GPT-5.5 within ChatGPT.

Technical details

Applies only to ChatGPT | API unchanged | Existing conversations automatically migrate to GPT-5.5 | Custom GPTs on GPT-4.5: auto-migrated to GPT-5.5

Best for: ChatGPT users and administrators maintaining custom GPTs — verify behaviour after the migration

Improved Memory — continued enterprise rollout

Platform: ChatGPT Business, Enterprise, and Edu

OpenAI continued the staged rollout of improved Memory for Business, Enterprise, and Edu customers. Rather than relying only on manually saved memories, ChatGPT can now reference relevant information from previous conversations to keep its understanding current, while allowing users to review, edit, or delete remembered information. Enterprise workspaces remain in the early-access phase, with administrators able to enable or disable the feature before wider default rollout.

Technical details

Available: Business, Enterprise, Edu | Controls: memory summary, source review, deletion, opt-out | Enterprise rollout: staged early access continues | Codex memory unaffected

Best for: Organisations using ChatGPT across long-running projects where context continuity improves productivity

Codex Micro — hardware accessory teased

Teased: June 29–30, 2026 | Launch: July 15 | Partner: Work Louder

OpenAI teased Codex Micro, developed in partnership with keyboard manufacturer Work Louder. Unlike the separate consumer AI device in development with Jony Ive, Codex Micro is designed specifically for Codex users — a programmable shortcut device similar to a macro keyboard that gives developers quick access to Codex workflows. This is OpenAI's first dedicated hardware accessory built specifically around one of its software products. Full details to be announced July 15.

Technical details

Product: Codex Micro | Partner: Work Louder | Category: programmable macro controller | Launch: July 15 | Purpose: faster Codex interaction and shortcut workflows

Best for: Developers who use Codex extensively throughout the day

Codex CLI 0.142.3

Released: June 27, 2026

Codex CLI received a small maintenance release with no new user-facing functionality or breaking changes. Bug fixes and maintenance work only.

Technical details

Version: 0.142.3 | Changes: maintenance only | No API changes | No new features

Best for: Existing Codex CLI users — update to remain current

Plans and Pricing

No significant pricing changes announced between June 26 and July 3. The most relevant operational changes remain last week's enterprise billing controls, improved Memory rollout, and the simplified Business model picker.

Technical details

Pricing: unchanged | Models: GPT-4.5 retired from ChatGPT | Business: simplified reasoning picker | Enterprise: Memory rollout continues | Codex CLI: updated to 0.142.3

Best for: No immediate action required. Business customers should familiarise themselves with the new reasoning-level picker; enterprise admins may wish to evaluate improved Memory before wider rollout.

Gemini (Google)

Dateline: July 3, 2026 | Next update: July 10, 2026

The past week marked a critical architectural consolidation for Google's AI ecosystem. Google DeepMind launched Gemini 3.1 Flash-Lite Image, a highly optimised sub-millisecond text-to-image model. Google Cloud deprecated Vertex AI as a standalone brand, absorbing all subsequent model deployments and enterprise developer features into the newly launched Gemini Enterprise Agent Platform. On the consumer front, the "Gemini Spark" update introduced a native macOS application with local file automation and third-party extensions.

Gemini 3.1 Flash-Lite Image — new visual generation model

Announced: June 30, 2026 | Model string: gemini-3.1-flash-lite-image | Also known as: nano-banana-2-lite (consumer UI) | Availability: Google AI Studio, API, consumer tiers

Google DeepMind expanded its native multimodal architecture with Gemini 3.1 Flash-Lite Image, engineered to bridge high-fidelity visual generation with strict operational cost constraints. Designed specifically for rapid, iterative text-to-image generation, real-time localised text rendering, and multi-turn visual editing, it addresses the economic bottlenecks enterprises face when deploying visual AI at scale. In consumer-facing applications, it is deployed under the name "Nano Banana 2 Lite."

★ What's new

Sub-millisecond processing speeds and drastically lowered compute requirements relative to Gemini 3 Pro Image. Supports a 1M token input context window. Output: 4K-token image matrices paired with up to 64K tokens of explanatory text. Exceptional performance in internationalized text rendering across non-Latin scripts and character consistency across sequential edits. Pricing: approximately 80% cost reduction per thousand generated matrices relative to gemini-3-pro-image, at an identical input/output token cost ratio to the standard text-based gemini-3.1-flash-lite model card.

Technical details

Model strings: gemini-3.1-flash-lite-image / nano-banana-2-lite | Availability: live in AI Studio, web/mobile consumer tiers, and select API regions | Context: 1M tokens input | Output: 4k tokens for image matrices, 64k tokens for text | Training: Google TPU clusters | Evaluation: Side-by-Side (SxS) human Elo for T2I prompt adherence, internationalisation text rendering, and multi-turn character consistency

Best for: Enterprise developers building high-volume creative applications, digital marketing automation, and interactive educational platforms requiring low-latency visual generation

Gemini Spark — native macOS app and third-party extensions

Announced: June 30, 2026 | Platform: macOS (native), Web, iOS, Android | Availability: Google AI Ultra subscribers (US beta); extensions rolling out globally

Google launched a native macOS application under the "Gemini Spark" framework, granting the assistant secure, explicit permissions to interface directly with local desktop environments. Users can command Gemini Spark to manage files across local directories, parse financial data, and automatically update Google Workspace spreadsheets — or issue multi-step instructions via mobile to their remote Mac. Third-party extensions went live for Canva, Dropbox, Instacart, OpenTable, and Zillow Rentals, alongside native integrations with Google Tasks and Keep.

★ What's new

macOS native client with local file system access (user-defined directory gating). Upcoming: remote execution pipeline — issue instructions from mobile to remote Mac hardware (retrieve sales reports, extract metrics, email summaries). Background task orchestration with automated real-time polling of web hooks, RSS feeds, financial tickers, and sports data streams. MCP support: developers can build and expose custom local or remote MCP servers directly to the Gemini Spark client, enabling custom internal enterprise applications to be called natively within the Gemini chat interface. New third-party extensions: Canva, Dropbox, Instacart, OpenTable, Zillow Rentals. Native integrations: Google Tasks, Google Keep.

Technical details

Platform: macOS native (Beta), Web, iOS, Android | Model: Gemini 3.1 Pro and Gemini 3.1 Flash-Lite | Availability: US Beta for AI Ultra subscribers; extensions rolling out over seven-day window | Local file system access: user-defined directory gating | Real-time polling: web hooks, RSS, financial tickers, sports data | MCP: custom local or remote MCP server support | Extensions: Canva, Dropbox, Instacart, OpenTable, Zillow Rentals

Best for: Power users and executives in Apple ecosystems who need a proactive AI assistant capable of automating local cross-application workflows and tracking real-time data

Gemini Enterprise Agent Platform — Vertex AI absorbed and rebranded

Announced: June 30, 2026 | Platform: Google Cloud | Availability: GA for all Google Cloud enterprise accounts

Google Cloud deprecated Vertex AI as a standalone brand, absorbing all machine learning models, developer suites, and cognitive roadmaps into the unified Gemini Enterprise Agent Platform. The platform is architected around four pillars — build, scale, govern, and optimise — shifting the enterprise value proposition from model fine-tuning to deploying highly autonomous, long-running agent workflows. The Agent Designer provides a visual no-code/low-code flowchart environment for designing trigger-based operational paths. The Agent Inbox provides a centralised operations room categorising agent activity into "Needs your input," "Errors," and "Completed."

★ What's new

Vertex AI brand retired; all enterprise ML workloads now under the Gemini Enterprise Agent Platform. Agent Designer: visual no-code/low-code flowchart environment for designing trigger-based autonomous agent paths that run in isolated cloud sandboxes. Agent Inbox: centralised operations room for prioritising and auditing agent activity. Projects: strictly walls off an agent's memory to assigned datasets, Drive repositories, and Group chats — enforcing clean data boundaries across departments. Skills: custom macros via @mentions to execute deterministic tasks across the enterprise workspace. Agent Gallery: out-of-the-box connectors for Asana, Workday, Mailchimp, Adobe, Atlassian, Lovable, and ServiceNow.

Technical details

Platform: Gemini Enterprise Agent Platform (formerly Vertex AI infrastructure) | Models: Gemini 3.1 Pro, Flash, Flash-Lite | Availability: GA globally for Google Cloud Enterprise | Key changes: complete absorption of Vertex AI into unified Agentic SDK; secure cloud sandboxing for runtime code execution; visual flowchart compilation into execution graphs; native enterprise data schema mapping | Agent Gallery connectors: Asana, Workday, Mailchimp, Adobe, Atlassian, Lovable, ServiceNow

Best for: CIOs, enterprise architects, and operations leaders seeking to deploy secure, autonomous, and governed AI workforces for end-to-end business workflows

Remote MCP server for Gemini Enterprise Agent Platform

Announced: June 30, 2026 | Availability: GA across all Google Cloud projects with Agent Platform API enabled

Google deployed a remote Model Context Protocol (MCP) server within the Gemini Enterprise Agent Platform, allowing external development frameworks — such as Claude Code — to interact directly and safely with Google Cloud environments. The connection runs entirely within Google Cloud's secure infrastructure and integrates natively with Cloud IAM, so security administrators can use standard IAM Deny policies.

⚠ What's new

External coding agents can natively call foundation models from Google's Model Garden, pull approved internal prompt templates, or manage notebooks within an active project. Implementation requires three steps: enable the Agent Platform API in Google Cloud Console, configure the local client JSON payload to point to the remote Google Cloud MCP endpoint, and copy the provided Toolset Endpoints into the external IDE workspace. Native Cloud IAM integration handles enterprise token exchange and session validation automatically at the boundary layer.

Technical details

Platform: Google Cloud IAM / Agent Platform API | Protocol: open MCP specification | Availability: auto-activated on enabling Agent Platform API | Key changes: native hosting of remote MCP server endpoints within Google Cloud; centralized asset cataloging via Agent Registry; enforces IAM Deny policies for external agent requests | Security: runs within Google Cloud's secure infrastructure, IAM-governed

Best for: DevOps engineers and full-stack development teams wanting to use advanced third-party coding agents without compromising enterprise data governance

Native VPC Service Controls — agent data isolation

Announced: June 27, 2026 | Availability: GA for all enterprise accounts using advanced agentic workflows

Google announced native integration of Virtual Private Cloud (VPC) Service Controls directly into the Gemini Enterprise Agent Platform, establishing explicit network perimeters around autonomous agent workloads. When a long-running agent executes code in its cloud sandbox or connects to internal data repositories (Drive, BigQuery), VPC Service Controls ensure data remains strictly within a designated trusted boundary.

Technical details

Platform: Google Cloud Security Architecture | Availability: globally via Google Cloud Console perimeter configuration | Key changes: extension of VPC Service Controls to cover Agent Platform API endpoints; enforcement of cryptographic identities for agents moving across network perimeters; automated blocking of unverified external data egress from agent tool-calling loops | Update path: existing VPC perimeters can be updated to include the Gemini Enterprise Agent Platform service identity without modifying agent code

Best for: CISOs, compliance officers, and network security engineers requiring absolute data isolation during deployment of autonomous enterprise agents — particularly banking, healthcare, and government

Workspace Feature Drop — Formula Fixer, iterative Gmail drafting

Announced: June 25, 2026 | Platform: Google Workspace (Gmail, Google Sheets, Google Drive) | Availability: AI Ultra, Pro, Plus, and Workspace Enterprise Plus on Gemini Alpha track

Google delivered its latest Workspace Feature Drop, embedding deeper context-aware Gemini capabilities across core productivity applications. In Sheets, the Gemini Formula Fixer diagnoses formula syntax or logic faults, explains the correction in plain language, and updates the formula inline. In Gmail, the "Help me write" feature was upgraded with a persistent instruction bar for iterative refinements — issue direct edits like "add the project deadline to line three" without rewriting the entire prompt. Spreadsheet localisation expanded to 28 new languages.

★ What's new

Gemini Formula Fixer (Sheets): diagnoses syntax/logic errors and updates formulas inline. Iterative Prompt Refinement (Gmail): persistent instruction bar for surgical text edits without rewriting the full prompt. 28-language sheet localisation (Spanish, Japanese, French, German, Korean, and more). AI Inbox (Alpha Track only): high-volume executive triage.

Technical details

Platform: Google Workspace | Model: Gemini 3.1 Pro integration | Availability: live for premium tiers, rolling out to enterprise | Sheets: multi-column context analysis, AST-based formula debugging engine | Gmail: stateless partial-string editing states in drafting API | Localization: 28 new languages with expanded global character set tokenizer arrays | AI Inbox: strictly gated to Gemini Alpha track

Best for: Financial analysts, project managers, and operations personnel heavily reliant on Google Workspace who need rapid data troubleshooting and precise communication drafting

Plans and Pricing

Gemini Enterprise Agent Platform deprecates Vertex AI's standalone billing model — all subsequent enterprise ML workloads are invoiced under the unified platform ledger. Agent execution within secure cloud sandboxes incurs standard Google Cloud compute-per-minute overheads, insulated by native VPC Service Controls at no additional compliance premium. Remote MCP server access is provided at zero licensing cost, billing only on the volume of underlying model and tool calls. Gemini Spark macOS and real-time background tracking capabilities remain tied to the Google AI Ultra subscription ($20/month). New Google Home Premium tier at $10/month for advanced voice agent capabilities and continuous video history analysis for consumer IoT hardware.

Technical details

Gemini Enterprise Agent Platform: replaces Vertex AI standalone billing | gemini-3.1-flash-lite-image: ~80% cost reduction vs gemini-3-pro-image per thousand generated matrices | Cloud sandbox execution: standard Google Cloud compute-per-minute overhead | Remote MCP: zero licensing cost; billed on model and tool call volume | Google AI Ultra: $20/month (Gemini Spark macOS + background tracking) | Google Home Premium: $10/month (advanced voice agent + continuous video history for IoT)

Best for: Enterprise IT leaders shifting high-frequency visual tasks to Flash-Lite architecture and integrating external engineering IDEs into Google Cloud via open-standard MCP channels

Microsoft Copilot

Dateline: July 3, 2026 | Next update: July 10, 2026

Copilot Cowork reached worldwide general availability on June 30, with automatic model selection between GPT-5.5 Thinking (deep reasoning), Claude (structured/visual work), and GPT-5.5 Instant (lightweight tasks). Productivity apps gained new customisation and brand tools, Copilot Vision was previewed for screen-based insights, and admins gained stronger governance with Purview and cost dashboards.

⚠️ Copilot Cowork — worldwide GA, model choice, expanded plugins

GA: June 30, 2026 | Status: globally available | Model auto-selection: live

Cowork is now generally available worldwide. Users can toggle between Chat and Cowork in the Copilot app. Cowork executes tasks end-to-end, returning deliverables rather than drafts. The headline new feature is automatic model selection: Cowork can now choose between GPT-5.5 Thinking for deep reasoning, Claude for structured/visual work, and GPT-5.5 Instant for lightweight tasks without user intervention.

★ What's new

Worldwide GA: all enterprise tenants. Model auto-selection: Cowork automatically routes between GPT-5.5 Thinking, Claude, and GPT-5.5 Instant based on task type. Expanded plugin ecosystem: Enosix, Harvey, LSEG, Miro, monday.com, Moodys, Morningstar, S&P Global Energy, TeamsMaestro, plus Databricks via sideloading. Microsoft Fabric and Dynamics 365 apps now integrated. Cost Management Dashboard in admin center: admins can monitor Cowork credit usage, budgets, and spend. Customize tab for custom skills.

Technical details

Cowork: usage-based billing, Cost Management Dashboard in admin center | Model auto-selection: GPT-5.5 Thinking / Claude / GPT-5.5 Instant | Plugin store expanded: Enosix, Harvey, LSEG, Miro, monday.com, Moodys, Morningstar, S&P Global Energy, TeamsMaestro, Databricks | Microsoft Fabric + Dynamics 365: integrated | Customize tab: custom skills

Best for: Enterprises orchestrating complex workflows with tailored model selection and plugin integration

Copilot in Apps — Word, Excel, Outlook, PowerPoint

★ What's new

Word: model choice now available; Copilot Catchup content card; image creation; iOS agentic capabilities; preserves chat history across apps; applies edits based on comments. Excel: Skills, personalisation, and rules; Planner Agent availability; deeper reasoning over Power BI enterprise data. Outlook: Compose canvas refinement; Copilot settings in Outlook Classic rolling out next month. PowerPoint: Brand Kit Picker for approved templates; reusable Copilot skills; references to SharePoint libraries and OneDrive folders.

Technical details

Word: model choice, Catchup card, image creation, iOS agentic, cross-app chat history, comment-based edits | Excel: Skills, personalisation, rules, Planner Agent, Power BI reasoning | Outlook: Compose canvas, Classic settings next month | PowerPoint: Brand Kit Picker, reusable skills, SharePoint/OneDrive references

Best for: Teams needing consistent brand enforcement, deeper data reasoning, and cross-app continuity

Governance and Compliance — Admin Center, Purview

★ What's new

Cost Management Dashboard: admins can monitor Cowork credit usage, budgets, and spend in the admin center. Purview DLP controls: email restriction and Cowork governance added. Federated Connectors GA: secure external app integration with staged rollout.

Technical details

Cost Management Dashboard: Cowork credits, budget monitoring, spend tracking | Purview DLP: email restriction, Cowork governance | Federated Connectors: GA, staged rollout

Best for: IT admins managing spend, compliance, and connector governance

Copilot Vision — screen-based insights

Announced: June 29, 2026 | Status: early rollout in Frontier tenants

Copilot Vision will generate insights based on what is visible on a user's screen. In early rollout for Frontier tenants.

Best for: Analysts and researchers needing contextual screen-based insights

Copilot Notebooks — expanded references

★ What's new

Outlook emails can now be added as references in Copilot Notebooks. Notebooks are now available to all Copilot Chat users, not just licensed Copilot 365 users.

Best for: Knowledge workers consolidating references across Outlook, SharePoint, and OneDrive

Plans and Pricing

Cowork: usage-based billing with admin Cost Management Dashboard. Claude and GPT-5.5 Thinking included in Pro and Enterprise tiers. GPT-5.5 Instant available as lower-cost tier. Plugins included with Cowork GA.

Technical details

Cowork: usage-based billing, admin Cost Management Dashboard | Models: Claude + GPT-5.5 Thinking (Pro + Enterprise), GPT-5.5 Instant (lower-cost) | Plugins: included with Cowork GA | Copilot Notebooks: expanded to all Copilot Chat users

Best for: Enterprises balancing cost with model flexibility and plugin expansion

Datum: 3. jul 2026. | Sledeće ažuriranje: 10. jul 2026.

Kriza Fable 5 se razrešila ove sedmice kroz tri simultana poteza: Ministarstvo trgovine SAD je ukinulo direktivu o kontroli izvoza 30. juna, Anthropic je obnovio globalni pristup Fable 5 1. jula uz poboljšane mere zaštite, i Anthropic je lansirao Claude Sonnet 5 30. juna — model sa performansama blizu Opus-a po Sonnet cenama. Ovo su dva najveća produktna događaja od samog lansiranja Fable 5. Uz to: Claude u Microsoft Foundry postaje opšte dostupan, lansiran je sopstveni gateway za preduzeća, analitika Enterprise admina je renovirana, i Claude Code je isporučio veliko sedmično ažuriranje uključujući opštu dostupnost Claude u Chrome-u i automatsko otvaranje PR-a u pozadini.

Claude / Anthropic

✅ Fable 5 + Mythos 5 — kontrole izvoza ukinute, pristup obnovljen

Kontrole izvoza ukinute: 30. jun 2026. | Pristup obnovljen: 1. jul 2026. | Dani offline: 19 | Poboljšane zaštitne mere: da

Dana 30. juna, Ministarstvo trgovine SAD je ukinulo direktivu o kontroli izvoza za Claude Fable 5 i Mythos 5 — okončavajući 19-dnevnu globalnu suspenziju. Anthropic je obnovio globalni pristup Fable 5 1. jula na Claude Platformi, Claude.ai, Claude Code i Claude Cowork-u. Rešenje je došlo uz paket poboljšanih mera sajber zaštite razvijenih u koordinaciji s vladom. Mythos 5 ostaje delimično ograničen: pristup je obnovljen za skup US organizacija nakon vladine odobrenja 26. juna, sa proširenjem na širi Glasswing program u toku.

✅ Rešeno

Fable 5 je ponovo globalno dostupan od 1. jula na Claude Platformi, Claude.ai, Claude Code i Claude Cowork-u. Pristup na AWS-u, Google Cloud-u i Microsoft Foundry-u se obnavlja što je brže moguće. Poboljšane zaštitne mere isporučene uz obnavljanje: konkretna tehnika jailbreak-a na koju je vlada ukazala sada je blokirana u više od 99% slučajeva novim slojem klasifikatora. Kompromis: nešto osetljivije bezbednosno rutiranje, s višom stopom lažnih pozitivnih na legitimnom bezbednosnom razvoju i zadacima debagovanja. Anthropic predlaže industrijski okvir za ozbiljnost jailbreak-a (s Amazon-om, Microsoft-om, Google-om i Glasswing partnerima) zasnovan na četiri kriterijuma: dobijeni kapaciteti napadača, obim tih kapaciteta, lakoća naoružavanja i otkrivljivost. HackerOne bug-bounty program lansiran za prijavljanje sajber bezbednosnih jailbreak-ova. Uslovi pretplate: Fable 5 je dostupan za do 50% sedmičnih ograničenja upotrebe na planovima Pro, Max, Team i pojedinim Enterprise planovima do 7. jula, nakon čega su potrebni krediti za upotrebu.

Tehničke detalji

Obnovljeni stringovi modela: claude-fable-5 (globalno), claude-mythos-5 (samo US Glasswing organizacije) | Poboljšani klasifikator: blokira navedenu jailbreak tehniku >99% | Prozor pretplate: Fable 5 uključen do 50% sedmičnih ograničenja do 7. jula; krediti za upotrebu od 8. jula | AWS/GCP/Foundry: obnavljanje u toku | HackerOne: bug-bounty program za sajber jailbreak-ove | Napomena o tokenizeru: isti tekst proizvodi ~30% više tokena nego modeli pre Opus-4.7

Preporučeno za: Sve korisnike — Fable 5 je nazad, koristite ga slobodno do 7. jula u okviru sedmičnih ograničenja. API programeri: obnovite claude-fable-5 u stringovima modela i obratite pažnju na novi tokenizer (~30% više tokena za isti tekst).

Claude Sonnet 5 — novi podrazumevani model za Free i Pro

Lansiranje: 30. jun 2026. | String modela: claude-sonnet-5 | Uvodna cena: $2/$10 po MTok do 31. avgusta | Standardna cena od 1. sept.: $3/$15 po MTok | Dostupnost: svi planovi + API + Bedrock + Vertex AI + Microsoft Foundry

Claude Sonnet 5 je najznačajniji model srednje klase koji je Anthropic isporučio. Zamenjuje Sonnet 4.6 kao podrazumevani model za Free i Pro korisnike i isporučuje agentne performanse blizu Opus-4.8 po Sonnet cenama. To je prvi Sonnet model izgrađen primarno oko autonomnih, višestepenih zadataka — planiranja, upotrebe alata, kontrole pregledača i terminala, i samoverifikacije bez podsticaja. Uvodna API cena od $2/$10 po MTok čini ga više od 60% jeftinijim od Opus 4.8 pri lansiranju.

★ Šta je novo

Claude Sonnet 5 lansiran 30. juna kao novi podrazumevani za Free i Pro. Ključni benchmarkovi (Anthropicovi sopstveni podaci): SWE-bench Verified 72.7% (nasuprot Sonnet 4.6: 62.3%, Opus 4.8: 79.4%); Terminal-Bench 76.1% (nasuprot Sonnet 4.6: 55.4%, +20.7 poena — najveći skok na jednom benchmarku u istoriji lansiranja Sonnet modela); BrowseComp (agentna pretraga) i OSWorld-Verified (upotreba računara) oba se pri srednom naporu približavaju Opus 4.8. Bezbednost: niže stope halucinacija, sycophancy-a i neželjenog agentnog ponašanja nego Sonnet 4.6. Sajber zaštitne mere su podrazumevano omogućene. Napomena o tokenizeru: Sonnet 5 koristi isti noviji tokenizer kao Opus 4.7/4.8 i Fable 5 — otprilike 30% više tokena za isti tekst. Proverite budžete tokena pre prelaska produkcijskog pipline-a.

Tehničke detalji

String modela: claude-sonnet-5 | Kontekst: 1M tokena | Maksimalni izlaz: 128k tokena | Uvodna cena: $2/$10 po MTok do 31. avgusta 2026. | Standardna cena od 1. septembra: $3/$15 po MTok | Tokenizer: noviji (~30% više tokena od Sonnet 4.6 za isti tekst) | Deprecacija Sonnet 4.6: datum još nije najavljeno | Claude Code: ažurirajte na v2.1.197 za Sonnet 5 kao podrazumevani

Preporučeno za: Podrazumevani svakodnevni model za većinu korisnika od 30. juna. Agentna radna opterećenja osetljiva na troškove gde je kvalitet klase Opus preteran. Visokoobjemni pipeline-ovi za kodiranje, upotrebu alata i rad sa znanjem.

Rate limiti povećani — podrška za modele višeg napora

Na snazi: uz obnavljanje Fable 5 i lansiranje Sonnet 5 | Primenjuje se na: Claude Platformu, claude.ai, Claude Code, Cowork

Anthropic je povećao rate limite na Chat-u, Cowork-u, Claude Code-u i Claude Platformi kako bi se prilagodio višoj upotrebi tokena koja dolazi s višim nivoima napora i većom upotrebom konteksta Fable 5. API struktura tierova je pojednostavljena na tri tiera: Start, Build i Scale.

★ Šta je novo

Rate limiti povećani na svim plaćenim planovima na Chat-u, Cowork-u, Claude Code-u i Claude Platformi. API struktura tierova pojednostavljena na Start, Build i Scale. Sonnet i Haiku API rate limiti povećani na svakom tieru. Pogledajte vaš tier i trenutne limite u Claude Konzoli.

Tehničke detalji

API tierovi: Start, Build, Scale (pojednostavljeno) | Konzola: pogledajte tier i limite u Claude Konzoli | Primarni pokretači: viši nivoi napora i veća upotreba konteksta Fable 5

Preporučeno za: Sve plaćene korisnike — nije potrebna akcija. API programeri: pogledajte novu dokumentaciju tierova u Claude Konzoli.

Claude u Microsoft Foundry — opšta dostupnost

GA datum: 29. jun 2026. | Platforma: Azure | Dostupnost: enterprise klijenti globalno

Claude u Microsoft Foundry je postao opšte dostupan 29. juna, dajući Azure enterprise klijentima produkcijski spreman pristup Claude-u s Azure-nativnom autentifikacijom (Entra ID), naplatom, mrežom, upravljanjem i kontrolama rezidencije podataka. Dostupne su dve opcije hostinga: Hosted on Azure (zaključivanje obrađeno unutar Azure-a, opcija US data zone) i Hosted on Anthropic (potpun skup Claude API funkcija). Sonnet 5 je dostupan u Foundry-u od prvog dana.

★ Šta je novo

Claude u Microsoft Foundry je sada opšte dostupan. Hosted on Azure: zaključivanje unutar Azure-a, Azure autentifikacija, naplata, upravljanje, US i globalne data zone, Anthropic kao procesor podataka. Hosted on Anthropic: pun skup Claude API funkcija (ranije zvano Foundry Preview). Claude Sonnet 5 dostupan u Foundry-u od lansiranja 1. jula. Podrška za NVIDIA GB300 GPU potvrđena za Foundry. Foundry Agent Service može koristiti Sonnet 5 kao jezgro rezonovanja za višestepeno planiranje, upotrebu alata i izvršavanje zadataka.

Tehničke detalji

GA: 29. jun 2026. | Hosted on Azure: Entra ID auth, Azure Marketplace naplata, US + globalne data zone | Hosted on Anthropic: puni API (ranije Foundry Preview) | Modeli: Sonnet 5 (GA 1. jul), Opus 4.8, Haiku 4.5 | NVIDIA: GB300 GPU potvrđen | Naplata: Claude Consumption Units (CCUs) putem Azure Marketplace | Fable 5 na Foundry-u: dolazi u sklopu obnavljanja pristupa

Preporučeno za: Azure enterprise klijente kojima je potreban Claude s postojećim Azure identity, naplatom i kontrolama upravljanja

Claude sopstveni gateway — API infrastruktura na vlastitim serverima

Platforma: Claude Platforma | Dostupnost: enterprise | Isporučen unutar claude binarne datoteke

Anthropic je lansirao sopstveni gateway — enterprise-gradni API gateway za Claude koji se pokreće unutar iste claude binarne datoteke koju programeri već instaliraju. Preduzeća mogu pokrenuti ga u jednom stateless kontejneru na sopstvenoj infrastrukturi, s vlastitim mrežnim politikama, identity providerom (OIDC: Google Workspace, Microsoft Entra ID, Okta) i telemetrijskim stackom.

★ Šta je novo

Sopstveni gateway se isporučuje unutar claude binarne datoteke kao jedan stateless kontejner. Identity: funkcioniše kao OIDC relying party prema Google Workspace-u, Microsoft Entra ID-u, Okta-i ili bilo kom standardnom OIDC provideru — izdaje kratkotrajne sesije, bez dugoživelih tajni na razvojnim mašinama. Politika: upravljana podešavanja definisana jednom na serveru; klijenti primaju politiku pri prijavi i gateway je primenjuje na svaki zahtev (dozvoljeni modeli, podrazumevana podešavanja). Telemetrija: klijent pečatira metriku upotrebe na svaki zahtev; gateway je prosleđuje putem OTLP-a na kolektor koji sami konfigurirate u vlastitoj mreži i po sopstvenom rasporedu zadržavanja.

Tehničke detalji

Raspoređivanje: jedan stateless kontejner | Isporučen u: claude binarnoj datoteci | Identity: OIDC RP — Google Workspace, Microsoft Entra ID, Okta, bilo koji standardni OIDC | Sesija: kratkotrajna, bez dugoživelih tajni | Politika: upravljana podešavanja, klijent primenjuje pri prijavi | Telemetrija: OTLP na kolektor koji sami konfigurišete | Nadopunjuje: Claude Platformu na AWS-u (AWS-nativna), Claude u Microsoft Foundry-u (Azure-nativna), WIF (autentifikacija bez ključeva za CI/CD)

Preporučeno za: Preduzeća sa strogim zahtevima za perimetre podataka — timove u finansijskim uslugama i zdravstvenoj zaštiti koji ne mogu rutirati saobraćaj putem Anthropicovih javnih krajnjih tačaka

Claude Enterprise — bogatija admin analitika, ovlašćenja za modele, upozorenja potrošnje

Platforma: Claude Enterprise | Dostupnost: Enterprise planovi | Admin konzola

Enterprise admini su ove sedmice dobili znatno više vidljivosti i kontrole nad upotrebom Claude-a. Admin konzola sada prikazuje raščlambu upotrebe i troškova na nivou modela, tima i korisnika, sa upozorenjima praga potrošnje i ovlašćenjima na nivou modela koja adminu omogućavaju da postave kojim modelima određene grupe mogu pristupiti.

★ Šta je novo

Bogatija admin analitika: raščlamba upotrebe i troškova po modelu, timu i korisniku u admin konzoli. Ovlašćenja na nivou modela: postavljanje podrazumevanih modela i dozvola pristupa po grupi, ne samo na nivou cele organizacije. Upozorenja potrošnje: konfigurišite notifikacije praga potrošnje za hvatanje prekoračenja pre nego što stignu na fakturu. Analytics API: finansijski i IT timovi mogu programatski povlačiti iste metrike u postojeće sisteme izveštavanja.

Tehničke detalji

Novo u admin konzoli: raščlamba troškova po modelu, raščlamba po timu/korisniku, ovlašćenja modela po grupi, upozorenja praga potrošnje | Analytics API: iste metrike kao admin dashboard, programatski pristup | Nadopunjuje: postojeće granice potrošnje, rutiranje modela, SCIM/RBAC, WIF | Planovi: Claude Enterprise

Preporučeno za: Enterprise IT admine i finansijske timove koji prate potrošnju Claude-a i primenjuju politike pristupa modelima po timu

Claude Code — Claude u Chrome GA, auto-otvaranje PR-a, skill /dataviz

Platforma: terminal / VS Code / web / mobilni / Chrome | Dostupnost: svi planovi | Ažurirajte na v2.1.197 za Sonnet 5

Claude Code je imao još jednu nabijenu sedmicu: Claude u Chrome-u diplomirao iz bete u opštu dostupnost; pozadinski agenti sada automatski commituju, pushuju i otvaraju draft PR kada završe rad s kodom u worktree-u; Sonnet 5 je novi podrazumevani u Claude Code-u od v2.1.197; i dodat je skill /dataviz za smernice o dizajnu grafikona i dashboardova.

★ Šta je novo

Claude u Chrome-u je sada opšte dostupan — Claude može da pregleda, klikće, popunjava formulare i deluje na web stranicama direktno unutar Chrome-a. Ažurirajte na v2.1.197: Sonnet 5 je sada podrazumevani model u Claude Code-u po uvodnoj ceni od $2/$10 po MTok do 31. avgusta. Auto-otvaranje PR-a u pozadini: pozadinski agenti pokrenuti iz claude agents sada automatski commituju, pushuju i otvaraju draft PR kada završe rad s kodom u worktree-u, umesto da staju da pitaju. Novi skill /dataviz: smernice za dizajn grafikona i dashboardova s validatorom palete boja koji se može pokrenuti. Notifikacije pozadinskog agenta: sesije kojima je potreban unos ili koje završe sada aktiviraju Notification hook (agent_needs_input / agent_completed). Claude Platforma na AWS-u dodata kao gateway upstream provider (anthropicAws). Ugrađeni Explore agent sada nasleđuje model glavne sesije (ograničen na Opus) umesto da uvek radi na Haiku-u. Podagenti i kompakcija konteksta sada nasleđuju konfiguraciju proširenog razmišljanja sesije. Podrazumevani modeli org-a: admini postavljaju podrazumevani model u org konzoli; prikazuje se kao 'Org default' u /model. Popravljeno: kratki mrežni prekidi usred odgovora sada se ponavljaju s backoff-om umesto da prekidaju izmenu.

Tehničke detalji

Potrebno ažuriranje: v2.1.197 za Sonnet 5 podrazumevano | Claude u Chrome-u: GA | Pozadinski PR: auto-commit, push, draft PR po završetku zadatka u worktree-u | /dataviz: skill za dizajn grafikona/dashboardova s validatorom palete boja | Notification hook: agent_needs_input / agent_completed | Gateway: anthropicAws dodat; model-not-found napreduje u failover lancu | Podrazumevani model org-a: postavljeno u org konzoli, prikazano kao 'Org default' u /model | Popravljeno: ECONNRESET i prolazni mrežni prekidi sada se ponavljaju s backoff-om | Depreciran: Opus 4.7 fast mode, uklanjanje 24. jula

Preporučeno za: Sve korisnike Claude Code-a — odmah ažurirajte na v2.1.197 za Sonnet 5 podrazumevano. Korisnici Chrome-a: Claude u Chrome-u je sada GA za produkcijsku upotrebu.

API — deprecacija Opus 4.7 fast mode, migracija MCP tunnels, ažuriranja SDK-a

Platforma: Claude API | Na snazi: različiti datumi ove sedmice

Tri API ažuriranja ove sedmice. Opus 4.7 fast mode je depreciran i biće uklonjen 24. jula. Management API za MCP tunnels je premešten na novu površinu. I svi glavni jezički SDK-ovi sada uključuju podršku za najnoviju verziju alata za izvršavanje koda.

★ Šta je novo

Deprecacija Opus 4.7 fast mode: uklanjanje 24. jula 2026. Nakon uklanjanja, zahtevi za claude-opus-4-7 s speed: 'fast' vratiće grešku. Migrirajte na Opus 4.8 fast mode. Opus 4.6 fast mode je već uklonjen 29. juna. Migracija MCP tunnels API-ja: management API premešten iz /v1/organizations/tunnels na Admin API-ju na /v1/tunnels na Claude API-ju; novi header: anthropic-beta: mcp-tunnels-2026-06-22; novi WIF opseg: workspace:manage_tunnels. Prethodna površina ostaje dostupna tokom prozora migracije. Ažuriranje SDK-a: Python, TypeScript, Go, Java, Ruby, PHP i C# SDK-ovi sada uključuju podršku za code_execution_20260120 (persistentnost REPL stanja, minimalna verzija za programatsko pozivanje alata). Nije potreban beta header — postavite type na code_execution_20260120.

Tehničke detalji

Uklanjanje Opus 4.7 fast mode: 24. jul 2026. | Opus 4.6 fast mode: već uklonjen 29. juna | MCP tunnels nova krajnja tačka: /v1/tunnels na Claude API-ju | MCP tunnels header: anthropic-beta: mcp-tunnels-2026-06-22 | MCP tunnels WIF opseg: workspace:manage_tunnels | Prozor migracije: prethodna Admin API površina još dostupna privremeno | code_execution_20260120: persistentnost REPL stanja, programatsko pozivanje alata, bez beta headera, svi glavni SDK-ovi ažurirani | Sledeće penzionisanje: Opus 4.1, 5. avgust 2026.

Preporučeno za: API programere koji koriste Opus 4.7 fast mode (migrirajte odmah), MCP tunnels (ažurirajte na novu krajnju tačku pre zatvaranja prozora migracije) ili code execution (ažurirajte SDK za REPL persistentnost)

Planovi i cene — najaktivnija sedmica 2026. godine

Uvodna cena Sonnet 5 od $2/$10 po MTok (do 31. avgusta) je naslovna vest. Fable 5 obnovljen s prozorom od 50% sedmičnih ograničenja na plaćenim planovima do 7. jula, potom krediti za upotrebu. Opus 4.6 fast mode uklonjen 29. juna. Opus 4.7 fast mode se uklanja 24. jula.

Tehničke detalji

Sonnet 5 uvodni: $2/$10 po MTok do 31. avgusta 2026. | Sonnet 5 standardno od 1. sept.: $3/$15 po MTok | Fable 5: obnovljen 1. jula, do 50% sedmičnih ograničenja na Pro/Max/Team/Enterprise do 7. jula; krediti za upotrebu od 8. jula | Fable 5 API: $10/$50 po MTok standardno, $5/$25 po MTok Batch API | Opus 4.8: $5/$25 po MTok | Opus 4.7: $5/$25 po MTok (fast mode se uklanja 24. jula) | Opus 4.6 fast mode: uklonjen 29. juna | Haiku 4.5: niskotroškovna kategorija | Sledeće penzionisanje: Opus 4.1, 5. avgust

Preporučeno za: Prebacite podrazumevani model na claude-sonnet-5 za svakodnevna radna opterećenja po $2/$10 do 31. avgusta. Koristite Fable 5 slobodno do 7. jula. Uklonite reference na fast mode za Opus 4.6 (već pokvareno) i planirajte migraciju Opus 4.7 fast mode pre 24. jula.

ChatGPT / OpenAI

Datum: 3. jul 2026. | Sledeće ažuriranje: 10. jul 2026.

Relativno mirna sedmica po pitanju većih lansiranja, ali nekoliko važnih događaja potvrdilo je OpenAI-jev dugoročni pravac. Najveća priča bila je objava OpenAI-jevog prvog istraživanja o tome kako AI agenti menjaju rad. OpenAI je takođe pojednostavio birač modela za ChatGPT Business, završio povlačenje GPT-4.5 iz ChatGPT-a i reklamirao prvi namenski Codex hardverski proizvod.

Prvo istraživanje o AI agentima koji menjaju rad

Objavljeno: 27–28. jun 2026. | Kategorija: Istraživanje

OpenAI je objavio svoju prvu veliku ekonomsku istraživačku studiju koja analizira kako ljudi koriste AI agente u realnom radu. Umesto merenja benchmark performansi, studija ispituje milione Codex interakcija i utvrđuje da korisnici sve više delegiraju duže, složenije i višefunkcionalne zadatke AI agentima. Unutar samog OpenAI-ja, Codex je u velikoj meri zamenio ChatGPT za mnoge radne zadatke, i udio korisnika koji dodjeljuju zadatke procenjene na više od osam sati iskusnog ljudskog rada povećao se gotovo desetostruko od početka godine.

★ Šta je novo

OpenAI ovo okviruje kao dokaz da AI prelazi iz odgovaranja na pitanja ka autonomnom izvršavanju zadataka. Aktivni korisnici su se povećali više od peterostruko tokom 2026. Više od 10% korisnika sada upravlja s tri ili više istovremenih Codex agenata. Studija tvrdi da agentni AI počinje da menja način na koji organizacije organizuju rad, a ne samo da ubrzava postojeći rad.

Tehničke detalji

Dataset: masovna Codex upotreba | Rast: aktivni korisnici povećani više od peterostruko tokom 2026. | Korisnici više agenata: >10% upravlja s tri ili više istovremenih agenata | Interni nalaz: Codex u velikoj meri zamenjuje ChatGPT za mnoge tokove rada OpenAI zaposlenih

Preporučeno za: Enterprise AI lidere, kreatore politike i organizacije koje pokušavaju da razumeju kako AI agenti — ne samo čatbotovi — počinju da transformišu rad sa znanjem

ChatGPT Business — pojednostavljen birač modela

Najavljeno: 26. jun 2026. | Platforma: ChatGPT Business

OpenAI je redizajnirao birač modela za ChatGPT Business, zamenjujući brojne varijante razmišljanja nivoima napora rezonovanja: Instant, Medium, High, Extra High, Pro Standard i Pro Extended. Instant može automatski eskalirati na Medium kada bi dodatno rezonovanje poboljšalo odgovor. GPT-5.5 Pro nastavlja da pokreće Pro Standard i Pro Extended.

★ Šta je novo

Novi nivoi napora rezonovanja: Instant, Medium, High, Extra High, Pro Standard, Pro Extended. Thinking Light je uklonjen. Ovo odražava OpenAI-jevu širu strategiju skrivanja složenosti modela od krajnjih korisnika i prepuštanja sistemima rutiranja da automatski odrede odgovarajući nivo rezonovanja.

Tehničke detalji

Platforma: ChatGPT Business (web, iOS, Android) | Nove opcije: Instant, Medium, High, Extra High, Pro Standard, Pro Extended | Thinking Light uklonjen | GPT-5.5 Pro nastavlja da pokreće Pro Standard i Pro Extended

Preporučeno za: Business korisnike koji žele jednostavniji izbor modela bez razumevanja razlika između više GPT varijanti

⚠️ Povlačenje GPT-4.5 završeno u ChatGPT-u

Na snazi: 26–27. jun 2026. | Platforma: Samo ChatGPT (API nije zahvaćen)

OpenAI je završio povlačenje GPT-4.5 iz ChatGPT-a. Postojeći razgovori automatski nastavljaju korišćenjem GPT-5.5 gde je prikladno. API korisnici nisu zahvaćeni. Korisnici s prilagođenim GPT-ovima izgrađenim na GPT-4.5 su migrirani na GPT-5.5 unutar ChatGPT-a.

Tehničke detalji

Primenjuje se samo na ChatGPT | API nepromenjen | Postojeći razgovori se automatski migriraju na GPT-5.5 | Prilagođeni GPT-ovi na GPT-4.5: auto-migrirani na GPT-5.5

Preporučeno za: ChatGPT korisnike i administratore koji održavaju prilagođene GPT-ove — proverite ponašanje nakon migracije

Poboljšana memorija — nastavak uvođenja za enterprise

Platforma: ChatGPT Business, Enterprise i Edu

OpenAI je nastavio fazno uvođenje poboljšane memorije za Business, Enterprise i Edu korisnike. ChatGPT sada može referencirati relevantne informacije iz prethodnih razgovora, dok korisnici zadržavaju kontrolu nad pregledanjem, uređivanjem i brisanjem zapamćenih informacija. Enterprise radni prostori ostaju u fazi ranog pristupa.

Tehničke detalji

Dostupno: Business, Enterprise, Edu | Kontrole: rezime memorije, pregled izvora, brisanje, opcija deaktivacije | Enterprise uvođenje: fazni rani pristup nastavlja se | Codex memorija nije zahvaćena

Preporučeno za: Organizacije koje koriste ChatGPT u dugotrajnim projektima gde kontinuitet konteksta poboljšava produktivnost

Codex Micro — reklamiran hardverski dodatak

Reklamirano: 29–30. jun 2026. | Lansiranje: 15. jul | Partner: Work Louder

OpenAI je reklamirao Codex Micro, razvijen u partnerstvu s proizvođačem tastatura Work Louder. Za razliku od zasebnog potrošačkog AI uređaja u razvoju s Jony Ive-om, Codex Micro je dizajniran posebno za Codex korisnike — programabilni uređaj za prečice sličan makro tastaturi koji programerima daje brz pristup Codex tokovima rada. Ovo je prvi namenski hardverski dodatak OpenAI-ja izgrađen posebno oko jednog od njegovih softverskih proizvoda. Puni detalji se najavljuju 15. jula.

Tehničke detalji

Proizvod: Codex Micro | Partner: Work Louder | Kategorija: programabilni makro kontroler | Lansiranje: 15. jul | Svrha: brža Codex interakcija i tokovi rada s prečicama

Preporučeno za: Programere koji intenzivno koriste Codex tokom dana

Codex CLI 0.142.3

Objavljeno: 27. jun 2026.

Codex CLI je dobio malo maintenance izdanje bez novih korisničkih funkcionalnosti ili prekidnih promena. Samo ispravke grešaka i maintenance rad.

Tehničke detalji

Verzija: 0.142.3 | Promene: samo maintenance | Bez API promena | Bez novih funkcija

Preporučeno za: Postojeće Codex CLI korisnike — ažurirajte da ostanete aktuelni

Planovi i cene

Bez značajnih promena cena između 26. juna i 3. jula. Najrelevantniji operativni promene ostaju prošlosedmični enterprise billing kontrole, uvođenje poboljšane memorije i pojednostavljeni birač modela za Business.

Preporučeno za: Nije potrebna neposredna akcija. Business korisnici treba da se upoznaju sa novim biraćem nivoa rezonovanja; enterprise admini možda žele proceniti poboljšanu memoriju pre šireg uvođenja.

Gemini (Google)

Datum: 3. jul 2026. | Sledeće ažuriranje: 10. jul 2026.

Prošla sedmica označila je kritičnu arhitektonsku konsolidaciju za Google-ov AI ekosistem. Google DeepMind je lansirao Gemini 3.1 Flash-Lite Image, visoko optimizovan model za generisanje slika. Google Cloud je deprekovao Vertex AI kao samostalni brend, apsorbirajući sve kasnijie modelske distribucije u novolansiranu Gemini Enterprise Agent Platformu. Na potrošačkom frontu, ažuriranje "Gemini Spark" uvelo je nativnu macOS aplikaciju s lokalnom automatizacijom datoteka i ekstenzijama trećih lica.

Gemini 3.1 Flash-Lite Image — novi model za vizuelno generisanje

Najavljeno: 30. jun 2026. | String modela: gemini-3.1-flash-lite-image | Poznato i kao: nano-banana-2-lite (potrošački UI) | Dostupnost: Google AI Studio, API, potrošački planovi

Google DeepMind je proširio svoju nativnu multimodalnu arhitekturu s Gemini 3.1 Flash-Lite Image, dizajniranim posebno za brzo, iterativno generisanje tekst-u-sliku, renderovanje teksta u realnom vremenu i višeosmerne vizuelne razgovore. Rešava ekonomske prepreke koje preduzeća susreću pri primeni vizuelnog AI u velikom obimu. U potrošačkim aplikacijama se raspoređuje pod nazivom "Nano Banana 2 Lite."

★ Šta je novo

Sub-milisekundne brzine obrade i drastično smanjeni zahtevi za računarima u poređenju s Gemini 3 Pro Image. Podržava ulazni kontekstni prozor od 1M tokena. Izlaz: matrice slika od 4K tokena parene s do 64K tokena objašnjivačkog teksta. Izvanredne performanse u međunarodnom renderovanju teksta kroz ne-latinična pisma i konzistentnosti karaktera kroz uzastopne izmene. Cena: ~80% smanjenje troškova po hiljadu generisanih matrica u poređenju s gemini-3-pro-image.

Tehničke detalji

Stringovi modela: gemini-3.1-flash-lite-image / nano-banana-2-lite | Dostupnost: živo u AI Studio-u, web/mobilnim potrošačkim planovima i pojedinim API regionima | Kontekst: 1M tokena ulaz | Izlaz: 4k tokena za matrice slika, 64k tokena za tekst | Trening: Google TPU klasteri

Preporučeno za: Enterprise programere koji grade aplikacije visokog volumena kreativnog sadržaja, automatizaciju digitalnog marketinga i interaktivne obrazovne platforme koje zahtevaju niskozatezno vizuelno generisanje

Gemini Spark — nativna macOS aplikacija i ekstenzije trećih lica

Najavljeno: 30. jun 2026. | Platforma: macOS (nativna), Web, iOS, Android | Dostupnost: Google AI Ultra pretplatnici (US beta); ekstenzije se globalno uvode

Google je lansirao nativnu macOS aplikaciju u okviru "Gemini Spark" okvira, dajući asistentu bezbedne, eksplicitne dozvole za direktno interfejsiranje s lokalnim desktop okruženjima. Korisnici mogu naređivati Gemini Spark-u da upravlja datotekama kroz lokalne direktorijume, parsira finansijske podatke i automatski ažurira Google Workspace tabele — ili da putem mobilnog uređaja izdaje višestepene instrukcije svom udaljenom Mac-u. Ekstenzije trećih lica aktivirane za Canva, Dropbox, Instacart, OpenTable i Zillow Rentals.

★ Šta je novo

macOS nativni klijent s pristupom lokalnom fajl sistemu (korisnikovim direktorijum gatovanjem). Predstojeće: pipeline za udaljeno izvršavanje — izdajte instrukcije s mobilnog uređaja svom udaljenom Mac hardveru. Orkestracija pozadinskih zadataka s automatskim praćenjem web hookova, RSS feedova, finansijskih tikera i sportskih podataka u realnom vremenu. MCP podrška: programeri mogu graditi i izlagati prilagođene lokalne ili udaljene MCP servere direktno Gemini Spark klijentu. Nove ekstenzije trećih lica: Canva, Dropbox, Instacart, OpenTable, Zillow Rentals. Nativne integracije: Google Tasks, Google Keep.

Tehničke detalji

Platforma: macOS nativna (Beta), Web, iOS, Android | Model: Gemini 3.1 Pro i Gemini 3.1 Flash-Lite | Dostupnost: US Beta za AI Ultra pretplatnike; ekstenzije se uvode tokom sedmodnevnog prozora | Lokalni fajl sistem: korisnikovim direktorijum gatovanje | MCP: podrška za prilagođene lokalne ili udaljene MCP servere | Ekstenzije: Canva, Dropbox, Instacart, OpenTable, Zillow Rentals

Preporučeno za: Power korisnike i rukovodioce u Apple ekosistemima koji trebaju proaktivnog AI asistenta za automatizaciju lokalnih cross-aplikacijskih tokova rada i praćenje podataka u realnom vremenu

Gemini Enterprise Agent Platform — Vertex AI absorbovan i prebrednovan

Najavljeno: 30. jun 2026. | Platforma: Google Cloud | Dostupnost: GA za sve Google Cloud enterprise naloge

Google Cloud je deprekovao Vertex AI kao samostalni brend, apsorbirajući sve ML modele, programerske paketee i kognitivne putokaze u objedinjenu Gemini Enterprise Agent Platformu. Agent Designer pruža vizuelno no-code/low-code okruženje flowchart-a za dizajniranje autonomnih putova na bazi okidača koji se pokreću u izolovanim cloud sandbox-ovima. Agent Inbox pruža centralizovanu operativnu sobu s kategorijama "Treba vaš unos," "Greške" i "Završeno."

★ Šta je novo

Brend Vertex AI povučen; sva enterprise ML radna opterećenja sada pod Gemini Enterprise Agent Platformom. Agent Designer: vizuelno no-code/low-code okruženje flowchart-a za dizajn autonomnih agentnih putova u izolovanim cloud sandbox-ovima. Agent Inbox: centralizovana operativna soba za prioritizaciju i reviziju agentne aktivnosti. Projekti: striktno ograđuju memoriju agenta na dodeljene datasete, Drive repozitorijume i Group chatove. Skills: prilagođeni makroi putem @mentiona za izvršavanje determinističkih zadataka. Agent Gallery: gotovi konektori za Asana, Workday, Mailchimp, Adobe, Atlassian, Lovable i ServiceNow.

Tehničke detalji

Platforma: Gemini Enterprise Agent Platform (ranije Vertex AI infrastruktura) | Modeli: Gemini 3.1 Pro, Flash, Flash-Lite | Dostupnost: GA globalno za Google Cloud Enterprise | Ključne promene: potpuna apsorpcija Vertex AI u objedinjeni Agentic SDK; bezbedno cloud sandbox-ovanje za izvršavanje koda u realnom vremenu; vizuelna kompilacija flowchart-a u execution graphs | Konektori Agent Gallery: Asana, Workday, Mailchimp, Adobe, Atlassian, Lovable, ServiceNow

Preporučeno za: CIO-e, enterprise arhitekte i operativne lidere koji žele bezbedne, autonomne i upravljane AI radne sile za end-to-end poslovne tokove rada

Udaljeni MCP server za Gemini Enterprise Agent Platformu

Najavljeno: 30. jun 2026. | Dostupnost: GA za sve Google Cloud projekte s omogućenim Agent Platform API-jem

Google je rasporedio udaljeni Model Context Protocol (MCP) server unutar Gemini Enterprise Agent Platforme, omogućavajući eksternim razvojnim okvirima — poput Claude Code-a — da bezbedno interaguju s Google Cloud okruženjima. Konekcija se izvodi u celosti unutar Google Cloud-ove bezbedne infrastrukture i nativno se integriše s Cloud IAM-om.

⚠ Šta je novo

Eksterni coding agenti mogu nativno pozivati modele iz Google Model Garden-a, povlačiti odobrene interne šablone prompta ili upravljati notebook-ovima unutar aktivnog projekta. Implementacija zahteva tri koraka: omogućite Agent Platform API u Google Cloud Console-u, konfigurišite lokalni client JSON payload da pokazuje na udaljenu Google Cloud MCP krajnju tačku, i kopirajte pružene Toolset Endpoints u eksterni IDE radni prostor. Nativna Cloud IAM integracija automatski rukuje enterprise token exchange-om i validacijom sesije na nivou granice.

Tehničke detalji

Platforma: Google Cloud IAM / Agent Platform API | Protokol: otvorena MCP specifikacija | Dostupnost: auto-aktivira se pri omogućavanju Agent Platform API-ja | Ključne promene: nativno hostovanje udaljenih MCP server endpointa unutar Google Cloud granica; centralizovano katalogisanje resursa putem Agent Registry-a | Bezbednost: radi unutar Google Cloud-ove bezbedne infrastrukture, upravljano IAM-om

Preporučeno za: DevOps inženjere i full-stack razvojne timove koji žele koristiti napredne coding agente trećih lica bez ugrožavanja upravljanja enterprise podacima

Nativna integracija VPC Service Controls

Najavljeno: 27. jun 2026. | Dostupnost: GA za sve enterprise naloge koji koriste napredne agentne tokove rada

Google je najavio nativnu integraciju Virtual Private Cloud (VPC) Service Controls direktno u Gemini Enterprise Agent Platformu, uspostavljajući eksplicitne mrežne perimetre oko autonomnih agentnih radnih opterećenja. Kada dugotrajni agent izvršava kod u svom cloud sandbox-u ili se povezuje na interne repozitorijume podataka, VPC Service Controls osiguravaju da podaci ostanu striktno unutar određene pouzdane granice.

Tehničke detalji

Platforma: Google Cloud bezbednosna arhitektura | Dostupnost: globalno putem Google Cloud Console konfiguracije perimetra | Ključne promene: proširenje VPC Service Controls na Agent Platform API krajnje tačke; primena kriptografskih identiteta za agente koji se kreću kroz mrežne perimetre; automatsko blokiranje neproverenih eksternih zahteva za izlaz podataka iz agentnih petlji pozivanja alata

Preporučeno za: CISO-e, compliance oficire i mrežne bezbednosne inženjere koji zahtevaju apsolutnu izolaciju podataka — posebno u bankarstvu, zdravstvenoj zaštiti i vladi

Workspace Feature Drop — Formula Fixer, iterativno Gmail pisanje

Najavljeno: 25. jun 2026. | Platforma: Google Workspace | Dostupnost: AI Ultra, Pro, Plus i Workspace Enterprise Plus na Gemini Alpha track-u

Google je isporučio najnoviji Workspace Feature Drop. U Sheets-u, Gemini Formula Fixer dijagnostikuje sintaksne ili logičke greške formula, objašnjava korekciju na prostom jeziku i ažurira formulu inline. U Gmail-u, "Pomozi mi da napišem" je nadograđen s trajnom instrukcijskom trakom za iterativne dorade — izdajte direktne izmene poput "dodaj rok projekta u treći red" bez prepisivanja celog prompta. Lokalizacija tabele proširena na 28 novih jezika.

★ Šta je novo

Gemini Formula Fixer (Sheets): dijagnostikuje sintaksne/logičke greške i ažurira formule inline. Iterativno dorada prompta (Gmail): trajna instrukcijska traka za hirurške izmene teksta bez prepisivanja celog prompta. 28-jezička lokalizacija tabele (špansko, japansko, francusko, nemačko, korejsko i više). AI Inbox (samo Alpha Track): visokoobjemno izvršno razvrstavanja.

Tehničke detalji

Platforma: Google Workspace | Model: Gemini 3.1 Pro integracija | Dostupnost: živo za premium planove, uvodi se na enterprise | Sheets: višekolumna kontekstualna analiza, engine za debugging formula baziran na AST | Gmail: stateless stanja parcijalne izmene stringova u drafting API-ju | Lokalizacija: 28 novih jezika s proširenim skupom karaktera tokenizera

Preporučeno za: Finansijske analitičare, menadžere projekata i operativno osoblje koje se oslanja na Google Workspace ekosisteme i treba brzo ispravljanje podataka i precizno pisanje komunikacija

Planovi i cene

Gemini Enterprise Agent Platform deprekovuje samostalni billing model Vertex AI-ja — sva naredna enterprise ML radna opterećenja se fakturišu pod objedinjenim ledgerom platforme. Izvršavanje agenata unutar bezbednih cloud sandbox-ova povlači standardne Google Cloud overhead-e računarstvo-po-minutu, izolovane nativnim VPC Service Controls bez dodatnih compliance premija. Pristup udaljenom MCP serveru je besplatan za licenciranje; naplaćuje se samo po obimu poziva modela i alata. Gemini Spark macOS ostaje vezan za Google AI Ultra pretplatu ($20/mesečno). Novi Google Home Premium tier na $10/mesečno za napredne glasovne agentne mogućnosti i kontinuiranu analizu video istorije za potrošački IoT hardver.

Tehničke detalji

Gemini Enterprise Agent Platform: zamenjuje Vertex AI standalone billing | gemini-3.1-flash-lite-image: ~80% smanjenje troškova vs gemini-3-pro-image | Izvršavanje cloud sandbox: standardni Google Cloud overhead računarstvo-po-minutu | Udaljeni MCP: zero licenciranje; naplata po obimu poziva modela i alata | Google AI Ultra: $20/mesec (Gemini Spark macOS + pozadinsko praćenje) | Google Home Premium: $10/mesec (napredni glasovni agent + kontinuirana video istorija za IoT)

Preporučeno za: Enterprise IT lidere koji prebacuju visokofrekventne vizuelne zadatke na Flash-Lite arhitekturu i integrišu externe programerske IDE-ove u Google Cloud putem open-standard MCP kanala

Microsoft Copilot

Datum: 3. jul 2026. | Sledeće ažuriranje: 10. jul 2026.

Copilot Cowork je dostigao globalnu opštu dostupnost 30. juna, s automatskim izborom modela između GPT-5.5 Thinking (duboko rezonovanje), Claude (strukturovani/vizuelni rad) i GPT-5.5 Instant (lagani zadaci). Produktivne aplikacije dobile su nove mogućnosti prilagođavanja i brenda, Copilot Vision je najavljen za uvide zasnovane na ekranu, a admini su dobili jaču upravljačku strukturu s Purview i dashboardovima troškova.

⚠️ Copilot Cowork — globalna GA, izbor modela, prošireni pluginovi

GA: 30. jun 2026. | Status: globalno dostupan | Auto-selekcija modela: aktivna

Cowork je sada globalno opšte dostupan. Korisnici mogu prebacivati između Chat-a i Cowork-a u Copilot aplikaciji. Cowork izvršava zadatke od kraja do kraja, vraćajući gotove rezultate umesto nacrta. Naslovna nova funkcija je automatska selekcija modela: Cowork sada može birati između GPT-5.5 Thinking za duboko rezonovanje, Claude-a za strukturovani/vizuelni rad i GPT-5.5 Instant-a za lagane zadatke bez korisničke intervencije.

★ Šta je novo

Globalna GA: svi enterprise tenanti. Auto-selekcija modela: Cowork automatski rutira između GPT-5.5 Thinking, Claude-a i GPT-5.5 Instant-a na osnovu vrste zadatka. Prošireni ekosistem pluginova: Enosix, Harvey, LSEG, Miro, monday.com, Moodys, Morningstar, S&P Global Energy, TeamsMaestro, plus Databricks putem sideloading-a. Microsoft Fabric i Dynamics 365 aplikacije su sada integrisane. Cost Management Dashboard u admin centru: admini mogu pratiti Cowork kreditnu upotrebu, budžete i potrošnju. Tab Prilagodavanje za prilagođene skills.

Tehničke detalji

Cowork: naplata zasnovana na upotrebi, Cost Management Dashboard u admin centru | Auto-selekcija modela: GPT-5.5 Thinking / Claude / GPT-5.5 Instant | Proširena prodavnica pluginova: Enosix, Harvey, LSEG, Miro, monday.com, Moodys, Morningstar, S&P Global Energy, TeamsMaestro, Databricks | Microsoft Fabric + Dynamics 365: integrisani | Tab Prilagodavanje: prilagođeni skills

Preporučeno za: Preduzeća koja orkestrišu složene tokove rada s prilagođenim izborom modela i integracijom pluginova

Copilot u aplikacijama — Word, Excel, Outlook, PowerPoint

★ Šta je novo

Word: izbor modela sada dostupan; content card Copilot Catchup; kreiranje slika; iOS agentne mogućnosti; čuva istoriju čata između aplikacija; primenjuje izmene na osnovu komentara. Excel: Skills, personalizacija i pravila; dostupnost Planner Agent-a; dublje rezonovanje nad Power BI enterprise podacima. Outlook: dorada Compose canvas-a; Copilot podešavanja u Outlook Classic-u koja se uvode sledećeg meseca. PowerPoint: Brand Kit Picker za odobrene šablone; višekratni Copilot skills; reference na SharePoint biblioteke i OneDrive fascikle.

Tehničke detalji

Word: izbor modela, Catchup card, kreiranje slika, iOS agentno, cross-app chat istorija, izmene na osnovu komentara | Excel: Skills, personalizacija, pravila, Planner Agent, Power BI rezonovanje | Outlook: Compose canvas, Classic podešavanja sledećeg meseca | PowerPoint: Brand Kit Picker, višekratni skills, SharePoint/OneDrive reference

Preporučeno za: Timove koji trebaju konzistentno primenivanje brenda, dublje rezonovanje nad podacima i kontinuitet između aplikacija

Upravljanje i usaglašenost — Admin Centar, Purview

★ Šta je novo

Cost Management Dashboard: admini mogu pratiti Cowork kreditnu upotrebu, budžete i potrošnju u admin centru. Purview DLP kontrole: dodato ograničenje email-a i upravljanje Cowork-om. Federated Connectors GA: bezbedna integracija eksternih aplikacija s faznim uvođenjem.

Tehničke detalji

Cost Management Dashboard: Cowork krediti, praćenje budžeta, praćenje potrošnje | Purview DLP: ograničenje email-a, upravljanje Cowork-om | Federated Connectors: GA, fazno uvođenje

Preporučeno za: IT admine koji upravljaju potrošnjom, usaglašenošću i upravljanjem konektorima

Copilot Vision — uvidi zasnovani na ekranu

Najavljeno: 29. jun 2026. | Status: rano uvođenje u Frontier tenantima

Copilot Vision će generisati uvide zasnovane na onome što je vidljivo na korisnikovom ekranu. U ranom uvođenju za Frontier tenante.

Preporučeno za: Analitičare i istraživače kojima su potrebni kontekstualni uvidi zasnovani na ekranu

Copilot Notebooks — proširene reference

★ Šta je novo

Outlook email-ovi se sada mogu dodavati kao reference u Copilot Notebooks-u. Notebooks su sada dostupni svim Copilot Chat korisnicima, ne samo licenciranim Copilot 365 korisnicima.

Preporučeno za: Radnike sa znanjem koji konsoliduju reference iz Outlooka, SharePoint-a i OneDrive-a

Planovi i cene

Cowork: naplata zasnovana na upotrebi s admin Cost Management Dashboard-om. Claude i GPT-5.5 Thinking uključeni u Pro i Enterprise planove. GPT-5.5 Instant dostupan kao niskotroškovna opcija. Pluginovi uključeni u Cowork GA.

Tehničke detalji

Cowork: naplata zasnovana na upotrebi, admin Cost Management Dashboard | Modeli: Claude + GPT-5.5 Thinking (Pro + Enterprise), GPT-5.5 Instant (niskotroškovna opcija) | Pluginovi: uključeni u Cowork GA | Copilot Notebooks: prošireni na sve Copilot Chat korisnike

Preporučeno za: Preduzeća koja balansiraju troškove s fleksibilnošću modela i proširenjem pluginova

Filed under: AI Weekly Digest
First published: Jul 3, 2026