This Week in Cloud: Agents Get Keys, Nodes Get Faster, and Azure Bets Big on Europe

It has been a busy week across the major clouds, with announcements spanning AI agent infrastructure, Kubernetes performance, database architecture, and a heavyweight statement of intent from Microsoft on the European market. Underneath the noise, there is a clear theme: the platforms are maturing, and the rough edges that have been annoying engineers for years are finally getting addressed.

AWS MCP Server Is Now Generally Available

AWS has moved the MCP Server to general availability as part of its Agent Toolkit, giving AI coding agents authenticated access to all AWS services via a compact set of tools. The key selling point is that agents can now call any of the 15,000-plus AWS API operations using your existing IAM credentials, retrieve current documentation at query time rather than relying on stale training data, and even run short Python scripts in a sandboxed environment that inherits your IAM permissions. Enterprise controls are included out of the box, with CloudTrail capturing all API calls and CloudWatch publishing metrics under a dedicated namespace so your audit trail stays clean.

Why it matters: If you have been frustrated by AI coding assistants reaching for the CLI instead of CDK, or generating IAM policies with far too much access, this gives you a structured way to give agents real AWS capability with proper guardrails. The separation between human and agent permissions via IAM is the bit worth paying attention to.

Read the full announcement

GKE Node Startup Gets Up to 4x Faster

Google has rearchitected how GKE provisions VMs and nodes, combining intelligent compute buffers, fast-starting virtual machines, and a new control plane that allows VMs to resize without rebooting. The result is node startup times up to four times faster for qualifying hardware, with the improvement already live for GKE Autopilot workloads on NVIDIA L4, A100, H100, and general-purpose compute. No Terraform changes, no YAML patches required.

Why it matters: Cold-start latency has been the primary driver of GPU over-provisioning in AI inference workloads. If your autoscaler can actually react in real time, you stop paying for idle accelerators as insurance against slow spin-up. For teams running batch or variable AI workloads on GKE, this is a genuine cost and operational improvement.

Read the full announcement

Bigtable Adds an In-Memory Tier

Google has introduced a new in-memory storage tier for Bigtable, sitting above the existing SSD and HDD tiers in a unified hybrid architecture. Using Remote Direct Memory Access (RDMA), the tier delivers sub-millisecond read latency with roughly ten times higher point read throughput per dollar compared to SSD alone, and can handle up to 120,000 queries per second on a single row without degradation. Hot data is promoted to memory automatically, cold data falls back to SSD, and you can apply fine-grained control via application profiles. The capability is available exclusively in the new Bigtable Enterprise Plus edition.

Why it matters: This collapses the common pattern of running a primary database alongside a separate caching layer, with all the consistency headaches that involves. For financial services, telemetry, or any workload governed by power-law access patterns, having a single managed service handle tiering automatically is architecturally cleaner and operationally simpler.

Read the full announcement

Azure Commits $30bn to UK and Expands Across Europe

Microsoft has published a detailed update on its European infrastructure expansion, including ongoing investment across Austria, Belgium, Denmark, Greece, Finland, Spain, Italy, Germany, Poland, and the UK. The headline figure is a $30 billion commitment to AI infrastructure across the UK between 2025 and 2028, including $15 billion in capital expenditure to expand datacentre capacity. The post also covers multi-region architecture guidance, sovereign cloud options, and the EU Data Boundary, positioning Azure as the platform for organisations with strict data residency requirements.

Why it matters: For UK and European architects, this directly affects region selection, sovereignty planning, and data residency strategies. More regions mean more options for active-active architectures, lower latency to end users, and cleaner compliance postures under UK GDPR and EU-wide regulation.

Read the full announcement

Azure IaaS: Defense in Depth as a Design Property

Microsoft has published the third instalment in its Azure IaaS blog series, this time focused squarely on security architecture. The post walks through how Azure applies defence in depth across the full stack, from hardware roots of trust and Trusted Platform Modules at the bottom, through hypervisor isolation and Trusted Launch for VMs, up to network defaults aligned with Zero Trust principles, encryption at rest and in transit by default, and continuous monitoring via Defender for Cloud and Azure Monitor. The framing throughout is that security is a design property of the platform, not a layer added afterwards.

Why it matters: This is a useful consolidation of how Azure IaaS security actually works at each layer, and worth reading if you are currently designing or reviewing an IaaS landing zone. The sections on JIT VM access, Private Link defaults, and how confidential computing extends isolation to in-use data are particularly relevant for regulated workloads.

Read the full announcement

The week’s announcements reflect where the industry is right now: agentic tooling is moving from experimental to production-ready, infrastructure teams are finally getting the low-level performance improvements they have been asking for, and the major clouds are making large, multi-year bets on physical infrastructure that will shape architectural options for years to come. The question worth sitting with is how quickly your own platform strategies will need to adapt as agent-driven infrastructure provisioning becomes a realistic option rather than a demo curiosity.